Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 46.588
Filtrar
Mais filtros











Intervalo de ano de publicação
1.
Ger Med Sci ; 22: Doc07, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39224664

RESUMO

Objective: The study aimed to investigate the subjective method of estimating linear breast dimensions in comparison to the objective method. Methods: The reproducibility and accuracy of the subjective method of estimating linear breast dimensions during a simplified breast shape analysis were examined. Four linear breast dimensions including the distance from the sternal notch to the nipple, distance from the nipple to the inframammary fold, distance from the nipple to the midline and under-breast width were evaluated based on subjective estimates. Images from 100 women with natural breasts and without any history of breast surgery were reviewed by two examiners three times each. The cases were obtained from a large database of breast images captured using the Vectra Camera System (Canfield Scientific Inc., USA). The subjective data were then compared with the objective linear data from the Vectra Camera System in the automated analysis. Statistical evaluation was conducted between the three repeated estimates of each examiner, between the two examiners and between the objective and subjective data. Results: The intra-individual variations of the three subjective estimates were significantly greater in one examiner than in the other. This trend was consistent across all eight parameters in the majority of the comparisons of the standard deviations and variation coefficients, and the differences were significant in 14 out of 16 comparisons (p<0.05). Conversely, in the comparison between the subjective and objective data, the estimates were closer to the measurements in one examiner than the other. In contrast to the reproducibility observed, the assessment of the accuracy revealed that the examiner who previously presented with less reproducibility of the estimated data overall showed better accuracy in comparison to the objective data. The overall differences were inconsistent, with some being positive and others being negative. Regarding the distances from the sternal notch to the nipple and breast width, both examiners underestimated the values. However, the deviations were at different levels, particularly when considering the objective data from the Vectra Camera System as the gold standard data for comparison. Regarding the distance from the nipple to the inframammary fold, one examiner underestimated the distance, while the other overestimated it. An opposite trend was noted for the distance from the nipple to the midline. There were no differences in the estimates between the right and left sides of the breasts. The correlations between the measured and estimated distances were positive: as the objective distances increased, the subjective distances also increased. In all cases, the correlations were significant. However, the correlation for the breast width was notably weaker than that for the other distances. Conclusions: The error assessment of the subjective method reveals that it varies significantly and unsystematically between examiners. This is true when assessing the reproducibility as well as the accuracy of the method in comparison to the objective data obtained with an automated system.


Assuntos
Mama , Humanos , Feminino , Mama/anatomia & histologia , Mama/diagnóstico por imagem , Reprodutibilidade dos Testes , Adulto , Pessoa de Meia-Idade , Variações Dependentes do Observador , Idoso , Adulto Jovem , Fotografação/métodos
2.
BMC Health Serv Res ; 24(1): 1094, 2024 Sep 19.
Artigo em Inglês | MEDLINE | ID: mdl-39300401

RESUMO

BACKGROUND: Dizziness and vertigo are common referrals to Ear Nose Throat (ENT) outpatient services however these services have long waitlists for assessment. Primary contact physiotherapy-led vestibular clinics are recognized as improving access to care. This pilot study investigated agreement between physiotherapists and an ENT medical practitioner for diagnostic and management decisions in patients attending a primary contact physiotherapy-led vestibular clinic. METHODS: Prospective blinded inter-rater agreement study undertaken in an ENT primary contact physiotherapy-led vestibular clinic. Participants were adults referred to ENT from general practitioners, triaged (Category 2 or 3) to the primary contact physiotherapy-led vestibular clinic with clinical symptoms consistent with vestibular disorder. Primary outcome measures included agreement of diagnoses and management decisions made by an ENT medical practitioner and Physiotherapist based on a vestibular physiotherapy assessment. Adverse events were reviewed 11 months post data collection. Gwet's first order agreement co-efficient (AC1) calculated inter-rater reliability between physiotherapy and ENT. RESULTS: Fifty-one participants were recruited consecutively from the primary contact physiotherapy-led vestibular clinic. Physiotherapy and ENT had a substantial agreement (AC1 0.613) on diagnosis. AC1 between physiotherapy and ENT for recommending Magnetic resonance imaging (0.810) and computerized tomography (0.935) both indicated near perfect agreement. There was moderate to near-perfect agreement regarding management recommendations between physiotherapy and ENT. Substantial agreement (AC1 0.720) was found for recommendations for ENT input, near perfect agreement (AC1 0.933) for neurology input and moderate agreement (AC1 0.574) for physiotherapy input. There were no adverse events from physiotherapist's management decision, based on final recommendations undertaken 11-months post data collection. CONCLUSIONS: Physiotherapists and ENT medical practitioner made comparable diagnostic and management decisions, based on physiotherapy and audiology hearing assessment, for adults with signs of vestibular dysfunction, within an ENT primary contact physiotherapy-led vestibular clinic. This study provides support for this type of Physiotherapy-led service in managing patients referred to an ENT service with vestibular dysfunction.


Assuntos
Fisioterapeutas , Humanos , Projetos Piloto , Estudos Prospectivos , Masculino , Feminino , Pessoa de Meia-Idade , Adulto , Modalidades de Fisioterapia , Idoso , Otolaringologia/normas , Doenças Vestibulares/diagnóstico , Doenças Vestibulares/terapia , Encaminhamento e Consulta , Variações Dependentes do Observador , Reprodutibilidade dos Testes , Vertigem/diagnóstico , Vertigem/terapia
3.
J Pathol Clin Res ; 10(5): e12395, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39294925

RESUMO

The gold standard for enrollment and endpoint assessment in metabolic dysfunction-associated steatosis clinical trials is histologic assessment of a liver biopsy performed on glass slides. However, obtaining the evaluations from several expert pathologists on glass is challenging, as shipping the slides around the country or around the world is time-consuming and comes with the hazards of slide breakage. This study demonstrated that pathologic assessment of disease activity in steatohepatitis, performed using digital images on the AISight whole slide image management system, yields results that are comparable to those obtained using glass slides. The accuracy of scoring for steatohepatitis (nonalcoholic fatty liver disease activity score ≥4 with ≥1 for each feature and absence of atypical features suggestive of other liver disease) performed on the system was evaluated against scoring conducted on glass slides. Both methods were assessed for overall percent agreement with a consensus "ground truth" score (defined as the median score of a panel of three pathologists' glass slides). Each case was also read by three different pathologists, once on glass and once digitally with a minimum 2-week washout period between the modalities. It was demonstrated that the average agreement across three pathologists of digital scoring with ground truth was noninferior to the average agreement of glass scoring with ground truth [noninferiority margin: -0.05; difference: -0.001; 95% CI: (-0.027, 0.026); and p < 0.0001]. For each pathologist, there was a similar average agreement of digital and glass reads with glass ground truth (pathologist A, 0.843 and 0.849; pathologist B, 0.633 and 0.605; and pathologist C, 0.755 and 0.780). Here, we demonstrate that the accuracy of digital reads for steatohepatitis using digital images is equivalent to glass reads in the context of a clinical trial for scoring using the Clinical Research Network scoring system.


Assuntos
Hepatopatia Gordurosa não Alcoólica , Humanos , Hepatopatia Gordurosa não Alcoólica/patologia , Ensaios Clínicos como Assunto , Reprodutibilidade dos Testes , Biópsia , Fígado/patologia , Interpretação de Imagem Assistida por Computador/métodos , Variações Dependentes do Observador
4.
Arch Dermatol Res ; 316(8): 608, 2024 Sep 06.
Artigo em Inglês | MEDLINE | ID: mdl-39240381

RESUMO

Line-field confocal optical coherence tomography (LC-OCT) is a new technology for skin cancer diagnostics. However, the interobserver agreement (IOA) of known image markers of keratinocyte carcinomas (KC), including basal cell carcinoma (BCC) and squamous cell carcinoma (SCC), as well as precursors, SCC in situ (CIS) and actinic keratosis (AK), remains unexplored. This study determined IOA on the presence or absence of 10 key LC-OCT image markers of KC and precursors, among evaluators new to LC-OCT with different levels of dermatologic imaging experience. Secondly, the frequency and association between reported image markers and lesion types, was determined. Six evaluators blinded to histopathologic diagnoses, assessed 75 LC-OCT images of KC (21 SCC; 21 BCC), CIS (12), and AK (21). For each image, evaluators independently reported the presence or absence of 10 predefined key image markers of KCs and precursors described in an LC-OCT literature review. Evaluators were stratified by experience-level as experienced (3) or novices (3) based on previous OCT and reflectance confocal microscopy usage. IOA was tested for all groups, using Conger's kappa coefficient (κ). The frequency of reported image marker and their association with lesion-types, were calculated as proportions and odds ratios (OR), respectively. Overall IOA was highest for the image markers lobules (κ = 0.68, 95% confidence interval (CI) 0.57;0.78) and clefting (κ = 0.63, CI 0.52;0.74), typically seen in BCC (94%;OR 143.2 and 158.7, respectively, p < 0.001), followed by severe dysplasia (κ = 0.42, CI 0.31;0.53), observed primarily in CIS (79%;OR 7.1, p < 0.001). The remaining seven image-markers had lower IOA (κ = 0.06-0.32) and were more evenly observed across lesion types. The lowest IOA was noted for a well-defined (κ = 0.07, CI 0;0.15) and interrupted dermal-epidermal junction (DEJ) (κ = 0.06, CI -0.002;0.13). IOA was higher for all image markers among experienced evaluators versus novices. This study shows varying IOA for 10 key image markers of KC and precursors in LC-OCT images among evaluators new to the technology. IOA was highest for the assessments of lobules, clefting, and severe dysplasia while lowest for the assessment of the DEJ integrity.


Assuntos
Carcinoma Basocelular , Carcinoma de Células Escamosas , Queratinócitos , Ceratose Actínica , Variações Dependentes do Observador , Neoplasias Cutâneas , Tomografia de Coerência Óptica , Humanos , Neoplasias Cutâneas/diagnóstico por imagem , Neoplasias Cutâneas/patologia , Neoplasias Cutâneas/diagnóstico , Tomografia de Coerência Óptica/métodos , Carcinoma Basocelular/diagnóstico por imagem , Carcinoma Basocelular/patologia , Carcinoma Basocelular/diagnóstico , Carcinoma de Células Escamosas/diagnóstico por imagem , Carcinoma de Células Escamosas/patologia , Queratinócitos/patologia , Ceratose Actínica/diagnóstico por imagem , Ceratose Actínica/patologia , Ceratose Actínica/diagnóstico , Microscopia Confocal/métodos , Lesões Pré-Cancerosas/diagnóstico por imagem , Lesões Pré-Cancerosas/patologia , Feminino , Masculino , Idoso , Pessoa de Meia-Idade
5.
BMC Med Educ ; 24(1): 962, 2024 Sep 03.
Artigo em Inglês | MEDLINE | ID: mdl-39227811

RESUMO

BACKGROUND: This study aimed to answer the research question: How reliable is ChatGPT in automated essay scoring (AES) for oral and maxillofacial surgery (OMS) examinations for dental undergraduate students compared to human assessors? METHODS: Sixty-nine undergraduate dental students participated in a closed-book examination comprising two essays at the National University of Singapore. Using pre-created assessment rubrics, three assessors independently performed manual essay scoring, while one separate assessor performed AES using ChatGPT (GPT-4). Data analyses were performed using the intraclass correlation coefficient and Cronbach's α to evaluate the reliability and inter-rater agreement of the test scores among all assessors. The mean scores of manual versus automated scoring were evaluated for similarity and correlations. RESULTS: A strong correlation was observed for Question 1 (r = 0.752-0.848, p < 0.001) and a moderate correlation was observed between AES and all manual scorers for Question 2 (r = 0.527-0.571, p < 0.001). Intraclass correlation coefficients of 0.794-0.858 indicated excellent inter-rater agreement, and Cronbach's α of 0.881-0.932 indicated high reliability. For Question 1, the mean AES scores were similar to those for manual scoring (p > 0.05), and there was a strong correlation between AES and manual scores (r = 0.829, p < 0.001). For Question 2, AES scores were significantly lower than manual scores (p < 0.001), and there was a moderate correlation between AES and manual scores (r = 0.599, p < 0.001). CONCLUSION: This study shows the potential of ChatGPT for essay marking. However, an appropriate rubric design is essential for optimal reliability. With further validation, the ChatGPT has the potential to aid students in self-assessment or large-scale marking automated processes.


Assuntos
Educação em Odontologia , Avaliação Educacional , Estudantes de Odontologia , Humanos , Reprodutibilidade dos Testes , Avaliação Educacional/métodos , Educação em Odontologia/métodos , Educação em Odontologia/normas , Feminino , Singapura , Masculino , Cirurgia Bucal/educação , Variações Dependentes do Observador
6.
PLoS One ; 19(9): e0310486, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39269960

RESUMO

PURPOSE: To assess the reproducibility of radiomic features (RFs) extracted from dynamic contrast-enhanced computed tomography (DCE-CT) scans of patients diagnosed with hepatocellular carcinoma (HCC) with regards to inter-observer variability and acquisition timing after contrast injection. The predictive ability of reproducible RFs for differentiating between the degrees of HCC differentiation is also investigated. METHODS: We analyzed a set of DCE-CT scans of 39 patients diagnosed with HCC. Two radiologists independently segmented the scans, and RFs were extracted from each sequence of the DCE-CT scans. The same lesion was segmented across the DCE-CT sequences of each patient's scan. From each lesion, 127 commonly used RFs were extracted. The reproducibility of RFs was assessed with regard to (i) inter-observer variability, by evaluating the reproducibility of RFs between the two radiologists; and (ii) timing of acquisition following contrast injection (inter- and intra-imaging phase). The reproducibility of RFs was assessed using the concordance correlation coefficient (CCC), with a cut-off value of 0.90. Reproducible RFs were used for building XGBoost classification models for the differentiation of HCC differentiation. RESULTS: Inter-observer analyses across the different contrast-enhancement phases showed that the number of reproducible RFs was 29 (22.8%), 52 (40.9%), and 36 (28.3%) for the non-contrast enhanced, late arterial, and portal venous phases, respectively. Intra- and inter-sequence analyses revealed that the number of reproducible RFs ranged between 1 (0.8%) and 47 (37%), inversely related with time interval between the sequences. XGBoost algorithms built using reproducible RFs in each phase were found to be high predictive ability of the degree of HCC tumor differentiation. CONCLUSIONS: The reproducibility of many RFs was significantly impacted by inter-observer variability, and a larger number of RFs were impacted by the difference in the time of acquisition after contrast injection. Our findings highlight the need for quality assessment to ensure that scans are analyzed in the same physiologic imaging phase in quantitative imaging studies, or that phase-wide reproducible RFs are selected. Overall, the study emphasizes the importance of reproducibility and quality control when using RFs as biomarkers for clinical applications.


Assuntos
Carcinoma Hepatocelular , Meios de Contraste , Neoplasias Hepáticas , Variações Dependentes do Observador , Tomografia Computadorizada por Raios X , Humanos , Carcinoma Hepatocelular/diagnóstico por imagem , Carcinoma Hepatocelular/patologia , Neoplasias Hepáticas/diagnóstico por imagem , Neoplasias Hepáticas/patologia , Masculino , Feminino , Tomografia Computadorizada por Raios X/métodos , Reprodutibilidade dos Testes , Pessoa de Meia-Idade , Idoso , Adulto , Radiômica
7.
Saudi Med J ; 45(9): 911-918, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-39218467

RESUMO

OBJECTIVES: To determine the diagnostic efficiencies of multiple diffusion-weighted imaging (DWI) techniques for hepatic fibrosis (HF) staging under the premise of high inter-examiner reliability. METHODS: Participants with biopsy-confirmed HF were recruited and divided into the early HF (EHF) and advanced HF (AHF) groups; healthy volunteers (HVs) served as controls. Two examiners analyzed intravoxel incoherent motion (IVIM) using the IVIM-DWI and diffusion kurtosis imaging (DKI) models. Intravoxel incoherent motion-DWI, DKI, and diffusion tensor imaging parameters with intraclass correlation coefficients (ICCs) of ≥0.6 were used to create regression models: HVs vs. EHF and EHF vs. AHF. RESULTS: We enrolled 48 HVs, 59 EHF patients, and 38 AHF patients. Mean, radial, and axial kurtosis; fractional anisotropy; mean, radial, and axial diffusivity; and α exhibited excellent reliability (ICCs: 0.80-0.98). Fractional anisotropy of kurtosis, f, and apparent diffusion coefficient showed good reliability (ICCs: 0.69-0.92). The real (0.58-0.67), pseudo- (0.27-0.76), and distributed diffusion coefficients (0.58-0.67) showed low reliability. In the HVs versus (vs.) EHF model, α (p=0.008) and ADC (p=0.011) presented statistical differences (area under curve [AUC]: 0.710). In the EHF vs. AHF model, α (p=0.04) and distributed diffusion coefficient (p=0.02) presented significant differences (AUC: 0.758). CONCLUSION: Under the premise of high inter-examiner reliability, DWI and IVIM-derived stretched-exponential model parameters may help stage HF.


Assuntos
Imagem de Difusão por Ressonância Magnética , Cirrose Hepática , Humanos , Cirrose Hepática/diagnóstico por imagem , Cirrose Hepática/patologia , Imagem de Difusão por Ressonância Magnética/métodos , Feminino , Masculino , Pessoa de Meia-Idade , Adulto , Reprodutibilidade dos Testes , Variações Dependentes do Observador
8.
BMC Musculoskelet Disord ; 25(1): 743, 2024 Sep 16.
Artigo em Inglês | MEDLINE | ID: mdl-39285360

RESUMO

BACKGROUND: The positioning error of femoral tunnel was the key factor leading to the failure of anterior cruciate ligament (ACL) reconstruction. This study aimed to propose a new femoral tunnel classification to guide revision ACL reconstruction. METHODS: Totals of 150 patients with ACL reconstruction failure from 2017 to 2023 were enrolled in this retrospective study. According to the tunnel diameter, shape, posterior wall and the positioning relationship with the Lateral Intercondylar Ridge on the three-dimensional CT imaging, we divided the femoral tunnels into four types: Type I off-target type, Type II straddled type, Type III anatomical type, and Type IV irregular type. Finally, explored the inter-observer reliability within two groups of doctors (Group A, 12 high seniorities; Group B, 12 low seniorities), and evaluated the intra-observer reliability within 6 doctors after two months. Clinical evaluation was performed using the Lysholm score, Tenger activity score, Pivot Shift and anterior knee laxity measurements. RESULTS: Among 150 cases of femoral tunnel three-dimensional CT reconstructed imaging, 144 cases were successfully included in the classification system, and 6 cases were confirmed as uncertain type. We measured the Kappa (κ) coefficient of group A was significantly higher than that of group B (κ 0.72 VS 0.68), and the κ coefficient of group A was still higher than group B (κ 0.69 VS 0.62) after further dividing Type III anatomical type into three subtypes. In addition, the κ coefficients of intra-observer reliability were all exceeded 0.73. Clinical follow-up showed that 9 patients had good knee joint motor function and stability after operation. CONCLUSION: The new femoral tunnel classification was reliable and had clinical guiding significance based on three-dimensional CT imaging. LEVEL OF EVIDENCE: Level III.


Assuntos
Lesões do Ligamento Cruzado Anterior , Reconstrução do Ligamento Cruzado Anterior , Fêmur , Imageamento Tridimensional , Tomografia Computadorizada por Raios X , Humanos , Reconstrução do Ligamento Cruzado Anterior/métodos , Feminino , Masculino , Fêmur/diagnóstico por imagem , Fêmur/cirurgia , Estudos Retrospectivos , Adulto , Imageamento Tridimensional/métodos , Lesões do Ligamento Cruzado Anterior/cirurgia , Lesões do Ligamento Cruzado Anterior/diagnóstico por imagem , Adulto Jovem , Reoperação , Reprodutibilidade dos Testes , Ligamento Cruzado Anterior/cirurgia , Ligamento Cruzado Anterior/diagnóstico por imagem , Pessoa de Meia-Idade , Adolescente , Variações Dependentes do Observador , Articulação do Joelho/diagnóstico por imagem , Articulação do Joelho/cirurgia , Falha de Tratamento
9.
Prog Orthod ; 25(1): 35, 2024 Sep 16.
Artigo em Inglês | MEDLINE | ID: mdl-39279025

RESUMO

OBJECTIVES: This study aimed to assess the accuracy of machine learning (ML) models with feature selection technique in classifying cervical vertebral maturation stages (CVMS). Consensus-based datasets were used for models training and evaluation for their model generalization capabilities on unseen datasets. METHODS: Three clinicians independently rated CVMS on 1380 lateral cephalograms, resulting in the creation of five datasets: two consensus-based datasets (Complete Agreement and Majority Voting), and three datasets based on a single rater's evaluations. Additionally, landmarks annotation of the second to fourth cervical vertebrae and patients' information underwent a feature selection process. These datasets were used to train various ML models and identify the top-performing model for each dataset. These models were subsequently tested on their generalization capabilities. RESULTS: Features that considered significant in the consensus-based datasets were consistent with a CVMS guideline. The Support Vector Machine model on the Complete Agreement dataset achieved the highest accuracy (77.4%), followed by the Multi-Layer Perceptron model on the Majority Voting dataset (69.6%). Models from individual ratings showed lower accuracies (60.4-67.9%). The consensus-based training models also exhibited lower coefficient of variation (CV), indicating superior generalization capability compared to models from single raters. CONCLUSION: ML models trained on consensus-based datasets for CVMS classification exhibited the highest accuracy, with significant features consistent with the original CVMS guidelines. These models also showed robust generalization capabilities, underscoring the importance of dataset quality.


Assuntos
Vértebras Cervicais , Aprendizado de Máquina , Variações Dependentes do Observador , Humanos , Vértebras Cervicais/crescimento & desenvolvimento , Masculino , Feminino , Criança , Cefalometria/métodos , Máquina de Vetores de Suporte , Adolescente , Determinação da Idade pelo Esqueleto/métodos , Conjuntos de Dados como Assunto
10.
PeerJ ; 12: e17903, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39221272

RESUMO

Background: The aim of the study was to assess the inter-rater and intra-rater agreement of measurements performed with the Luna EMG (electromyography) multifunctional robot, a tool for evaluation of upper limb proprioception in individuals with stroke. Methods: The study was conducted in a group of patients with chronic stroke. A total of 126 patients participated in the study, including 78 women and 48 men, on average aged nearly 60 years (mean = 59.9). Proprioception measurements were performed using the Luna EMG diagnostic and rehabilitation robot to assess the left and right upper limbs. The examinations were conducted by two raters, twice, two weeks apart. The results were compared between the raters and the examinations. Results: High consistency of the measurements performed for the right and the left hand was reflected by the interclass correlation coefficients (0.996-0.998 and 0.994-0.999, respectively) and by Pearson's linear correlation which was very high (r = 1.00) in all the cases for the right and the left hand in both the inter-rater and intra-rater agreement analyses. Conclusions: Measurements performed by the Luna EMG diagnostic and rehabilitation robot demonstrate high inter-rater and intra-rater agreement in the assessment of upper limb proprioception in patients with chronic stroke. The findings show that Luna EMG is a reliable tool enabling effective evaluation of upper limb proprioception post-stroke.


Assuntos
Eletromiografia , Variações Dependentes do Observador , Propriocepção , Robótica , Reabilitação do Acidente Vascular Cerebral , Acidente Vascular Cerebral , Extremidade Superior , Humanos , Masculino , Feminino , Pessoa de Meia-Idade , Propriocepção/fisiologia , Eletromiografia/métodos , Estudos Prospectivos , Acidente Vascular Cerebral/fisiopatologia , Acidente Vascular Cerebral/diagnóstico , Reprodutibilidade dos Testes , Extremidade Superior/fisiopatologia , Reabilitação do Acidente Vascular Cerebral/métodos , Reabilitação do Acidente Vascular Cerebral/instrumentação , Idoso , Adulto
11.
PLoS One ; 19(9): e0309684, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39231102

RESUMO

Behavioral pain scales have been helpful for standardized swine pain assessment. However, it is still unknown if observers' experience influences the scale score. We conducted a pilot study to investigate how three different levels of swine experience influenced how observers scored castration pain in piglets using Unesp-Botucatu Pig Composite Acute Pain Scale (UPAPS). We used a database from UPAPS scores from pigs undergoing surgical castration in a previous study. Scores were attributed by six observers with Little to no experience (n = 2), Some experience (n = 2) and Extensive experience (n = 2). Reliability was estimated using the intraclass correlation coefficient, agreement was investigated by Bland-Altman analysis, predictive capacity was estimated using the area under the curve (AUC), and statistical differences were tested using a regression model. We found that intra-experience levels reliability were satisfactory (Little to no: 0.72, Some: 0.81, Extensive: 0.84), but inter-experience reliability was lower (0.42). Little to no experience observers had poor agreement with other observers, with a bias toward underscoring UPAPS (bias of 0.94 vs. Some, 1.17 vs. Extensive). Predictive capacity was similar between all observers (AUC, Little to no: 71.94%, Some: 76.10%, Extensive: 79.09%, p > 0.05). Regression model confirmed underscoring of Little to no experience observers (mean ± standard error; Little to no: 1.09 ± 0.14; Some: 2.02 ± 0.23; Extensive: 2.25 ± 0.22; p < 0.05). We concluded that minimal experience, as Some experience observers have in the swine industry, is sufficient for them to score UPAPS in a similar way than more experienced observers. The present pilot study supports the enhancement and implementation of UPAPS on farm and laboratory settings by minimally qualified observers, improving swine welfare in the short and long term.


Assuntos
Dor Aguda , Medição da Dor , Animais , Projetos Piloto , Suínos , Masculino , Medição da Dor/métodos , Dor Aguda/psicologia , Reprodutibilidade dos Testes , Orquiectomia/efeitos adversos , Variações Dependentes do Observador , Castração , Humanos
12.
Arq Bras Oftalmol ; 88(1): e20230150, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39109742

RESUMO

PURPOSE: To develop a simple, subjective, and reliable grading scale for isotretinoin-induced meibography changes. METHODS: After analyzing meibography images obtained from systemic isotretinoin users, a grading scale was proposed and named "meibography health score." The score ranged from 1 to 3, with decreasing gland reflectivity and identifiable margins. A total of 11 medical professionals were asked to grade 10 meibography images obtained from isotretinoin users using the proposed scale and were divided into three groups: (A) ophthalmologists with experience with meibography, (B) ophthalmologists with no experience with meibography, and (C) radiologists. The kappa statistic was determined to test interrater reliability. RESULTS: The overall kappa was approximately 0.64. The kappa scores for Groups A, B, and C were 0.78, 0.59, and 0.90, respectively. Grade 2 had the lowest kappa scores (0.62, 0.35, and 0.82 for A, B, and C, respectively) and grade 3 the highest (0.78, 0.90, and 1.0 for A, B and C, respectively). Furthermore, Group C had the highest kappa scores and Group B the lowest. CONCLUSION: The meibography health score exhibited good interrater reliability, particularly in severe cases.


Assuntos
Acne Vulgar , Fármacos Dermatológicos , Isotretinoína , Glândulas Tarsais , Variações Dependentes do Observador , Humanos , Isotretinoína/efeitos adversos , Acne Vulgar/tratamento farmacológico , Reprodutibilidade dos Testes , Glândulas Tarsais/efeitos dos fármacos , Glândulas Tarsais/diagnóstico por imagem , Glândulas Tarsais/patologia , Fármacos Dermatológicos/efeitos adversos , Índice de Gravidade de Doença , Feminino , Masculino , Doenças Palpebrais/induzido quimicamente , Doenças Palpebrais/diagnóstico por imagem
13.
Andes Pediatr ; 95(3): 252-262, 2024 Jun.
Artigo em Espanhol | MEDLINE | ID: mdl-39093210

RESUMO

The measurement of isometric muscle torque with hand-held dynamometry is a technique little studied in the pediatric setting for the evaluation of maximal isometric muscle strength. OBJECTIVE: to determine the reliability of hand-held dynamometry to obtain the maximal isometric torque of upper and lower limb muscle groups in Chilean children and adolescents. PATIENTS AND METHODS: Crosssectional study. Seventy-two participants aged between 7 and 15 years were selected from a school in Talca. Maximal isometric torque was recorded in 15 muscle groups of upper and lower limbs through hand-held dynamometry. Intra- and inter-rater evaluation was used, applying the intraclass correlation coefficient (ICC) to determine the reliability of the tests and Bland-Altman plots to evaluate concordance. RESULTS: The results demonstrated good to excellent inter-rater reliability (ICC = 0.850.98) and intra-rater reliability (ICC = 0.87-0.98). Only two groups, hip extensors and abductors, showed good inter-rater reliability (ICC = 0.85 and ICC = 0.88, respectively); and one group, the ankle dorsiflexors, showed good intra-rater reliability (ICC = 0.87). 100% of the tests presented at least 95.8% inter- and intra-rater agreement on the Bland-Altman plots. CONCLUSION: The evaluation of isometric muscle torque using hand-held dynamometry is a reliable procedure for use in different growth periods.


Assuntos
Contração Isométrica , Dinamômetro de Força Muscular , Força Muscular , Torque , Humanos , Criança , Adolescente , Reprodutibilidade dos Testes , Masculino , Feminino , Estudos Transversais , Chile , Contração Isométrica/fisiologia , Força Muscular/fisiologia , Músculo Esquelético/fisiologia , Variações Dependentes do Observador , Extremidade Inferior/fisiologia
14.
BMC Musculoskelet Disord ; 25(1): 651, 2024 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-39160504

RESUMO

PURPOSE: To assess the test-retest and inter-rater reliability of goniometry and fleximetry in measuring cervical range of motion in individuals with chronic neck pain. METHODS: A reliability study. Thirty individuals with chronic neck pain were selected. Cervical range of motion was measured by goniometry and fleximetry at two time points 7 days apart. To characterize the sample, we used the numerical pain rating scale, Pain-Related Catastrophizing Thoughts Scale, and Neck Disability Index. Intraclass correlation coefficient (ICC), standard error of measurement (SEM) and minimum detectable change (MDC) were calculated. Correlations between goniometry and fleximetry measurements were performed using Spearman's correlation coefficient (rho). RESULTS: For goniometry, we found excellent test-retest reliability (ICC ≥ 0.986, SEM ≤ 1.89%, MDC ≤ 5.23%) and inter-rater reliability (ICC ≥ 0.947, SEM ≤ 3.91%, MDC ≤ 10.84%). Similarly, we found excellent test-retest reliability (ICC ≥ 0.969, SEM ≤ 2.71%, MDC ≤ 7.52%) and inter-rater reliability (ICC ≥ 0.981, SEM ≤ 1.88%, MDC ≤ 5.20%) for fleximetry. Finally, we observed a strong correlation between the goniometry and the fleximetry for all cervical movements (rho ≥ 0.993). CONCLUSION: Goniometry and fleximetry measurements are reliable for assessing cervical range of motion in individuals with chronic neck pain.


Assuntos
Artrometria Articular , Vértebras Cervicais , Dor Crônica , Cervicalgia , Amplitude de Movimento Articular , Humanos , Cervicalgia/fisiopatologia , Cervicalgia/diagnóstico , Amplitude de Movimento Articular/fisiologia , Feminino , Reprodutibilidade dos Testes , Masculino , Dor Crônica/fisiopatologia , Dor Crônica/diagnóstico , Artrometria Articular/métodos , Adulto , Pessoa de Meia-Idade , Vértebras Cervicais/fisiopatologia , Medição da Dor/métodos , Variações Dependentes do Observador
15.
BMJ Open Ophthalmol ; 9(1)2024 Aug 17.
Artigo em Inglês | MEDLINE | ID: mdl-39153755

RESUMO

PURPOSE: The calliper function is used for manual measurements of full thickness macular holes (FTMHs). We aimed to investigate whether a reproducible difference could be detected beyond interobserver variability between two commonly used manufacturers in their manual calliper facility in spectral domain optical coherence tomography (OCT) for metrics related to FTMH. METHODS: This is a non-interventional, retrospective, observational study. Two independent observers examined 8 eyes (16 OCT) scans and 128 measurements (minimal linear diameter (MLD), basal diameter and hole height on both sides) of FTMHs, taken on Heidelberg Spectralis and Topcon Triton (OCT machines). The interobserver agreement and OCT machine agreement of measurements were analysed by Bland-Altman plots and intraclass correlation coefficient (ICC) analysis. Spectralis and Triton had 125 µm and 50 µm horizontal b-scan spacing, respectively. RESULTS: Overall, we report high absolute agreement in interobserver (ICC 0.991 (95% CI 0.985 to 0.995, p<0.001)) and OCT machine (ICC 0.993 (95% CI 0.987 to 0.996, p<0.001)) variability. Lower horizontal resolution in Triton compared with Spectralis leads to interobserver variability, in smaller horizontal measurements. Lower horizontal scanning density in Spectralis lead to relatively large interobserver variation if different reference scans were chosen, and consistently smaller MLD measurements than Triton. Vertical measurements without 1:1 scaling lead to inaccurate exaggerated oblique vertical measurements. Calliper function appears otherwise identically calibrated. CONCLUSIONS: We report excellent interobserver and OCT machine agreement in measurements. However, the paper shows several factors that could influence the reliability of measurements acquired in eyes with FTMHs, such as the dimension of the hole as well as different image resolution, density scanning protocols or vertical scaling of the OCT machines viewing platform.


Assuntos
Variações Dependentes do Observador , Perfurações Retinianas , Tomografia de Coerência Óptica , Humanos , Tomografia de Coerência Óptica/métodos , Perfurações Retinianas/diagnóstico por imagem , Estudos Retrospectivos , Reprodutibilidade dos Testes , Feminino , Masculino , Idoso , Pessoa de Meia-Idade
16.
Cancer Imaging ; 24(1): 109, 2024 Aug 18.
Artigo em Inglês | MEDLINE | ID: mdl-39155364

RESUMO

OBJECTIVES: This study aimed to investigate the intra- and inter-observer consistency of the Visually Accessible Rembrandt Images (VASARI) feature set before and after dichotomization, and the association between dichotomous VASARI features and the overall survival (OS) in glioblastoma (GBM) patients. METHODS: This retrospective study included 351 patients with pathologically confirmed IDH1 wild-type GBM between January 2016 and June 2022. Firstly, VASARI features were assessed by four radiologists with varying levels of experience before and after dichotomization. Cohen's kappa coefficient (κ) was calculated to measure the intra- and inter-observer consistency. Then, after adjustment for confounders using propensity score matching, Kaplan-Meier curves were used to compare OS differences for each dichotomous VASARI feature. Next, patients were randomly stratified into a training set (n = 211) and a test set (n = 140) in a 3:2 ratio. Based on the training set, Cox proportional hazards regression analysis was adopted to develop combined and clinical models to predict OS, and the performance of the models was evaluated with the test set. RESULTS: Eleven VASARI features with κ value of 0.61-0.8 demonstrated almost perfect agreement after dichotomization, with the range of κ values across all readers being 0.874-1.000. Seven VASARI features were correlated with GBM patient OS. For OS prediction, the combined model outperformed the clinical model in both training set (C-index, 0.762 vs. 0.723) and test set (C-index, 0.812 vs. 0.702). CONCLUSION: The dichotomous VASARI features exhibited excellent inter- and intra-observer consistency. The combined model outperformed the clinical model for OS prediction.


Assuntos
Neoplasias Encefálicas , Glioblastoma , Pontuação de Propensão , Humanos , Glioblastoma/mortalidade , Glioblastoma/diagnóstico por imagem , Masculino , Feminino , Estudos Retrospectivos , Pessoa de Meia-Idade , Neoplasias Encefálicas/mortalidade , Imageamento por Ressonância Magnética/métodos , Adulto , Idoso , Estimativa de Kaplan-Meier , Variações Dependentes do Observador
17.
Pediatr Surg Int ; 40(1): 233, 2024 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-39158792

RESUMO

PURPOSE: This study evaluates the inter-rater agreements of both the Glans-Urethral Meatus-Shaft (GMS) hypospadias score and Hypospadias Objective Penile Evaluation (HOPE) score, aiming to standardize disease classification for consistent agreement in clinically relevant characteristics of hypospadias. METHODS: Photos of hypospadias in children were collected from two separate institutions. Three raters scored the photos using GMS and HOPE, excluding penile torsion and curvature assessment in HOPE due to photo limitations. RESULTS: A total of 528 photos were included. With GMS, Fleiss' multi-rater kappa showed an agreement of 0.745 for glans-urethral plate, 0.869 for meatus, and 0.745 for shaft. For HOPE scores, the agreements were 0.888 for position of meatus, 0.669 for shape of meatus, 0.730 for shape of glans, and 0.708 for the shape of the skin. The lower agreement in the shape of the meatus evaluation may be attributed to the lack of a quantitative classification method in HOPE. Experts rely on their subjective judgment based on the provided example photos and their index patient. CONCLUSIONS: While there is high agreement among experts when evaluating hypospadias using the GMS and HOPE scoring criteria, only the position of the meatus achieved nearly perfect agreement highlighting that the current scoring systems entail a subjective element in disease classification.


Assuntos
Hipospadia , Pênis , Uretra , Humanos , Hipospadia/classificação , Masculino , Lactente , Fotografação/métodos , Pré-Escolar , Variações Dependentes do Observador , Reprodutibilidade dos Testes , Criança
18.
Arch Dermatol Res ; 316(8): 543, 2024 Aug 20.
Artigo em Inglês | MEDLINE | ID: mdl-39162820

RESUMO

Actinic keratosis (AK) is a common precancerous skin condition predominantly affecting older males with fair skin and significant UV exposure. The clinical significance of AK is related to its potential for malignant transformation and progression to squamous cell carcinoma (SCC). Accurate diagnosis of AK is essential for adequate treatment, evaluation of therapeutic efficacy, and mitigating the risk of developing SCC. However, clinician variability due to the subjective nature of current diagnostic tools presents significant challenges to achieving consistent and reliable AK diagnoses. Thus, there is no universally accepted standard for measuring AK.This review evaluates current methods for evaluating and diagnosing AK, focusing on clinician variability through inter- and intraobserver agreement. Eight peer-reviewed studies investigating the reliability of various approaches for AK evaluation show substantial variability in interobserver or intraobserver agreement, with most methods demonstrating only slight to moderate reliability. Some suggest that consensus discussions and simplified rating scales can modestly improve diagnostic reliability. However, remaining variability and the lack of a universally accepted standard for measuring AK underscore the need for more robust and standardized diagnostic and evaluation methods.The review emphasizes the need for improved diagnostic tools and standardized methods to enhance the accuracy and reliability of AK assessments. It also proposes applying a novel examination approach using 1,3-dihydroxyacetone (DHA) staining which may improve the visualization and identification of AK lesions. Advancements in these areas have significant potential, promising better clinical practices and patient outcomes in AK management.


Assuntos
Ceratose Actínica , Neoplasias Cutâneas , Humanos , Ceratose Actínica/diagnóstico , Ceratose Actínica/patologia , Ceratose Actínica/terapia , Reprodutibilidade dos Testes , Neoplasias Cutâneas/diagnóstico , Neoplasias Cutâneas/patologia , Variações Dependentes do Observador , Carcinoma de Células Escamosas/diagnóstico , Carcinoma de Células Escamosas/patologia , Pele/patologia , Lesões Pré-Cancerosas/diagnóstico , Lesões Pré-Cancerosas/patologia
19.
Transl Vis Sci Technol ; 13(8): 3, 2024 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-39087929

RESUMO

Purpose: To determine the reliability of a nine-point summary scale for grading intermediate age-related macular degeneration (AMD) image morphologic features based on the Early Treatment Diabetic Retinopathy Study (ETDRS) grid. Methods: Two trained graders independently divided spectral domain-optical coherence tomography (SD-OCT) scans into nine subfields and then graded each subfield for the presence of intraretinal hyperreflective foci (HRF), reticular pseudodrusen (RPD), and incomplete or complete retinal pigment epithelium and outer retinal atrophy (iRORA or cRORA). Grading results were assessed by summing the subfield grades into a nine-point summary score and also by using an eye-level binary grade for presence of the finding in any subfield. Gwet's first-order agreement coefficient (AC1) was calculated to assess intergrader agreement. Results: Images of 79 eyes from 52 patients were evaluated. Intergrader agreement was higher when the OCT grades were summarized with a nine-point summary score (Gwet's AC1 0.92, 0.89, 0.99, and 0.99 for HRF, RPD, iRORA, and cRORA, respectively) compared with the eye-level binary grade (Gwet's AC1 0.75, 0.76, 0.97, and 0.96 for HRF, RPD, iRORA, and cRORA, respectively), with significant differences detected for HRF and RPD. Conclusions: The use of a nine-point summary score showed higher reliability in grading when compared to the binary subfield- and eye-level data, and thus may offer more precise estimation of AMD disease staging. Translational Relevance: These findings suggest that a nine-point summary score could be a useful means of disease staging by using findings on OCT in clinical studies of AMD.


Assuntos
Degeneração Macular , Tomografia de Coerência Óptica , Humanos , Tomografia de Coerência Óptica/métodos , Idoso , Feminino , Masculino , Reprodutibilidade dos Testes , Degeneração Macular/diagnóstico por imagem , Degeneração Macular/patologia , Variações Dependentes do Observador , Pessoa de Meia-Idade , Idoso de 80 Anos ou mais , Epitélio Pigmentado da Retina/patologia , Epitélio Pigmentado da Retina/diagnóstico por imagem , Drusas Retinianas/diagnóstico por imagem , Drusas Retinianas/patologia , Índice de Gravidade de Doença
20.
Gait Posture ; 113: 534-542, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39173444

RESUMO

BACKGROUND: The gait variable minimum toe clearance (MTC) has been investigated concerning trip-related fall research in older adults. However, comparing studies is difficult due to the different methods used to measure MTC and shoe conditions, which may affect agreement. Measurement methods can include using a single virtual point (SVP), multiple virtual points (MVPS), or metatarsal head markers (marker-based). The shoe types used in MTC studies include standard shoes (SS), personal shoes (PS), and barefoot (BF) conditions. RESEARCH QUESTION: What is the agreement, inter and intra-rater reliability, and repeatability for the 3 commonly used methods of measuring MTC (SVP, MVPS, marker-based) under the 3 shoe conditions for optical motion capture systems (SS, PS, BF)? METHODS: Twelve healthy young adults (mean [SD] 23.8 [1.9] years,7 males) participated in this observational study. In a randomized order, participants completed 25 walking trials at self-selected normal and slow speeds in SS, PS, and BF conditions while infrared cameras recorded the maker trajectories. Each participant performed a familiarization trial for at least 1 minute before collecting data on each shoe condition. Statistical analyses included Bland-Altman 95 % limits of agreement (LOA) analyses, interclass correlation coefficient (ICC) analyses for inter- and intra-rater reliability, and the repeatability coefficient (RC). RESULTS: The SVP and MVPS had a tighter 95 % LOA than the marker-based method, particularly under SS and BF conditions. The inter-rater reliability was good to excellent under these shoe conditions. Intra-reliability for all methods under all shoe conditions was excellent (ICC >.90). The RC was very similar for each method, with none exceeding 1.02 cm. SIGNIFICANCE: The study provides estimates of the agreement between MTC methods and suggests that only SVP or MVPS produced similar results in SS/BF conditions. Additionally, a "true" change in MTC requires a difference greater than 1.02 cm.


Assuntos
Sapatos , Humanos , Masculino , Reprodutibilidade dos Testes , Feminino , Adulto Jovem , Adulto , Dedos do Pé/fisiologia , Variações Dependentes do Observador , Fenômenos Biomecânicos , Análise da Marcha/métodos , Marcha/fisiologia , Captura de Movimento
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA