Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
1.
J Biomed Inform ; 46(4): 734-43, 2013 Aug.
Article in English | MEDLINE | ID: mdl-23602781

ABSTRACT

A major goal of Natural Language Processing in the public health informatics domain is the automatic extraction and encoding of data stored in free text patient records. This extracted data can then be utilized by computerized systems to perform syndromic surveillance. In particular, the chief complaint--a short string that describes a patient's symptoms--has come to be a vital resource for syndromic surveillance in the North American context due to its near ubiquity. This paper reviews fifteen systems in North America--at the city, county, state and federal level--that use chief complaints for syndromic surveillance.


Subject(s)
Population Surveillance , Humans , North America , Syndrome
2.
J Am Med Inform Assoc ; 17(5): 595-601, 2010.
Article in English | MEDLINE | ID: mdl-20819870

ABSTRACT

OBJECTIVE: Standardized surveillance syndromes do not exist but would facilitate sharing data among surveillance systems and comparing the accuracy of existing systems. The objective of this study was to create reference syndrome definitions from a consensus of investigators who currently have or are building syndromic surveillance systems. DESIGN: Clinical condition-syndrome pairs were catalogued for 10 surveillance systems across the United States and the representatives of these systems were brought together for a workshop to discuss consensus syndrome definitions. RESULTS: Consensus syndrome definitions were generated for the four syndromes monitored by the majority of the 10 participating surveillance systems: Respiratory, gastrointestinal, constitutional, and influenza-like illness (ILI). An important element in coming to consensus quickly was the development of a sensitive and specific definition for respiratory and gastrointestinal syndromes. After the workshop, the definitions were refined and supplemented with keywords and regular expressions, the keywords were mapped to standard vocabularies, and a web ontology language (OWL) ontology was created. LIMITATIONS: The consensus definitions have not yet been validated through implementation. CONCLUSION: The consensus definitions provide an explicit description of the current state-of-the-art syndromes used in automated surveillance, which can subsequently be systematically evaluated against real data to improve the definitions. The method for creating consensus definitions could be applied to other domains that have diverse existing definitions.


Subject(s)
Communicable Diseases , Population Surveillance/methods , Group Processes , Humans , Syndrome , United States
3.
J Biomed Inform ; 42(5): 839-51, 2009 Oct.
Article in English | MEDLINE | ID: mdl-19435614

ABSTRACT

In this paper we describe an algorithm called ConText for determining whether clinical conditions mentioned in clinical reports are negated, hypothetical, historical, or experienced by someone other than the patient. The algorithm infers the status of a condition with regard to these properties from simple lexical clues occurring in the context of the condition. The discussion and evaluation of the algorithm presented in this paper address the questions of whether a simple surface-based approach which has been shown to work well for negation can be successfully transferred to other contextual properties of clinical conditions, and to what extent this approach is portable among different clinical report types. In our study we find that ConText obtains reasonable to good performance for negated, historical, and hypothetical conditions across all report types that contain such conditions. Conditions experienced by someone other than the patient are very rarely found in our report set. A comprehensive solution to the problem of determining whether a clinical condition is historical or recent requires knowledge above and beyond the surface clues picked up by ConText.


Subject(s)
Algorithms , Medical Informatics/methods , Medical Records , Natural Language Processing , Databases, Factual , Humans , Reproducibility of Results
4.
J Biomed Inform ; 41(4): 613-23, 2008 Aug.
Article in English | MEDLINE | ID: mdl-18166502

ABSTRACT

OBJECTIVE: To determine whether preprocessing chief complaints before automatically classifying them into syndromic categories improves classification performance. METHODS: We preprocessed chief complaints using two preprocessors (CCP and EMT-P) and evaluated whether classification performance increased for a probabilistic classifier (CoCo) or for a keyword-based classifier (modification of the NYC Department of Health and Mental Hygiene chief complaint coder (KC)). RESULTS: CCP exhibited high accuracy (85%) in preprocessing chief complaints but only slightly improved CoCo's classification performance for a few syndromes. EMT-P, which splits chief complaints into multiple problems, substantially increased CoCo's sensitivity for all syndromes. Preprocessing with CCP or EMT-P only improved KC's sensitivity for the Constitutional syndrome. CONCLUSION: Evaluation of preprocessing systems should not be limited to accuracy of the preprocessor but should include the effect of preprocessing on syndromic classification. Splitting chief complaints into multiple problems before classification is important for CoCo, but other preprocessing steps only slightly improved classification performance for CoCo and a keyword-based classifier.


Subject(s)
Artificial Intelligence , Diagnosis, Computer-Assisted/methods , Natural Language Processing , Pattern Recognition, Automated/methods , Population Surveillance/methods , Terminology as Topic , Syndrome
5.
J Biomed Inform ; 41(2): 224-31, 2008 Apr.
Article in English | MEDLINE | ID: mdl-18194876

ABSTRACT

The goals of automated biosurveillance systems are to detect disease outbreaks early, while exhibiting few false positives. Evaluation measures currently exist to estimate the expected detection time of biosurveillance systems. Researchers also have developed models that estimate clinician detection of cases of outbreak diseases, which is a process known as clinical case finding. However, little research has been done on estimating how well biosurveillance systems augment traditional outbreak detection that is carried out by clinicians. In this paper, we introduce a general approach for doing so for non-endemic disease outbreaks, which are characteristic of bioterrorist induced diseases, such as respiratory anthrax. We first layout the basic framework, which makes minimal assumptions, and then we specialize it in several ways. We illustrate the method using a Bayesian outbreak detection algorithm called PANDA, a model of clinician outbreak detection, and simulated cases of a windborne anthrax release. This analysis derives a bound on how well we would expect PANDA to augment clinician detection of an anthrax outbreak. The results support that such analyses are useful in assessing the extent to which computer-based outbreak detection systems are expected to augment traditional clinician outbreak detection.


Subject(s)
Artificial Intelligence , Disease Outbreaks/prevention & control , Disease Outbreaks/statistics & numerical data , Epidemiologic Measurements , Pattern Recognition, Automated/methods , Population Surveillance/methods , Public Health Informatics/methods , Algorithms , Bioterrorism/prevention & control , Humans
6.
Int J Med Inform ; 77(2): 107-13, 2008 Feb.
Article in English | MEDLINE | ID: mdl-17317291

ABSTRACT

OBJECTIVE: Determine whether agreement among annotators improves after being trained to use an annotation schema that specifies: what types of clinical conditions to annotate, the linguistic form of the annotations, and which modifiers to include. METHODS: Three physicians and 3 lay people individually annotated all clinical conditions in 23 emergency department reports. For annotations made using a Baseline Schema and annotations made after training on a detailed annotation schema, we compared: (1) variability of annotation length and number and (2) annotator agreement, using the F-measure. RESULTS: Physicians showed higher agreement and lower variability after training on the detailed annotation schema than when applying the Baseline Schema. Lay people agreed with physicians almost as well as other physicians did but showed a slower learning curve. CONCLUSION: Training annotators on the annotation schema we developed increased agreement among annotators and should be useful in generating reference standard sets for natural language processing studies. The methodology we used to evaluate the schema could be applied to other types of annotation or classification tasks in biomedical informatics.


Subject(s)
Artificial Intelligence , Documentation/standards , Emergency Service, Hospital/organization & administration , Documentation/methods , Hospitals, Religious , Humans , Medical Records Systems, Computerized , Natural Language Processing , Pennsylvania , Teaching
7.
J Biomed Inform ; 39(2): 196-208, 2006 Apr.
Article in English | MEDLINE | ID: mdl-16230050

ABSTRACT

Evaluating automated indexing applications requires comparing automatically indexed terms against manual reference standard annotations. However, there are no standard guidelines for determining which words from a textual document to include in manual annotations, and the vague task can result in substantial variation among manual indexers. We applied grounded theory to emergency department reports to create an annotation schema representing syntactic and semantic variables that could be annotated when indexing clinical conditions. We describe the annotation schema, which includes variables representing medical concepts (e.g., symptom, demographics), linguistic form (e.g., noun, adjective), and modifier types (e.g., anatomic location, severity). We measured the schema's quality and found: (1) the schema was comprehensive enough to be applied to 20 unseen reports without changes to the schema; (2) agreement between author annotators applying the schema was high, with an F measure of 93%; and (3) the authors made complementary errors when applying the schema, demonstrating that the schema incorporates both linguistic and medical expertise.


Subject(s)
Abstracting and Indexing/methods , Artificial Intelligence , Documentation/methods , Emergency Service, Hospital , Information Storage and Retrieval/methods , Medical Records Systems, Computerized , Natural Language Processing
8.
AMIA Annu Symp Proc ; : 141-5, 2006.
Article in English | MEDLINE | ID: mdl-17238319

ABSTRACT

OBJECTIVE: Determine how four contextual features (Validity, Certainty, Directionality, and Temporality) contribute to classification of respiratory syndrome-related clinical conditions as acute, chronic, or absent from manual annotations in Emergency Department Reports. Based on the results, we will direct our research towards automatic identification of the contextual features found to be discriminating. METHODS: A physician annotated all instances of 56 clinical conditions in 120 ED reports and encoded four contextual features for every annotation. We classified clinical conditions using the contextual features and measured agreement to reference standard classifications made by the physician using a weighted kappa (Kw). RESULTS: Kw was 0.518 when not using any of the features and 0.953 when using all of the features. CONCLUSION: Validity, Directionality, and Temporality all improved accuracy. Negation(Directionality) was the most important feature for improving accuracy. Using Certainty made the classification worse.


Subject(s)
Algorithms , Emergency Service, Hospital , Natural Language Processing , Respiratory Tract Diseases/classification , Acute Disease , Chronic Disease , Forms and Records Control , Humans , Medical Records , Pilot Projects
9.
Ann Emerg Med ; 46(5): 445-55, 2005 Nov.
Article in English | MEDLINE | ID: mdl-16271676

ABSTRACT

STUDY OBJECTIVE: Electronic surveillance systems often monitor triage chief complaints in hopes of detecting an outbreak earlier than can be accomplished with traditional reporting methods. We measured the accuracy of a Bayesian chief complaint classifier called CoCo that assigns patients 1 of 7 syndromic categories (respiratory, botulinic, gastrointestinal, neurologic, rash, constitutional, or hemorrhagic) based on free-text triage chief complaints. METHODS: We compared CoCo's classifications with criterion syndromic classification based on International Classification of Diseases, Ninth Revision (ICD-9) discharge diagnoses. We assigned the criterion classification to a patient based on whether the patient's primary diagnosis was a member of a set of ICD-9 codes associated with CoCo's 7 syndromes. We tested CoCo's performance on a set of 527,228 chief complaints from patients registered at the University of Pittsburgh Medical Center emergency department (ED) between 1990 and 2003. We performed a sensitivity analysis by varying the ICD-9 codes in the criterion standard. We also tested CoCo on chief complaints from EDs in a second location (Utah). RESULTS: Approximately 16% (85,569/527,228) of the patients were classified according to the criterion standard into 1 of the 7 syndromes. CoCo's classification performance (number of cases by criterion standard, sensitivity [95% confidence interval (CI)], and specificity [95% CI]) was respiratory (34,916, 63.1 [62.6 to 63.6], 94.3 [94.3 to 94.4]); botulinic (1,961, 30.1 [28.2 to 32.2], 99.3 [99.3 to 99.3]); gastrointestinal (20,431, 69.0 [68.4 to 69.6], 95.6 [95.6 to 95.7]); neurologic (7,393, 67.6 [66.6 to 68.7], 92.7 [92.6 to 92.8]); rash (2,232, 46.8 [44.8 to 48.9], 99.3 [99.3 to 99.3]); constitutional (10,603, 45.8 [44.9 to 46.8], 96.6 [96.6 to 96.7]); and hemorrhagic (8,033, 75.2 [74.3 to 76.2], 98.5 [98.4 to 98.5]). The sensitivity analysis showed that the results were not affected by the choice of ICD-9 codes in the criterion standard. Classification accuracy did not differ on chief complaints from the second location. CONCLUSION: Our results suggest that, for most syndromes, our chief complaint classification system can identify about half of the patients with relevant syndromic presentations, with specificities higher than 90% and positive predictive values ranging from 12% to 44%.


Subject(s)
Classification/methods , Emergency Service, Hospital/organization & administration , Triage/methods , Bayes Theorem , Emergency Service, Hospital/statistics & numerical data , Humans , International Classification of Diseases , Pennsylvania , Retrospective Studies , Sensitivity and Specificity , Terminology as Topic
10.
J Am Med Inform Assoc ; 12(6): 618-29, 2005.
Article in English | MEDLINE | ID: mdl-16049227

ABSTRACT

OBJECTIVE: To generate and measure the reliability for a reference standard set with representative cases from seven broad syndromic case definitions and several narrower syndromic definitions used for biosurveillance. DESIGN: From 527,228 eligible patients between 1990 and 2003, we generated a set of patients potentially positive for seven syndromes by classifying all eligible patients according to their ICD-9 primary discharge diagnoses. We selected a representative subset of the cases for chart review by physicians, who read emergency department reports and assigned values to 14 variables related to the seven syndromes. MEASUREMENTS: (1) Positive predictive value of the ICD-9 diagnoses; (2) prevalence of the syndromic definitions and related variables; (3) agreement between physician raters demonstrated by kappa, kappa corrected for bias and prevalence, and Finn's r; and (4) reliability of the reference standard classifications demonstrated by generalizability coefficients. RESULTS: Positive predictive value for ICD-9 classification ranged from 0.33 for botulinic to 0.86 for gastrointestinal. We generated between 80 and 566 positive cases for six of the seven syndromic definitions. Rash syndrome exhibited low prevalence (34 cases). Agreement between physician raters was high, with kappa > 0.70 for most variables. Ratings showed no bias. Finn's r was >0.70 for all variables. Generalizability coefficients were >0.70 for all variables but three. CONCLUSION: Of the 27 syndromes generated by the 14 variables, 21 showed high enough prevalence, agreement, and reliability to be used as reference standard definitions against which an automated syndromic classifier could be compared. Syndromic definitions that showed poor agreement or low prevalence include febrile botulinic syndrome, febrile and nonfebrile rash syndrome, respiratory syndrome explained by a nonrespiratory or noninfectious diagnosis, and febrile and nonfebrile gastrointestinal syndrome explained by a nongastrointestinal or noninfectious diagnosis.


Subject(s)
Bioterrorism/classification , International Classification of Diseases , Syndrome , Humans , Medical Informatics , Observer Variation , Predictive Value of Tests , Reproducibility of Results
11.
Artif Intell Med ; 33(1): 31-40, 2005 Jan.
Article in English | MEDLINE | ID: mdl-15617980

ABSTRACT

OBJECTIVE: Develop and evaluate a natural language processing application for classifying chief complaints into syndromic categories for syndromic surveillance. INTRODUCTION: Much of the input data for artificial intelligence applications in the medical field are free-text patient medical records, including dictated medical reports and triage chief complaints. To be useful for automated systems, the free-text must be translated into encoded form. METHODS: We implemented a biosurveillance detection system from Pennsylvania to monitor the 2002 Winter Olympic Games. Because input data was in free-text format, we used a natural language processing text classifier to automatically classify free-text triage chief complaints into syndromic categories used by the biosurveillance system. The classifier was trained on 4700 chief complaints from Pennsylvania. We evaluated the ability of the classifier to classify free-text chief complaints into syndromic categories with a test set of 800 chief complaints from Utah. RESULTS: The classifier produced the following areas under the ROC curve: Constitutional = 0.95; Gastrointestinal = 0.97; Hemorrhagic = 0.99; Neurological = 0.96; Rash = 1.0; Respiratory = 0.99; Other = 0.96. Using information stored in the system's semantic model, we extracted from the Respiratory classifications lower respiratory complaints and lower respiratory complaints with fever with a precision of 0.97 and 0.96, respectively. CONCLUSION: Results suggest that a trainable natural language processing text classifier can accurately extract data from free-text chief complaints for biosurveillance.


Subject(s)
Diagnosis, Computer-Assisted , Natural Language Processing , Triage/methods , Bayes Theorem , Humans , Neural Networks, Computer , Sensitivity and Specificity
12.
Stud Health Technol Inform ; 107(Pt 1): 487-91, 2004.
Article in English | MEDLINE | ID: mdl-15360860

ABSTRACT

Clinical conditions described in patients' dictated reports are necessary for automated detection of patients with respiratory illnesses such as inhalational anthrax and pneumonia. We applied MetaMap to emergency department reports to extract a set of 71 clinical conditions relevant to detection of a lower respiratory outbreak. We indexed UMLS terms in emergency department reports with MetaMap, filtered the indexed output with a specialized lexicon of UMLS terms for the domain, and mapped the clinical conditions of interest to concepts in the lexicon. We compared MetaMap's ability to accurately identify the conditions against a physician's manual annotations and evaluated incorrectly indexed features to determine what additional processing is necessary. MetaMap identified the clinical conditions with a recall of 0.72 and a precision of 0.56. Necessary processing beyond MetaMap's indexing includes finding validation, temporal discrimination, anatomic location discrimination, finding-disease discrimination, and contextual inference. Successful identification of clinical conditions in an emergency department report with MetaMap requires processing techniques specific to the clinical question of interest.


Subject(s)
Abstracting and Indexing , Natural Language Processing , Population Surveillance , Respiratory Tract Diseases/diagnosis , Unified Medical Language System , Bioterrorism , Disease Outbreaks/prevention & control , Emergency Service, Hospital , Humans , Information Storage and Retrieval , Medical Records , Vocabulary, Controlled
13.
J Biomed Inform ; 37(2): 120-7, 2004 Apr.
Article in English | MEDLINE | ID: mdl-15120658

ABSTRACT

Automatic detection of cases of febrile illness may have potential for early detection of outbreaks of infectious disease either by identification of anomalous numbers of febrile illness or in concert with other information in diagnosing specific syndromes, such as febrile respiratory syndrome. At most institutions, febrile information is contained only in free-text clinical records. We compared the sensitivity and specificity of three fever detection algorithms for detecting fever from free-text. Keyword CC and CoCo classified patients based on triage chief complaints; Keyword HP classified patients based on dictated emergency department reports. Keyword HP was the most sensitive (sensitivity 0.98, specificity 0.89), and Keyword CC was the most specific (sensitivity 0.61, specificity 1.0). Because chief complaints are available sooner than emergency department reports, we suggest a combined application that classifies patients based on their chief complaint followed by classification based on their emergency department report, once the report becomes available.


Subject(s)
Communicable Disease Control/methods , Disease Outbreaks/prevention & control , Fever/diagnosis , Information Storage and Retrieval/methods , Medical Records Systems, Computerized , Natural Language Processing , Population Surveillance/methods , Algorithms , Database Management Systems , Disease Notification/methods , Fever/epidemiology , Humans , Terminology as Topic
14.
J Biomed Inform ; 36(3): 177-88, 2003 Jun.
Article in English | MEDLINE | ID: mdl-14615227

ABSTRACT

A large number of biological agents can cause natural or bioterroristic disease outbreaks and each can present in a bewildering number of ways (e.g., a few cases versus many cases, confined to a building versus widely disseminated). This 'problem space' is a challenge for designers of early warning systems for disease outbreaks and the sheer size of this space is a barrier to progress. This paper addresses this problem by deriving nine categories of threats that represent a parsimonious characterization of the problem space. A literature search also identified one or more example outbreaks for each of the nine categories. These outbreaks have occurred in recent times and could be used by researchers in need of actual outbreak data for investigations of the role of different types of surveillance data and algorithms in outbreak detection. The methodological contribution of this research is a Criterion Set of threats for analysis and evaluation of detection systems. This set characterizes the problem space in a tractable manner with less loss of generality than analyses based on one or two selected diseases, which is representative of current analyses.


Subject(s)
Bioterrorism/classification , Bioterrorism/prevention & control , Disease Outbreaks/prevention & control , Population Surveillance/methods , Research Design , Risk Assessment/methods , United States/epidemiology
SELECTION OF CITATIONS
SEARCH DETAIL
...