Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters










Database
Language
Publication year range
1.
Crisis ; 45(2): 136-143, 2024 Mar.
Article in English | MEDLINE | ID: mdl-37818627

ABSTRACT

Background: Suicide rates continue to rise for adolescents in the United States. 62% of teenagers use Instagram, and as technology and research in this domain advance, social media posts could provide insights into near-term adolescent risk states and could inform new strategies for suicide prevention. This study analyzed language in captions of teenagers' Instagram accounts in the 3 months before suicide and compared caption language to matched living controls. Method: The study identified 89 teenagers who died by suicide using obituaries and news reports and 89 matched living control teenagers. Linguistic Inquiry and Word Count (LIWC) software was used to test for differences in specific language categories across linguistic, psychological, and topical categories (e.g., word count, tone, grammar, affective, cognitive, social, punctuation marks, etc.). Results: Significant differences between suicide decedents and living controls were found. Adolescent suicide decedents used more words per sentence, more references to sadness, male individuals, drives, and leisure and fewer verbs and references to they, affiliation, achievement, and power. Limitations: Methodological limitations include the use of only public accounts, small sample size, occasional short posts, and lack of adjustment for multiple testing. Conclusion: Although the sample size is relatively small and only included youth with public accounts, we identified differences in Instagram caption language between adolescents who died by suicide as compared to living controls.


Subject(s)
Social Media , Suicide , Adolescent , Humans , Male , United States , Suicide/psychology , Linguistics , Language , Suicide Prevention
2.
JAMIA Open ; 6(4): ooad085, 2023 Dec.
Article in English | MEDLINE | ID: mdl-37799347

ABSTRACT

Objectives: To develop and test a scalable, performant, and rule-based model for identifying 3 major domains of social needs (residential instability, food insecurity, and transportation issues) from the unstructured data in electronic health records (EHRs). Materials and Methods: We included patients aged 18 years or older who received care at the Johns Hopkins Health System (JHHS) between July 2016 and June 2021 and had at least 1 unstructured (free-text) note in their EHR during the study period. We used a combination of manual lexicon curation and semiautomated lexicon creation for feature development. We developed an initial rules-based pipeline (Match Pipeline) using 2 keyword sets for each social needs domain. We performed rule-based keyword matching for distinct lexicons and tested the algorithm using an annotated dataset comprising 192 patients. Starting with a set of expert-identified keywords, we tested the adjustments by evaluating false positives and negatives identified in the labeled dataset. We assessed the performance of the algorithm using measures of precision, recall, and F1 score. Results: The algorithm for identifying residential instability had the best overall performance, with a weighted average for precision, recall, and F1 score of 0.92, 0.84, and 0.92 for identifying patients with homelessness and 0.84, 0.82, and 0.79 for identifying patients with housing insecurity. Metrics for the food insecurity algorithm were high but the transportation issues algorithm was the lowest overall performing metric. Discussion: The NLP algorithm in identifying social needs at JHHS performed relatively well and would provide the opportunity for implementation in a healthcare system. Conclusion: The NLP approach developed in this project could be adapted and potentially operationalized in the routine data processes of a healthcare system.

4.
Psychiatr Serv ; 74(1): 56-62, 2023 01 01.
Article in English | MEDLINE | ID: mdl-35652194

ABSTRACT

The disability determination process of the Social Security Administration's (SSA's) disability program requires assessing work-related functioning for individual claimants alleging disability due to mental impairment. This task is particularly challenging because the determination process involves the review of a large file of information, including objective medical evidence and self-reports from claimants, families, and former employers. To improve this decision-making process, SSA entered an interagency agreement with the Rehabilitation Medicine Department, Epidemiology and Biostatistics Section, in the Clinical Center of the National Institutes of Health, intending to use data science and informatics to develop decision support tools. This collaborative effort over the past decade has led to the development of the Work Disability-Functional Assessment Battery and has initiated an approach to applying natural language processing to the review of claimants' files for information on mental health functioning. This informatics research collaboration holds promise for improving the process of disability determination for individuals with mental impairments who make claims at the SSA.


Subject(s)
Disabled Persons , Mental Health , United States , Humans , United States Social Security Administration , Social Security , Disability Evaluation , Informatics
5.
JMIR Med Inform ; 10(3): e32245, 2022 Mar 18.
Article in English | MEDLINE | ID: mdl-35302510

ABSTRACT

Natural language processing (NLP) in health care enables transformation of complex narrative information into high value products such as clinical decision support and adverse event monitoring in real time via the electronic health record (EHR). However, information technologies for mental health have consistently lagged because of the complexity of measuring and modeling mental health and illness. The use of NLP to support management of mental health conditions is a viable topic that has not been explored in depth. This paper provides a framework for the advanced application of NLP methods to identify, extract, and organize information on mental health and functioning to inform the decision-making process applied to assessing mental health. We present a use-case related to work disability, guided by the disability determination process of the US Social Security Administration (SSA). From this perspective, the following questions must be addressed about each problem that leads to a disability benefits claim: When did the problem occur and how long has it existed? How severe is it? Does it affect the person's ability to work? and What is the source of the evidence about the problem? Our framework includes 4 dimensions of medical information that are central to assessing disability-temporal sequence and duration, severity, context, and information source. We describe key aspects of each dimension and promising approaches for application in mental functioning. For example, to address temporality, a complete functional timeline must be created with all relevant aspects of functioning such as intermittence, persistence, and recurrence. Severity of mental health symptoms can be successfully identified and extracted on a 4-level ordinal scale from absent to severe. Some NLP work has been reported on the extraction of context for specific cases of wheelchair use in clinical settings. We discuss the links between the task of information source assessment and work on source attribution, coreference resolution, event extraction, and rule-based methods. Gaps were identified in NLP applications that directly applied to the framework and in existing relevant annotated data sets. We highlighted NLP methods with the potential for advanced application in the field of mental functioning. Findings of this work will inform the development of instruments for supporting SSA adjudicators in their disability determination process. The 4 dimensions of medical information may have relevance for a broad array of individuals and organizations responsible for assessing mental health function and ability. Further, our framework with 4 specific dimensions presents significant opportunity for the application of NLP in the realm of mental health and functioning beyond the SSA setting, and it may support the development of robust tools and methods for decision-making related to clinical care, program implementation, and other outcomes.

6.
Int J Med Inform ; 147: 104351, 2021 03.
Article in English | MEDLINE | ID: mdl-33401169

ABSTRACT

BACKGROUND: Secondary use of Electronic Health Records (EHRs) has mostly focused on health conditions (diseases and drugs). Function is an important health indicator in addition to morbidity and mortality. Nevertheless, function has been overlooked in accessing patients' health status. The World Health Organization (WHO)'s International Classification of Functioning, Disability and Health (ICF) is considered the international standard for describing and coding function and health states. We pioneer the first comprehensive analysis and identification of functioning concepts in the Mobility domain of the ICF. RESULTS: Using physical therapy notes at the National Institutes of Health's Clinical Center, we induced a hierarchical order of mobility-related entities including 5 entities types, 3 relations, 8 attributes, and 33 attribute values. Two domain experts manually curated a gold standard corpus of 14,281 nested entity mentions from 400 clinical notes. Inter-annotator agreement (IAA) of exact matching averaged 92.3 % F1-score on mention text spans, and 96.6 % Cohen's kappa on attributes assignments. A high-performance Ensemble machine learning model for named entity recognition (NER) was trained and evaluated using the gold standard corpus. Average F1-score on exact entity matching of our Ensemble method (84.90 %) outperformed popular NER methods: Conditional Random Field (80.4 %), Recurrent Neural Network (81.82 %), and Bidirectional Encoder Representations from Transformers (82.33 %). CONCLUSIONS: The results of this study show that mobility functioning information can be reliably captured from clinical notes once adequate resources are provided for sequence labeling methods. We expect that functioning concepts in other domains of the ICF can be identified in similar fashion.


Subject(s)
Machine Learning , Neural Networks, Computer , Electronic Health Records , Humans , Natural Language Processing
7.
J Am Med Inform Assoc ; 28(3): 516-532, 2021 03 01.
Article in English | MEDLINE | ID: mdl-33319905

ABSTRACT

OBJECTIVES: Normalizing mentions of medical concepts to standardized vocabularies is a fundamental component of clinical text analysis. Ambiguity-words or phrases that may refer to different concepts-has been extensively researched as part of information extraction from biomedical literature, but less is known about the types and frequency of ambiguity in clinical text. This study characterizes the distribution and distinct types of ambiguity exhibited by benchmark clinical concept normalization datasets, in order to identify directions for advancing medical concept normalization research. MATERIALS AND METHODS: We identified ambiguous strings in datasets derived from the 2 available clinical corpora for concept normalization and categorized the distinct types of ambiguity they exhibited. We then compared observed string ambiguity in the datasets with potential ambiguity in the Unified Medical Language System (UMLS) to assess how representative available datasets are of ambiguity in clinical language. RESULTS: We found that <15% of strings were ambiguous within the datasets, while over 50% were ambiguous in the UMLS, indicating only partial coverage of clinical ambiguity. The percentage of strings in common between any pair of datasets ranged from 2% to only 36%; of these, 40% were annotated with different sets of concepts, severely limiting generalization. Finally, we observed 12 distinct types of ambiguity, distributed unequally across the available datasets, reflecting diverse linguistic and medical phenomena. DISCUSSION: Existing datasets are not sufficient to cover the diversity of clinical concept ambiguity, limiting both training and evaluation of normalization methods for clinical text. Additionally, the UMLS offers important semantic information for building and evaluating normalization methods. CONCLUSIONS: Our findings identify 3 opportunities for concept normalization research, including a need for ambiguity-specific clinical datasets and leveraging the rich semantics of the UMLS in new methods and evaluation measures for normalization.


Subject(s)
Datasets as Topic , Electronic Health Records , Terminology as Topic , Unified Medical Language System , Deep Learning , Natural Language Processing , Semantics , Vocabulary, Controlled
8.
BMC Public Health ; 19(1): 1288, 2019 Oct 15.
Article in English | MEDLINE | ID: mdl-31615472

ABSTRACT

BACKGROUND: Human activity and the interaction between health conditions and activity is a critical part of understanding the overall function of individuals. The World Health Organization's International Classification of Functioning, Disability and Health (ICF) models function as all aspects of an individual's interaction with the world, including organismal concepts such as individual body structures, functions, and pathologies, as well as the outcomes of the individual's interaction with their environment, referred to as activity and participation. Function, particularly activity and participation outcomes, is an important indicator of health at both the level of an individual and the population level, as it is highly correlated with quality of life and a critical component of identifying resource needs. Since it reflects the cumulative impact of health conditions on individuals and is not disease specific, its use as a health indicator helps to address major barriers to holistic, patient-centered care that result from multiple, and often competing, disease specific interventions. While the need for better information on function has been widely endorsed, this has not translated into its routine incorporation into modern health systems. PURPOSE: We present the importance of capturing information on activity as a core component of modern health systems and identify specific steps and analytic methods that can be used to make it more available to utilize in improving patient care. We identify challenges in the use of activity and participation information, such as a lack of consistent documentation and diversity of data specificity and representation across providers, health systems, and national surveys. We describe how activity and participation information can be more effectively captured, and how health informatics methodologies, including natural language processing (NLP), can enable automatically locating, extracting, and organizing this information on a large scale, supporting standardization and utilization with minimal additional provider burden. We examine the analytic requirements and potential challenges of capturing this information with informatics, and describe how data-driven techniques can combine with common standards and documentation practices to make activity and participation information standardized and accessible for improving patient care. RECOMMENDATIONS: We recommend four specific actions to improve the capture and analysis of activity and participation information throughout the continuum of care: (1) make activity and participation annotation standards and datasets available to the broader research community; (2) define common research problems in automatically processing activity and participation information; (3) develop robust, machine-readable ontologies for function that describe the components of activity and participation information and their relationships; and (4) establish standards for how and when to document activity and participation status during clinical encounters. We further provide specific short-term goals to make significant progress in each of these areas within a reasonable time frame.


Subject(s)
Data Collection , Medical Informatics , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...