Your browser doesn't support javascript.
loading
Text mining for disease surveillance in veterinary clinical data: part two, training computers to identify features in clinical text.
Davies, Heather; Nenadic, Goran; Alfattni, Ghada; Arguello Casteleiro, Mercedes; Al Moubayed, Noura; Farrell, Sean; Radford, Alan D; Noble, P-J M.
Affiliation
  • Davies H; Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, United Kingdom.
  • Nenadic G; Department of Computer Science, Manchester University, Manchester, United Kingdom.
  • Alfattni G; Department of Computer Science, Manchester University, Manchester, United Kingdom.
  • Arguello Casteleiro M; Department of Computer Science, Manchester University, Manchester, United Kingdom.
  • Al Moubayed N; Department of Computer Science, Durham University, Durham, United Kingdom.
  • Farrell S; Department of Computer Science, Durham University, Durham, United Kingdom.
  • Radford AD; Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, United Kingdom.
  • Noble PM; Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, United Kingdom.
Front Vet Sci ; 11: 1352726, 2024.
Article in En | MEDLINE | ID: mdl-39239390
ABSTRACT
In part two of this mini-series, we evaluate the range of machine-learning tools now available for application to veterinary clinical text-mining. These tools will be vital to automate extraction of information from large datasets of veterinary clinical narratives curated by projects such as the Small Animal Veterinary Surveillance Network (SAVSNET) and VetCompass, where volumes of millions of records preclude reading records and the complexities of clinical notes limit usefulness of more "traditional" text-mining approaches. We discuss the application of various machine learning techniques ranging from simple models for identifying words and phrases with similar meanings to expand lexicons for keyword searching, to the use of more complex language models. Specifically, we describe the use of language models for record annotation, unsupervised approaches for identifying topics within large datasets, and discuss more recent developments in the area of generative models (such as ChatGPT). As these models become increasingly complex it is pertinent that researchers and clinicians work together to ensure that the outputs of these models are explainable in order to instill confidence in any conclusions drawn from them.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Front Vet Sci Year: 2024 Document type: Article Affiliation country: United kingdom Country of publication: Switzerland

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Front Vet Sci Year: 2024 Document type: Article Affiliation country: United kingdom Country of publication: Switzerland