Your browser doesn't support javascript.
An Overview of Supervised Machine Learning Methods and Data Analysis for COVID-19 Detection.
Tchagna Kouanou, Aurelle; Mih Attia, Thomas; Feudjio, Cyrille; Djeumo, Anges Fleurio; Ngo Mouelas, Adèle; Nzogang, Mendel Patrice; Tchito Tchapga, Christian; Tchiotsop, Daniel.
  • Tchagna Kouanou A; Department of Computer Engineering, College of Technology, University of Buea, Buea, Cameroon.
  • Mih Attia T; Department of Training, Research Development and Innovation, InchTech's Solutions, Yaoundé, Cameroon.
  • Feudjio C; Department of Computer Engineering, College of Technology, University of Buea, Buea, Cameroon.
  • Djeumo AF; Department of Electrical and Electronic Engineering, College of Technology, University of Buea, Buea, Cameroon.
  • Ngo Mouelas A; Department of Training, Research Development and Innovation, InchTech's Solutions, Yaoundé, Cameroon.
  • Nzogang MP; Department of Training, Research Development and Innovation, InchTech's Solutions, Yaoundé, Cameroon.
  • Tchito Tchapga C; Ecole Nationale Supérieur Polytechnique, University of Yaounde 1, Yaoundé, Cameroon.
  • Tchiotsop D; Faculté de Médecine et des Sciences Biomédicales, University of Yaounde 1, Yaoundé, Cameroon.
J Healthc Eng ; 2021: 4733167, 2021.
Article in English | MEDLINE | ID: covidwho-1546589
ABSTRACT

Methods:

Our analysis and machine learning algorithm is based on most cited two clinical datasets from the literature one from San Raffaele Hospital Milan Italia and the other from Hospital Israelita Albert Einstein São Paulo Brasilia. The datasets were processed to select the best features that most influence the target, and it turned out that almost all of them are blood parameters. EDA (Exploratory Data Analysis) methods were applied to the datasets, and a comparative study of supervised machine learning models was done, after which the support vector machine (SVM) was selected as the one with the best performance.

Results:

SVM being the best performant is used as our proposed supervised machine learning algorithm. An accuracy of 99.29%, sensitivity of 92.79%, and specificity of 100% were obtained with the dataset from Kaggle (https//www.kaggle.com/einsteindata4u/covid19) after applying optimization to SVM. The same procedure and work were performed with the dataset taken from San Raffaele Hospital (https//zenodo.org/record/3886927#.YIluB5AzbMV). Once more, the SVM presented the best performance among other machine learning algorithms, and 92.86%, 93.55%, and 90.91% for accuracy, sensitivity, and specificity, respectively, were obtained.

Conclusion:

The obtained results, when compared with others from the literature based on these same datasets, are superior, leading us to conclude that our proposed solution is reliable for the COVID-19 diagnosis.
Subject(s)

Full text: Available Collection: International databases Database: MEDLINE Main subject: Data Analysis / COVID-19 Type of study: Diagnostic study / Prognostic study Limits: Humans Country/Region as subject: South America / Brazil Language: English Journal: J Healthc Eng Year: 2021 Document Type: Article Affiliation country: 2021

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Data Analysis / COVID-19 Type of study: Diagnostic study / Prognostic study Limits: Humans Country/Region as subject: South America / Brazil Language: English Journal: J Healthc Eng Year: 2021 Document Type: Article Affiliation country: 2021