Search | VHL Regional Portal

1.

A large-scale multivariate soccer athlete health, performance, and position monitoring dataset.

Midoglu, Cise; Kjæreng Winther, Andreas; Boeker, Matthias; Dahl Pettersen, Susann; Pedersen, Sigurd; Ragab, Nourhan; Kupka, Tomas; Hicks, Steven A; Bredsgaard Randers, Morten; Jain, Ramesh; Dagenborg, Håvard J; Pettersen, Svein Arne; Johansen, Dag; Riegler, Michael A; Halvorsen, Pål.

Sci Data ; 11(1): 553, 2024 May 30.

Article in English | MEDLINE | ID: mdl-38816403

ABSTRACT

Data analysis for athletic performance optimization and injury prevention is of tremendous interest to sports teams and the scientific community. However, sports data are often sparse and hard to obtain due to legal restrictions, unwillingness to share, and lack of personnel resources to be assigned to the tedious process of data curation. These constraints make it difficult to develop automated systems for analysis, which require large datasets for learning. We therefore present SoccerMon, the largest soccer athlete dataset available today containing both subjective and objective metrics, collected from two different elite women's soccer teams over two years. Our dataset contains 33,849 subjective reports and 10,075 objective reports, the latter including over six billion GPS position measurements. SoccerMon can not only play a valuable role in developing better analysis and prediction systems for soccer, but also inspire similar data collection activities in other domains which can benefit from subjective athlete reports, GPS position information, and/or time-series data in general.

Subject(s)

Athletic Performance , Soccer , Humans , Female , Geographic Information Systems , Athletes

2.

A dataset for predicting cloud cover over Europe.

Svennevik, Hanna; Hicks, Steven A; Riegler, Michael A; Storelvmo, Trude; Hammer, Hugo L.

Sci Data ; 11(1): 245, 2024 Feb 27.

Article in English | MEDLINE | ID: mdl-38413601

ABSTRACT

Clouds are important factors when projecting future climate. Unfortunately, future cloud fractional cover (the portion of the sky covered by clouds) is associated with significant uncertainty, making climate projections difficult. In this paper, we present the European Cloud Cover dataset, which can be used to learn statistical relations between cloud cover and other environmental variables, to potentially improve future climate projections. The dataset was created using a novel technique called Area Weighting Regridding Scheme to map satellite observations to cloud fractional cover on the same grid as the other variables in the dataset. Baseline experiments using autoregressive models document that it is possible to use the dataset to predict cloud fractional cover.

3.

Sperm motility assessed by deep convolutional neural networks into WHO categories.

Haugen, Trine B; Witczak, Oliwia; Hicks, Steven A; Björndahl, Lars; Andersen, Jorunn M; Riegler, Michael A.

Sci Rep ; 13(1): 14777, 2023 09 07.

Article in English | MEDLINE | ID: mdl-37679484

ABSTRACT

Semen analysis is central in infertility investigation. Manual assessment of sperm motility according to the WHO recommendations is the golden standard, and extensive training is a requirement for accurate and reproducible results. Deep convolutional neural networks (DCNN) are especially suitable for image classification. In this study, we evaluated the performance of the DCNN ResNet-50 in predicting the proportion of sperm in the WHO motility categories. Two models were evaluated using tenfold cross-validation with 65 video recordings of wet semen preparations from an external quality assessment programme for semen analysis. The corresponding manually assessed data was obtained from several of the reference laboratories, and the mean values were used for training of the DCNN models. One model was trained to predict the three categories progressive motility, non-progressive motility, and immotile spermatozoa. Another model was used in predicting four categories, where progressive motility was differentiated into rapid and slow. The resulting average mean absolute error (MAE) was 0.05 and 0.07, and the average ZeroR baseline was 0.09 and 0.10 for the three-category and the four-category model, respectively. Manual and DCNN-predicted motility was compared by Pearson's correlation coefficient and by difference plots. The strongest correlation between the mean manually assessed values and DCNN-predicted motility was observed for % progressively motile spermatozoa (Pearson's r = 0.88, p < 0.001) and % immotile spermatozoa (r = 0.89, p < 0.001). For rapid progressive motility, the correlation was moderate (Pearson's r = 0.673, p < 0.001). The median difference between manual and predicted progressive motility was 0 and 2 for immotile spermatozoa. The largest bias was observed at high and low percentages of progressive and immotile spermatozoa. The DCNN-predicted value was within the range of the interlaboratory variation of the results for most of the samples. In conclusion, DCNN models were able to predict the proportion of spermatozoa into the WHO motility categories with significantly lower error than the baseline. The best correlation between the manual and the DCNN-predicted motility values was found for the categories progressive and immotile. Of note, there was considerable variation between the mean motility values obtained for each category by the reference laboratories, especially for rapid progressive motility, which impacts the training of the DCNN models.

Subject(s)

Semen , Sperm Motility , Male , Humans , Semen Analysis , Neural Networks, Computer , World Health Organization

4.

Usefulness of Heat Map Explanations for Deep-Learning-Based Electrocardiogram Analysis.

Storås, Andrea M; Andersen, Ole Emil; Lockhart, Sam; Thielemann, Roman; Gnesin, Filip; Thambawita, Vajira; Hicks, Steven A; Kanters, Jørgen K; Strümke, Inga; Halvorsen, Pål; Riegler, Michael A.

Diagnostics (Basel) ; 13(14)2023 Jul 11.

Article in English | MEDLINE | ID: mdl-37510089

ABSTRACT

Deep neural networks are complex machine learning models that have shown promising results in analyzing high-dimensional data such as those collected from medical examinations. Such models have the potential to provide fast and accurate medical diagnoses. However, the high complexity makes deep neural networks and their predictions difficult to understand. Providing model explanations can be a way of increasing the understanding of "black box" models and building trust. In this work, we applied transfer learning to develop a deep neural network to predict sex from electrocardiograms. Using the visual explanation method Grad-CAM, heat maps were generated from the model in order to understand how it makes predictions. To evaluate the usefulness of the heat maps and determine if the heat maps identified electrocardiogram features that could be recognized to discriminate sex, medical doctors provided feedback. Based on the feedback, we concluded that, in our setting, this mode of explainable artificial intelligence does not provide meaningful information to medical doctors and is not useful in the clinic. Our results indicate that improved explanation techniques that are tailored to medical data should be developed before deep neural networks can be applied in the clinic for diagnostic purposes.

5.

VISEM-Tracking, a human spermatozoa tracking dataset.

Thambawita, Vajira; Hicks, Steven A; Storås, Andrea M; Nguyen, Thu; Andersen, Jorunn M; Witczak, Oliwia; Haugen, Trine B; Hammer, Hugo L; Halvorsen, Pål; Riegler, Michael A.

Sci Data ; 10(1): 260, 2023 05 09.

Article in English | MEDLINE | ID: mdl-37156762

ABSTRACT

A manual assessment of sperm motility requires microscopy observation, which is challenging due to the fast-moving spermatozoa in the field of view. To obtain correct results, manual evaluation requires extensive training. Therefore, computer-aided sperm analysis (CASA) has become increasingly used in clinics. Despite this, more data is needed to train supervised machine learning approaches in order to improve accuracy and reliability in the assessment of sperm motility and kinematics. In this regard, we provide a dataset called VISEM-Tracking with 20 video recordings of 30 seconds (comprising 29,196 frames) of wet semen preparations with manually annotated bounding-box coordinates and a set of sperm characteristics analyzed by experts in the domain. In addition to the annotated data, we provide unlabeled video clips for easy-to-use access and analysis of the data via methods such as self- or unsupervised learning. As part of this paper, we present baseline sperm detection performances using the YOLOv5 deep learning (DL) model trained on the VISEM-Tracking dataset. As a result, we show that the dataset can be used to train complex DL models to analyze spermatozoa.

Subject(s)

Semen , Sperm Motility , Spermatozoa , Humans , Male , Reproducibility of Results , Video Recording

6.

SinGAN-Seg: Synthetic training data generation for medical image segmentation.

Thambawita, Vajira; Salehi, Pegah; Sheshkal, Sajad Amouei; Hicks, Steven A; Hammer, Hugo L; Parasa, Sravanthi; Lange, Thomas de; Halvorsen, Pål; Riegler, Michael A.

PLoS One ; 17(5): e0267976, 2022.

Article in English | MEDLINE | ID: mdl-35500005

ABSTRACT

Analyzing medical data to find abnormalities is a time-consuming and costly task, particularly for rare abnormalities, requiring tremendous efforts from medical experts. Therefore, artificial intelligence has become a popular tool for the automatic processing of medical data, acting as a supportive tool for doctors. However, the machine learning models used to build these tools are highly dependent on the data used to train them. Large amounts of data can be difficult to obtain in medicine due to privacy reasons, expensive and time-consuming annotations, and a general lack of data samples for infrequent lesions. In this study, we present a novel synthetic data generation pipeline, called SinGAN-Seg, to produce synthetic medical images with corresponding masks using a single training image. Our method is different from the traditional generative adversarial networks (GANs) because our model needs only a single image and the corresponding ground truth to train. We also show that the synthetic data generation pipeline can be used to produce alternative artificial segmentation datasets with corresponding ground truth masks when real datasets are not allowed to share. The pipeline is evaluated using qualitative and quantitative comparisons between real data and synthetic data to show that the style transfer technique used in our pipeline significantly improves the quality of the generated data and our method is better than other state-of-the-art GANs to prepare synthetic images when the size of training datasets are limited. By training UNet++ using both real data and the synthetic data generated from the SinGAN-Seg pipeline, we show that the models trained on synthetic data have very close performances to those trained on real data when both datasets have a considerable amount of training data. In contrast, we show that synthetic data generated from the SinGAN-Seg pipeline improves the performance of segmentation models when training datasets do not have a considerable amount of data. All experiments were performed using an open dataset and the code is publicly available on GitHub.

Subject(s)

Deep Learning , Image Processing, Computer-Assisted , Algorithms , Artificial Intelligence , Image Processing, Computer-Assisted/methods , Neural Networks, Computer

7.

On evaluation metrics for medical applications of artificial intelligence.

Hicks, Steven A; Strümke, Inga; Thambawita, Vajira; Hammou, Malek; Riegler, Michael A; Halvorsen, Pål; Parasa, Sravanthi.

Sci Rep ; 12(1): 5979, 2022 04 08.

Article in English | MEDLINE | ID: mdl-35395867

ABSTRACT

Clinicians and software developers need to understand how proposed machine learning (ML) models could improve patient care. No single metric captures all the desirable properties of a model, which is why several metrics are typically reported to summarize a model's performance. Unfortunately, these measures are not easily understandable by many clinicians. Moreover, comparison of models across studies in an objective manner is challenging, and no tool exists to compare models using the same performance metrics. This paper looks at previous ML studies done in gastroenterology, provides an explanation of what different metrics mean in the context of binary classification in the presented studies, and gives a thorough explanation of how different metrics should be interpreted. We also release an open source web-based tool that may be used to aid in calculating the most relevant metrics presented in this paper so that other researchers and clinicians may easily incorporate them into their research.

Subject(s)

Artificial Intelligence , Benchmarking , Humans , Machine Learning , Software

8.

Impact of Image Resolution on Deep Learning Performance in Endoscopy Image Classification: An Experimental Study Using a Large Dataset of Endoscopic Images.

Thambawita, Vajira; Strümke, Inga; Hicks, Steven A; Halvorsen, Pål; Parasa, Sravanthi; Riegler, Michael A.

Diagnostics (Basel) ; 11(12)2021 Nov 24.

Article in English | MEDLINE | ID: mdl-34943421

ABSTRACT

Recent trials have evaluated the efficacy of deep convolutional neural network (CNN)-based AI systems to improve lesion detection and characterization in endoscopy. Impressive results are achieved, but many medical studies use a very small image resolution to save computing resources at the cost of losing details. Today, no conventions between resolution and performance exist, and monitoring the performance of various CNN architectures as a function of image resolution provides insights into how subtleties of different lesions on endoscopy affect performance. This can help set standards for image or video characteristics for future CNN-based models in gastrointestinal (GI) endoscopy. This study examines the performance of CNNs on the HyperKvasir dataset, consisting of 10,662 images from 23 different findings. We evaluate two CNN models for endoscopic image classification under quality distortions with image resolutions ranging from 32 × 32 to 512 × 512 pixels. The performance is evaluated using two-fold cross-validation and F1-score, maximum Matthews correlation coefficient (MCC), precision, and sensitivity as metrics. Increased performance was observed with higher image resolution for all findings in the dataset. MCC was achieved at image resolutions between 512 × 512 pixels for classification for the entire dataset after including all subclasses. The highest performance was observed with an MCC value of 0.9002 when the models were trained on the highest resolution and tested on the same resolution. Different resolutions and their effect on CNNs are explored. We show that image resolution has a clear influence on the performance which calls for standards in the field in the future.

9.

DeepFake electrocardiograms using generative adversarial networks are the beginning of the end for privacy issues in medicine.

Thambawita, Vajira; Isaksen, Jonas L; Hicks, Steven A; Ghouse, Jonas; Ahlberg, Gustav; Linneberg, Allan; Grarup, Niels; Ellervik, Christina; Olesen, Morten Salling; Hansen, Torben; Graff, Claus; Holstein-Rathlou, Niels-Henrik; Strümke, Inga; Hammer, Hugo L; Maleckar, Mary M; Halvorsen, Pål; Riegler, Michael A; Kanters, Jørgen K.

Sci Rep ; 11(1): 21896, 2021 11 09.

Article in English | MEDLINE | ID: mdl-34753975

ABSTRACT

Recent global developments underscore the prominent role big data have in modern medical science. But privacy issues constitute a prevalent problem for collecting and sharing data between researchers. However, synthetic data generated to represent real data carrying similar information and distribution may alleviate the privacy issue. In this study, we present generative adversarial networks (GANs) capable of generating realistic synthetic DeepFake 10-s 12-lead electrocardiograms (ECGs). We have developed and compared two methods, named WaveGAN* and Pulse2Pulse. We trained the GANs with 7,233 real normal ECGs to produce 121,977 DeepFake normal ECGs. By verifying the ECGs using a commercial ECG interpretation program (MUSE 12SL, GE Healthcare), we demonstrate that the Pulse2Pulse GAN was superior to the WaveGAN* to produce realistic ECGs. ECG intervals and amplitudes were similar between the DeepFake and real ECGs. Although these synthetic ECGs mimic the dataset used for creation, the ECGs are not linked to any individuals and may thus be used freely. The synthetic dataset will be available as open access for researchers at OSF.io and the DeepFake generator available at the Python Package Index (PyPI) for generating synthetic ECGs. In conclusion, we were able to generate realistic synthetic ECGs using generative adversarial neural networks on normal ECGs from two population studies, thereby addressing the relevant privacy issues in medical datasets.

Subject(s)

Electrocardiography , Neural Networks, Computer , Computer Simulation , Datasets as Topic , Humans , Privacy

10.

Explaining deep neural networks for knowledge discovery in electrocardiogram analysis.

Hicks, Steven A; Isaksen, Jonas L; Thambawita, Vajira; Ghouse, Jonas; Ahlberg, Gustav; Linneberg, Allan; Grarup, Niels; Strümke, Inga; Ellervik, Christina; Olesen, Morten Salling; Hansen, Torben; Graff, Claus; Holstein-Rathlou, Niels-Henrik; Halvorsen, Pål; Maleckar, Mary M; Riegler, Michael A; Kanters, Jørgen K.

Sci Rep ; 11(1): 10949, 2021 05 26.

Article in English | MEDLINE | ID: mdl-34040033

ABSTRACT

Deep learning-based tools may annotate and interpret medical data more quickly, consistently, and accurately than medical doctors. However, as medical doctors are ultimately responsible for clinical decision-making, any deep learning-based prediction should be accompanied by an explanation that a human can understand. We present an approach called electrocardiogram gradient class activation map (ECGradCAM), which is used to generate attention maps and explain the reasoning behind deep learning-based decision-making in ECG analysis. Attention maps may be used in the clinic to aid diagnosis, discover new medical knowledge, and identify novel features and characteristics of medical tests. In this paper, we showcase how ECGradCAM attention maps can unmask how a novel deep learning model measures both amplitudes and intervals in 12-lead electrocardiograms, and we show an example of how attention maps may be used to develop novel ECG features.

Subject(s)

Deep Learning , Electrocardiography , Knowledge Discovery , Models, Cardiovascular , Adult , Aged , Algorithms , Cardiologists , Data Accuracy , Diagnosis, Computer-Assisted , Female , Heart Diseases/diagnosis , Heart Diseases/physiopathology , Humans , Male , Middle Aged , Sex Determination Analysis

11.

Kvasir-Capsule, a video capsule endoscopy dataset.

Smedsrud, Pia H; Thambawita, Vajira; Hicks, Steven A; Gjestang, Henrik; Nedrejord, Oda Olsen; Næss, Espen; Borgli, Hanna; Jha, Debesh; Berstad, Tor Jan Derek; Eskeland, Sigrun L; Lux, Mathias; Espeland, Håvard; Petlund, Andreas; Nguyen, Duc Tien Dang; Garcia-Ceja, Enrique; Johansen, Dag; Schmidt, Peter T; Toth, Ervin; Hammer, Hugo L; de Lange, Thomas; Riegler, Michael A; Halvorsen, Pål.

Sci Data ; 8(1): 142, 2021 05 27.

Article in English | MEDLINE | ID: mdl-34045470

ABSTRACT

Artificial intelligence (AI) is predicted to have profound effects on the future of video capsule endoscopy (VCE) technology. The potential lies in improving anomaly detection while reducing manual labour. Existing work demonstrates the promising benefits of AI-based computer-assisted diagnosis systems for VCE. They also show great potential for improvements to achieve even better results. Also, medical data is often sparse and unavailable to the research community, and qualified medical personnel rarely have time for the tedious labelling work. We present Kvasir-Capsule, a large VCE dataset collected from examinations at a Norwegian Hospital. Kvasir-Capsule consists of 117 videos which can be used to extract a total of 4,741,504 image frames. We have labelled and medically verified 47,238 frames with a bounding box around findings from 14 different classes. In addition to these labelled images, there are 4,694,266 unlabelled frames included in the dataset. The Kvasir-Capsule dataset can play a valuable role in developing better algorithms in order to reach true potential of VCE technology.

Subject(s)

Capsule Endoscopy , Intestinal Diseases/pathology , Intestine, Small/pathology , Machine Learning , Humans

12.

Machine Learning-Based Analysis of Sperm Videos and Participant Data for Male Fertility Prediction.

Hicks, Steven A; Andersen, Jorunn M; Witczak, Oliwia; Thambawita, Vajira; Halvorsen, Pål; Hammer, Hugo L; Haugen, Trine B; Riegler, Michael A.

Sci Rep ; 9(1): 16770, 2019 11 14.

Article in English | MEDLINE | ID: mdl-31727961

ABSTRACT

Methods for automatic analysis of clinical data are usually targeted towards a specific modality and do not make use of all relevant data available. In the field of male human reproduction, clinical and biological data are not used to its fullest potential. Manual evaluation of a semen sample using a microscope is time-consuming and requires extensive training. Furthermore, the validity of manual semen analysis has been questioned due to limited reproducibility, and often high inter-personnel variation. The existing computer-aided sperm analyzer systems are not recommended for routine clinical use due to methodological challenges caused by the consistency of the semen sample. Thus, there is a need for an improved methodology. We use modern and classical machine learning techniques together with a dataset consisting of 85 videos of human semen samples and related participant data to automatically predict sperm motility. Used techniques include simple linear regression and more sophisticated methods using convolutional neural networks. Our results indicate that sperm motility prediction based on deep learning using sperm motility videos is rapid to perform and consistent. Adding participant data did not improve the algorithms performance. In conclusion, machine learning-based automatic analysis may become a valuable tool in male infertility investigation and research.

Subject(s)

Infertility, Male/diagnosis , Semen Analysis/methods , Spermatozoa/physiology , Humans , Machine Learning , Male , Microscopy, Video , Neural Networks, Computer , Reproducibility of Results , Sperm Motility

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL