Search | VHL Regional Portal

1.

A review of machine learning methods for cancer characterization from microbiome data.

Teixeira, Marco; Silva, Francisco; Ferreira, Rui M; Pereira, Tania; Figueiredo, Ceu; Oliveira, Hélder P.

NPJ Precis Oncol ; 8(1): 123, 2024 May 30.

Article in English | MEDLINE | ID: mdl-38816569

ABSTRACT

Recent studies have shown that the microbiome can impact cancer development, progression, and response to therapies suggesting microbiome-based approaches for cancer characterization. As cancer-related signatures are complex and implicate many taxa, their discovery often requires Machine Learning approaches. This review discusses Machine Learning methods for cancer characterization from microbiome data. It focuses on the implications of choices undertaken during sample collection, feature selection and pre-processing. It also discusses ML model selection, guiding how to choose an ML model, and model validation. Finally, it enumerates current limitations and how these may be surpassed. Proposed methods, often based on Random Forests, show promising results, however insufficient for widespread clinical usage. Studies often report conflicting results mainly due to ML models with poor generalizability. We expect that evaluating models with expanded, hold-out datasets, removing technical artifacts, exploring representations of the microbiome other than taxonomical profiles, leveraging advances in deep learning, and developing ML models better adapted to the characteristics of microbiome data will improve the performance and generalizability of models and enable their usage in the clinic.

2.

Optimized reconstruction of the absorption spectra of kidney tissues from the spectra of tissue components using the least squares method.

Pinheiro, Maria R; Fernandes, Luís E; Carneiro, Isa C; Carvalho, Sónia D; Henrique, Rui M; Tuchin, Valery V; Oliveira, Hélder P; Oliveira, Luís M.

J Biophotonics ; : e202300466, 2024 Feb 06.

Article in English | MEDLINE | ID: mdl-38318753

ABSTRACT

With the objective of developing new methods to acquire diagnostic information, the reconstruction of the broadband absorption coefficient spectra (µa [λ]) of healthy and chromophobe renal cell carcinoma kidney tissues was performed. By performing a weighted sum of the absorption spectra of proteins, DNA, oxygenated, and deoxygenated hemoglobin, lipids, water, melanin, and lipofuscin, it was possible to obtain a good match of the experimental µa (λ) of both kidney conditions. The weights used in those reconstructions were estimated using the least squares method, and assuming a total water content of 77% in both kidney tissues, it was possible to calculate the concentrations of the other tissue components. It has been shown that with the development of cancer, the concentrations of proteins, DNA, oxygenated hemoglobin, lipids, and lipofuscin increase, and the concentration of melanin decreases. Future studies based on minimally invasive spectral measurements will allow cancer diagnosis using the proposed approach.

3.

Mortality prediction using medical time series on TBI patients.

Fonseca, João; Liu, Xiuyun; Oliveira, Hélder P; Pereira, Tania.

Comput Methods Programs Biomed ; 242: 107806, 2023 Dec.

Article in English | MEDLINE | ID: mdl-37832428

ABSTRACT

BACKGROUND AND OBJECTIVE: Traumatic Brain Injury (TBI) is one of the leading causes of injury-related mortality in the world, with severe cases reaching mortality rates of 30-40%. It is highly heterogeneous both in causes and consequences making more complex the medical interpretation and prognosis. Gathering clinical, demographic, and laboratory data to perform a prognosis requires time and skill in several clinical specialties. Artificial intelligence (AI) methods can take advantage of existing data by performing helpful predictions and guiding physicians toward a better prognosis and, consequently, better healthcare. The objective of this work was to develop learning models and evaluate their capability of predicting the mortality of TBI. The predictive model would allow the early assessment of the more serious cases and scarce medical resources can be pointed toward the patients who need them most. METHODS: Long Short Term Memory (LSTM) and Transformer architectures were tested and compared in performance, coupled with data imbalance, missing data, and feature selection strategies. From the Medical Information Mart for Intensive Care III (MIMIC-III) dataset, a cohort of TBI patients was selected and an analysis of the first 48 hours of multiple time series sequential variables was done to predict hospital mortality. RESULTS: The best performance was obtained with the Transformer architecture, achieving an AUC of 0.907 with the larger group of features and trained with class proportion class weights and binary cross entropy loss. CONCLUSIONS: Using the time series sequential data, LSTM and Transformers proved to be both viable options for predicting TBI hospital mortality in 48 hours after admission. Overall, using sequential deep learning models with time series data to predict TBI mortality is viable and can be used as a helpful indicator of the well-being of patients.

Subject(s)

Artificial Intelligence , Brain Injuries, Traumatic , Humans , Time Factors , Brain Injuries, Traumatic/diagnosis , Prognosis , Critical Care

4.

Transformers for Energy Forecast.

Oliveira, Hugo S; Oliveira, Helder P.

Sensors (Basel) ; 23(15)2023 Aug 01.

Article in English | MEDLINE | ID: mdl-37571622

ABSTRACT

Forecasting energy consumption models allow for improvements in building performance and reduce energy consumption. Energy efficiency has become a pressing concern in recent years due to the increasing energy demand and concerns over climate change. This paper addresses the energy consumption forecast as a crucial ingredient in the technology to optimize building system operations and identifies energy efficiency upgrades. The work proposes a modified multi-head transformer model focused on multi-variable time series through a learnable weighting feature attention matrix to combine all input variables and forecast building energy consumption properly. The proposed multivariate transformer-based model is compared with two other recurrent neural network models, showing a robust performance while exhibiting a lower mean absolute percentage error. Overall, this paper highlights the superior performance of the modified transformer-based model for the energy consumption forecast in a multivariate step, allowing it to be incorporated in future forecasting tasks, allowing for the tracing of future energy consumption scenarios according to the current building usage, playing a significant role in creating a more sustainable and energy-efficient building usage.

5.

Evaluating the ability of an artificial-intelligence cloud-based platform designed to provide information prior to locoregional therapy for breast cancer in improving patient's satisfaction with therapy: The CINDERELLA trial.

Kaidar-Person, Orit; Antunes, Marilia; Cardoso, Jaime S; Ciani, Oriana; Cruz, Helena; Di Micco, Rosa; Gentilini, Oreste D; Gonçalves, Tiago; Gouveia, Pedro; Heil, Jörg; Kabata, Pawel; Lopes, Daniela; Martinho, Marta; Martins, Henrique; Mavioso, Carlos; Mika, Martin; Montenegro, Helena; Oliveira, Helder P; Pfob, André; Rotmensz, Nicole; Schinköthe, Timo; Silva, Giovani; Tarricone, Rosana; Cardoso, Maria-Joao.

PLoS One ; 18(8): e0289365, 2023.

Article in English | MEDLINE | ID: mdl-37535564

ABSTRACT

BACKGROUND: Breast cancer therapy improved significantly, allowing for different surgical approaches for the same disease stage, therefore offering patients different aesthetic outcomes with similar locoregional control. The purpose of the CINDERELLA trial is to evaluate an artificial-intelligence (AI) cloud-based platform (CINDERELLA platform) vs the standard approach for patient education prior to therapy. METHODS: A prospective randomized international multicentre trial comparing two methods for patient education prior to therapy. After institutional ethics approval and a written informed consent, patients planned for locoregional treatment will be randomized to the intervention (CINDERELLA platform) or controls. The patients in the intervention arm will use the newly designed web-application (CINDERELLA platform, CINDERELLA APProach) to access the information related to surgery and/or radiotherapy. Using an AI system, the platform will provide the patient with a picture of her own aesthetic outcome resulting from the surgical procedure she chooses, and an objective evaluation of this aesthetic outcome (e.g., good/fair). The control group will have access to the standard approach. The primary objectives of the trial will be i) to examine the differences between the treatment arms with regards to patients' pre-treatment expectations and the final aesthetic outcomes and ii) in the experimental arm only, the agreement of the pre-treatment AI-evaluation (output) and patient's post-therapy self-evaluation. DISCUSSION: The project aims to develop an easy-to-use cost-effective AI-powered tool that improves shared decision-making processes. We assume that the CINDERELLA APProach will lead to higher satisfaction, better psychosocial status, and wellbeing of breast cancer patients, and reduce the need for additional surgeries to improve aesthetic outcome.

Subject(s)

Artificial Intelligence , Breast Neoplasms , Female , Humans , Breast Neoplasms/surgery , Cloud Computing , Intelligence , Patient Satisfaction , Prospective Studies

6.

Single Modality vs. Multimodality: What Works Best for Lung Cancer Screening?

Sousa, Joana Vale; Matos, Pedro; Silva, Francisco; Freitas, Pedro; Oliveira, Hélder P; Pereira, Tania.

Sensors (Basel) ; 23(12)2023 Jun 15.

Article in English | MEDLINE | ID: mdl-37420765

ABSTRACT

In a clinical context, physicians usually take into account information from more than one data modality when making decisions regarding cancer diagnosis and treatment planning. Artificial intelligence-based methods should mimic the clinical method and take into consideration different sources of data that allow a more comprehensive analysis of the patient and, as a consequence, a more accurate diagnosis. Lung cancer evaluation, in particular, can benefit from this approach since this pathology presents high mortality rates due to its late diagnosis. However, many related works make use of a single data source, namely imaging data. Therefore, this work aims to study the prediction of lung cancer when using more than one data modality. The National Lung Screening Trial dataset that contains data from different sources, specifically, computed tomography (CT) scans and clinical data, was used for the study, the development and comparison of single-modality and multimodality models, that may explore the predictive capability of these two types of data to their full potential. A ResNet18 network was trained to classify 3D CT nodule regions of interest (ROI), whereas a random forest algorithm was used to classify the clinical data, with the former achieving an area under the ROC curve (AUC) of 0.7897 and the latter 0.5241. Regarding the multimodality approaches, three strategies, based on intermediate and late fusion, were implemented to combine the information from the 3D CT nodule ROIs and the clinical data. From those, the best model-a fully connected layer that receives as input a combination of clinical data and deep imaging features, given by a ResNet18 inference model-presented an AUC of 0.8021. Lung cancer is a complex disease, characterized by a multitude of biological and physiological phenomena and influenced by multiple factors. It is thus imperative that the models are capable of responding to that need. The results obtained showed that the combination of different types may have the potential to produce more comprehensive analyses of the disease by the models.

Subject(s)

Lung Neoplasms , Humans , Lung Neoplasms/diagnostic imaging , Lung Neoplasms/pathology , Artificial Intelligence , Early Detection of Cancer/methods , Tomography, X-Ray Computed/methods , Lung/pathology

7.

Machine learning-based approaches for cancer prediction using microbiome data.

Freitas, Pedro; Silva, Francisco; Sousa, Joana Vale; Ferreira, Rui M; Figueiredo, Céu; Pereira, Tania; Oliveira, Hélder P.

Sci Rep ; 13(1): 11821, 2023 07 21.

Article in English | MEDLINE | ID: mdl-37479864

ABSTRACT

Emerging evidence of the relationship between the microbiome composition and the development of numerous diseases, including cancer, has led to an increasing interest in the study of the human microbiome. Technological breakthroughs regarding DNA sequencing methods propelled microbiome studies with a large number of samples, which called for the necessity of more sophisticated data-analytical tools to analyze this complex relationship. The aim of this work was to develop a machine learning-based approach to distinguish the type of cancer based on the analysis of the tissue-specific microbial information, assessing the human microbiome as valuable predictive information for cancer identification. For this purpose, Random Forest algorithms were trained for the classification of five types of cancer-head and neck, esophageal, stomach, colon, and rectum cancers-with samples provided by The Cancer Microbiome Atlas database. One versus all and multi-class classification studies were conducted to evaluate the discriminative capability of the microbial data across increasing levels of cancer site specificity, with results showing a progressive rise in difficulty for accurate sample classification. Random Forest models achieved promising performances when predicting head and neck, stomach, and colon cancer cases, with the latter returning accuracy scores above 90% across the different studies conducted. However, there was also an increased difficulty when discriminating esophageal and rectum cancers, failing to differentiate with adequate results rectum from colon cancer cases, and esophageal from head and neck and stomach cancers. These results point to the fact that anatomically adjacent cancers can be more complex to identify due to microbial similarities. Despite the limitations, microbiome data analysis using machine learning may advance novel strategies to improve cancer detection and prevention, and decrease disease burden.

Subject(s)

Colonic Neoplasms , Microbiota , Rectal Neoplasms , Stomach Neoplasms , Humans , Colonic Neoplasms/diagnosis , Stomach Neoplasms/diagnosis , Machine Learning , Microbiota/genetics

8.

Synthesizing 3D Lung CT scans with Generative Adversarial Networks.

Ferreira, Artur; Pereira, Tania; Silva, Francisco; Vilares, Ana T; Silva, Miguel C; Cunha, Antonio; Oliveira, Helder P.

Annu Int Conf IEEE Eng Med Biol Soc ; 2022: 2033-2036, 2022 07.

Article in English | MEDLINE | ID: mdl-36085795

ABSTRACT

In the healthcare domain, datasets are often private and lack large amounts of samples, making it difficult to cope with the inherent patient data heterogeneity. As an attempt to mitigate data scarcity, generative models are being used due to their ability to produce new data, using a dataset as a reference. However, synthesis studies often rely on a 2D representation of data, a seriously limited form of information when it comes to lung computed tomography scans where, for example, pathologies like nodules can manifest anywhere in the organ. Here, we develop a 3D Progressive Growing Generative Adversarial Network capable of generating thoracic CT volumes at a resolution of 1283, and analyze the model outputs through a quantitative metric (3D Muli-Scale Structural Similarity) and a Visual Turing Test. Clinical relevance - This paper is a novel application of the 3D PGGAN model to synthesize CT lung scans. This preliminary study focuses on synthesizing the entire volume of the lung rather than just the lung nodules. The synthesized data represent an attempt to mitigate data scarcity which is one of the major limitations to create learning models with good generalization in healthcare.

Subject(s)

Thorax , Tomography, X-Ray Computed , Adaptation, Psychological , Generalization, Psychological , Humans , Lung/diagnostic imaging

9.

On the way for the best imaging features from CT images to predict EGFR Mutation Status in Lung Cancer.

Silva, Pedro; Pereira, Tania; Teixeira, Marco; Silva, Francisco; Oliveira, Helder P.

Annu Int Conf IEEE Eng Med Biol Soc ; 2022: 2659-2662, 2022 07.

Article in English | MEDLINE | ID: mdl-36085894

ABSTRACT

Artificial Intelligence-based tools have shown promising results to help clinicians in diagnosis tasks. Radio-genomics would aid in the genotype characterization using information from radiologic images. The prediction of the mutations status of main oncogenes associated with lung cancer will help the clinicians to have a more accurate diagnosis and a personalized treatment plan, decreasing the need to use the biopsy. In this work, novel and objective features were extracted from the lung that contained the nodule, and several machine learning methods were combined with feature selection techniques to select the best approach to predict the EGFR mutation status in lung cancer CT images. An AUC of 0.756 ± 0.055 was obtained using a logistic regression and independent component analysis as feature selector, supporting the hypothesis that CT images can capture pathophysiological information with great value for clinical assessment and personalized medicine of lung cancer. Clinical Relevance - Radiogenomic approaches could be an interesting help for lung cancer characterization. This work represents a preliminary study for the development of computer-aided decision systems to provide a more accurate and fast characterization of lung cancer which is fundamental for an adequate treatment plan for lung cancer patients.

Subject(s)

ErbB Receptors , Lung Neoplasms , Artificial Intelligence , ErbB Receptors/genetics , Humans , Lung Neoplasms/diagnostic imaging , Lung Neoplasms/genetics , Lung Neoplasms/pathology , Mutation , Tomography, X-Ray Computed/methods

10.

Unsupervised Approach for Malignancy Assessment of Lung Nodules in Computed Tomography Scans Using Radiomic Features.

Teixeira, Marco; Pereira, Tania; Silva, Francisco; Cunha, Antonio; Oliveira, Helder P.

Annu Int Conf IEEE Eng Med Biol Soc ; 2022: 2037-2040, 2022 07.

Article in English | MEDLINE | ID: mdl-36086366

ABSTRACT

Lung cancer is the leading cause of cancer death worldwide. Early low-dose computed tomography (CT) screening can decrease its mortality rate and computer-aided diagnoses systems may make these screenings more accessible. Radiomic features and supervised machine learning have traditionally been employed in these systems. Contrary to supervised methods, unsupervised learning techniques do not require large amounts of annotated data which are labor-intensive to gather and long training times. Therefore, recent approaches have used unsupervised methods, such as clustering, to improve the performance of supervised models. However, an analysis of purely unsupervised methods for malignancy prediction of lung nodules from CT images has not been performed. This work studies nodule malignancy in the LIDC-IDRI image collection of chest CT scans using established radiomic features and unsupervised learning methods based on k-Means, Spectral Clustering, and Gaussian Mixture clustering. All tested methods resulted in clusters of high homogeneity malignancy. Results suggest convex feature distributions and well-separated feature subspaces associated with different diagnoses. Furthermore, diagnosis uncertainty may be explained by common characteristics captured by radiomic features. The k-Means and Gaussian Mixture models are able to generalize to unseen data, achieving a balanced accuracy of 87.23% and 86.96% when inference was tested. These results motivate the usage of unsupervised approaches for malignancy prediction of lung nodules, such as cluster-then-label models. Clinical Relevance- Unsupervised clustering of radiomic features of lung nodules in chest CT scans can differentiate between malignant and benign cases and reflects experts' diagnosis uncertainty.

Subject(s)

Lung Neoplasms , Precancerous Conditions , Humans , Lung/diagnostic imaging , Lung/pathology , Lung Neoplasms/diagnostic imaging , Lung Neoplasms/pathology , Radionuclide Imaging , Tomography, X-Ray Computed/methods

11.

A Random Forest-based Classifier for MYCN Status Prediction in Neuroblastoma using CT Images.

Pereira, Tania; Silva, Francisco; Claro, Pedro; Carvalho, Diogo Costa; Dias, Silvia Costa; Torrao, Helena; Oliveira, Helder P.

Annu Int Conf IEEE Eng Med Biol Soc ; 2022: 3854-3857, 2022 07.

Article in English | MEDLINE | ID: mdl-36086471

ABSTRACT

Neuroblastoma (NB) is the most common extracranial solid tumor in childhood. Genomic amplification of MYCN is associated with poor outcomes and is detected in 16% of all NB cases. CT scans and MRI are the imaging techniques recommended for diagnosis and disease staging. The assessment of imaging features such as tumor volume, shape, and local extension represent relevant prognostic information. Radiogenomics have shown powerful results in the assessment of the genotype based on imaging findings automatically extracted from medical images. In this work, random forest was used to classify the MYCN amplification using radiomic features extracted from CT slices in a population of 46 NB patients. The learning model showed an area under the curve (AUC) of 0.85 ± 0.13, suggesting that radiomic-based methodologies might be helpful in the extraction of information that is not accessible by human naked eyes but could aid the clinicians on the diagnosis and treatment plan definition. Clinical relevance - This approach represents a random forest-based model to predict the MYCN amplification in NB patients that could give a faster, earlier, and repeatable analysis of the tumor along the time.

Subject(s)

Neuroblastoma , Area Under Curve , Humans , N-Myc Proto-Oncogene Protein/genetics , N-Myc Proto-Oncogene Protein/metabolism , Neuroblastoma/diagnostic imaging , Neuroblastoma/genetics , Tomography, X-Ray Computed

12.

Learning Models for Traumatic Brain Injury Mortality Prediction on Pediatric Electronic Health Records.

Fonseca, João; Liu, Xiuyun; Oliveira, Hélder P; Pereira, Tania.

Front Neurol ; 13: 859068, 2022.

Article in English | MEDLINE | ID: mdl-35756926

ABSTRACT

Background: Traumatic Brain Injury (TBI) is one of the leading causes of injury related mortality in the world, with severe cases reaching mortality rates of 30-40%. It is highly heterogeneous both in causes and consequences, complicating medical interpretation and prognosis. Gathering clinical, demographic, and laboratory data to perform a prognosis requires time and skill in several clinical specialties. Machine learning (ML) methods can take advantage of the data and guide physicians toward a better prognosis and, consequently, better healthcare. The objective of this study was to develop and test a wide range of machine learning models and evaluate their capability of predicting mortality of TBI, at hospital discharge, while assessing the similarity between the predictive value of the data and clinical significance. Methods: The used dataset is the Hackathon Pediatric Traumatic Brain Injury (HPTBI) dataset, composed of electronic health records containing clinical annotations and demographic data of 300 patients. Four different classification models were tested, either with or without feature selection. For each combination of the classification model and feature selection method, the area under the receiver operator curve (ROC-AUC), balanced accuracy, precision, and recall were calculated. Results: Methods based on decision trees perform better when using all features (Random Forest, AUC = 0.86 and XGBoost, AUC = 0.91) but other models require prior feature selection to obtain the best results (k-Nearest Neighbors, AUC = 0.90 and Artificial Neural Networks, AUC = 0.84). Additionally, Random Forest and XGBoost allow assessing the feature's importance, which could give insights for future strategies on the clinical routine. Conclusion: Predictive capability depends greatly on the combination of model and feature selection methods used but, overall, ML models showed a very good performance in mortality prediction for TBI. The feature importance results indicate that predictive value is not directly related to clinical significance.

13.

The Influence of a Coherent Annotation and Synthetic Addition of Lung Nodules for Lung Segmentation in CT Scans.

Sousa, Joana; Pereira, Tania; Neves, Inês; Silva, Francisco; Oliveira, Hélder P.

Sensors (Basel) ; 22(9)2022 Apr 30.

Article in English | MEDLINE | ID: mdl-35591132

ABSTRACT

Lung cancer is a highly prevalent pathology and a leading cause of cancer-related deaths. Most patients are diagnosed when the disease has manifested itself, which usually is a sign of lung cancer in an advanced stage and, as a consequence, the 5-year survival rates are low. To increase the chances of survival, improving the cancer early detection capacity is crucial, for which computed tomography (CT) scans represent a key role. The manual evaluation of the CTs is a time-consuming task and computer-aided diagnosis (CAD) systems can help relieve that burden. The segmentation of the lung is one of the first steps in these systems, yet it is very challenging given the heterogeneity of lung diseases usually present and associated with cancer development. In our previous work, a segmentation model based on a ResNet34 and U-Net combination was developed on a cross-cohort dataset that yielded good segmentation masks for multiple pathological conditions but misclassified some of the lung nodules. The multiple datasets used for the model development were originated from different annotation protocols, which generated inconsistencies for the learning process, and the annotations are usually not adequate for lung cancer studies since they did not comprise lung nodules. In addition, the initial datasets used for training presented a reduced number of nodules, which was showed not to be enough to allow the segmentation model to learn to include them as a lung part. In this work, an objective protocol for the lung mask's segmentation was defined and the previous annotations were carefully reviewed and corrected to create consistent and adequate ground-truth masks for the development of the segmentation model. Data augmentation with domain knowledge was used to create lung nodules in the cases used to train the model. The model developed achieved a Dice similarity coefficient (DSC) above 0.9350 for all test datasets and it showed an ability to cope, not only with a variety of lung patterns, but also with the presence of lung nodules as well. This study shows the importance of using consistent annotations for the supervised learning process, which is a very time-consuming task, but that has great importance to healthcare applications. Due to the lack of massive datasets in the medical field, which consequently brings a lack of wide representativity, data augmentation with domain knowledge could represent a promising help to overcome this limitation for learning models development.

Subject(s)

Lung Neoplasms , Tomography, X-Ray Computed , Diagnosis, Computer-Assisted , Humans , Image Processing, Computer-Assisted , Lung/diagnostic imaging , Lung Neoplasms/diagnostic imaging , Thorax

14.

Multiple instance learning for lung pathophysiological findings detection using CT scans.

Frade, Julieta; Pereira, Tania; Morgado, Joana; Silva, Francisco; Freitas, Cláudia; Mendes, José; Negrão, Eduardo; de Lima, Beatriz Flor; Silva, Miguel Correia da; Madureira, António J; Ramos, Isabel; Costa, José Luís; Hespanhol, Venceslau; Cunha, António; Oliveira, Hélder P.

Med Biol Eng Comput ; 60(6): 1569-1584, 2022 Jun.

Article in English | MEDLINE | ID: mdl-35386027

ABSTRACT

Lung diseases affect the lives of billions of people worldwide, and 4 million people, each year, die prematurely due to this condition. These pathologies are characterized by specific imagiological findings in CT scans. The traditional Computer-Aided Diagnosis (CAD) approaches have been showing promising results to help clinicians; however, CADs normally consider a small part of the medical image for analysis, excluding possible relevant information for clinical evaluation. Multiple Instance Learning (MIL) approach takes into consideration different small pieces that are relevant for the final classification and creates a comprehensive analysis of pathophysiological changes. This study uses MIL-based approaches to identify the presence of lung pathophysiological findings in CT scans for the characterization of lung disease development. This work was focus on the detection of the following: Fibrosis, Emphysema, Satellite Nodules in Primary Lesion Lobe, Nodules in Contralateral Lung and Ground Glass, being Fibrosis and Emphysema the ones with more outstanding results, reaching an Area Under the Curve (AUC) of 0.89 and 0.72, respectively. Additionally, the MIL-based approach was used for EGFR mutation status prediction - the most relevant oncogene on lung cancer, with an AUC of 0.69. The results showed that this comprehensive approach can be a useful tool for lung pathophysiological characterization.

Subject(s)

Emphysema , Lung Neoplasms , Diagnosis, Computer-Assisted/methods , Emphysema/pathology , Fibrosis , Humans , Lung/diagnostic imaging , Lung/pathology , Lung Neoplasms/diagnostic imaging , Lung Neoplasms/pathology , Radiographic Image Interpretation, Computer-Assisted , Tomography, X-Ray Computed/methods

15.

Towards Machine Learning-Aided Lung Cancer Clinical Routines: Approaches and Open Challenges.

Silva, Francisco; Pereira, Tania; Neves, Inês; Morgado, Joana; Freitas, Cláudia; Malafaia, Mafalda; Sousa, Joana; Fonseca, João; Negrão, Eduardo; Flor de Lima, Beatriz; Correia da Silva, Miguel; Madureira, António J; Ramos, Isabel; Costa, José Luis; Hespanhol, Venceslau; Cunha, António; Oliveira, Hélder P.

J Pers Med ; 12(3)2022 Mar 16.

Article in English | MEDLINE | ID: mdl-35330479

ABSTRACT

Advancements in the development of computer-aided decision (CAD) systems for clinical routines provide unquestionable benefits in connecting human medical expertise with machine intelligence, to achieve better quality healthcare. Considering the large number of incidences and mortality numbers associated with lung cancer, there is a need for the most accurate clinical procedures; thus, the possibility of using artificial intelligence (AI) tools for decision support is becoming a closer reality. At any stage of the lung cancer clinical pathway, specific obstacles are identified and "motivate" the application of innovative AI solutions. This work provides a comprehensive review of the most recent research dedicated toward the development of CAD tools using computed tomography images for lung cancer-related tasks. We discuss the major challenges and provide critical perspectives on future directions. Although we focus on lung cancer in this review, we also provide a more clear definition of the path used to integrate AI in healthcare, emphasizing fundamental research points that are crucial for overcoming current barriers.

16.

3D Breast Volume Estimation.

Gouveia, Pedro F; Oliveira, Hélder P; Monteiro, João P; Teixeira, João F; Silva, Nuno L; Pinto, David; Mavioso, Carlos; Anacleto, João; Martinho, Marta; Duarte, Inês; Cardoso, Jaime S; Cardoso, Fatima; Cardoso, Maria João.

Eur Surg Res ; 63(1): 3-8, 2022.

Article in English | MEDLINE | ID: mdl-34038908

ABSTRACT

INTRODUCTION: Breast volume estimation is considered crucial for breast cancer surgery planning. A single, easy, and reproducible method to estimate breast volume is not available. This study aims to evaluate, in patients proposed for mastectomy, the accuracy of the calculation of breast volume from a low-cost 3D surface scan (Microsoft Kinect) compared to the breast MRI and water displacement technique. MATERIAL AND METHODS: Patients with a Tis/T1-T3 breast cancer proposed for mastectomy between July 2015 and March 2017 were assessed for inclusion in the study. Breast volume calculations were performed using a 3D surface scan and the breast MRI and water displacement technique. Agreement between volumes obtained with both methods was assessed with the Spearman and Pearson correlation coefficients. RESULTS: Eighteen patients with invasive breast cancer were included in the study and submitted to mastectomy. The level of agreement of the 3D breast volume compared to surgical specimens and breast MRI volumes was evaluated. For mastectomy specimen volume, an average (standard deviation) of 0.823 (0.027) and 0.875 (0.026) was obtained for the Pearson and Spearman correlations, respectively. With respect to MRI annotation, we obtained 0.828 (0.038) and 0.715 (0.018). DISCUSSION: Although values obtained by both methodologies still differ, the strong linear correlation coefficient suggests that 3D breast volume measurement using a low-cost surface scan device is feasible and can approximate both the MRI breast volume and mastectomy specimen with sufficient accuracy. CONCLUSION: 3D breast volume measurement using a depth-sensor low-cost surface scan device is feasible and can parallel MRI breast and mastectomy specimen volumes with enough accuracy. Differences between methods need further development to reach clinical applicability. A possible approach could be the fusion of breast MRI and the 3D surface scan to harmonize anatomic limits and improve volume delimitation.

Subject(s)

Breast Neoplasms , Breast/diagnostic imaging , Breast/surgery , Breast Neoplasms/diagnostic imaging , Breast Neoplasms/surgery , Female , Humans , Imaging, Three-Dimensional/methods , Magnetic Resonance Imaging/methods , Mastectomy/methods

17.

An Interpretable Approach for Lung Cancer Prediction and Subtype Classification using Gene Expression.

Ramos, Bernardo; Pereira, Tania; Moranguinho, Joao; Morgado, Joana; Costa, Jose Luis; Oliveira, Helder P.

Annu Int Conf IEEE Eng Med Biol Soc ; 2021: 1707-1710, 2021 11.

Article in English | MEDLINE | ID: mdl-34891615

ABSTRACT

Lung cancer is the deadliest form of cancer, accounting for 20% of total cancer deaths. It represents a group of histologically and molecularly heterogeneous diseases even within the same histological subtype. Moreover, accurate histological subtype diagnosis influences the specific subtype's target genes, which will help define the treatment plan to target those genes in therapy. Deep learning (DL) models seem to set the benchmarks for the tasks of cancer prediction and subtype classification when using gene expression data; however, these methods do not provide interpretability, which is great concern from the perspective of cancer biology since the identification of the cancer driver genes in an individual provides essential information for treatment and prognosis. In this work, we identify some limitations of previous work that showed efforts to build algorithms to extract feature weights from DL models, and we propose using tree-based learning algorithms that address these limitations. Preliminary results show that our methods outperform those of related research while providing model interpretability.Clinical Relevance: The machine learning methods used in this work are interpretable and provide biological insight. Two sets of genes were extracted: a set that differentiates normal tissue from cancerous tissue (cancer prediction), and a set of genes that distinguishes LUAD from LUSC samples (subtype classification).

Subject(s)

Lung Neoplasms , Algorithms , Gene Expression , Humans , Lung Neoplasms/diagnosis , Lung Neoplasms/genetics , Machine Learning

18.

Attention Based Deep Multiple Instance Learning Approach for Lung Cancer Prediction using Histopathological Images.

Moranguinho, Joao; Pereira, Tania; Ramos, Bernardo; Morgado, Joana; Costa, Jose Luis; Oliveira, Helder P.

Annu Int Conf IEEE Eng Med Biol Soc ; 2021: 2852-2855, 2021 11.

Article in English | MEDLINE | ID: mdl-34891842

ABSTRACT

Deep Neural Networks using histopathological images as an input currently embody one of the gold standards in automated lung cancer diagnostic solutions, with Deep Convolutional Neural Networks achieving the state of the art values for tissue type classification. One of the main reasons for such results is the increasing availability of voluminous amounts of data, acquired through the efforts employed by extensive projects like The Cancer Genome Atlas. Nonetheless, whole slide images remain weakly annotated, as most common pathologist annotations refer to the entirety of the image and not to individual regions of interest in the patient's tissue sample. Recent works have demonstrated Multiple Instance Learning as a successful approach in classification tasks entangled with this lack of annotation, by representing images as a bag of instances where a single label is available for the whole bag. Thus, we propose a bag/embedding-level lung tissue type classifier using Multiple Instance Learning, where the automated inspection of lung biopsy whole slide images determines the presence of cancer in a given patient. Furthermore, we use a post-model interpretability algorithm to validate our model's predictions and highlight the regions of interest for such predictions.

Subject(s)

Lung Neoplasms , Neural Networks, Computer , Algorithms , Humans

19.

The Impact of Interstitial Diseases Patterns on Lung CT Segmentation.

Silva, Francisco; Pereira, Tania; Morgado, Joana; Cunha, Antonio; Oliveira, Helder P.

Annu Int Conf IEEE Eng Med Biol Soc ; 2021: 2856-2859, 2021 11.

Article in English | MEDLINE | ID: mdl-34891843

ABSTRACT

Lung segmentation represents a fundamental step in the development of computer-aided decision systems for the investigation of interstitial lung diseases. In a holistic lung analysis, eliminating background areas from Computed Tomography (CT) images is essential to avoid the inclusion of noise information and spend unnecessary computational resources on non-relevant data. However, the major challenge in this segmentation task relies on the ability of the models to deal with imaging manifestations associated with severe disease. Based on U-net, a general biomedical image segmentation architecture, we proposed a light-weight and faster architecture. In this 2D approach, experiments were conducted with a combination of two publicly available databases to improve the heterogeneity of the training data. Results showed that, when compared to the original U-net, the proposed architecture maintained performance levels, achieving 0.894 ± 0.060, 4.493 ± 0.633 and 4.457 ± 0.628 for DSC, HD and HD-95 metrics, respectively, when using all patients from the ILD database for testing only, while allowing a more effficient computational usage. Quantitative and qualitative evaluations on the ability to cope with high-density lung patterns associated with severe disease were conducted, supporting the idea that more representative and diverse data is necessary to build robust and reliable segmentation tools.

Subject(s)

Lung Diseases, Interstitial , Tomography, X-Ray Computed , Databases, Factual , Humans , Lung/diagnostic imaging , Lung Diseases, Interstitial/diagnostic imaging , Thorax

20.

Ensemble Strategies for EGFR Mutation Status Prediction in Lung Cancer.

Malafaia, Mafalda; Pereira, Tania; Silva, Francisco; Morgado, Joana; Cunha, Antonio; Oliveira, Helder P.

Annu Int Conf IEEE Eng Med Biol Soc ; 2021: 3285-3288, 2021 11.

Article in English | MEDLINE | ID: mdl-34891942

ABSTRACT

Lung cancer treatments that are accurate and effective are urgently needed. The diagnosis of advanced-stage patients accounts for the majority of the cases, being essential to provide a specialized course of treatment. One emerging course of treatment relies on target therapy through the testing of biomarkers, such as the Epidermal Growth Factor Receptor (EGFR) gene. Such testing can be obtained from invasive methods, namely through biopsy, which may be avoided by applying machine learning techniques to the imaging phenotypes extracted from Computerized Tomography (CT). This study aims to explore the contribution of ensemble methods when applied to the prediction of EGFR mutation status. The obtained results translate in a direct correlation between the semantic predictive model and the outcome of the combined ensemble methods, showing that the utilized features do not have a positive contribution to the predictive developed models.

Subject(s)

ErbB Receptors , Lung Neoplasms , ErbB Receptors/genetics , Humans , Lung , Lung Neoplasms/genetics , Mutation , Tomography, X-Ray Computed

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL