Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add more filters










Database
Language
Publication year range
1.
Biometrics ; 78(4): 1414-1426, 2022 12.
Article in English | MEDLINE | ID: mdl-34407216

ABSTRACT

We introduce binacox, a prognostic method to deal with the problem of detecting multiple cut-points per feature in a multivariate setting where a large number of continuous features are available. The method is based on the Cox model and combines one-hot encoding with the binarsity penalty, which uses total-variation regularization together with an extra linear constraint, and enables feature selection. Original nonasymptotic oracle inequalities for prediction (in terms of Kullback-Leibler divergence) and estimation with a fast rate of convergence are established. The statistical performance of the method is examined in an extensive Monte Carlo simulation study, and then illustrated on three publicly available genetic cancer data sets. On these high-dimensional data sets, our proposed method outperforms state-of-the-art survival models regarding risk prediction in terms of the C-index, with a computing time orders of magnitude faster. In addition, it provides powerful interpretability from a clinical perspective by automatically pinpointing significant cut-points in relevant variables.


Subject(s)
Proportional Hazards Models , Computer Simulation , Monte Carlo Method , Prognosis
2.
J Clin Med ; 8(9)2019 Sep 19.
Article in English | MEDLINE | ID: mdl-31546961

ABSTRACT

Hospital admission of patients with sickle-cell disease (SCD) presenting with a vaso-occlusive crisis (VOC) can be justified by pain refractory to usual outpatient care and/or the occurrence of a complication. Yet, the trajectories of vital parameters and standard biomarkers throughout a non-complicated VOC has not been established. In this observational cohort study, we describe the course of routine parameters throughout 329 hospital stays for non-complicated VOC. We used a new spline-based approach to study and visualize non-specific time-dependent variables extracted from the hospital clinical data warehouse. We identified distinct trends during the VOC for hemoglobin level, leukocytes count, C-Reactive Protein (CRP) level and temperature. Hemoglobin decreased after admission and rarely returned to steady state levels before discharge. White blood cell counts were elevated at admission before immediately decreasing, whereas eosinophils increased slowly throughout the first five days of the stay. In over 95% of non-complicated VOC-related stays, the CRP value was below 100 mg/L within the first day following admission and above normal after 48 hours, and the temperature was below 38 °C throughout the entire stay. Knowing the typical trajectories of these routine parameters during non-complicated VOC may urge the clinicians to be more vigilant in case of deviation from these patterns.

3.
BMC Med Res Methodol ; 19(1): 50, 2019 03 06.
Article in English | MEDLINE | ID: mdl-30841867

ABSTRACT

BACKGROUND: Choosing the most performing method in terms of outcome prediction or variables selection is a recurring problem in prognosis studies, leading to many publications on methods comparison. But some aspects have received little attention. First, most comparison studies treat prediction performance and variable selection aspects separately. Second, methods are either compared within a binary outcome setting (where we want to predict whether the readmission will occur within an arbitrarily chosen delay or not) or within a survival analysis setting (where the outcomes are directly the censored times), but not both. In this paper, we propose a comparison methodology to weight up those different settings both in terms of prediction and variables selection, while incorporating advanced machine learning strategies. METHODS: Using a high-dimensional case study on a sickle-cell disease (SCD) cohort, we compare 8 statistical methods. In the binary outcome setting, we consider logistic regression (LR), support vector machine (SVM), random forest (RF), gradient boosting (GB) and neural network (NN); while on the survival analysis setting, we consider the Cox Proportional Hazards (PH), the CURE and the C-mix models. We also propose a method using Gaussian Processes to extract meaningfull structured covariates from longitudinal data. RESULTS: Among all assessed statistical methods, the survival analysis ones obtain the best results. In particular the C-mix model yields the better performances in both the two considered settings (AUC =0.94 in the binary outcome setting), as well as interesting interpretation aspects. There is some consistency in selected covariates across methods within a setting, but not much across the two settings. CONCLUSIONS: It appears that learning withing the survival analysis setting first (so using all the temporal information), and then going back to a binary prediction using the survival estimates gives significantly better prediction performances than the ones obtained by models trained "directly" within the binary outcome setting.


Subject(s)
Anemia, Sickle Cell/diagnosis , Anemia, Sickle Cell/therapy , Outcome Assessment, Health Care/statistics & numerical data , Patient Readmission/statistics & numerical data , Cohort Studies , Humans , Logistic Models , Machine Learning , Multivariate Analysis , Neural Networks, Computer , Outcome Assessment, Health Care/methods , Prognosis , Proportional Hazards Models , Reproducibility of Results , Support Vector Machine , Survival Analysis
4.
Stat Methods Med Res ; 28(5): 1523-1539, 2019 05.
Article in English | MEDLINE | ID: mdl-29658407

ABSTRACT

We introduce a supervised learning mixture model for censored durations (C-mix) to simultaneously detect subgroups of patients with different prognosis and order them based on their risk. Our method is applicable in a high-dimensional setting, i.e. with a large number of biomedical covariates. Indeed, we penalize the negative log-likelihood by the Elastic-Net, which leads to a sparse parameterization of the model and automatically pinpoints the relevant covariates for the survival prediction. Inference is achieved using an efficient Quasi-Newton Expectation Maximization algorithm, for which we provide convergence properties. The statistical performance of the method is examined on an extensive Monte Carlo simulation study and finally illustrated on three publicly available genetic cancer datasets with high-dimensional covariates. We show that our approach outperforms the state-of-the-art survival models in this context, namely both the CURE and Cox proportional hazards models penalized by the Elastic-Net, in terms of C-index, AUC( t) and survival prediction. Thus, we propose a powerful tool for personalized medicine in cancerology.


Subject(s)
Models, Statistical , Neoplasms/genetics , Precision Medicine , Algorithms , Humans , Monte Carlo Method , Neoplasms/mortality , Prognosis , Proportional Hazards Models
SELECTION OF CITATIONS
SEARCH DETAIL
...