Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
1.
Metabolomics ; 20(2): 35, 2024 Mar 05.
Artigo em Inglês | MEDLINE | ID: mdl-38441696

RESUMO

INTRODUCTION: Longitudinal biomarkers in patients with community-acquired pneumonia (CAP) may help in monitoring of disease progression and treatment response. The metabolic host response could be a potential source of such biomarkers since it closely associates with the current health status of the patient. OBJECTIVES: In this study we performed longitudinal metabolite profiling in patients with CAP for a comprehensive range of metabolites to identify potential host response biomarkers. METHODS: Previously collected serum samples from CAP patients with confirmed Streptococcus pneumoniae infection (n = 25) were used. Samples were collected at multiple time points, up to 30 days after admission. A wide range of metabolites was measured, including amines, acylcarnitines, organic acids, and lipids. The associations between metabolites and C-reactive protein (CRP), procalcitonin, CURB disease severity score at admission, and total length of stay were evaluated. RESULTS: Distinct longitudinal profiles of metabolite profiles were identified, including cholesteryl esters, diacyl-phosphatidylethanolamine, diacylglycerols, lysophosphatidylcholines, sphingomyelin, and triglycerides. Positive correlations were found between CRP and phosphatidylcholine (34:1) (cor = 0.63) and negative correlations were found for CRP and nine lysophosphocholines (cor = - 0.57 to - 0.74). The CURB disease severity score was negatively associated with six metabolites, including acylcarnitines (tau = - 0.64 to - 0.58). Negative correlations were found between the length of stay and six triglycerides (TGs), especially TGs (60:3) and (58:2) (cor = - 0.63 and - 0.61). CONCLUSION: The identified metabolites may provide insight into biological mechanisms underlying disease severity and may be of interest for exploration as potential treatment response monitoring biomarker.


Assuntos
Pneumonia , Streptococcus pneumoniae , Humanos , Metabolômica , Proteína C-Reativa , Biomarcadores , Triglicerídeos
2.
Clin Pharmacol Ther ; 115(4): 795-804, 2024 04.
Artigo em Inglês | MEDLINE | ID: mdl-37946529

RESUMO

Virtual patient simulation is increasingly performed to support model-based optimization of clinical trial designs or individualized dosing strategies. Quantitative pharmacological models typically incorporate individual-level patient characteristics, or covariates, which enable the generation of virtual patient cohorts. The individual-level patient characteristics, or covariates, used as input for such simulations should accurately reflect the values seen in real patient populations. Current methods often make unrealistic assumptions about the correlation between patient's covariates or require direct access to actual data sets with individual-level patient data, which may often be limited by data sharing limitations. We propose and evaluate the use of copulas to address current shortcomings in simulation of patient-associated covariates for virtual patient simulations for model-based dose and trial optimization in clinical pharmacology. Copulas are multivariate distribution functions that can capture joint distributions, including the correlation, of covariate sets. We compare the performance of copulas to alternative simulation strategies, and we demonstrate their utility in several case studies. Our work demonstrates that copulas can reproduce realistic patient characteristics, both in terms of individual covariates and the dependence structure between different covariates, outperforming alternative methods, in particular when aiming to reproduce high-dimensional covariate sets. In conclusion, copulas represent a versatile and generalizable approach for virtual patient simulation which preserve relationships between covariates, and offer an open science strategy to facilitate re-use of patient data sets.


Assuntos
Modelos Estatísticos , Simulação de Paciente , Humanos , Simulação por Computador
3.
JAC Antimicrob Resist ; 3(4): dlab175, 2021 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-34859221

RESUMO

BACKGROUND: Collateral effects of antibiotic resistance occur when resistance to one antibiotic agent leads to increased resistance or increased sensitivity to a second agent, known respectively as collateral resistance (CR) and collateral sensitivity (CS). Collateral effects are relevant to limit impact of antibiotic resistance in design of antibiotic treatments. However, methods to detect antibiotic collateral effects in clinical population surveillance data of antibiotic resistance are lacking. OBJECTIVES: To develop a methodology to quantify collateral effect directionality and effect size from large-scale antimicrobial resistance population surveillance data. METHODS: We propose a methodology to quantify and test collateral effects in clinical surveillance data based on a conditional t-test. Our methodology was evaluated using MIC data for 419 Escherichia coli strains, containing MIC data for 20 antibiotics, which were obtained from the Pathosystems Resource Integration Center (PATRIC) database. RESULTS: We demonstrate that the proposed approach identifies several antibiotic combinations that show symmetrical or non-symmetrical CR and CS. For several of these combinations, collateral effects were previously confirmed in experimental studies. We furthermore provide insight into the power of our method for multiple collateral effect sizes and MIC distributions. CONCLUSIONS: Our proposed approach is of relevance as a tool for analysis of large-scale population surveillance studies to provide broad systematic identification of collateral effects related to antibiotic resistance, and is made available to the community as an R package. This method can help mapping CS and CR, which could guide combination therapy and prescribing in the future.

4.
CPT Pharmacometrics Syst Pharmacol ; 10(4): 350-361, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33792207

RESUMO

Pharmacometric modeling can capture tumor growth inhibition (TGI) dynamics and variability. These approaches do not usually consider covariates in high-dimensional settings, whereas high-dimensional molecular profiling technologies ("omics") are being increasingly considered for prediction of anticancer drug treatment response. Machine learning (ML) approaches have been applied to identify high-dimensional omics predictors for treatment outcome. Here, we aimed to combine TGI modeling and ML approaches for two distinct aims: omics-based prediction of tumor growth profiles and identification of pathways associated with treatment response and resistance. We propose a two-step approach combining ML using least absolute shrinkage and selection operator (LASSO) regression with pharmacometric modeling. We demonstrate our workflow using a previously published dataset consisting of 4706 tumor growth profiles of patient-derived xenograft (PDX) models treated with a variety of mono- and combination regimens. Pharmacometric TGI models were fit to the tumor growth profiles. The obtained empirical Bayes estimates-derived TGI parameter values were regressed using the LASSO on high-dimensional genomic copy number variation data, which contained over 20,000 variables. The predictive model was able to decrease median prediction error by 4% as compared with a model without any genomic information. A total of 74 pathways were identified as related to treatment response or resistance development by LASSO, of which part was verified by literature. In conclusion, we demonstrate how the combined use of ML and pharmacometric modeling can be used to gain pharmacological understanding in genomic factors driving variation in treatment response.


Assuntos
Antineoplásicos/metabolismo , Neoplasias/tratamento farmacológico , Farmacogenética/instrumentação , Carga Tumoral/efeitos dos fármacos , Animais , Antineoplásicos/farmacologia , Teorema de Bayes , Variação Biológica da População/genética , Variações do Número de Cópias de DNA/genética , Desenvolvimento de Medicamentos/métodos , Descoberta de Drogas/métodos , Genômica , Humanos , Aprendizado de Máquina , Camundongos , Modelos Animais , Neoplasias/patologia , Valor Preditivo dos Testes , Resultado do Tratamento , Carga Tumoral/genética
5.
Behav Res Methods ; 52(6): 2657-2673, 2020 12.
Artigo em Inglês | MEDLINE | ID: mdl-32542441

RESUMO

In meta-analysis, heterogeneity often exists between studies. Knowledge about study features (i.e., moderators) that can explain the heterogeneity in effect sizes can be useful for researchers to assess the effectiveness of existing interventions and design new potentially effective interventions. When there are multiple moderators, they may amplify or attenuate each other's effect on treatment effectiveness. However, in most meta-analysis studies, interaction effects are neglected due to the lack of appropriate methods. The method meta-CART was recently proposed to identify interactions between multiple moderators. The analysis result is a tree model in which the studies are partitioned into more homogeneous subgroups by combinations of moderators. This paper describes the R-package metacart, which provides user-friendly functions to conduct meta-CART analyses in R. This package can fit both fixed- and random-effects meta-CART, and can handle dichotomous, categorical, ordinal and continuous moderators. In addition, a new look ahead procedure is presented. The application of the package is illustrated step-by-step using diverse examples.


Assuntos
Projetos de Pesquisa , Humanos , Resultado do Tratamento
6.
Clin Pharmacol Ther ; 107(2): 397-405, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-31400148

RESUMO

A limited understanding of intersubject and intrasubject variability hampers effective biomarker translation from in vitro/in vivo studies to clinical trials and clinical decision support. Specifically, variability of biomolecule concentration can play an important role in interpretation, power analysis, and sampling time designation. In the present study, a wide range of 749 plasma metabolites, 62 urine biogenic amines, and 1,263 plasma proteins were analyzed in 10 healthy male volunteers measured repeatedly during 12 hours under tightly controlled conditions. Three variability components in relative concentration data are determined using linear mixed models: between (intersubject), time (intrasubject), and noise (intrasubject). Biomolecules such as 3-carboxy-4-methyl-5-propyl-2-furanpropanoate, platelet-derived growth factor C, and cathepsin D with low noise potentially detect changing conditions within a person. If also the between component is low, biomolecules can easier differentiate conditions between persons, for example cathepsin D, CD27 antigen, and prolylglycine. Variability over time does not necessarily inhibit translatability, but requires choosing sampling times carefully.


Assuntos
Proteínas Sanguíneas/análise , Ensaios Clínicos como Assunto/métodos , Ensaios Clínicos como Assunto/normas , Proteinúria/metabolismo , Adulto , Biomarcadores , Alimentos , Voluntários Saudáveis , Humanos , Masculino , Modelos Estatísticos , Fatores de Tempo , Adulto Jovem
7.
Res Synth Methods ; 10(1): 134-152, 2019 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-30511514

RESUMO

In meta-analytic studies, there are often multiple moderators available (eg, study characteristics). In such cases, traditional meta-analysis methods often lack sufficient power to investigate interaction effects between moderators, especially high-order interactions. To overcome this problem, meta-CART was proposed: an approach that applies classification and regression trees (CART) to identify interactions, and then subgroup meta-analysis to test the significance of moderator effects. The aim of this study is to improve meta-CART upon two aspects: 1) to integrate the two steps of the approach into one and 2) to consistently take into account the fixed-effect or random-effects assumption in both the the interaction identification and testing process. For fixed effect meta-CART, weights are applied, and subgroup analysis is adapted. For random effects meta-CART, a new algorithm has been developed. The performance of the improved meta-CART was investigated via an extensive simulation study on different types of moderator variables (ie, dichotomous, nominal, ordinal, and continuous variables). The simulation results revealed that the new method can achieve satisfactory performance (power greater than 0.80 and Type I error less than 0.05) if appropriate pruning rule is applied and the number of studies is large enough. The required minimum number of studies ranges from 40 to 120 depending on the complexity and strength of the interaction effects, the within-study sample size, the type of moderators, and the residual heterogeneity.


Assuntos
Modificador do Efeito Epidemiológico , Metanálise como Assunto , Projetos de Pesquisa , Algoritmos , Simulação por Computador , Interpretação Estatística de Dados , Humanos , Modelos Estatísticos , Método de Monte Carlo , Análise de Regressão , Tamanho da Amostra
8.
Br J Math Stat Psychol ; 70(1): 118-136, 2017 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-28130936

RESUMO

In the framework of meta-analysis, moderator analysis is usually performed only univariately. When several study characteristics are available that may account for treatment effect, standard meta-regression has difficulties in identifying interactions between them. To overcome this problem, meta-CART has been proposed: an approach that applies classification and regression trees (CART) to identify interactions, and then subgroup meta-analysis to test the significance of moderator effects. The previous version of meta-CART has its shortcomings: when applying CART, the sample sizes of studies are not taken into account, and the effect sizes are dichotomized around the median value. Therefore, this article proposes new meta-CART extensions, weighting study effect sizes by their accuracy, and using a regression tree to avoid dichotomization. In addition, new pruning rules are proposed. The performance of all versions of meta-CART was evaluated via a Monte Carlo simulation study. The simulation results revealed that meta-regression trees with random-effects weights and a 0.5-standard-error pruning rule perform best. The required sample size for meta-CART to achieve satisfactory performance depends on the number of study characteristics, the magnitude of the interactions, and the residual heterogeneity.


Assuntos
Algoritmos , Metanálise como Assunto , Modelos Estatísticos , Avaliação de Resultados em Cuidados de Saúde/métodos , Análise de Regressão , Software , Simulação por Computador , Interpretação Estatística de Dados , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
9.
Eur J Hum Genet ; 21(1): 95-101, 2013 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22713803

RESUMO

Twin and family studies are typically used to elucidate the relative contribution of genetic and environmental variation to phenotypic variation. Here, we apply a quantitative genetic method based on hierarchical clustering, to blood plasma lipidomics data obtained in a healthy cohort consisting of 37 monozygotic and 28 dizygotic twin pairs, and 52 of their biological nontwin siblings. Such data are informative of the concentrations of a wide range of lipids in the studied blood samples. An important advantage of hierarchical clustering is that it can be applied to a high-dimensional 'omics' type data, whereas the use of many other quantitative genetic methods for analysis of such data is hampered by the large number of correlated variables. For this study we combined two lipidomics data sets, originating from two different measurement blocks, which we corrected for block effects by 'quantile equating'. In the analysis of the combined data, average similarities of lipidomics profiles were highest between monozygotic (MZ) cotwins, and became progressively lower between dizygotic (DZ) cotwins, among sex-matched nontwin siblings and among sex-matched unrelated participants, respectively. Our results suggest that (1) shared genetic background, shared environment, and similar age contribute to similarities in blood plasma lipidomics profiles among individuals; and (2) that the power of quantitative genetic analyses is enhanced by quantile equating and combination of data sets obtained in different measurement blocks.


Assuntos
Análise por Conglomerados , Lipídeos/sangue , Lipídeos/genética , Gêmeos Dizigóticos/genética , Gêmeos Monozigóticos/genética , Adolescente , Proteína C-Reativa/genética , Proteína C-Reativa/metabolismo , Feminino , Interação Gene-Ambiente , Humanos , Masculino , Modelos Genéticos , Países Baixos , Linhagem
10.
PLoS One ; 7(10): e46267, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23077503

RESUMO

The common approach to SNP genotyping is to use (model-based) clustering per individual SNP, on a set of arrays. Genotyping all SNPs on a single array is much more attractive, in terms of flexibility, stability and applicability, when developing new chips. A new semi-parametric method, named SCALA, is proposed. It is based on a mixture model using semi-parametric log-concave densities. Instead of using the raw data, the mixture is fitted on a two-dimensional histogram, thereby making computation time almost independent of the number of SNPs. Furthermore, the algorithm is effective in low-MAF situations.Comparisons between SCALA and CRLMM on HapMap genotypes show very reliable calling of single arrays. Some heterozygous genotypes from HapMap are called homozygous by SCALA and to lesser extent by CRLMM too. Furthermore, HapMap's NoCalls (NN) could be genotyped by SCALA, mostly with high probability. The software is available as R scripts from the website www.math.leidenuniv.nl/~rrippe.


Assuntos
Polimorfismo de Nucleotídeo Único , Algoritmos , Genótipo , Humanos
11.
PLoS One ; 7(9): e44331, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22984493

RESUMO

OBJECTIVE: The aim is to characterize subgroups or phenotypes of rheumatoid arthritis (RA) patients using a systems biology approach. The discovery of subtypes of rheumatoid arthritis patients is an essential research area for the improvement of response to therapy and the development of personalized medicine strategies. METHODS: In this study, 39 RA patients are phenotyped using clinical chemistry measurements, urine and plasma metabolomics analysis and symptom profiles. In addition, a Chinese medicine expert classified each RA patient as a Cold or Heat type according to Chinese medicine theory. Multivariate data analysis techniques are employed to detect and validate biochemical and symptom relationships with the classification. RESULTS: The questionnaire items 'Red joints', 'Swollen joints', 'Warm joints' suggest differences in the level of inflammation between the groups although c-reactive protein (CRP) and rheumatoid factor (RHF) levels were equal. Multivariate analysis of the urine metabolomics data revealed that the levels of 11 acylcarnitines were lower in the Cold RA than in the Heat RA patients, suggesting differences in muscle breakdown. Additionally, higher dehydroepiandrosterone sulfate (DHEAS) levels in Heat patients compared to Cold patients were found suggesting that the Cold RA group has a more suppressed hypothalamic-pituitary-adrenal (HPA) axis function. CONCLUSION: Significant and relevant biochemical differences are found between Cold and Heat RA patients. Differences in immune function, HPA axis involvement and muscle breakdown point towards opportunities to tailor disease management strategies to each of the subgroups RA patient.


Assuntos
Artrite Reumatoide/diagnóstico , Artrite Reumatoide/metabolismo , Metabolômica/métodos , Adulto , Idoso , Artrite Reumatoide/classificação , Proteína C-Reativa/biossíntese , Química Clínica/métodos , Temperatura Baixa , Feminino , Temperatura Alta , Humanos , Sistema Hipotálamo-Hipofisário/fisiopatologia , Medicina Tradicional Chinesa , Pessoa de Meia-Idade , Análise Multivariada , Fenótipo , Sistema Hipófise-Suprarrenal/fisiopatologia , Medicina de Precisão/métodos , Fator Reumatoide/sangue , Reumatologia/métodos , Inquéritos e Questionários
12.
PLoS One ; 7(6): e38230, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22679492

RESUMO

Copy number variations (CNV) and allelic imbalance in tumor tissue can show strong segmentation. Their graphical presentation can be enhanced by appropriate smoothing. Existing signal and scatterplot smoothers do not respect segmentation well. We present novel algorithms that use a penalty on the L(0) norm of differences of neighboring values. Visualization is our main goal, but we compare classification performance to that of VEGA.


Assuntos
Variações do Número de Cópias de DNA/genética , Algoritmos , Biologia Computacional/métodos , Genoma Humano/genética , Humanos
13.
PLoS One ; 6(9): e24846, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-21949766

RESUMO

BACKGROUND: The future of personalized medicine depends on advanced diagnostic tools to characterize responders and non-responders to treatment. Systems diagnosis is a new approach which aims to capture a large amount of symptom information from patients to characterize relevant sub-groups. METHODOLOGY: 49 patients with a rheumatic disease were characterized using a systems diagnosis questionnaire containing 106 questions based on Chinese and Western medicine symptoms. Categorical principal component analysis (CATPCA) was used to discover differences in symptom patterns between the patients. Two Chinese medicine experts where subsequently asked to rank the Cold and Heat status of all the patients based on the questionnaires. These rankings were used to study the Cold and Heat symptoms used by these practitioners. FINDINGS: The CATPCA analysis results in three dimensions. The first dimension is a general factor (40.2% explained variance). In the second dimension (12.5% explained variance) 'anxious', 'worrying', 'uneasy feeling' and 'distressed' were interpreted as the Internal disease stage, and 'aggravate in wind', 'fear of wind' and 'aversion to cold' as the External disease stage. In the third dimension (10.4% explained variance) 'panting s', 'superficial breathing', 'shortness of breath s', 'shortness of breath f' and 'aversion to cold' were interpreted as Cold and 'restless', 'nervous', 'warm feeling', 'dry mouth s' and 'thirst' as Heat related. 'Aversion to cold', 'fear of wind' and 'pain aggravates with cold' are most related to the experts Cold rankings and 'aversion to heat', 'fullness of chest' and 'dry mouth' to the Heat rankings. CONCLUSIONS: This study shows that the presented systems diagnosis questionnaire is able to identify groups of symptoms that are relevant for sub-typing patients with a rheumatic disease.


Assuntos
Doenças Reumáticas/classificação , Doenças Reumáticas/diagnóstico , Inquéritos e Questionários , Temperatura Baixa , Temperatura Alta , Humanos , Modelos Biológicos
14.
Anal Chem ; 82(3): 1039-46, 2010 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-20052990

RESUMO

Combination of data sets from different objects (for example, from two groups of healthy volunteers from the same population) that were measured on a common set of variables (for example, metabolites or peptides) is desirable for statistical analysis in "omics" studies because it increases power. However, this type of combination is not directly possible if nonbiological systematic differences exist among the individual data sets, or "blocks". Such differences can, for example, be due to small analytical changes that are likely to accumulate over large time intervals between blocks of measurements. In this article we present a data transformation method, that we will refer to as "quantile equating", which per variable corrects for linear and nonlinear differences in distribution among blocks of semiquantitative data obtained with the same analytical method. We demonstrate the successful application of the quantile equating method to data obtained on two typical metabolomics platforms, i.e., liquid chromatography-mass spectrometry and nuclear magnetic resonance spectroscopy. We suggest uni- and multivariate methods to evaluate similarities and differences among data blocks before and after quantile equating. In conclusion, we have developed a method to correct for nonbiological systematic differences among semiquantitative data blocks and have demonstrated its successful application to metabolomics data sets.


Assuntos
Cromatografia Líquida de Alta Pressão/métodos , Lipídeos/sangue , Espectroscopia de Ressonância Magnética/métodos , Espectrometria de Massas/métodos , Metabolômica/métodos , Adolescente , Algoritmos , Estudos de Coortes , Feminino , Humanos , Lipídeos/química , Masculino , Análise de Componente Principal , Irmãos , Gêmeos , Adulto Jovem
15.
OMICS ; 12(1): 17-31, 2008 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-18266560

RESUMO

Differences in genetic background and/or environmental exposure among individuals are expected to give rise to differences in measurable characteristics, or phenotypes. Consequently, genetic resemblance and similarities in environment should manifest as similarities in phenotypes. The metabolome reflects many of the system properties, and is therefore an important part of the phenotype. Nevertheless, it has not yet been examined to what extent individuals sharing part of their genome and/or environment indeed have similar metabolomes. Here we present the results of hierarchical clustering of blood plasma lipid profile data obtained by liquid chromatography-mass spectrometry from 23 healthy, 18-year-old twin pairs, of which 21 pairs were monozygotic, and 8 of their siblings. For 13 monozygotic twin pairs, within-pair similarities in relative concentrations of the detected lipids were indeed larger than the similarities with any other study participant. We demonstrate such high coclustering to be unexpected on basis of chance. The similarities between dizygotic twins and between nontwin siblings, as well as between nonfamilial participants, were less pronounced. In a number of twin pairs, within-pair dissimilarity of lipid profiles positively correlated with increased blood plasma concentrations of C-reactive protein in one twin. In conclusion, this study demonstrates that in healthy individuals, the individual genetic background contributes to the blood plasma lipid profile. Furthermore, lipid profiling may prove useful in monitoring health status, for example, in the context of personalized medicine.


Assuntos
Lipídeos/sangue , Gêmeos Monozigóticos/sangue , Gêmeos Monozigóticos/genética , Adolescente , Proteína C-Reativa/metabolismo , Cromatografia Líquida , Feminino , Humanos , Masculino , Espectrometria de Massas por Ionização por Electrospray
16.
Psychol Methods ; 12(3): 336-58, 2007 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-17784798

RESUMO

The authors provide a didactic treatment of nonlinear (categorical) principal components analysis (PCA). This method is the nonlinear equivalent of standard PCA and reduces the observed variables to a number of uncorrelated principal components. The most important advantages of nonlinear over linear PCA are that it incorporates nominal and ordinal variables and that it can handle and discover nonlinear relationships between variables. Also, nonlinear PCA can deal with variables at their appropriate measurement level; for example, it can treat Likert-type scales ordinally instead of numerically. Every observed value of a variable can be referred to as a category. While performing PCA, nonlinear PCA converts every category to a numeric value, in accordance with the variable's analysis level, using optimal quantification. The authors discuss how optimal quantification is carried out, what analysis levels are, which decisions have to be made when applying nonlinear PCA, and how the results can be interpreted. The strengths and limitations of the method are discussed. An example applying nonlinear PCA to empirical data using the program CATPCA (J. J. Meulman, W. J. Heiser, & SPSS, 2004) is provided.


Assuntos
Modelos Psicológicos , Criança , Humanos
17.
Psychol Methods ; 12(3): 359-79, 2007 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-17784799

RESUMO

Principal components analysis (PCA) is used to explore the structure of data sets containing linearly related numeric variables. Alternatively, nonlinear PCA can handle possibly nonlinearly related numeric as well as nonnumeric variables. For linear PCA, the stability of its solution can be established under the assumption of multivariate normality. For nonlinear PCA, however, standard options for establishing stability are not provided. The authors use the nonparametric bootstrap procedure to assess the stability of nonlinear PCA results, applied to empirical data. They use confidence intervals for the variable transformations and confidence ellipses for the eigenvalues, the component loadings, and the person scores. They discuss the balanced version of the bootstrap, bias estimation, and Procrustes rotation. To provide a benchmark, the same bootstrap procedure is applied to linear PCA on the same data. On the basis of the results, the authors advise using at least 1,000 bootstrap samples, using Procrustes rotation on the bootstrap results, examining the bootstrap distributions along with the confidence regions, and merging categories with small marginal frequencies to reduce the variance of the bootstrap results.


Assuntos
Pesquisa Empírica , Modelos Psicológicos , Meio Social , Humanos
18.
Stat Med ; 22(9): 1365-81, 2003 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-12704603

RESUMO

Predicting future outcomes based on knowledge obtained from past observational data is a common application in a wide variety of areas of scientific research. In the present paper, prediction will be focused on various grades of cervical preneoplasia and neoplasia. Statistical tools used for prediction should of course possess predictive accuracy, and preferably meet secondary requirements such as speed, ease of use, and interpretability of the resulting predictive model. A new automated procedure based on an extension (called 'boosting') of regression and classification tree (CART) models is described. The resulting tool is a fast 'off-the-shelf' procedure for classification and regression that is competitive in accuracy with more customized approaches, while being fairly automatic to use (little tuning), and highly robust especially when applied to less than clean data. Additional tools are presented for interpreting and visualizing the results of such multiple additive regression tree (MART) models.


Assuntos
Métodos Epidemiológicos , Modelos Estatísticos , Análise de Regressão , Carcinoma de Células Escamosas/epidemiologia , Feminino , Humanos , Valor Preditivo dos Testes , Neoplasias do Colo do Útero/epidemiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...