Search | VHL Regional Portal

1.

Large trans-ethnic meta-analysis identifies AKR1C4 as a novel gene associated with age at menarche.

Sarnowski, C; Cousminer, D L; Franceschini, N; Raffield, L M; Jia, G; Fernández-Rhodes, L; Grant, S F A; Hakonarson, H; Lange, L A; Long, J; Sofer, T; Tao, R; Wallace, R B; Wong, Q; Zirpoli, G; Boerwinkle, E; Bradfield, J P; Correa, A; Kooperberg, C L; North, K E; Palmer, J R; Zemel, B S; Zheng, W; Murabito, J M; Lunetta, K L.

Hum Reprod ; 36(7): 1999-2010, 2021 06 18.

Article in English | MEDLINE | ID: mdl-34021356

ABSTRACT

STUDY QUESTION: Does the expansion of genome-wide association studies (GWAS) to a broader range of ancestries improve the ability to identify and generalise variants associated with age at menarche (AAM) in European populations to a wider range of world populations? SUMMARY ANSWER: By including women with diverse and predominantly non-European ancestry in a large-scale meta-analysis of AAM with half of the women being of African ancestry, we identified a new locus associated with AAM in African-ancestry participants, and generalised loci from GWAS of European ancestry individuals. WHAT IS KNOWN ALREADY: AAM is a highly polygenic puberty trait associated with various diseases later in life. Both AAM and diseases associated with puberty timing vary by race or ethnicity. The majority of GWAS of AAM have been performed in European ancestry women. STUDY DESIGN, SIZE, DURATION: We analysed a total of 38 546 women who did not have predominantly European ancestry backgrounds: 25 149 women from seven studies from the ReproGen Consortium and 13 397 women from the UK Biobank. In addition, we used an independent sample of 5148 African-ancestry women from the Southern Community Cohort Study (SCCS) for replication. PARTICIPANTS/MATERIALS, SETTING, METHODS: Each AAM GWAS was performed by study and ancestry or ethnic group using linear regression models adjusted for birth year and study-specific covariates. ReproGen and UK Biobank results were meta-analysed using an inverse variance-weighted average method. A trans-ethnic meta-analysis was also carried out to assess heterogeneity due to different ancestry. MAIN RESULTS AND THE ROLE OF CHANCE: We observed consistent direction and effect sizes between our meta-analysis and the largest GWAS conducted in European or Asian ancestry women. We validated four AAM loci (1p31, 6q16, 6q22 and 9q31) with common genetic variants at P < 5 × 10-7. We detected one new association (10p15) at P < 5 × 10-8 with a low-frequency genetic variant lying in AKR1C4, which was replicated in an independent sample. This gene belongs to a family of enzymes that regulate the metabolism of steroid hormones and have been implicated in the pathophysiology of uterine diseases. The genetic variant in the new locus is more frequent in African-ancestry participants, and has a very low frequency in Asian or European-ancestry individuals. LARGE SCALE DATA: N/A. LIMITATIONS, REASONS FOR CAUTION: Extreme AAM (<9 years or >18 years) were excluded from analysis. Women may not fully recall their AAM as most of the studies were conducted many years later. Further studies in women with diverse and predominantly non-European ancestry are needed to confirm and extend these findings, but the availability of such replication samples is limited. WIDER IMPLICATIONS OF THE FINDINGS: Expanding association studies to a broader range of ancestries or ethnicities may improve the identification of new genetic variants associated with complex diseases or traits and the generalisation of variants from European-ancestry studies to a wider range of world populations. STUDY FUNDING/COMPETING INTEREST(S): Funding was provided by CHARGE Consortium grant R01HL105756-07: Gene Discovery For CVD and Aging Phenotypes and by the NIH grant U24AG051129 awarded by the National Institute on Aging (NIA). The authors have no conflict of interest to declare.

Subject(s)

Genome-Wide Association Study , Polymorphism, Single Nucleotide , Adolescent , Cohort Studies , Ethnicity , Female , Humans , Menarche/genetics

2.

Prospective associations of C-reactive protein (CRP) levels and CRP genetic risk scores with risk of total knee and hip replacement for osteoarthritis in a diverse cohort.

Shadyab, A H; Terkeltaub, R; Kooperberg, C; Reiner, A; Eaton, C B; Jackson, R D; Krok-Schoen, J L; Salem, R M; LaCroix, A Z.

Osteoarthritis Cartilage ; 26(8): 1038-1044, 2018 08.

Article in English | MEDLINE | ID: mdl-29758352

ABSTRACT

OBJECTIVE: To examine associations of high-sensitivity C-reactive protein (CRP) levels and polygenic CRP genetic risk scores (GRS) with risk of end-stage hip or knee osteoarthritis (OA), defined as incident total hip (THR) or knee replacement (TKR) for OA. DESIGN: This study included a cohort of postmenopausal white, African American, and Hispanic women from the Women's Health Initiative. Women were followed from baseline to date of THR or TKR, death, or December 31, 2014. Medicare claims data identified THR and TKR. Hs-CRP and genotyping data were collected at baseline. Three CRP GRS were constructed: 1) a 4-SNP GRS comprised of genetic variants representing variation in the CRP gene among European populations; 2) a multilocus 18-SNP GRS of genetic variants significantly associated with CRP levels in a meta-analysis of genome-wide association studies; and 3) a 5-SNP GRS of genetic variants significantly associated with CRP levels among African American women. RESULTS: In analyses conducted separately among each race and ethnic group, there were no significant associations of ln hs-CRP with risk of THR or TKR, after adjusting for age, body mass index, lifestyle characteristics, chronic diseases, hormone therapy use, and non-steroidal anti-inflammatory drug use. CRP GRS were not associated with risk of THR or TKR in any ethnic group. CONCLUSIONS: Serum levels of ln hs-CRP and genetically-predicted CRP levels were not associated with risk of THR or TKR for OA among a diverse cohort of women.

Subject(s)

Arthroplasty, Replacement, Hip/statistics & numerical data , Arthroplasty, Replacement, Knee/statistics & numerical data , C-Reactive Protein/genetics , Osteoarthritis, Hip/genetics , Osteoarthritis, Knee/genetics , C-Reactive Protein/analysis , Female , Genetic Predisposition to Disease/epidemiology , Genetic Predisposition to Disease/genetics , Genome-Wide Association Study , Humans , Osteoarthritis, Hip/blood , Osteoarthritis, Hip/etiology , Osteoarthritis, Hip/surgery , Osteoarthritis, Knee/blood , Osteoarthritis, Knee/etiology , Osteoarthritis, Knee/surgery , Polymorphism, Single Nucleotide/genetics , Racial Groups/genetics , Racial Groups/statistics & numerical data , Risk Factors

3.

Trans-ethnic analysis of metabochip data identifies two new loci associated with BMI.

Gong, J; Nishimura, K K; Fernandez-Rhodes, L; Haessler, J; Bien, S; Graff, M; Lim, U; Lu, Y; Gross, M; Fornage, M; Yoneyama, S; Isasi, C R; Buzkova, P; Daviglus, M; Lin, D-Y; Tao, R; Goodloe, R; Bush, W S; Farber-Eger, E; Boston, J; Dilks, H H; Ehret, G; Gu, C C; Lewis, C E; Nguyen, K-D H; Cooper, R; Leppert, M; Irvin, M R; Bottinger, E P; Wilkens, L R; Haiman, C A; Park, L; Monroe, K R; Cheng, I; Stram, D O; Carlson, C S; Jackson, R; Kuller, L; Houston, D; Kooperberg, C; Buyske, S; Hindorff, L A; Crawford, D C; Loos, R J F; Le Marchand, L; Matise, T C; North, K E; Peters, U.

Int J Obes (Lond) ; 42(3): 384-390, 2018 03.

Article in English | MEDLINE | ID: mdl-29381148

ABSTRACT

OBJECTIVE: Body mass index (BMI) is commonly used to assess obesity, which is associated with numerous diseases and negative health outcomes. BMI has been shown to be a heritable, polygenic trait, with close to 100 loci previously identified and replicated in multiple populations. We aim to replicate known BMI loci and identify novel associations in a trans-ethnic study population. SUBJECTS: Using eligible participants from the Population Architecture using Genomics and Epidemiology consortium, we conducted a trans-ethnic meta-analysis of 102 514 African Americans, Hispanics, Asian/Native Hawaiian, Native Americans and European Americans. Participants were genotyped on over 200 000 SNPs on the Illumina Metabochip custom array, or imputed into the 1000 Genomes Project (Phase I). Linear regression of the natural log of BMI, adjusting for age, sex, study site (if applicable), and ancestry principal components, was conducted for each race/ethnicity within each study cohort. Race/ethnicity-specific, and combined meta-analyses used fixed-effects models. RESULTS: We replicated 15 of 21 BMI loci included on the Metabochip, and identified two novel BMI loci at 1q41 (rs2820436) and 2q31.1 (rs10930502) at the Metabochip-wide significance threshold (P<2.5 × 10-7). Bioinformatic functional investigation of SNPs at these loci suggests a possible impact on pathways that regulate metabolism and adipose tissue. CONCLUSION: Conducting studies in genetically diverse populations continues to be a valuable strategy for replicating known loci and uncovering novel BMI associations.

Subject(s)

Body Mass Index , Racial Groups/genetics , Racial Groups/statistics & numerical data , Genome-Wide Association Study , Genomics , Humans , Polymorphism, Single Nucleotide/genetics

4.

Generalization and fine mapping of European ancestry-based central adiposity variants in African ancestry populations.

Yoneyama, S; Yao, J; Guo, X; Fernandez-Rhodes, L; Lim, U; Boston, J; Buzková, P; Carlson, C S; Cheng, I; Cochran, B; Cooper, R; Ehret, G; Fornage, M; Gong, J; Gross, M; Gu, C C; Haessler, J; Haiman, C A; Henderson, B; Hindorff, L A; Houston, D; Irvin, M R; Jackson, R; Kuller, L; Leppert, M; Lewis, C E; Li, R; Le Marchand, L; Matise, T C; Nguyen, K-Dh; Chakravarti, A; Pankow, J S; Pankratz, N; Pooler, L; Ritchie, M D; Bien, S A; Wassel, C L; Chen, Y-D I; Taylor, K D; Allison, M; Rotter, J I; Schreiner, P J; Schumacher, F; Wilkens, L; Boerwinkle, E; Kooperberg, C; Peters, U; Buyske, S; Graff, M; North, K E.

Int J Obes (Lond) ; 41(2): 324-331, 2017 02.

Article in English | MEDLINE | ID: mdl-27867202

ABSTRACT

BACKGROUND/OBJECTIVES: Central adiposity measures such as waist circumference (WC) and waist-to-hip ratio (WHR) are associated with cardiometabolic disorders independently of body mass index (BMI) and are gaining clinically utility. Several studies report genetic variants associated with central adiposity, but most utilize only European ancestry populations. Understanding whether the genetic associations discovered among mainly European descendants are shared with African ancestry populations will help elucidate the biological underpinnings of abdominal fat deposition. SUBJECTS/METHODS: To identify the underlying functional genetic determinants of body fat distribution, we conducted an array-wide association meta-analysis among persons of African ancestry across seven studies/consortia participating in the Population Architecture using Genomics and Epidemiology (PAGE) consortium. We used the Metabochip array, designed for fine-mapping cardiovascular-associated loci, to explore novel array-wide associations with WC and WHR among 15 945 African descendants using all and sex-stratified groups. We further interrogated 17 known WHR regions for African ancestry-specific variants. RESULTS: Of the 17 WHR loci, eight single-nucleotide polymorphisms (SNPs) located in four loci were replicated in the sex-combined or sex-stratified meta-analyses. Two of these eight independently associated with WHR after conditioning on the known variant in European descendants (rs12096179 in TBX15-WARS2 and rs2059092 in ADAMTS9). In the fine-mapping assessment, the putative functional region was reduced across all four loci but to varying degrees (average 40% drop in number of putative SNPs and 20% drop in genomic region). Similar to previous studies, the significant SNPs in the female-stratified analysis were stronger than the significant SNPs from the sex-combined analysis. No novel associations were detected in the array-wide analyses. CONCLUSIONS: Of 17 previously identified loci, four loci replicated in the African ancestry populations of this study. Utilizing different linkage disequilibrium patterns observed between European and African ancestries, we narrowed the suggestive region containing causative variants for all four loci.

Subject(s)

Adiposity/genetics , Black People/genetics , Genetic Variation , White People/genetics , Adult , Body Fat Distribution , Female , Genetic Predisposition to Disease/ethnology , Genome-Wide Association Study , Genotype , Humans , Male , Obesity, Abdominal/ethnology , Obesity, Abdominal/genetics , Polymorphism, Single Nucleotide/genetics , Waist-Hip Ratio

5.

Generalization of adiposity genetic loci to US Hispanic women.

Graff, M; Fernández-Rhodes, L; Liu, S; Carlson, C; Wassertheil-Smoller, S; Neuhouser, M; Reiner, A; Kooperberg, C; Rampersaud, E; Manson, J E; Kuller, L H; Howard, B V; Ochs-Balcom, H M; Johnson, K C; Vitolins, M Z; Sucheston, L; Monda, K; North, K E.

Nutr Diabetes ; 3: e85, 2013 Aug 26.

Article in English | MEDLINE | ID: mdl-23978819

ABSTRACT

BACKGROUND: Obesity is a public health concern. Yet the identification of adiposity-related genetic variants among United States (US) Hispanics, which is the largest US minority group, remains largely unknown. OBJECTIVE: To interrogate an a priori list of 47 (32 overall body mass and 15 central adiposity) index single-nucleotide polymorphisms (SNPs) previously studied in individuals of European descent among 3494 US Hispanic women in the Women's Health Initiative SNP Health Association Resource (WHI SHARe). DESIGN: Cross-sectional analysis of measured body mass index (BMI), waist circumference (WC) and waist-to-hip ratio (WHR) were inverse normally transformed after adjusting for age, smoking, center and global ancestry. WC and WHR models were also adjusted for BMI. Genotyping was performed using the Affymetrix 6.0 array. In the absence of an a priori selected SNP, a proxy was selected (r(2)î¶0.8 in CEU). RESULTS: Six BMI loci (TMEM18, NUDT3/HMGA1, FAIM2, FTO, MC4R and KCTD15) and two WC/WHR loci (VEGFA and ITPR2-SSPN) were nominally significant (P<0.05) at the index or proxy SNP in the corresponding BMI and WC/WHR models. To account for distinct linkage disequilibrium patterns in Hispanics and further assess generalization of genetic effects at each locus, we interrogated the evidence for association at the 47 surrounding loci within 1 Mb region of the index or proxy SNP. Three additional BMI loci (FANCL, TFAP2B and ETV5) and five WC/WHR loci (DNM3-PIGC, GRB14, ADAMTS9, LY86 and MSRA) displayed Bonferroni-corrected significant associations with BMI and WC/WHR. Conditional analyses of each index SNP (or its proxy) and the most significant SNP within the 1 Mb region supported the possible presence of index-independent signals at each of these eight loci as well as at KCTD15. CONCLUSION: This study provides evidence for the generalization of nine BMI and seven central adiposity loci in Hispanic women. This study expands the current knowledge of common adiposity-related genetic loci to Hispanic women.

6.

Boosting for detection of gene-environment interactions.

Pashova, H; LeBlanc, M; Kooperberg, C.

Stat Med ; 32(2): 255-66, 2013 Jan 30.

Article in English | MEDLINE | ID: mdl-22764060

ABSTRACT

In genetic association studies, it is typically thought that genetic variants and environmental variables jointly will explain more of the inheritance of a phenotype than either of these two components separately. Traditional methods to identify gene-environment interactions typically consider only one measured environmental variable at a time. However, in practice, multiple environmental factors may each be imprecise surrogates for the underlying physiological process that actually interacts with the genetic factors. In this paper, we develop a variant of L(2) boosting that is specifically designed to identify combinations of environmental variables that jointly modify the effect of a gene on a phenotype. Because the effect modifiers might have a small signal compared with the main effects, working in a space that is orthogonal to the main predictors allows us to focus on the interaction space. In a simulation study that investigates some plausible underlying model assumptions, our method outperforms the least absolute shrinkage and selection and Akaike Information Criterion and Bayesian Information Criterion model selection procedures as having the lowest test error. In an example for the Women's Health Initiative-Population Architecture using Genomics and Epidemiology study, the dedicated boosting method was able to pick out two single-nucleotide polymorphisms for which effect modification appears present. The performance was evaluated on an independent test set, and the results are promising.

Subject(s)

Bayes Theorem , Gene-Environment Interaction , Genetic Association Studies , Polymorphism, Single Nucleotide , Women's Health , Clinical Trials as Topic , Demography , Female , Humans , Phenotype , United States/epidemiology

7.

The use of phenome-wide association studies (PheWAS) for exploration of novel genotype-phenotype relationships and pleiotropy discovery.

Pendergrass, S A; Brown-Gentry, K; Dudek, S M; Torstenson, E S; Ambite, J L; Avery, C L; Buyske, S; Cai, C; Fesinmeyer, M D; Haiman, C; Heiss, G; Hindorff, L A; Hsu, C-N; Jackson, R D; Kooperberg, C; Le Marchand, L; Lin, Y; Matise, T C; Moreland, L; Monroe, K; Reiner, A P; Wallace, R; Wilkens, L R; Crawford, D C; Ritchie, M D.

Genet Epidemiol ; 35(5): 410-22, 2011 Jul.

Article in English | MEDLINE | ID: mdl-21594894

ABSTRACT

The field of phenomics has been investigating network structure among large arrays of phenotypes, and genome-wide association studies (GWAS) have been used to investigate the relationship between genetic variation and single diseases/outcomes. A novel approach has emerged combining both the exploration of phenotypic structure and genotypic variation, known as the phenome-wide association study (PheWAS). The Population Architecture using Genomics and Epidemiology (PAGE) network is a National Human Genome Research Institute (NHGRI)-supported collaboration of four groups accessing eight extensively characterized epidemiologic studies. The primary focus of PAGE is deep characterization of well-replicated GWAS variants and their relationships to various phenotypes and traits in diverse epidemiologic studies that include European Americans, African Americans, Mexican Americans/Hispanics, Asians/Pacific Islanders, and Native Americans. The rich phenotypic resources of PAGE studies provide a unique opportunity for PheWAS as each genotyped variant can be tested for an association with the wide array of phenotypic measurements available within the studies of PAGE, including prevalent and incident status for multiple common clinical conditions and risk factors, as well as clinical parameters and intermediate biomarkers. The results of PheWAS can be used to discover novel relationships between SNPs, phenotypes, and networks of interrelated phenotypes; identify pleiotropy; provide novel mechanistic insights; and foster hypothesis generation. The PAGE network has developed infrastructure to support and perform PheWAS in a high-throughput manner. As implementing the PheWAS approach has presented several challenges, the infrastructure and methodology, as well as insights gained in this project, are presented herein to benefit the larger scientific community.

Subject(s)

Genetic Association Studies/statistics & numerical data , Databases, Genetic , Ethnicity/genetics , Genetic Variation , Genome-Wide Association Study/statistics & numerical data , Humans , Models, Genetic , Models, Statistical , Phenotype , Polymorphism, Single Nucleotide , Racial Groups/genetics

8.

Variation in 24 hemostatic genes and associations with non-fatal myocardial infarction and ischemic stroke.

Smith, N L; Bis, J C; Biagiotti, S; Rice, K; Lumley, T; Kooperberg, C; Wiggins, K L; Heckbert, S R; Psaty, B M.

J Thromb Haemost ; 6(1): 45-53, 2008 Jan.

Article in English | MEDLINE | ID: mdl-17927806

ABSTRACT

BACKGROUND: Arterial thrombosis involves platelet aggregation and clot formation, yet little is known about the contribution of genetic variation in fibrin-based hemostatic factors to arterial clotting risk. We hypothesized that common variation in 24 coagulation-fibrinolysis genes would contribute to risk of incident myocardial infarction (MI) or ischemic stroke (IS). METHODS: We conducted a population-based, case-control study. Subjects were hypertensive adults and postmenopausal women 30-79 years of age, who sustained a first MI (n = 856) or IS (n = 368) between 1995 and 2002, and controls matched on age, hypertension status, and calendar year (n = 2,689). We investigated the risk of MI and IS associated with (i) global variation within each gene as measured by common haplotypes and (ii) individual haplotypes and single nucleotide polymorphisms (SNPs). Significance was assessed using a 0.2 threshold of the false discovery rate q-value, which accounts for multiple testing. RESULTS: After accounting for multiple testing, global genetic variation in factor (F) VIII was associated with IS risk. Two haplotypes in FVIII and one in FXIIIa1 were significantly associated with increased IS risk (all q-values < 0.2). A plasminogen gene SNP was associated with MI risk. All are new discoveries not previously reported. Another 24 tests had P-values < 0.05 and q-values > 0.2 in MI and IS analyses, 23 of which are new and hypothesis generating. CONCLUSIONS: Apart from the association of FVIII variation with IS, we found little evidence that common variation in the 24 candidate fibrin-based hemostasis genes strongly influences arterial thrombotic risk, but our results cannot rule out small effects.

Subject(s)

Factor VIII/genetics , Factor XIIIa/genetics , Genetic Variation , Hemostasis/genetics , Myocardial Infarction/genetics , Plasminogen/genetics , Stroke/genetics , Aged , Case-Control Studies , Haplotypes , Humans , Hypertension , Middle Aged , Polymorphism, Single Nucleotide , Postmenopause

9.

Widespread collaboration of Isw2 and Sin3-Rpd3 chromatin remodeling complexes in transcriptional repression.

Fazzio, T G; Kooperberg, C; Goldmark, J P; Neal, C; Basom, R; Delrow, J; Tsukiyama, T.

Mol Cell Biol ; 21(19): 6450-60, 2001 Oct.

Article in English | MEDLINE | ID: mdl-11533234

ABSTRACT

The yeast Isw2 chromatin remodeling complex functions in parallel with the Sin3-Rpd3 histone deacetylase complex to repress early meiotic genes upon recruitment by Ume6p. For many of these genes, the effect of an isw2 mutation is partially masked by a functional Sin3-Rpd3 complex. To identify the full range of genes repressed or activated by these factors and uncover hidden targets of Isw2-dependent regulation, we performed full genome expression analyses using cDNA microarrays. We find that the Isw2 complex functions mainly in repression of transcription in a parallel pathway with the Sin3-Rpd3 complex. In addition to Ume6 target genes, we find that many Ume6-independent genes are derepressed in mutants lacking functional Isw2 and Sin3-Rpd3 complexes. Conversely, we find that ume6 mutants, but not isw2 sin3 or isw2 rpd3 double mutants, have reduced fidelity of mitotic chromosome segregation, suggesting that one or more functions of Ume6p are independent of Sin3-Rpd3 and Isw2 complexes. Chromatin structure analyses of two nonmeiotic genes reveals increased DNase I sensitivity within their regulatory regions in an isw2 mutant, as seen previously for one meiotic locus. These data suggest that the Isw2 complex functions at Ume6-dependent and -independent loci to create DNase I-inaccessible chromatin structure by regulating the positioning or placement of nucleosomes.

Subject(s)

Adenosine Triphosphatases/physiology , Chromatin/physiology , Gene Expression Regulation, Fungal , Saccharomyces cerevisiae Proteins , Transcription Factors/physiology , Yeasts/genetics , Adenosine Triphosphatases/genetics , Cell Division , Chromatin/ultrastructure , Chromosome Segregation , DNA-Binding Proteins/genetics , DNA-Binding Proteins/physiology , Deoxyribonuclease I/chemistry , Histone Deacetylases , Macromolecular Substances , Mutation , Oligonucleotide Array Sequence Analysis , Phenotype , Repressor Proteins/genetics , Repressor Proteins/physiology , Transcription Factors/genetics , Transcription, Genetic , Yeasts/cytology , Yeasts/metabolism

10.

Sequence analysis using logic regression.

Kooperberg, C; Ruczinski, I; LeBlanc, M L; Hsu, L.

Genet Epidemiol ; 21 Suppl 1: S626-31, 2001.

Article in English | MEDLINE | ID: mdl-11793751

ABSTRACT

Logic Regression is a new adaptive regression methodology that attempts to construct predictors as Boolean combinations of (binary) covariates. In this paper we use this algorithm to deal with single-nucleotide polymorphism (SNP) sequence data. The predictors that are found are interpretable as risk factors of the disease. Significance of these risk factors is assessed using techniques like cross-validation, permutation tests, and independent test sets. These model selection techniques remain valid when data is dependent, as is the case for the family data used here. In our analysis of the Genetic Analysis Workshop 12 data we identify the exact locations of mutations on gene 1 and gene 6 and a number of mutations on gene 2 that are associated with the affected status, without selecting any false positives.

Subject(s)

Chromosome Mapping/statistics & numerical data , Genetic Predisposition to Disease/genetics , Logistic Models , Models, Genetic , Polymorphism, Single Nucleotide/genetics , Algorithms , DNA Mutational Analysis , Female , Humans , Male , Quantitative Trait, Heritable

11.

Correlates of serum lycopene in older women.

Casso, D; White, E; Patterson, R E; Agurs-Collins, T; Kooperberg, C; Haines, P S.

Nutr Cancer ; 36(2): 163-9, 2000.

Article in English | MEDLINE | ID: mdl-10890026

ABSTRACT

Experimental and epidemiological evidence suggests that lycopene, a predominant carotenoid found in human serum, may reduce the risk of certain cancers. We examined the association of dietary, physiological, and other factors with serum lycopene concentrations in a subsample of 946 postmenopausal women participating in the Women's Health Initiative. Pearson partial correlation coefficients and linear regression coefficients were calculated after adjustment for age, ethnicity, and serum low-density-lipoprotein (LDL) cholesterol. Serum lycopene was correlated with serum LDL cholesterol (r = 0.23) and dietary lycopene (r = 0.17, both p < 0.001). Individual food items found to be correlated with serum lycopene after adjustment included fresh tomatoes or tomato juice (r = 0.11), cooked tomatoes, tomato sauce, or salsa (r = 0.17), and spaghetti with meat sauce (r = 0.19, all p < 0.01). Age and body mass index were negatively associated with serum lycopene levels (both p < 0.001). Serum lycopene levels were highest in the summer and highest for those living in the northeastern United States. If we postulate that high serum lycopene levels reduce cancer risk, it becomes apparent that we have limited ability to detect this association from studies of lycopene intake. An understanding of factors associated with serum lycopene levels can be useful for the interpretation of studies of dietary lycopene and disease risk.

Subject(s)

Antioxidants/administration & dosage , Carotenoids/blood , Cholesterol, LDL/blood , Lipids/blood , Neoplasms/prevention & control , Aged , Carotenoids/administration & dosage , Cholesterol/blood , Cross-Sectional Studies , Female , Humans , Lycopene , Solanum lycopersicum , Middle Aged , Neoplasms/epidemiology , Postmenopause , Statistics as Topic , Surveys and Questionnaires

12.

Linear regression for bivariate censored data via multiple imputation.

Pan, W; Kooperberg, C.

Stat Med ; 18(22): 3111-21, 1999 Nov 30.

Article in English | MEDLINE | ID: mdl-10544310

ABSTRACT

Bivariate survival data arise, for example, in twin studies and studies of both eyes or ears of the same individual. Often it is of interest to regress the survival times on a set of predictors. In this paper we extend Wei and Tanner's multiple imputation approach for linear regression with univariate censored data to bivariate censored data. We formulate a class of censored bivariate linear regression methods by iterating between the following two steps: 1. the data is augmented by imputing survival times for censored observations; 2. a linear model is fit to the imputed complete data. We consider three different methods to implement these two steps. In particular, the marginal (independence) approach ignores the possible correlation between two survival times when estimating the regression coefficient. To improve the efficiency, we propose two methods that account for the correlation between the survival times. First, we improve the efficiency by using generalized least squares regression in step 2. Second, instead of generating data from an estimate of the marginal distribution we generate data from a bivariate log-spline density estimate in step 1. Through simulation studies we find that the performance of the two methods that take the dependence into account is close and that they are both more efficient than the marginal approach. The methods are applied to a data set from an otitis media clinical trial.

Subject(s)

Computer Simulation , Proportional Hazards Models , Anti-Bacterial Agents/therapeutic use , Child , Ear/surgery , Female , Humans , Male , Monte Carlo Method , Otitis Media/drug therapy , Otitis Media/surgery , Prednisone/therapeutic use

13.

Improved recognition of native-like protein structures using a combination of sequence-dependent and sequence-independent features of proteins.

Simons, K T; Ruczinski, I; Kooperberg, C; Fox, B A; Bystroff, C; Baker, D.

Proteins ; 34(1): 82-95, 1999 Jan 01.

Article in English | MEDLINE | ID: mdl-10336385

ABSTRACT

We describe the development of a scoring function based on the decomposition P(structure/sequence) proportional to P(sequence/structure) *P(structure), which outperforms previous scoring functions in correctly identifying native-like protein structures in large ensembles of compact decoys. The first term captures sequence-dependent features of protein structures, such as the burial of hydrophobic residues in the core, the second term, universal sequence-independent features, such as the assembly of beta-strands into beta-sheets. The efficacies of a wide variety of sequence-dependent and sequence-independent features of protein structures for recognizing native-like structures were systematically evaluated using ensembles of approximately 30,000 compact conformations with fixed secondary structure for each of 17 small protein domains. The best results were obtained using a core scoring function with P(sequence/structure) parameterized similarly to our previous work (Simons et al., J Mol Biol 1997;268:209-225] and P(structure) focused on secondary structure packing preferences; while several additional features had some discriminatory power on their own, they did not provide any additional discriminatory power when combined with the core scoring function. Our results, on both the training set and the independent decoy set of Park and Levitt (J Mol Biol 1996;258:367-392), suggest that this scoring function should contribute to the prediction of tertiary structure from knowledge of sequence and secondary structure.

Subject(s)

Models, Statistical , Protein Structure, Tertiary , Protein Conformation , Protein Structure, Secondary

14.

Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions.

Simons, K T; Kooperberg, C; Huang, E; Baker, D.

J Mol Biol ; 268(1): 209-25, 1997 Apr 25.

Article in English | MEDLINE | ID: mdl-9149153

ABSTRACT

We explore the ability of a simple simulated annealing procedure to assemble native-like structures from fragments of unrelated protein structures with similar local sequences using Bayesian scoring functions. Environment and residue pair specific contributions to the scoring functions appear as the first two terms in a series expansion for the residue probability distributions in the protein database; the decoupling of the distance and environment dependencies of the distributions resolves the major problems with current database-derived scoring functions noted by Thomas and Dill. The simulated annealing procedure rapidly and frequently generates native-like structures for small helical proteins and better than random structures for small beta sheet containing proteins. Most of the simulated structures have native-like solvent accessibility and secondary structure patterns, and thus ensembles of these structures provide a particularly challenging set of decoys for evaluating scoring functions. We investigate the effects of multiple sequence information and different types of conformational constraints on the overall performance of the method, and the ability of a variety of recently developed scoring functions to recognize the native-like conformations in the ensembles of simulated structures.

Subject(s)

Bayes Theorem , Models, Molecular , Protein Structure, Tertiary , Proteins/chemistry , Computer Simulation , Databases, Factual , Models, Statistical , Peptide Fragments/chemistry , Protein Folding , Sequence Homology, Amino Acid

15.

Hazard regression with interval-censored data.

Kooperberg, C; Clarkson, D B.

Biometrics ; 53(4): 1485-94, 1997 Dec.

Article in English | MEDLINE | ID: mdl-9423263

ABSTRACT

In a recent paper, Kooperberg, Stone, and Truong (1995a) introduced hazard regression (HARE), in which linear splines and their tensor products are used to estimate the conditional log-hazard function based on possibly censored, positive response data and one or more covariates. Model selection is carried out in an adaptive fashion using maximum likelihood estimation of the unknown coefficients, Rao and Wald statistics to carry out stepwise addition and deletion of basis functions, and the Bayesian Information Criterion (BIC) to select the final model. In the present paper, the HARE methodology is extended to accommodate interval-censored data, time-dependent covariates, and cubic splines. The presence of interval-censored data means that the log-likelihood function may no longer be concave, presenting additional numerical challenges. The extended methodology is applied to a data set containing both interval-censoring and time-dependent covariates. The new software will be available in a future release of S-Plus.

Subject(s)

Models, Statistical , Proportional Hazards Models , Survival Analysis , Acquired Immunodeficiency Syndrome/prevention & control , Anal Canal/pathology , Bayes Theorem , Biometry/methods , Follow-Up Studies , HIV Seronegativity , HIV Seropositivity/pathology , HIV Seropositivity/virology , Homosexuality, Male , Humans , Likelihood Functions , Male , Papillomaviridae/isolation & purification

16.

Statistical modeling to predict elective surgery time. Comparison with a computer scheduling system and surgeon-provided estimates.

Wright, I H; Kooperberg, C; Bonar, B A; Bashein, G.

Anesthesiology ; 85(6): 1235-45, 1996 Dec.

Article in English | MEDLINE | ID: mdl-8968169

ABSTRACT

BACKGROUND: Accurate estimation of operating times is a prerequisite for the efficient scheduling of the operating suite. The authors, in this study, sought to compare surgeons' time estimates for elective cases with those of commercial scheduling software, and to ascertain whether improvements could be made by regression modeling. METHODS: The study was conducted at the University of Washington Medical Center in three phases. Phase 1 retrospectively reviewed surgeons' time estimates and the scheduling system's estimates throughout 1 yr. In phase 2, data were collected prospectively from participating surgeons by means of a data entry form completed at the time of scheduling elective cases. Data included the procedure code, estimated operating time, estimated case difficulty, and potential factors that might affect the duration. In phase 3, identical data were collected from five selected surgeons by personal interview. RESULTS: In phase 1, 26 of 43 surgeons provided significantly better estimates than did the scheduling system (P < 0.01), and no surgeon was significantly worse, although the absolute errors were large (34% of 157 min average case length). In phase 2, modeling improved the accuracy of the surgeons' estimates by 11.5%, compared with the scheduling system. In phase 3, applying the model from phase 2 improved the accuracy of the surgeons' estimates by 18.2%. CONCLUSIONS: Surgeons provide more accurate time estimates than does the scheduling software as it is used in our institution. Regression modeling effects modest improvements in accuracy. Further improvements would be likely if the hospital information system could provide timely historical data and feedback to the surgeons.

Subject(s)

Appointments and Schedules , General Surgery , Models, Statistical , Operating Room Information Systems , Operating Rooms/organization & administration , Computer Simulation , Humans , Prospective Studies , Retrospective Studies , Time Factors

17.

Trees and splines in survival analysis.

Intrator, O; Kooperberg, C.

Stat Methods Med Res ; 4(3): 237-61, 1995 Sep.

Article in English | MEDLINE | ID: mdl-8548105

ABSTRACT

During the past few years several nonparametric alternatives to the Cox proportional hazards model have appeared in the literature. These methods extend techniques that are well known from regression analysis to the analysis of censored survival data. In this paper we discuss methods based on (partition) trees and (polynomial) splines, analyse two datasets using both Survival Trees and HARE, and compare the strengths and weaknesses of the two methods. One of the strengths of HARE is that its model fitting procedure has an implicit check for proportionality of the underlying hazards model. It also provides an explicit model for the conditional hazards function, which makes it very convenient to obtain graphical summaries. On the other hand, the tree-based methods automatically partition a dataset into groups of cases that are similar in survival history. Results obtained by survival trees and HARE are often complementary. Trees and splines in survival analysis should provide the data analyst with two useful tools when analysing survival data.

Subject(s)

Algorithms , Decision Trees , Models, Statistical , Statistics, Nonparametric , Survival Analysis , Breast Neoplasms/mortality , Coronary Disease/mortality , Data Interpretation, Statistical , Female , Humans , Likelihood Functions , Linear Models , Logistic Models , Male , Multivariate Analysis , Proportional Hazards Models , Regression Analysis , Risk Factors , Software

18.

Using logistic regression to estimate the adjusted attributable risk of low birthweight in an unmatched case-control study.

Kooperberg, C; Petitti, D B.

Epidemiology ; 2(5): 363-6, 1991 Sep.

Article in English | MEDLINE | ID: mdl-1742386

ABSTRACT

Other authors have shown how to estimate attributable risk based on stratification. In this paper, we show how to estimate adjusted attributable risks, standard errors, and confidence intervals from an unmatched case-control study that has population-based controls and uses the logistic regression model to estimate relative risk. We apply the method to data from a case-control study of low birthweight. The method is conceptually simple, has no assumptions beyond those of the logistic model, makes use of computer-intensive statistical techniques (the bootstrap), and extends to interactions. A Fortran computer program to carry out the computations is available from the authors upon request.

Subject(s)

Infant, Low Birth Weight , Logistic Models , Risk , Case-Control Studies , Confidence Intervals , Humans , Infant, Newborn , Mathematics , Software

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL