Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 28
Filter
1.
Transl Psychiatry ; 10(1): 169, 2020 05 28.
Article in English | MEDLINE | ID: mdl-32467585

ABSTRACT

The complex phenotypic and genetic nature of anxieties hampers progress in unravelling their molecular etiologies. Dogs present extensive natural variation in fear and anxiety behaviour and could advance the understanding of the molecular background of behaviour due to their unique breeding history and genetic architecture. As dogs live as part of human families under constant care and monitoring, information from their behaviour and experiences are easily available. Here we have studied the genetic background of fearfulness in the Great Dane breed. Dogs were scored and categorised into cases and controls based on the results of the validated owner-completed behavioural survey. A genome-wide association study in a cohort of 124 dogs with and without socialisation as a covariate revealed a genome-wide significant locus on chromosome 11. Whole exome sequencing and whole genome sequencing revealed extensive regions of opposite homozygosity in the same locus on chromosome 11 between the cases and controls with interesting neuronal candidate genes such as MAPK9/JNK2, a known hippocampal regulator of anxiety. Further characterisation of the identified locus will pave the way for molecular understanding of fear in dogs and may provide a natural animal model for human anxieties.


Subject(s)
Genome-Wide Association Study , Animals , Chromosomes , Dogs , Fear , Genome , Genomics , Humans , Polymorphism, Single Nucleotide
2.
Transl Psychiatry ; 9(1): 18, 2019 01 17.
Article in English | MEDLINE | ID: mdl-30655508

ABSTRACT

Anxiety disorders are among the leading health issues in human medicine. The complex phenotypic and allelic nature of these traits as well as the challenge of establishing reliable measures of the heritable component of behaviour from the associated environmental factors hampers progress in their molecular aetiology. Dogs exhibit large natural variation in fearful and anxious behaviour and could facilitate progress in the molecular aetiology due to their unique genetic architecture. We have performed a genome-wide association study with a canine high-density SNP array in a cohort of 330 German Shepherds for two phenotypes, fear of loud noises (noise sensitivity) and fear of strangers or in novel situations. Genome-widely significant loci were discovered for the traits on chromosomes 20 and 7, respectively. The regions overlap human neuropsychiatric loci, including 18p11.2, with physiologically relevant candidate genes that contribute to glutamatergic and dopaminergic neurotransmission in the brain. In addition, the noise-sensitivity locus includes hearing-related candidate genes. These results indicate a genetic contribution for canine fear and suggest a shared molecular aetiology of anxiety across species. Further characterisation of the identified loci will pave the way to molecular understanding of the conditions as a prerequisite for improved therapy.


Subject(s)
Anxiety Disorders/genetics , Behavior, Animal , Chromosome Mapping , Fear , Genetic Association Studies , Alleles , Animals , Breeding , Dogs , Genome-Wide Association Study , Genomics , Humans , Polymorphism, Single Nucleotide
3.
Heredity (Edinb) ; 118(5): 413-423, 2017 May.
Article in English | MEDLINE | ID: mdl-27901510

ABSTRACT

Local adaptation is a common feature of plant and animal populations. Adaptive phenotypic traits are genetically differentiated along environmental gradients, but the genetic basis of such adaptation is still poorly known. Genetic association studies of local adaptation combine data over populations. Correcting for population structure in these studies can be problematic since both selection and neutral demographic events can create similar allele frequency differences between populations. Correcting for demography with traditional methods may lead to eliminating some true associations. We developed a new Bayesian approach for identifying the loci underlying an adaptive trait in a multipopulation situation in the presence of possible double confounding due to population stratification and adaptation. With this method we studied the genetic basis of timing of bud set, a surrogate trait for timing of yearly growth cessation that confers local adaptation to the populations of Scots pine (Pinus sylvestris). Population means of timing of bud set were highly correlated with latitude. Most effects at individual loci were small. Interestingly, we found genetic heterogeneity (that is, different sets of loci associated with the trait) between the northern and central European parts of the cline. We also found indications of stronger stabilizing selection toward the northern part of the range. The harsh northern conditions may impose greater selective pressure on timing of growth cessation, and the relative importance of different environmental cues used for tracking the seasons might differ depending on latitude of origin.


Subject(s)
Adaptation, Physiological/genetics , Genetics, Population/methods , Pinus sylvestris/genetics , Polymorphism, Single Nucleotide , Bayes Theorem , Europe , Flowers/physiology , Genotype , Models, Genetic , Phenotype , Pinus sylvestris/physiology
4.
Heredity (Edinb) ; 115(6): 556-64, 2015 Dec.
Article in English | MEDLINE | ID: mdl-26174023

ABSTRACT

Linear regression-based quantitative trait loci/association mapping methods such as least squares commonly assume normality of residuals. In genetics studies of plants or animals, some quantitative traits may not follow normal distribution because the data include outlying observations or data that are collected from multiple sources, and in such cases the normal regression methods may lose some statistical power to detect quantitative trait loci. In this work, we propose a robust multiple-locus regression approach for analyzing multiple quantitative traits without normality assumption. In our method, the objective function is least absolute deviation (LAD), which corresponds to the assumption of multivariate Laplace distributed residual errors. This distribution has heavier tails than the normal distribution. In addition, we adopt a group LASSO penalty to produce shrinkage estimation of the marker effects and to describe the genetic correlation among phenotypes. Our LAD-LASSO approach is less sensitive to the outliers and is more appropriate for the analysis of data with skewedly distributed phenotypes. Another application of our robust approach is on missing phenotype problem in multiple-trait analysis, where the missing phenotype items can simply be filled with some extreme values, and be treated as outliers. The efficiency of the LAD-LASSO approach is illustrated on both simulated and real data sets.


Subject(s)
Models, Genetic , Quantitative Trait Loci , Computer Simulation , Phenotype , Polymorphism, Single Nucleotide , Regression Analysis
5.
Heredity (Edinb) ; 112(3): 351-60, 2014 Mar.
Article in English | MEDLINE | ID: mdl-24253936

ABSTRACT

Quantitative trait loci (QTL) affecting the phenotype of interest can be detected using linkage analysis (LA), linkage disequilibrium (LD) mapping or a combination of both (LDLA). The LA approach uses information from recombination events within the observed pedigree and LD mapping from the historical recombinations within the unobserved pedigree. We propose the Bayesian variable selection approach for combined LDLA analysis for single-nucleotide polymorphism (SNP) data. The novel approach uses both sources of information simultaneously as is commonly done in plant and animal genetics, but it makes fewer assumptions about population demography than previous LDLA methods. This differs from approaches in human genetics, where LDLA methods use LA information conditional on LD information or the other way round. We argue that the multilocus LDLA model is more powerful for the detection of phenotype-genotype associations than single-locus LDLA analysis. To illustrate the performance of the Bayesian multilocus LDLA method, we analyzed simulation replicates based on real SNP genotype data from small three-generational CEPH families and compared the results with commonly used quantitative transmission disequilibrium test (QTDT). This paper is intended to be conceptual in the sense that it is not meant to be a practical method for analyzing high-density SNP data, which is more common. Our aim was to test whether this approach can function in principle.


Subject(s)
Bayes Theorem , Chromosome Mapping/methods , Linkage Disequilibrium , Genotype , Humans , Models, Genetic , Neoplasms/genetics , Pedigree , Phenotype , Polymorphism, Single Nucleotide , Quantitative Trait Loci
6.
J Dairy Sci ; 97(2): 1117-27, 2014 Feb.
Article in English | MEDLINE | ID: mdl-24342683

ABSTRACT

The observed low accuracy of genomic selection in multibreed and admixed populations results from insufficient linkage disequilibrium between markers and trait loci. Failure to remove variation due to the population structure may also hamper the prediction accuracy. We verified if accounting for breed origin of alleles in the calculation of genomic relationships would improve the prediction accuracy in an admixed population. Individual breed proportions derived from the pedigree were used to estimate breed-wise allele frequencies (AF). Breed-wise and across-breed AF were estimated from the currently genotyped population and also in the base population. Genomic relationship matrices (G) were subsequently calculated using across-breed (GAB) and breed-wise (GBW) AF estimated in the currently genotyped and also in the base population. Unified relationship matrices were derived by combining different G with pedigree relationships in the evaluation of genomic estimated breeding values (GEBV) for genotyped and ungenotyped animals. The validation reliabilities and inflation of GEBV were assessed by a linear regression of deregressed breeding value (deregressed proofs) on GEBV, weighted by the reliability of deregressed proofs. The regression coefficients (b1) from GAB ranged from 0.76 for milk to 0.90 for protein. Corresponding b1 terms from GBW ranged from 0.72 to 0.88. The validation reliabilities across 4 evaluations with different G were generally 36, 40, and 46% for milk, protein, and fat, respectively. Unexpectedly, validation reliabilities were generally similar across different evaluations, irrespective of AF used to compute G. Thus, although accounting for the population structure in GBW tends to simplify the blending of genomic- and pedigree-based relationships, it appeared to have little effect on the validation reliabilities.


Subject(s)
Cattle/genetics , Gene Frequency , Genome/genetics , Genomics/methods , Milk , Models, Genetic , Animals , Breeding , Genotype , Linkage Disequilibrium , Pedigree , Phenotype , Reproducibility of Results
7.
J Dairy Sci ; 96(8): 5364-75, 2013 Aug.
Article in English | MEDLINE | ID: mdl-23769355

ABSTRACT

Different approaches of calculating genomic measures of relationship were explored and compared with pedigree relationships (A) within and across base breeds in a crossbreed population, using genotypes for 38,194 loci of 4,106 Nordic Red dairy cattle. Four genomic relationship matrices (G) were calculated using either observed allele frequencies (AF) across breeds or within-breed AF. The G matrices were compared separately when the AF were estimated in the observed and in the base population. Breedwise AF in the current and base population were estimated using linear regression models of individual genotypes on breed composition. Different G matrices were further used to predict direct estimated genomic values using a genomic BLUP model. Higher variability existed in the diagonal elements of G across breeds (standard deviation=0.06, on average) compared with A (0.01). The use of simple observed AF across base breeds to compute G increased coefficients for individuals in distantly related populations. Estimated breedwise AF reduced differences in coefficients similarly within and across populations. The variability of the current adjusted G matrix decreased from 0.055 to 0.035 when breedwise AF were estimated from the base breed population. The direct estimated genomic values and their validation reliabilities were, however, unaffected by AF used to compute G when estimated with a genomic BLUP model, due to inclusion of breed means in the model. In multibreed populations, G adjusted with breedwise AF from the founder population may provide more consistency among relationship coefficients between genotyped and ungenotyped individuals in an across-breed single-step evaluation.


Subject(s)
Cattle/genetics , Gene Frequency/genetics , Animals , Breeding , Genetic Loci/genetics , Genotype , Models, Genetic , Pedigree , Species Specificity
8.
J Anim Breed Genet ; 130(2): 89-97, 2013 Apr.
Article in English | MEDLINE | ID: mdl-23496009

ABSTRACT

Genetic parameters for different claw disorders, overall claw health and feet and leg conformation traits were estimated for Finnish Ayrshire cows. The merged data set with records of claw health and feet and leg conformation traits consisted of 105,000 observations from 52,598 Finnish Ayrshire cows between 2000 and 2010. The binary claw health data and the linearly scored conformation data were analysed using an animal model and restricted maximum likelihood method by applying the statistical package ASReml. Binomial logistic models with mixed effects were used to estimate genetic parameters for sole haemorrhages, chronic laminitis, white-line separation, sole ulcer, interdigital dermatitis, heel horn erosion, digital dermatitis, corkscrew claw and overall claw health. Estimated heritabilities for different claw disorders using a binomial logistic model ranged from 0.01 to 0.20. Estimated heritability for overall claw health using a binomial logistic model was 0.08. Estimated heritabilities for feet and leg conformation traits ranged from 0.07 to 0.39. The genetic correlations between claw health and feet and leg conformation traits ranged from -0.40 to 0.42. All phenotypic correlations were close to zero. The moderate genetic correlation, together with higher heritability of feet and leg conformation traits, showed that RLSV (rear leg side view) is a useful indicator trait to be used together with claw trimming information to increase the accuracy of breeding values for claw health in genetic evaluation.


Subject(s)
Cattle Diseases/genetics , Extremities/anatomy & histology , Foot Diseases/veterinary , Foot/anatomy & histology , Hoof and Claw/pathology , Animal Husbandry , Animals , Cattle , Cattle Diseases/epidemiology , Female , Finland/epidemiology , Foot Diseases/epidemiology , Foot Diseases/genetics
9.
J Anim Breed Genet ; 130(1): 10-9, 2013 Feb.
Article in English | MEDLINE | ID: mdl-23317061

ABSTRACT

The current study evaluates reliability of genomic predictions in selection candidates using multi-trait random regression model, which accounts for interactions between marker effects and breed of origin in the Nordic Red dairy cattle (RDC). The population structure of the RDC is admixed. Data consisted of individual animal breed proportions calculated from the full pedigree, deregressed proofs (DRP) of published estimated breeding values (EBV) for yield traits and genotypic data for 37,595 single nucleotide polymorphic markers. The analysed data included 3330 bulls in the reference population and 812 bulls that were used for validation. Direct genomic breeding values (DGV) were estimated using the model under study, which accounts for breed effects and also with GBLUP, which assume uniform population. Validation reliability was calculated as a coefficient of determination from weighted regression of DRP on DGV (rDRP,DGV 2), scaled by the mean reliability of DRP. Using the breed-specific model increased the reliability of DGV by 2 and 3% for milk and protein, respectively, when compared to homogeneous population GBLUP. The exception was for fat, where there was no gain in reliability. Estimated validation reliabilities were low for milk (0.32) and protein (0.32) and slightly higher (0.42) for fat.


Subject(s)
Breeding , Genetics, Population , Regression Analysis , Selection, Genetic , Animals , Cattle , Genotyping Techniques , High-Throughput Screening Assays , Milk/physiology , Models, Theoretical , Pedigree , Polymorphism, Single Nucleotide/genetics
10.
Heredity (Edinb) ; 109(4): 235-45, 2012 Oct.
Article in English | MEDLINE | ID: mdl-22805656

ABSTRACT

Accurate and fast estimation of genetic parameters that underlie quantitative traits using mixed linear models with additive and dominance effects is of great importance in both natural and breeding populations. Here, we propose a new fast adaptive Markov chain Monte Carlo (MCMC) sampling algorithm for the estimation of genetic parameters in the linear mixed model with several random effects. In the learning phase of our algorithm, we use the hybrid Gibbs sampler to learn the covariance structure of the variance components. In the second phase of the algorithm, we use this covariance structure to formulate an effective proposal distribution for a Metropolis-Hastings algorithm, which uses a likelihood function in which the random effects have been integrated out. Compared with the hybrid Gibbs sampler, the new algorithm had better mixing properties and was approximately twice as fast to run. Our new algorithm was able to detect different modes in the posterior distribution. In addition, the posterior mode estimates from the adaptive MCMC method were close to the REML (residual maximum likelihood) estimates. Moreover, our exponential prior for inverse variance components was vague and enabled the estimated mode of the posterior variance to be practically zero, which was in agreement with the support from the likelihood (in the case of no dominance). The method performance is illustrated using simulated data sets with replicates and field data in barley.


Subject(s)
Algorithms , Breeding/statistics & numerical data , Quantitative Trait, Heritable , Bayes Theorem , Computer Simulation , Likelihood Functions , Linear Models , Markov Chains , Models, Genetic , Monte Carlo Method
11.
Heredity (Edinb) ; 108(2): 134-46, 2012 Feb.
Article in English | MEDLINE | ID: mdl-21792229

ABSTRACT

A novel hierarchical quantitative trait locus (QTL) mapping method using a polynomial growth function and a multiple-QTL model (with no dependence in time) in a multitrait framework is presented. The method considers a population-based sample where individuals have been phenotyped (over time) with respect to some dynamic trait and genotyped at a given set of loci. A specific feature of the proposed approach is that, instead of an average functional curve, each individual has its own functional curve. Moreover, each QTL can modify the dynamic characteristics of the trait value of an individual through its influence on one or more growth curve parameters. Apparent advantages of the approach include: (1) assumption of time-independent QTL and environmental effects, (2) alleviating the necessity for an autoregressive covariance structure for residuals and (3) the flexibility to use variable selection methods. As a by-product of the method, heritabilities and genetic correlations can also be estimated for individual growth curve parameters, which are considered as latent traits. For selecting trait-associated loci in the model, we use a modified version of the well-known Bayesian adaptive shrinkage technique. We illustrate our approach by analysing a sub sample of 500 individuals from the simulated QTLMAS 2009 data set, as well as simulation replicates and a real Scots pine (Pinus sylvestris) data set, using temporal measurements of height as dynamic trait of interest.


Subject(s)
Human Development , Pinus/growth & development , Pinus/genetics , Quantitative Trait Loci , Bayes Theorem , Databases, Nucleic Acid , Genotype , Humans , Models, Genetic
12.
Heredity (Edinb) ; 107(5): 405-12, 2011 Oct.
Article in English | MEDLINE | ID: mdl-21712846

ABSTRACT

The successful implementation of Bayesian shrinkage analysis of high-dimensional regression models, as often encountered in quantitative trait locus (QTL) mapping, is contingent upon the choice of suitable sparsity-inducing priors. In practice, the shape (that is, the rate of tail decay) of such priors is typically preset, with no regard for the range of plausible alternatives and the fact that the most appropriate shape may depend on the data at hand. This study is presumably the first attempt to tackle this oversight through the shape-adaptive shrinkage prior (SASP) approach, with a focus on the mapping of QTLs in experimental crosses. Simulation results showed that the separation between genuine QTL effects and spurious ones can be made clearer using the SASP-based approach as compared with existing competitors. This feature makes our new method a promising approach to QTL mapping, where good separation is the ultimate goal. We also discuss a re-estimation procedure intended to improve the accuracy of the estimated genetic effects of detected QTLs with regard to shrinkage-induced bias, which may be particularly important in large-scale models with collinear predictors. The re-estimation procedure is relevant to any shrinkage method, and is potentially valuable for many scientific disciplines such as bioinformatics and quantitative genetics, where oversaturated models are booming.


Subject(s)
Bayes Theorem , Chromosome Mapping/methods , Models, Genetic , Quantitative Trait Loci , Algorithms , Computer Simulation , Gene Frequency , Genotype , Hordeum/genetics , Phenotype , Regression Analysis
13.
Heredity (Edinb) ; 106(4): 511-9, 2011 Apr.
Article in English | MEDLINE | ID: mdl-20628415

ABSTRACT

Population-based genomic association analyses are more powerful than within-family analyses. However, population stratification (unknown or ignored origin of individuals from multiple source populations) and cryptic relatedness (unknown or ignored covariance between individuals because of their relatedness) are confounding factors in population-based genomic association analyses, which inflate the false-positive rate. As a consequence, false association signals may arise in genomic data association analyses for reasons other than true association between the tested genomic factor (marker genotype, gene or protein expression) and the study phenotype. It is therefore important to correct or account for these confounders in population-based genomic data association analyses. The common correction techniques for population stratification and cryptic relatedness problems are presented here in the phenotype-marker association analysis context, and comments on their suitability for other types of genomic association analyses (for example, phenotype-expression association) are also provided. Even though many of these techniques have originally been developed in the context of human genetics, most of them are also applicable to model organisms and breeding populations.


Subject(s)
Genome-Wide Association Study/methods , Animals , Genetics, Population/methods , Humans , Models, Genetic
14.
Genet Res (Camb) ; 91(3): 193-207, 2009 Jun.
Article in English | MEDLINE | ID: mdl-19589189

ABSTRACT

In self-pollinating populations, individuals are characterized by a high degree of inbreeding. Additionally, phenotypic observations are highly influenced by genotype-by-environment interaction effects. Usually, Bayesian approaches to predict breeding values (in self-pollinating crops) omit genotype-by-environment interactions in the statistical model, which may result in biased estimates. In our study, a Bayesian Gibbs sampling algorithm was developed that is adapted to the high degree of inbreeding in self-pollinated crops and accounts for interaction effects between genotype and environment. As related lines are supposed to show similar genotype-by-environment interaction effects, an extended genetic relationship matrix is included in the Bayesian model. Additionally, since the coefficient matrix C in the mixed model equations can be characterized by rank deficiencies, the pseudoinverse of C was calculated by using the nullspace, which resulted in a faster computation time. In this study, field data of spring barley lines and data of a 'virtual' parental population of self-pollinating crops, generated by computer simulation, were used. For comparison, additional breeding values were predicted by a frequentist approach. In general, standard Bayesian Gibbs sampling and a frequentist approach resulted in similar estimates if heritability of the regarded trait was high. For low heritable traits, the modified Bayesian model, accounting for relatedness between lines in genotype-by-environment interaction, was superior to the standard model.


Subject(s)
Bayes Theorem , Breeding , Crops, Agricultural/physiology , Environment , Inbreeding , Models, Genetic , Computer Simulation , Genotype , Models, Statistical , Pollination
15.
Heredity (Edinb) ; 103(3): 223-37, 2009 Sep.
Article in English | MEDLINE | ID: mdl-19455182

ABSTRACT

For small pedigrees, the issue of correcting for known or estimated relatedness structure in population-based Bayesian multilocus association analysis is considered. Two such relatedness corrections: [1] a random term arising from the infinite polygenic model and [2] a fixed covariate following the class D model of Bonney, are compared with the case of no correction using both simulated and real marker and gene-expression data from lymphoblastoid cell lines from four CEPH families. This comparison is performed with clinical quantitative trait locus (cQTL) models-multilocus association models where marker data and expression levels of gene transcripts as well as possible genotype x expression interaction terms are jointly used to explain quantitative trait variation. We found out that regardless of having a correction term in the model, the cQTL-models fit a few extra small-effect components (similar to finite polygenic models) which itself serves as a relatedness correction. For small data and small heritability one may use the covariate model, which clearly outperforms the infinite polygenic model in small data examples.


Subject(s)
Genomics , Models, Genetic , Bayes Theorem , Cell Line, Tumor , Genome , Humans , Neoplasms/genetics , Quantitative Trait Loci
16.
Theor Appl Genet ; 119(1): 105-23, 2009 Jun.
Article in English | MEDLINE | ID: mdl-19363603

ABSTRACT

A common difficulty in mapping quantitative trait loci (QTLs) is that QTL effects may show environment specificity and thus differ across environments. Furthermore, quantitative traits are likely to be influenced by multiple QTLs or genes having different effect sizes. There is currently a need for efficient mapping strategies to account for both multiple QTLs and marker-by-environment interactions. Thus, the objective of our study was to develop a Bayesian multi-locus multi-environmental method of QTL analysis. This strategy is compared to (1) Bayesian multi-locus mapping, where each environment is analysed separately, (2) Restricted Maximum Likelihood (REML) single-locus method using a mixed hierarchical model, and (3) REML forward selection applying a mixed hierarchical model. For this study, we used data on multi-environmental field trials of 301 BC(2)DH lines derived from a cross between the spring barley elite cultivar Scarlett and the wild donor ISR42-8 from Israel. The lines were genotyped by 98 SSR markers and measured for the agronomic traits "ears per m(2)," "days until heading," "plant height," "thousand grain weight," and "grain yield". Additionally, a simulation study was performed to verify the QTL results obtained in the spring barley population. In general, the results of Bayesian QTL mapping are in accordance with REML methods. In this study, Bayesian multi-locus multi-environmental analysis is a valuable method that is particularly suitable if lines are cultivated in multi-environmental field trials.


Subject(s)
Bayes Theorem , Environment , Hordeum/genetics , Inbreeding , Likelihood Functions , Models, Genetic , Quantitative Trait Loci/genetics , Chromosome Mapping , Chromosomes, Plant , Computer Simulation , Genetic Variation
17.
Heredity (Edinb) ; 101(3): 271-84, 2008 Sep.
Article in English | MEDLINE | ID: mdl-18648392

ABSTRACT

Previous articles have presented clinical quantitative trait locus (cQTL) models, where the information provided by quantitative/qualitative phenotypes, molecular markers and gene expressions (transcription levels) were combined and analyzed simultaneously. Because of financial constraints, marker data may be available for much larger group of individuals than expression data. However, it is desirable to use all the available information. We therefore extend such approaches by presenting a reliable missing data model for the case when marker data is more complete (that is, has many fewer missing entries). In the suggested hierarchical model, an expression QTL (eQTL) model (which is essentially our missing data model) is part of the larger cQTL model and it represents a Bayesian model-based method for estimating cis- and trans-acting regulatory effects for multiple (typically hundreds of) expression phenotypes simultaneously. The modeling dependence between transcripts in the eQTL model is also considered. The method is based on presenting data in the form of marker gene pairs, for which the presence of regulatory effect (link) can be hypothesized. These marker gene pairs can be obtained from oligonucleotide arrays or created using information available on known pathways or previous eQTL/allelic expression studies. The estimation of the model parameters (such as presence/absence of regulation, eQTL/cQTL effects and proportion of eQTLs and cQTLs among the set of marker gene pairs) as well as the handling of missing data is performed using Markov Chain Monte Carlo (MCMC) sampling. The method is illustrated using both simulated and real data.


Subject(s)
Models, Genetic , Quantitative Trait Loci , Bayes Theorem , Computer Simulation , Data Interpretation, Statistical , Gene Expression , Genes, Fungal , Genetic Markers , Genotype , Humans , Markov Chains , Monte Carlo Method , Phenotype , Saccharomyces cerevisiae/genetics
18.
Genomics ; 92(6): 384-92, 2008 Dec.
Article in English | MEDLINE | ID: mdl-18657605

ABSTRACT

We are studying variable selection in multiple regression models in which molecular markers and/or gene-expression measurements as well as intensity measurements from protein spectra serve as predictors for the outcome variable (i.e., trait or disease state). Finding genetic biomarkers and searching genetic-epidemiological factors can be formulated as a statistical problem of variable selection, in which, from a large set of candidates, a small number of trait-associated predictors are identified. We illustrate our approach by analyzing the data available for chronic fatigue syndrome (CFS). CFS is a complex disease from several aspects, e.g., it is difficult to diagnose and difficult to quantify. To identify biomarkers we used microarray data and SELDI-TOF-based proteomics data. We also analyzed genetic marker information for a large number of SNPs for an overlapping set of individuals. The objectives of the analyses were to identify markers specific to fatigue that are also possibly exclusive to CFS. The use of such models can be motivated, for example, by the search for new biomarkers for the diagnosis and prognosis of cancer and measures of response to therapy. Generally, for this we use Bayesian hierarchical modeling and Markov Chain Monte Carlo computation.


Subject(s)
Biomarkers/analysis , Fatigue Syndrome, Chronic/metabolism , Mathematical Computing , Models, Biological , Proteomics/statistics & numerical data , Bayes Theorem , Fatigue Syndrome, Chronic/genetics , Gene Frequency , Genetic Carrier Screening , Genetic Markers , Humans , Markov Chains , Monte Carlo Method , Polymorphism, Single Nucleotide , Regression Analysis
19.
Heredity (Edinb) ; 97(1): 4-18, 2006 Jul.
Article in English | MEDLINE | ID: mdl-16670709

ABSTRACT

A novel Bayesian gene mapping method, which can simultaneously utilize both molecular marker and gene expression data, is introduced. The approach enables a quantitative or qualitative phenotype to be expressed as a linear combination of the marker genotypes, gene expression levels, and possible genotype x gene expression interactions. The interaction data, given as marker-gene pairs, contains possible in cis and in trans effects obtained from earlier allelic expression studies, genetical genomics studies, biological hypotheses, or known pathways. The method is presented for an inbred line cross design and can be easily generalized to handle other types of populations and designs. The model selection is based on the use of effect-specific variance components combined with Jeffreys' non-informative prior--the method operates by adaptively shrinking marker, expression, and interaction effects toward zero so that non-negligible effects are expected to occur only at very few positions. The estimation of the model parameters and the handling of missing genotype or expression data is performed via Markov chain Monte Carlo sampling. The potential of the method including heritability estimation is presented using simulated examples and novel summary statistics. The method is also applied to a real yeast data set with known pathways.


Subject(s)
Bayes Theorem , Chromosome Mapping/methods , Models, Genetic , Quantitative Trait, Heritable , Gene Expression , Genetic Markers , Genotype , Humans , Inbreeding , Markov Chains , Monte Carlo Method , Phenotype , Regression Analysis , Saccharomyces cerevisiae/genetics
20.
Genome ; 48(4): 619-25, 2005 Aug.
Article in English | MEDLINE | ID: mdl-16094429

ABSTRACT

The first genetic linkage map for silver birch, Betula pendula Roth, was constructed by using a pseudo-testcross mapping strategy and integration of linkage data from 3 unrelated 2-generation pedigrees. The map is based on the genetic inheritance and segregation of 82 amplified fragment length polyhmorphisms and 19 microsatellite markers, and was constructed by simultaneously comparing the performance of CRI-MAP and OUTMAP packages. The analysis revealed 16 linkage groups, and the total map coverage is 1561 cM (Kosambi units). Average map distance between adjacent markers is 15.5 cM. Linkage groups range between 6 and 18 loci and from 81.2 to 326.5 cM; the remaining 9 linkage groups consist of 2 or 3 loci ranging from 6.3 to 42.4 cM. The uncertainty of the map is illustrated with sensitivity analysis. This initial map can serve as a basis for developing a more detailed genetic map.


Subject(s)
Chromosome Mapping/methods , Microsatellite Repeats/genetics , Nucleic Acid Amplification Techniques/methods , Algorithms , Chromosomes, Plant/genetics , Genetic Linkage , Genome, Plant
SELECTION OF CITATIONS
SEARCH DETAIL
...