Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
1.
Sci Rep ; 14(1): 14373, 2024 06 22.
Artigo em Inglês | MEDLINE | ID: mdl-38909151

RESUMO

Continued spread of chronic wasting disease (CWD) through wild cervid herds negatively impacts populations, erodes wildlife conservation, drains resource dollars, and challenges wildlife management agencies. Risk factors for CWD have been investigated at state scales, but a regional model to predict locations of new infections can guide increasingly efficient surveillance efforts. We predicted CWD incidence by county using CWD surveillance data depicting white-tailed deer (Odocoileus virginianus) in 16 eastern and midwestern US states. We predicted the binary outcome of CWD-status using four machine learning models, utilized five-fold cross-validation and grid search to pinpoint the best model, then compared model predictions against the subsequent year of surveillance data. Cross validation revealed that the Light Boosting Gradient model was the most reliable predictor given the regional data. The predictive model could be helpful for surveillance planning. Predictions of false positives emphasize areas that warrant targeted CWD surveillance because of similar conditions with counties known to harbor CWD. However, disagreements in positives and negatives between the CWD Prediction Web App predictions and the on-the-ground surveillance data one year later underscore the need for state wildlife agency professionals to use a layered modeling approach to ensure robust surveillance planning. The CWD Prediction Web App is at https://cwd-predict.streamlit.app/ .


Assuntos
Cervos , Aprendizado de Máquina , Doença de Emaciação Crônica , Animais , Doença de Emaciação Crônica/epidemiologia , Doença de Emaciação Crônica/diagnóstico , Animais Selvagens , Estados Unidos/epidemiologia , Incidência
2.
Artigo em Inglês | MEDLINE | ID: mdl-33431415

RESUMO

Multidrug resistance (MDR) surveillance consists of reporting MDR prevalence and MDR phenotypes. Detailed knowledge of the specific associations underlying MDR patterns can allow antimicrobial stewardship programs to accurately identify clinically relevant resistance patterns. We applied machine learning and graphical networks to quantify and visualize associations between resistance traits in a set of 1,091 Staphylococcus aureus isolates collected from one New York hospital between 2008 and 2018. Antimicrobial susceptibility testing was performed using reference broth microdilution. The isolates were analyzed by year, methicillin susceptibility, and infection site. Association mining was used to identify resistance patterns that consisted of two or more individual antimicrobial resistance (AMR) traits and quantify the association among the individual resistance traits in each pattern. The resistance patterns captured the majority of the most common MDR phenotypes and reflected previously identified pairwise relationships between AMR traits in S. aureus Associations between ß-lactams and other antimicrobial classes (macrolides, lincosamides, and fluoroquinolones) were common, although the strength of the association among these antimicrobial classes varied by infection site and by methicillin susceptibility. Association mining identified associations between clinically important AMR traits, which could be further investigated for evidence of resistance coselection. For example, in skin and skin structure infections, clindamycin and tetracycline resistance occurred together 1.5 times more often than would be expected if they were independent from one another. Association mining efficiently discovered and quantified associations among resistance traits, allowing these associations to be compared between relevant subsets of isolates to identify and track clinically relevant MDR.


Assuntos
Infecções Estafilocócicas , Staphylococcus aureus , Antibacterianos/farmacologia , Antibacterianos/uso terapêutico , Farmacorresistência Bacteriana , Resistência a Múltiplos Medicamentos , Humanos , Aprendizado de Máquina , Testes de Sensibilidade Microbiana , New York , Infecções Estafilocócicas/tratamento farmacológico , Staphylococcus aureus/genética
3.
Nat Genet ; 52(10): 1067-1075, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32958950

RESUMO

Distal enhancers play pivotal roles in development and disease yet remain one of the least understood regulatory elements. We used massively parallel reporter assays to perform functional comparisons of two leading enhancer models and find that gene-distal transcription start sites are robust predictors of active enhancers with higher resolution than histone modifications. We show that active enhancer units are precisely delineated by active transcription start sites, validate that these boundaries are sufficient for capturing enhancer function, and confirm that core promoter sequences are necessary for this activity. We assay adjacent enhancers and find that their joint activity is often driven by the stronger unit within the cluster. Finally, we validate these results through functional dissection of a distal enhancer cluster using CRISPR-Cas9 deletions. In summary, definition of high-resolution enhancer boundaries enables deconvolution of complex regulatory loci into modular units.


Assuntos
Elementos Facilitadores Genéticos/genética , Código das Histonas/genética , Sítio de Iniciação de Transcrição , Transcrição Gênica , Linhagem Celular , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas/genética , Humanos , Regiões Promotoras Genéticas/genética , Processamento de Proteína Pós-Traducional/genética , Iniciação da Transcrição Genética
5.
J Comput Graph Stat ; 29(3): 535-546, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-38919169

RESUMO

A new empirical Bayes approach to variable selection in the context of generalized linear models is developed. The proposed algorithm scales to situations in which the number of putative explanatory variables is very large, possibly much larger than the number of responses. The coefficients in the linear predictor are modeled as a three-component mixture allowing the explanatory variables to have a random positive effect on the response, a random negative effect, or no effect. A key assumption is that only a small (but unknown) fraction of the candidate variables have a non-zero effect. This assumption, in addition to treating the coefficients as random effects facilitates an approach that is computationally efficient. In particular, the number of parameters that have to be estimated is small, and remains constant regardless of the number of explanatory variables. The model parameters are estimated using a Generalized Alternating Maximization algorithm which is scalable, and leads to significantly faster convergence compared with simulation-based fully Bayesian methods.

6.
Cell Host Microbe ; 25(4): 553-564.e7, 2019 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-30974084

RESUMO

Host genetic variation influences microbiome composition. While studies have focused on associations between the gut microbiome and specific alleles, gene copy number (CN) also varies. We relate microbiome diversity to CN variation of the AMY1 locus, which encodes salivary amylase, facilitating starch digestion. After imputing AMY1-CN for ∼1,000 subjects, we identified taxa differentiating fecal microbiomes of high and low AMY1-CN hosts. In a month-long diet intervention study, we show that diet standardization drove gut microbiome convergence, and AMY1-CN correlated with oral and gut microbiome composition and function. The microbiomes of low-AMY1-CN subjects had enhanced capacity to break down complex carbohydrates. High-AMY1-CN subjects had higher levels of salivary Porphyromonas; their gut microbiota had increased abundance of resistant starch-degrading microbes, produced higher levels of short-chain fatty acids, and drove higher adiposity when transferred to germ-free mice. This study establishes AMY1-CN as a genetic factor associated with microbiome composition and function.


Assuntos
Amilases/genética , Trato Gastrointestinal/microbiologia , Dosagem de Genes , Microbiota , Boca/microbiologia , Saliva/enzimologia , Animais , Vida Livre de Germes , Humanos , Camundongos
7.
Prev Vet Med ; 167: 137-145, 2019 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-30952439

RESUMO

Multidrug resistance is a serious problem raising the specter of infections for which there is no treatment. One of the most important tools in combating multidrug resistance is large scale monitoring programs, because they track resistance over large geographic areas and time scales. This large scope, however, can also introduce variability into the data. The primary monitoring program in the United States is the National Antimicrobial Resistance Monitoring System (NARMS). This study examines the variability of a previously identified resistance pattern in Escherichia coli among ampicillin, gentamicin, sulfisoxazole, and tetracycline using samples isolated from chicken during the years 2004 to 2006 and 2008 to 2012. 2007 is excluded because sulfisozaxole resistance was not measured at slaughter that year. To assess variability in this resistance pattern susceptibility/resistance contingency tables were constructed for each of the 15 combinations of the 4 drugs for each of the years. For each table, variability across the years was assessed at the full table multinomial level as a measure of general variability of the resistance pattern and at the level of the highest order interaction term in a log-linear model of the table as a measure of variability in that particular component of the resistance pattern. A power analysis using the traditional asymptotic normal approximation and one using a Dirichlet-multinomial simulation were carried out to determine the effect of variation on ability to detect nonzero highest order loglinear model terms and the validity of the normal approximation in carrying out such tests. All tables exhibit overdispersion at the multinomial level and in their highest order model parameters. The normal approximation performs well for large sample sizes, low levels of dispersion, and small log-linear model parameters. The approximation breaks down as dispersion or the log linear model parameter grows or sample size shrinks. Taken together these analyses indicate that the level of variability in the NARMS dataset makes it difficult to detect multidrug resistance patterns at the current level of sample collection. In order to better control this dispersion NARMS could collect more variables on each of the samples.


Assuntos
Antibacterianos/farmacologia , Farmacorresistência Bacteriana Múltipla , Escherichia coli/efeitos dos fármacos , Carne/microbiologia , Animais , Galinhas , Microbiologia de Alimentos , Testes de Sensibilidade Microbiana , Estados Unidos
8.
Front Microbiol ; 10: 687, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31031716

RESUMO

Using multiple antimicrobials in food animals may incubate genetically-linked multidrug-resistance (MDR) in enteric bacteria, which can contaminate meat at slaughter. The U.S. National Antimicrobial Resistance Monitoring System tested 21,243 chicken-associated Escherichia coli between 2004 and 2012 for resistance to 15 antimicrobials, resulting in >32,000 possible MDR patterns. We analyzed MDR patterns in this dataset with association rule mining, also called market-basket analysis. The association rules were pruned with four quality measures resulting in a <1% false-discovery rate. MDR rules were more stable across consecutive years than between slaughter and retail. Rules were decomposed into networks with antimicrobials as nodes and rules as edges. A strong subnetwork of beta-lactam resistance existed in each year and the beta-lactam resistances also had strong associations with sulfisoxazole, gentamicin, streptomycin and tetracycline resistances. The association rules concur with previously identified E. coli resistance patterns but provide significant flexibility for studying MDR in large datasets.

9.
Prev Vet Med ; 152: 81-88, 2018 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-29559109

RESUMO

The growth of antimicrobial resistance presents a significant threat to human and animal health. Of particular concern is multi-drug resistance, as this increases the chances an infection will be untreatable by any antibiotic. In order to understand multi-drug resistance, it is essential to understand the association between drug resistances. Pairwise associations characterize the connectivity between resistances and are useful in making decisions about courses of treatment, or the design of drug cocktails. Higher-order associations, interactions, which tie together groups of drugs can suggest commonalities in resistance mechanism and lead to their identification. To capture interactions, we apply log-linear models of contingency tables to analyze publically available data on the resistance of Escheresia coli isolated from chicken and turkey meat by the National Antimicrobial Resistance Monitoring System. Standard large sample and conditional exact testing approaches for assessing significance of parameters in these models breakdown due to structured patterns inherent to antimicrobial resistance. To address this, we adopt a Bayesian approach which reveals that E. coli resistance associations can be broken into two subnetworks. The first subnetwork is characterized by a hierarchy of ß-lactams which is consistent across the chicken and turkey datasets. Tier one in this hierarchy is a near equivalency between amoxicillin-clavulanic acid, ceftriaxone and cefoxitin. Susceptibility to tier one then implies susceptibility to ceftiofur. The second subnetwork is characterized by more complex interactions between a variety of drug classes that vary between the chicken and turkey datasets.


Assuntos
Antibacterianos/farmacologia , Farmacorresistência Bacteriana , Escherichia coli/efeitos dos fármacos , Microbiologia de Alimentos , Carne/microbiologia , Animais , Teorema de Bayes , Galinhas , Modelos Lineares , Perus
10.
PLoS One ; 12(6): e0179530, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28662051

RESUMO

With the surge of interest in metabolism and the appreciation of its diverse roles in numerous biomedical contexts, the number of metabolomics studies using liquid chromatography coupled to mass spectrometry (LC-MS) approaches has increased dramatically in recent years. However, variation that occurs independently of biological signal and noise (i.e. batch effects) in metabolomics data can be substantial. Standard protocols for data normalization that allow for cross-study comparisons are lacking. Here, we investigate a number of algorithms for batch effect correction and differential abundance analysis, and compare their performance. We show that linear mixed effects models, which account for latent (i.e. not directly measurable) factors, produce satisfactory results in the presence of batch effects without the need for internal controls or prior knowledge about the nature and sources of unwanted variation in metabolomics data. We further introduce an algorithm-RRmix-within the family of latent factor models and illustrate its suitability for differential abundance analysis in the presence of strong batch effects. Together this analysis provides a framework for systematically standardizing metabolomics data.


Assuntos
Cromatografia Líquida/métodos , Espectrometria de Massas/métodos , Metabolômica/métodos , Algoritmos , Linhagem Celular Tumoral , Humanos , Padrões de Referência
11.
PLoS Comput Biol ; 12(11): e1005160, 2016 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-27851767

RESUMO

Surveillance of antimicrobial resistance (AMR) is an important component of public health. Antimicrobial drug use generates selective pressure that may lead to resistance against to the administered drug, and may also select for collateral resistances to other drugs. Analysis of AMR surveillance data has focused on resistance to individual drugs but joint distributions of resistance in bacterial populations are infrequently analyzed and reported. New methods are needed to characterize and communicate joint resistance distributions. Markov networks are a class of graphical models that define connections, or edges, between pairs of variables with non-zero partial correlations and are used here to describe AMR resistance relationships. The graphical least absolute shrinkage and selection operator is used to estimate sparse Markov networks from AMR surveillance data. The method is demonstrated using a subset of Escherichia coli isolates collected by the National Antimicrobial Resistance Monitoring System between 2004 and 2012 which included AMR results for 16 drugs from 14418 isolates. Of the 119 possible unique edges, 33 unique edges were identified at least once during the study period and graphical density ranged from 16.2% to 24.8%. Two frequent dense subgraphs were noted, one containing the five ß-lactam drugs and the other containing both sulfonamides, three aminoglycosides, and tetracycline. Density did not appear to change over time (p = 0.71). Unweighted modularity did not appear to change over time (p = 0.18), but a significant decreasing trend was noted in the modularity of the weighted networks (p < 0.005) indicating relationships between drugs of different classes tended to increase in strength and frequency over time compared to relationships between drugs of the same class. The current method provides a novel method to study the joint resistance distribution, but additional work is required to unite the underlying biological and genetic characteristics of the isolates with the current results derived from phenotypic data.


Assuntos
Antibacterianos/uso terapêutico , Infecções Bacterianas/microbiologia , Farmacorresistência Bacteriana , Escherichia coli/efeitos dos fármacos , Escherichia coli/isolamento & purificação , Vigilância da População/métodos , Infecções Bacterianas/epidemiologia , Simulação por Computador , Humanos , Cadeias de Markov , Modelos Estatísticos , Prevalência , Reprodutibilidade dos Testes , Medição de Risco/métodos , Sensibilidade e Especificidade , Resultado do Tratamento
12.
Antimicrob Agents Chemother ; 60(9): 5302-11, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27324772

RESUMO

In response to concerning increases in antimicrobial resistance (AMR), the Food and Drug Administration (FDA) has decided to increase veterinary oversight requirements for antimicrobials and restrict their use in growth promotion. Given the high stakes of this policy for the food supply, economy, and human and veterinary health, it is important to rigorously assess the effects of this policy. We have undertaken a detailed analysis of data provided by the National Antimicrobial Resistance Monitoring System (NARMS). We examined the trends in both AMR proportion and MIC between 2004 and 2012 at slaughter and retail stages. We investigated the makeup of variation in these data and estimated the sample and effect size requirements necessary to distinguish an effect of the policy change. Finally, we applied our approach to take a detailed look at the 2005 withdrawal of approval for the fluoroquinolone enrofloxacin in poultry water. Slaughter and retail showed similar trends. Both AMR proportion and MIC were valuable in assessing AMR, capturing different information. Most variation was within years, not between years, and accounting for geographic location explained little additional variation. At current rates of data collection, a 1-fold change in MIC should be detectable in 5 years and a 6% decrease in percent resistance could be detected in 6 years following establishment of a new resistance rate. Analysis of the enrofloxacin policy change showed the complexities of the AMR policy with no statistically significant change in resistance of both Campylobacter jejuni and Campylobacter coli to ciprofloxacin, another second-generation fluoroquinolone.


Assuntos
Antibacterianos/farmacologia , Farmacorresistência Bacteriana Múltipla , Microbiologia de Alimentos , Carne/microbiologia , Aves Domésticas/microbiologia , Matadouros/legislação & jurisprudência , Análise de Variância , Animais , Campylobacter coli/efeitos dos fármacos , Campylobacter coli/crescimento & desenvolvimento , Campylobacter jejuni/efeitos dos fármacos , Campylobacter jejuni/crescimento & desenvolvimento , Bovinos , Ciprofloxacina/farmacologia , Enrofloxacina , Escherichia coli/efeitos dos fármacos , Escherichia coli/crescimento & desenvolvimento , Fluoroquinolonas/farmacologia , Manipulação de Alimentos/legislação & jurisprudência , Abastecimento de Alimentos/legislação & jurisprudência , Humanos , Testes de Sensibilidade Microbiana , Salmonella typhimurium/efeitos dos fármacos , Salmonella typhimurium/crescimento & desenvolvimento , Suínos , Estados Unidos , United States Food and Drug Administration/legislação & jurisprudência
13.
PLoS Comput Biol ; 8(12): e1002806, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23236270

RESUMO

We present an approach for identifying genes under natural selection using polymorphism and divergence data from synonymous and non-synonymous sites within genes. A generalized linear mixed model is used to model the genome-wide variability among categories of mutations and estimate its functional consequence. We demonstrate how the model's estimated fixed and random effects can be used to identify genes under selection. The parameter estimates from our generalized linear model can be transformed to yield population genetic parameter estimates for quantities including the average selection coefficient for new mutations at a locus, the synonymous and non-synynomous mutation rates, and species divergence times. Furthermore, our approach incorporates stochastic variation due to the evolutionary process and can be fit using standard statistical software. The model is fit in both the empirical Bayes and Bayesian settings using the lme4 package in R, and Markov chain Monte Carlo methods in WinBUGS. Using simulated data we compare our method to existing approaches for detecting genes under selection: the McDonald-Kreitman test, and two versions of the Poisson random field based method MKprf. Overall, we find our method universally outperforms existing methods for detecting genes subject to selection using polymorphism and divergence data.


Assuntos
Distribuição de Poisson , Seleção Genética , Simulação por Computador , Humanos , Mutação , Polimorfismo Genético , Processos Estocásticos
14.
Stat Appl Genet Mol Biol ; 11(1): Article 8, 2012 Jan 06.
Artigo em Inglês | MEDLINE | ID: mdl-22499680

RESUMO

Testing for unequal variances is usually performed in order to check the validity of the assumptions that underlie standard tests for differences between means (the t-test and anova). However, existing methods for testing for unequal variances (Levene's test and Bartlett's test) are notoriously non-robust to normality assumptions, especially for small sample sizes. Moreover, although these methods were designed to deal with one hypothesis at a time, modern applications (such as to microarrays and fMRI experiments) often involve parallel testing over a large number of levels (genes or voxels). Moreover, in these settings a shift in variance may be biologically relevant, perhaps even more so than a change in the mean. This paper proposes a parsimonious model for parallel testing of the equal variance hypothesis. It is designed to work well when the number of tests is large; typically much larger than the sample sizes. The tests are implemented using an empirical Bayes estimation procedure which `borrows information' across levels. The method is shown to be quite robust to deviations from normality, and to substantially increase the power to detect differences in variance over the more traditional approaches even when the normality assumption is valid.


Assuntos
Modelos Estatísticos , Análise de Variância , Teorema de Bayes , Perfilação da Expressão Gênica , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Tamanho da Amostra
15.
Mol Cell Proteomics ; 10(8): M110.007203, 2011 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-21602509

RESUMO

Recent developments in mass-spectrometry-based shotgun proteomics, especially methods using spectral counting, have enabled large-scale identification and differential profiling of complex proteomes. Most such proteomic studies are interested in identifying proteins, the abundance of which is different under various conditions. Several quantitative methods have recently been proposed and implemented for this purpose. Building on some techniques that are now widely accepted in the microarray literature, we developed and implemented a new method using a Bayesian model to calculate posterior probabilities of differential abundance for thousands of proteins in a given experiment simultaneously. Our Bayesian model is shown to deliver uniformly superior performance when compared with several existing methods.


Assuntos
Teorema de Bayes , Modelos Biológicos , Proteoma/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Software , Interpretação Estatística de Dados , Humanos , Funções Verossimilhança , Cadeias de Markov , Método de Monte Carlo , Proteoma/química , Proteômica , Curva ROC , Padrões de Referência , Saccharomyces cerevisiae , Proteínas de Saccharomyces cerevisiae/química , Espectrometria de Massas em Tandem/normas
16.
BMJ ; 337: a568, 2008 Jul 31.
Artigo em Inglês | MEDLINE | ID: mdl-18669565

RESUMO

OBJECTIVE: To measure the effect of free access to the scientific literature on article downloads and citations. DESIGN: Randomised controlled trial. SETTING: 11 journals published by the American Physiological Society. PARTICIPANTS: 1619 research articles and reviews. MAIN OUTCOME MEASURES: Article readership (measured as downloads of full text, PDFs, and abstracts) and number of unique visitors (internet protocol addresses). Citations to articles were gathered from the Institute for Scientific Information after one year. INTERVENTIONS: Random assignment on online publication of articles published in 11 scientific journals to open access (treatment) or subscription access (control). RESULTS: Articles assigned to open access were associated with 89% more full text downloads (95% confidence interval 76% to 103%), 42% more PDF downloads (32% to 52%), and 23% more unique visitors (16% to 30%), but 24% fewer abstract downloads (-29% to -19%) than subscription access articles in the first six months after publication. Open access articles were no more likely to be cited than subscription access articles in the first year after publication. Fifty nine per cent of open access articles (146 of 247) were cited nine to 12 months after publication compared with 63% (859 of 1372) of subscription access articles. Logistic and negative binomial regression analysis of article citation counts confirmed no citation advantage for open access articles. CONCLUSIONS: Open access publishing may reach more readers than subscription access publishing. No evidence was found of a citation advantage for open access articles in the first year after publication. The citation advantage from open access reported widely in the literature may be an artefact of other causes.


Assuntos
Acesso à Informação , Disseminação de Informação , Armazenamento e Recuperação da Informação/estatística & dados numéricos , Publicações Periódicas como Assunto/estatística & dados numéricos , Internet/estatística & dados numéricos , Análise de Regressão
17.
Bioinformatics ; 24(6): 874-5, 2008 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-18245126

RESUMO

MOTIVATION: The program MBBC 2.0 clusters time-course microarray data using a Bayesian product partition model. RESULTS: The Bayesian product partition model in Booth et al. (2007) simultaneously searches for the optimal number of clusters, and assigns cluster memberships based on temporal changes of gene expressions. MBBC 2.0 to makes this method easily available for statisticians and scientists, and is built with three free computer language software packages: Ox, R and C++, taking advantage of the strengths of each language. Within MBBC, the search algorithm is implemented with Ox and resulting graphs are drawn with R. A user-friendly graphical interface is built with C++ to run the Ox and R programs internally. Thus, MBBC users are not required to know how to use Ox, R or C++, but they must be pre-installed. AVAILABILITY: A self-extractable zip file, MBBC20zip.exe, is available at the MBBC webpage www.stat.ufl.edu/~casella/mbbc/, which contains MBBC.exe, source files, and all other related files. The current version works only in the Windows operating system. A free installation program and overview for Ox is available at www.doornik.com. A detailed installation guide for Ox is provided by MBBC, and is accessible without installing Ox. R is available at www.r-project.org/.


Assuntos
Algoritmos , Inteligência Artificial , Análise por Conglomerados , Perfilação da Expressão Gênica/métodos , Modelos Biológicos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Reconhecimento Automatizado de Padrão/métodos , Teorema de Bayes , Simulação por Computador
18.
BMC Genomics ; 9: 31, 2008 Jan 23.
Artigo em Inglês | MEDLINE | ID: mdl-18215294

RESUMO

BACKGROUND: Simple sequence repeats (SSRs) have been successfully used for various genetic and evolutionary studies in eukaryotic systems. The eukaryotic model organism Neurospora crassa is an excellent system to study evolution and biological function of SSRs. RESULTS: We identified and characterized 2749 SSRs of 963 SSR types in the genome of N. crassa. The distribution of tri-nucleotide (nt) SSRs, the most common SSRs in N. crassa, was significantly biased in exons. We further characterized the distribution of 19 abundant SSR types (AST), which account for 71% of total SSRs in the N. crassa genome, using a Poisson log-linear model. We also characterized the size variation of SSRs among natural accessions using Polymorphic Index Content (PIC) and ANOVA analyses and found that there are genome-wide, chromosome-dependent and local-specific variations. Using polymorphic SSRs, we have built linkage maps from three line-cross populations. CONCLUSION: Taking our computational, statistical and experimental data together, we conclude that 1) the distributions of the SSRs in the sequenced N. crassa genome differ systematically between chromosomes as well as between SSR types, 2) the size variation of tri-nt SSRs in exons might be an important mechanism in generating functional variation of proteins in N. crassa, 3) there are different levels of evolutionary forces in variation of amino acid repeats, and 4) SSRs are stable molecular markers for genetic studies in N. crassa.


Assuntos
Evolução Molecular , Repetições de Microssatélites , Neurospora crassa/genética , Polimorfismo Genético , Etiquetas de Sequências Expressas , Marcadores Genéticos , Genoma Fúngico
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...