RESUMO
Biologists currently have an assortment of high-throughput sequencing techniques allowing the study of population dynamics in increasing detail. The utility of genetic estimates depends on their ability to recover meaningful approximations while filtering out noise produced by artifacts. In this study, we empirically compared the congruence of two reduced representation approaches (genotyping-by-sequencing, GBS, and whole-exome sequencing, WES) in estimating genetic diversity and population structure using SNP markers typed in a small number of wild jaguar (Panthera onca) samples from South America. Due to its targeted nature, WES allowed for a more straightforward reconstruction of loci compared to GBS, facilitating the identification of true polymorphisms across individuals. We therefore used WES-derived metrics as a benchmark against which GBS-derived indicators were compared, adjusting parameters for locus assembly and SNP filtering in the latter. We observed significant variation in SNP call rates across samples in GBS datasets, leading to a recurrent miscalling of heterozygous sites. This issue was further amplified by small sample sizes, ultimately impacting the consistency of summary statistics between genotyping methods. Recognizing that the genetic markers obtained from GBS and WES are intrinsically different due to varying evolutionary pressures, particularly selection, we consider that our empirical comparison offers valuable insights and highlights critical considerations for estimating population genetic attributes using reduced representation datasets. Our results emphasize the critical need for careful evaluation of missing data and stringent filtering to achieve reliable estimates of genetic diversity and differentiation in elusive wildlife species.
RESUMO
Bacterial pustule (BP), caused by Xanthomonas citri pv. glycines, is an important disease that, under favorable conditions, can drastically affect soybean production. We performed a genome-wide association study (GWAS) with a panel containing Brazilian and American cultivars, which were screened qualitatively and quantitatively against two Brazilian X. citri isolates (IBS 333 and IBS 327). The panel was genotyped using a genotyping by sequencing (GBS) approach, and we identified two main new regions in soybeans associated with X. citri resistance on chromosomes 6 (IBS 333) and 18 (IBS 327), different from the traditional rxp gene located on chromosome 17. The region on chromosome 6 was also detected by QTL mapping using a biparental cross between Williams 82 (R) and PI 416937 (S), showing that Williams 82 has another recessive resistance gene besides rxp, which was also detected in nine BP-resistant ancestors of the Brazilian cultivars (including CNS, S-100), based on haplotype analysis. Furthermore, we identified additional SNPs in strong LD (0.8) with peak SNPs by exploring variation available in WGS (whole genome sequencing) data among 31 soybean accessions. In these regions in strong LD, two candidate resistance genes were identified (Glyma.06g311000 and Glyma.18g025100) for chromosomes 6 and 18, respectively. Therefore, our results allowed the identification of new chromosomal regions in soybeans associated with BP disease, which could be useful for marker-assisted selection and will enable a reduction in time and cost for the development of resistant cultivars.
RESUMO
BACKGROUND: African mahogany species (Khaya sp.) have been introduced to Brazil gaining increasing economic interest over the last years, as they produce high quality wood for industrial applications. To this date, however, the knowledge available on the genetic basis of African mahogany plantations in Brazil is limited, which has driven this study to examine the extent of genetic diversity and structure of three cultivated species (Khaya grandifoliola, Khaya senegalensis and Khaya ivorensis) and their prospects for forest breeding. RESULTS: In total, 115 individuals were genotyped (48 of K. grandifoliola, 34 of K. senegalensis and 33 of K. ivorensis) for 3,330 filtered neutral loci obtained from genotyping-by-sequencing for the three species. The number of SNPs varied from 2,951 in K. ivorensis to 4,754 in K. senegalensis. Multiloci clustering, principal component analysis, Bayesian structure and network analyses showed a clear genetic separation among the three species. Structure analysis also showed internal structure within each species, highlighting genetic subgroups that could be sampled for selecting distinct genotypes for further breeding, although the genetic distances are moderate to low. CONCLUSION: In our study, SNP markers efficiently assessed the genomic diversity of African mahogany forest plantations in Brazil. Our genetic data clearly separated the three Khaya species. Moreover, pairwise estimates of genetic distances among individuals within each species showed considerable genetic divergence among individuals. By genotyping 115 pre-selected individuals with desirable growth traits, allowed us not only to recommend superior genotypes but also to identify genetically distinct individuals for use in breeding crosses.
Assuntos
Florestas , Variação Genética , Brasil , Meliaceae/genética , Polimorfismo de Nucleotídeo Único , Melhoramento Vegetal , Genótipo , Genoma de PlantaRESUMO
BACKGROUND: Phytophthora root rot, a major constraint in chile pepper production worldwide, is caused by the soil-borne oomycete, Phytophthora capsici. This study aimed to detect significant regions in the Capsicum genome linked to Phytophthora root rot resistance using a panel consisting of 157 Capsicum spp. genotypes. Multi-locus genome wide association study (GWAS) was conducted using single nucleotide polymorphism (SNP) markers derived from genotyping-by-sequencing (GBS). Individual plants were separately inoculated with P. capsici isolates, 'PWB-185', 'PWB-186', and '6347', at the 4-8 leaf stage and were scored for disease symptoms up to 14-days post-inoculation. Disease scores were used to calculate disease parameters including disease severity index percentage, percent of resistant plants, area under disease progress curve, and estimated marginal means for each genotype. RESULTS: Most of the genotypes displayed root rot symptoms, whereas five accessions were completely resistant to all the isolates and displayed no symptoms of infection. A total of 55,117 SNP markers derived from GBS were used to perform multi-locus GWAS which identified 330 significant SNP markers associated with disease resistance. Of these, 56 SNP markers distributed across all the 12 chromosomes were common across the isolates, indicating association with more durable resistance. Candidate genes including nucleotide-binding site leucine-rich repeat (NBS-LRR), systemic acquired resistance (SAR8.2), and receptor-like kinase (RLKs), were identified within 0.5 Mb of the associated markers. CONCLUSIONS: Results will be used to improve resistance to Phytophthora root rot in chile pepper by the development of Kompetitive allele-specific markers (KASP®) for marker validation, genomewide selection, and marker-assisted breeding.
Assuntos
Capsicum , Resistência à Doença , Estudo de Associação Genômica Ampla , Phytophthora , Doenças das Plantas , Raízes de Plantas , Polimorfismo de Nucleotídeo Único , Phytophthora/fisiologia , Phytophthora/patogenicidade , Capsicum/genética , Capsicum/microbiologia , Doenças das Plantas/microbiologia , Doenças das Plantas/genética , Resistência à Doença/genética , Raízes de Plantas/microbiologia , Raízes de Plantas/genética , GenótipoRESUMO
Platonia insignis is a fruit tree native to Brazil of increasing economic importance, with its pulp trading among the highest market values. This study aimed to evaluate the structure and genomic diversity of P. insignis (bacurizeiro) accessions from six locations in the Brazilian States of Roraima, Amazonas, Pará (Amazon biome), and Maranhão (Cerrado biome). A total of 2031 SNP markers were obtained using genotyping-by-sequencing (GBS), from which 625 outlier SNPs were identified. High genetic structure was observed, with most of the genetic variability (59%) concentrated among locations, mainly between biomes (Amazon and Cerrado). A positive and significant correlation (r = 0.85; p < 0.005) was detected between genetic and geographic distances, indicating isolation by distance. The highest genetic diversity was observed for the location in the Cerrado biome (HE = 0.1746; HO = 0.2078). The locations in the Amazon biome showed low genetic diversity indexes with significant levels of inbreeding. The advance of urban areas, events of burning, and expansion of agricultural activities are most probably the main factors for the genetic diversity reduction of P. insignis. Approaches to functional analysis showed that most of the outlier loci found may be related to genes involved in cellular and metabolic processes.
RESUMO
Eucalyptus dunnii is one of the most important Eucalyptus species for short-fiber pulp production in regions where other species of the genus are affected by poor soil and climatic conditions. In this context, E. dunnii holds promise as a resource to address and adapt to the challenges of climate change. Despite its rapid growth and favorable wood properties for solid wood products, the advancement of its improvement remains in its early stages. In this work, we evaluated the performance of two single nucleotide polymorphism, (SNP), genotyping methods for population genetics analysis and Genomic Selection in E. dunnii. Double digest restriction-site associated DNA sequencing (ddRADseq) was compared with the EUChip60K array in 308 individuals from a provenance-progeny trial. The compared SNP set included 8,011 and 19,008 informative SNPs distributed along the 11 chromosomes, respectively. Although the two datasets differed in the percentage of missing data, genome coverage, minor allele frequency and estimated genetic diversity parameters, they revealed a similar genetic structure, showing two subpopulations with little differentiation between them, and low linkage disequilibrium. GS analyses were performed for eleven traits using Genomic Best Linear Unbiased Prediction (GBLUP) and a conventional pedigree-based model (ABLUP). Regardless of the SNP dataset, the predictive ability (PA) of GBLUP was better than that of ABLUP for six traits (Cellulose content, Total and Ethanolic extractives, Total and Klason lignin content and Syringyl and Guaiacyl lignin monomer ratio). When contrasting the SNP datasets used to estimate PAs, the GBLUP-EUChip60K model gave higher and significant PA values for six traits, meanwhile, the values estimated using ddRADseq gave higher values for three other traits. The PAs correlated positively with narrow sense heritabilities, with the highest correlations shown by the ABLUP and GBLUP-EUChip60K. The two genotyping methods, ddRADseq and EUChip60K, are generally comparable for population genetics and genomic prediction, demonstrating the utility of the former when subjected to rigorous SNP filtering. The results of this study provide a basis for future whole-genome studies using ddRADseq in non-model forest species for which SNP arrays have not yet been developed.
RESUMO
An in-depth genotypic characterisation of a diverse collection of Digitaria insularis was undertaken to explore the neutral genetic variation across the natural expansion range of this weed species in Brazil. With the exception of Minas Gerais, populations from all other states showed high estimates of expected heterozygosity (HE > 0.60) and genetic diversity. There was a lack of population structure based on geographic origin and a low population differentiation between populations across the landscape as evidenced by average Fst value of 0.02. On combining haloxyfop [acetyl CoA carboxylase (ACCase)-inhibiting herbicide] efficacy data with neutral genetic variation, we found evidence of presence of two scenarios of resistance evolution in this weed species. Whilst populations originating from north-eastern region demonstrated an active role of gene flow, populations from the mid-western region displayed multiple, independent resistance evolution as the major evolutionary mechanism. A target-site mutation (Trp2027Cys) in the ACCase gene, observed in less than 1% of resistant populations, could not explain the reduced sensitivity of 15% of the populations to haloxyfop. The genetic architecture of resistance to ACCase-inhibiting herbicides was dissected using a genome wide association study (GWAS) approach. GWAS revealed association of three SNPs with reduced sensitivity to haloxyfop and clethodim. In silico analysis of these SNPs revealed important non-target site genes belonging to families involved in herbicide detoxification, including UDPGT91C1 and GT2, and genes involved in vacuolar sequestration-based degradation pathway. Exploration of five genomic prediction models revealed that the highest prediction power (≥0.80) was achieved with the models Bayes A and RKHS, incorporating SNPs with additive effects and epistatic interactions, respectively.
RESUMO
Guinea pigs are a major source of animal protein for Peruvian Andean families. Despite the economic and cultural relevance of guinea pigs, their genomic characterization has been scarcely addressed. Genotyping-by-sequencing (GBS) has emerged as an affordable alternative to genotyping of livestock and native animals. Here, we report the use of GBS for single nucleotide polymorphism (SNP) discovery of traditionally raised guinea pigs from six regions of the Peruvian Andes and one group of breeding animals. The paired-end (2 × 150 bp) sequencing of 40 guinea pig DNA samples generated a mean of 6.4 million high-quality sequencing reads per sample. We obtained an average sequencing depth of 10× with an 88.5% mapping rate to the Cavia porcellus reference genome. A total of 279 965 SNPs (102 SNPs/Mbp) were identified after variant calling and quality filtering. Based on this SNP set, we assessed the genetic diversity and distance within our selected guinea pig populations. An overall average minor allele frequency of 0.13, an observed heterozygosity of 0.31, an expected heterozygosity of 0.35, and an F-value of 0.1 were obtained, while the SNP-based neighbor-joining tree suggests a closer genetic relationship between individuals from geographically close locations. We showed that GBS is a cost-effective tool for SNP discovery and genetic characterization of Peruvian guinea pig populations. Therefore, it may be considered as a suitable and affordable tool for genomic characterization of poorly studied native animal species.
Assuntos
Genoma , Polimorfismo de Nucleotídeo Único , Humanos , Animais , Cobaias , Genótipo , Peru , Genômica , Sequenciamento de Nucleotídeos em Larga EscalaRESUMO
The advances in genomics in recent years have increased the accuracy and efficiency of breeding programs for many crops. Nevertheless, the adoption of genomic enhancement for several other crops essential in developing countries is still limited, especially for those that do not have a reference genome. These crops are more often called orphans. This is the first report to show how the results provided by different platforms, including the use of a simulated genome, called the mock genome, can generate in population structure and genetic diversity studies, especially when the intention is to use this information to support the formation of heterotic groups, choice of testers, and genomic prediction of single crosses. For that, we used a method to assemble a reference genome to perform the single-nucleotide polymorphism (SNP) calling without needing an external genome. Thus, we compared the analysis results using the mock genome with the standard approaches (array and genotyping-by-sequencing (GBS)). The results showed that the GBS-Mock presented similar results to the standard methods of genetic diversity studies, division of heterotic groups, the definition of testers, and genomic prediction. These results showed that a mock genome constructed from the population's intrinsic polymorphisms to perform the SNP calling is an effective alternative for conducting genomic studies of this nature in orphan crops, especially those that do not have a reference genome.
RESUMO
Developing sound breeding programs for aquaculture species may be challenging when matings cannot be controlled due to communal spawning. We developed a genotyping-by-sequencing marker panel of 300 SNPs for parentage testing and sex determination by using data from an in-house reference genome as well as a 90 K SNP genotyping array based on different populations of yellowtail kingfish (Seriola lalandi). The minimum and maximum distance between adjacent marker pairs were 0.7 Mb and 13 Mb, respectively, with an average marker spacing of 2 Mb. Weak evidence of the linkage disequilibrium between adjacent marker pairs was found. The results showed high panel performance for parental assignment, with probability exclusion values equaling 1. The rate of false positives when using cross-population data was null. A skewed distribution of genetic contributions by dominant females was observed, thus increasing the risk of higher rates of inbreeding in subsequent captive generations when no parentage data are used. All these results are discussed in the context of breeding program design, using this marker panel to increase the sustainability of this aquaculture resource.
RESUMO
Double digest restriction-site associated DNA sequencing (ddRADseq) technology combines genome reduced representation by digestion with two restriction enzymes and next generation sequencing (NGS) to obtain thousands of markers (SNP, SSR, and InDels) and genotype tens to hundreds of samples simultaneously. In this chapter, we describe a 96-plex derived ddRADseq protocol that can be set up to obtain different depth of coverage per locus and can be exploited to model and non-model plant species.
Assuntos
Genoma , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA/métodos , Genótipo , Sequência de Bases , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Tecnologia , Polimorfismo de Nucleotídeo ÚnicoRESUMO
KEY MESSAGE: The article presents an optimization of the key parameters for the identification of SNPs in sugarcane using a GBS protocol based on two Illumina NextSeq and NovaSeq platforms. Sugarcane (Saccharum sp.), a world-wide known feedstock for sugar production, bioethanol, and energy, has an extremely complex genome, being highly polyploid and aneuploid. A double-digestion restriction site-associated DNA sequencing protocol (ddRADseq) was tested in four commercial sugarcane hybrids and one high-fibre biotype for the detection of single nucleotide polymorphisms (SNPs). In this work we tested two Illumina sequencing platforms, read size (70 vs. 150 bp), different sequencing coverage per individual (medium and high coverage), and single-reads versus paired-end reads. We also explored different variant calling strategies (with and without reference genome) and filtering schemes [combining two minor allele frequencies (MAFs) with three depth of coverage thresholds]. For the discovery of a large number of novel SNPs in sugarcane, we recommend longer size and paired-end reads, medium sequencing coverage per individual and Illumina platform NovaSeq6000 for a cost-effective approach, and filter parameters of lower MAF and higher depth coverages thresholds. Although the de novo analysis retrieved more SNPs, the reference-based method allows downstream characterization of variants. For the two best performing matrices, the number of SNPs per chromosome correlated positively with chromosome length, demonstrating the presence of variants throughout the genome. Multivariate comparisons, with both matrices, showed closer relationships among commercial hybrids than with the high-fibre biotype. Functional analysis of the SNPs demonstrated that more than half of them landed within regulatory regions, whereas the other half affected coding, intergenic and intronic regions. Allelic distances values were lower than 0.07 when analysing two replicated genotypes, confirming the protocol robustness.
Assuntos
Saccharum , Saccharum/genética , Análise de Sequência de DNA , Polimorfismo de Nucleotídeo Único/genética , Genótipo , Sequência de BasesRESUMO
Organisms with lower dispersal abilities tend to have more genetically dissimilar populations. The same is true for parasites, whose transmission frequency may depend on the population structure of the host. This should be especially true when hosts and parasites face similar barriers to dispersal. Here, we considered the similarities between host and parasite population structure in a social spider system. In this system, host colonies are typified by rapid growth via internal recruitment followed by budding or fission events when colonies grow too large, with each colony representing a distinct population. Host colonies provide habitat for kleptoparasitic spiders, which steal prey from and may also feed directly on host individuals. We asked whether kleptoparasites exhibit a similar degree of population subdivision as their host. Under the free-mixing hypothesis (i.e., horizontal transmission), kleptoparasites would have well-mixed populations across broader regions than a single host nest, whereas host populations would be strongly genetically structured. Under the host-tracking hypothesis (i.e., vertical transmission), kleptoparasites would have a population structure that parallels that of the host. We conducted a genotype-by-sequencing study to assess the population structure of both hosts and kleptoparasites within three nearby regions in eastern Ecuador. We found strong signatures of population differentiation and bottlenecks in the host species, which is congruent with past studies. However, we found that kleptoparasite populations were well mixed across host nests, with no evidence of recent bottlenecks. These results support our free-mixing hypothesis, suggesting that kleptoparasites follow patterns of horizontal transmission in this social spider system.
Assuntos
Parasitos , Aranhas , Animais , Dinâmica Populacional , Ecossistema , Especificidade de Hospedeiro , Equador , Aranhas/genéticaRESUMO
The major challenges that agriculture is facing in the twenty-first century are increasing droughts, water scarcity, flooding, poorer soils, and extreme temperatures due to climate change. However, most crops are not tolerant to extreme climatic environments. The aim in the near future, in a world with hunger and an increasing population, is to breed and/or engineer crops to tolerate abiotic stress with a higher yield. Some crop varieties display a certain degree of tolerance, which has been exploited by plant breeders to develop varieties that thrive under stress conditions. Moreover, a long list of genes involved in abiotic stress tolerance have been identified and characterized by molecular techniques and overexpressed individually in plant transformation experiments. Nevertheless, stress tolerance phenotypes are polygenetic traits, which current genomic tools are dissecting to exploit their use by accelerating genetic introgression using molecular markers or site-directed mutagenesis such as CRISPR-Cas9. In this review, we describe plant mechanisms to sense and tolerate adverse climate conditions and examine and discuss classic and new molecular tools to select and improve abiotic stress tolerance in major crops.
Assuntos
Produtos Agrícolas , Melhoramento Vegetal , Produtos Agrícolas/genética , Secas , Melhoramento Vegetal/métodos , Solo , Estresse Fisiológico/genéticaRESUMO
Due to the recent increase in demand for agave-based beverages, many wild agave populations have experienced rapid decline and fragmentation, whereas cultivated plants are now managed at monocultural plantations, in some cases involving clonal propagation. We examined the relative effect of migration, genetic drift, natural selection and human activities on the genetic repertoire of Agave angustifolia var. pacifica, an agave used for bacanora (an alcoholic spirit similar to tequila) production in northwestern Mexico. We sampled 34 wild and cultivated sites and used over eleven thousand genome-wide SNPs. We found shallow genetic structure among wild samples, although we detected differentiation between coastal and inland sites. Surprisingly, no differentiation was found between cultivated and wild populations. Moreover, we detected moderate inbreeding (FIS ~ 0.13) and similar levels of genomic diversity in wild and cultivated agaves. Nevertheless, the cultivated plants had almost no private alleles and presented evidence of clonality. The overall low genetic structure in A. angustifolia var. pacifica is apparently the result of high dispersibility promoted by pollinators and the possibility of clonal reproduction. Incipient cultivation history and reliance on wild seeds and plants are probably responsible for the observed patterns of high genetic connectivity and considerable diversity in cultivated samples.
RESUMO
Although Brazil is currently the largest soybean producer in the world, only a small number of studies have analyzed the genetic diversity of Brazilian soybean. These studies have shown the existence of a narrow genetic base. The objectives of this work were to analyze the population structure and genetic diversity, and to identify selection signatures in the genome of soybean germplasms from different companies in Brazil. A panel consisting of 343 soybean lines from Brazil, North America, and Asia was genotyped using genotyping by sequencing (GBS). Population structure was assessed by Bayesian and multivariate approaches. Genetic diversity was analyzed using metrics such as the fixation index, nucleotide diversity, genetic dissimilarity, and linkage disequilibrium. The software BayeScan was used to detect selection signatures between Brazilian and Asian accessions as well as among Brazilian germplasms. Region of origin, company of origin, and relative maturity group (RMG) all had a significant influence on population structure. Varieties belonging to the same company and especially to the same RMG exhibited a high level of genetic similarity. This result was exacerbated among early maturing accessions. Brazilian soybean showed significantly lower genetic diversity when compared to Asian accessions. This was expected, because the crop's region of origin is its main genetic diversity reserve. We identified 7 genomic regions under selection between the Brazilian and Asian accessions, and 27 among Brazilian varieties developed by different companies. Associated with these genomic regions, we found 96 quantitative trait loci (QTLs) for important soybean breeding traits such as flowering, maturity, plant architecture, productivity components, pathogen resistance, and seed composition. Some of the QTLs associated with the markers under selection have genes of great importance to soybean's regional adaptation. The results reported herein allowed to expand the knowledge about the organization of the genetic variability of the Brazilian soybean germplasm. Furthermore, it was possible to identify genomic regions under selection possibly associated with the adaptation of soybean to Brazilian environments.
RESUMO
Based on molecular markers, genomic prediction enables us to speed up breeding schemes and increase the response to selection. There are several high-throughput genotyping platforms able to deliver thousands of molecular markers for genomic study purposes. However, even though its widely applied in plant breeding, species without a reference genome cannot fully benefit from genomic tools and modern breeding schemes. We used a method to assemble a population-tailored mock genome to call single-nucleotide polymorphism (SNP) markers without an available reference genome, and for the first time, we compared the results with standard genotyping platforms (array and genotyping-by-sequencing (GBS) using a reference genome) for performance in genomic prediction models. Our results indicate that using a population-tailored mock genome to call SNP delivers reliable estimates for the genomic relationship between genotypes. Furthermore, genomic prediction estimates were comparable to standard approaches, especially when considering only additive effects. However, mock genomes were slightly worse than arrays at predicting traits influenced by dominance effects, but still performed as well as standard GBS methods that use a reference genome. Nevertheless, the array-based SNP markers methods achieved the best predictive ability and reliability to estimate variance components. Overall, the mock genomes can be a worthy alternative for genomic selection studies, especially for those species where the reference genome is not available.
Assuntos
Biologia Computacional , Técnicas de Genotipagem , Modelos Genéticos , Animais , Quimera/genética , Biologia Computacional/métodos , Biologia Computacional/normas , Conjuntos de Dados como Assunto , Genoma , Estudo de Associação Genômica Ampla/métodos , Estudo de Associação Genômica Ampla/normas , Genômica/métodos , Genômica/normas , Genótipo , Técnicas de Genotipagem/métodos , Técnicas de Genotipagem/normas , Fenótipo , Padrões de Referência , Reprodutibilidade dos Testes , Seleção Genética , Especificidade da Espécie , Zea mays/classificação , Zea mays/genéticaRESUMO
Genotyping-by-sequencing (GBS) is a widely used and cost-effective technique for obtaining large numbers of genetic markers from populations by sequencing regions adjacent to restriction cut sites. Although a standard reference-based pipeline can be followed to analyse GBS reads, a reference genome is still not available for a large number of species. Hence, reference-free approaches are required to generate the genetic variability information that can be obtained from a GBS experiment. Unfortunately, available tools to perform de novo analysis of GBS reads face issues of usability, accuracy and performance. Furthermore, few available tools are suitable for analysing data sets from polyploid species. In this manuscript, we describe a novel algorithm to perform reference-free variant detection and genotyping from GBS reads. Nonexact searches on a dynamic hash table of consensus sequences allow for efficient read clustering and sorting. This algorithm was integrated in the Next Generation Sequencing Experience Platform (NGSEP) to integrate the state-of-the-art variant detector already implemented in this tool. We performed benchmark experiments with three different empirical data sets of plants and animals with different population structures and ploidies, and sequenced with different GBS protocols at different read depths. These experiments show that NGSEP has comparable and in some cases better accuracy and always better computational efficiency compared to existing solutions. We expect that this new development will be useful for many research groups conducting population genetic studies in a wide variety of species.
Assuntos
Diploide , Poliploidia , Genômica , Genótipo , Humanos , SoftwareRESUMO
BACKGROUND: Phytophthora root rot, caused by Phytophthora capsici, is a major disease affecting Capsicum production worldwide. A recombinant inbred line (RIL) population derived from the hybridization between 'Criollo de Morellos-334' (CM-334), a resistant landrace from Mexico, and 'Early Jalapeno', a susceptible cultivar was genotyped using genotyping-by-sequencing (GBS)-derived single nucleotide polymorphism (SNP) markers. A GBS-SNP based genetic linkage map for the RIL population was constructed. Quantitative trait loci (QTL) mapping dissected the genetic architecture of P. capsici resistance and candidate genes linked to resistance for this important disease were identified. RESULTS: Development of a genetic linkage map using 1,973 GBS-derived polymorphic SNP markers identified 12 linkage groups corresponding to the 12 chromosomes of chile pepper, with a total length of 1,277.7 cM and a marker density of 1.5 SNP/cM. The maximum gaps between consecutive SNP markers ranged between 1.9 (LG7) and 13.5 cM (LG5). Collinearity between genetic and physical positions of markers reached a maximum of 0.92 for LG8. QTL mapping identified genomic regions associated with P. capsici resistance in chromosomes P5, P8, and P9 that explained between 19.7 and 30.4% of phenotypic variation for resistance. Additive interactions between QTL in chromosomes P5 and P8 were observed. The role of chromosome P5 as major genomic region containing P. capsici resistance QTL was established. Through candidate gene analysis, biological functions associated with response to pathogen infections, regulation of cyclin-dependent protein serine/threonine kinase activity, and epigenetic mechanisms such as DNA methylation were identified. CONCLUSIONS: Results support the genetic complexity of the P. capsici-Capsicum pathosystem and the possible role of epigenetics in conferring resistance to Phytophthora root rot. Significant genomic regions and candidate genes associated with disease response and gene regulatory activity were identified which allows for a deeper understanding of the genomic landscape of Phytophthora root rot resistance in chile pepper.
Assuntos
Capsicum/genética , Capsicum/microbiologia , Resistência à Doença/genética , Phytophthora/fisiologia , Doenças das Plantas/genética , Doenças das Plantas/microbiologia , Mapeamento Cromossômico , Marcadores Genéticos , Genoma de Planta , Técnicas de Genotipagem , Raízes de Plantas/microbiologia , Polimorfismo de Nucleotídeo Único , Locos de Características QuantitativasRESUMO
The identification of environmentally stable and globally predictable resistance to potato late blight is challenged by the clonal and polyploid nature of the crop and the rapid evolution of the pathogen. A diversity panel of tetraploid potato germplasm bred for multiple resistance and quality traits was genotyped by genotyping by sequencing (GBS) and evaluated for late blight resistance in three countries where the International Potato Center (CIP) has established breeding work. Health-indexed, in vitro plants of 380 clones and varieties were distributed from CIP headquarters and tuber seed was produced centrally in Peru, China, and Ethiopia. Phenotypes were recorded following field exposure to local isolates of Phytophthora infestans. QTL explaining resistance in four experiments conducted across the three countries were identified in chromosome IX, and environment-specific QTL were found in chromosomes III, V, and X. Different genetic models were evaluated for prediction ability to identify best performing germplasm in each and all environments. The best prediction ability (0.868) was identified with the genomic best linear unbiased predictors (GBLUPs) when using the diploid marker data and QTL-linked markers as fixed effects. Genotypes with high levels of resistance in all environments were identified from the B3, LBHT, and B3-LTVR populations. The results show that many of the advanced clones bred in Peru for high levels of late blight resistance maintain their resistance in Ethiopia and China, suggesting that the centralized selection strategy has been largely successful.