Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 13 de 13
Filter
Add more filters










Publication year range
1.
Nat Commun ; 12(1): 2395, 2021 04 22.
Article in English | MEDLINE | ID: mdl-33888695

ABSTRACT

The infraorder Brachyura (true or short-tailed crabs) represents a successful group of marine invertebrates yet with limited genomic resources. Here we report a chromosome-anchored reference genome and transcriptomes of the Chinese mitten crab Eriocheir sinensis, a catadromous crab and invasive species with wide environmental tolerance, strong osmoregulatory capacity and high fertility. We show the expansion of specific gene families in the crab, including F-ATPase, which enhances our knowledge on the adaptive plasticity of this successful invasive species. Our analysis of spatio-temporal transcriptomes and the genome of E. sinensis and other decapods shows that brachyurization development is associated with down-regulation of Hox genes at the megalopa stage when tail shortening occurs. A better understanding of the molecular mechanism regulating sexual development is achieved by integrated analysis of multiple omics. These genomic resources significantly expand the gene repertoire of Brachyura, and provide insights into the biology of this group, and Crustacea in general.


Subject(s)
Adaptation, Physiological/genetics , Brachyura/physiology , Gene Expression Regulation, Developmental , Genome/genetics , Animals , Aquaculture , Chromosome Mapping , Female , Fertility/genetics , Gene Expression Profiling , Genes, Homeobox/genetics , Genomics , Introduced Species , Life Cycle Stages/genetics , Male , Multigene Family/genetics , Osmoregulation/genetics , Sexual Development/genetics , Spatio-Temporal Analysis , Whole Genome Sequencing
2.
Plant Biotechnol J ; 17(2): 397-409, 2019 02.
Article in English | MEDLINE | ID: mdl-29992702

ABSTRACT

Morella rubra, red bayberry, is an economically important fruit tree in south China. Here, we assembled the first high-quality genome for both a female and a male individual of red bayberry. The genome size was 313-Mb, and 90% sequences were assembled into eight pseudo chromosome molecules, with 32 493 predicted genes. By whole-genome comparison between the female and male and association analysis with sequences of bulked and individual DNA samples from female and male, a 59-Kb region determining female was identified and located on distal end of pseudochromosome 8, which contains abundant transposable element and seven putative genes, four of them are related to sex floral development. This 59-Kb female-specific region was likely to be derived from duplication and rearrangement of paralogous genes and retained non-recombinant in the female-specific region. Sex-specific molecular markers developed from candidate genes co-segregated with sex in a genetically diverse female and male germplasm. We propose sex determination follow the ZW model of female heterogamety. The genome sequence of red bayberry provides a valuable resource for plant sex chromosome evolution and also provides important insights for molecular biology, genetics and modern breeding in Myricaceae family.


Subject(s)
Evolution, Molecular , Genome, Plant/genetics , Myrica/genetics , Chromosome Mapping , Flowers/genetics , Flowers/growth & development , Flowers/physiology , Fruit/genetics , Fruit/growth & development , Fruit/physiology , Genetic Markers/genetics , Molecular Sequence Annotation , Myrica/growth & development , Myrica/physiology , Organ Specificity , Plant Breeding
3.
Nat Commun ; 5: 5315, 2014 Oct 28.
Article in English | MEDLINE | ID: mdl-25350882

ABSTRACT

The jujube (Ziziphus jujuba Mill.), a member of family Rhamnaceae, is a major dry fruit and a traditional herbal medicine for more than one billion people. Here we present a high-quality sequence for the complex jujube genome, the first genome sequence of Rhamnaceae, using an integrated strategy. The final assembly spans 437.65 Mb (98.6% of the estimated) with 321.45 Mb anchored to the 12 pseudo-chromosomes and contains 32,808 genes. The jujube genome has undergone frequent inter-chromosome fusions and segmental duplications, but no recent whole-genome duplication. Further analyses of the jujube-specific genes and transcriptome data from 15 tissues reveal the molecular mechanisms underlying some specific properties of the jujube. Its high vitamin C content can be attributed to a unique high level expression of genes involved in both biosynthesis and regeneration. Our study provides insights into jujube-specific biology and valuable genomic resources for the improvement of Rhamnaceae plants and other fruit trees.


Subject(s)
Fruit/genetics , Genome, Plant/genetics , Trees/genetics , Ziziphus/genetics , Adaptation, Physiological/genetics , Ascorbic Acid/metabolism , Carbohydrate Metabolism/genetics , Chromosomes, Plant/genetics , Gene Duplication/genetics , Gene Expression Regulation, Plant , Genes, Plant , Genetic Variation , Molecular Sequence Annotation , Molecular Sequence Data , Plant Shoots/genetics , Plant Shoots/growth & development , Sequence Alignment , Sequence Analysis, DNA , Sequence Analysis, RNA , Species Specificity , Stress, Physiological/genetics , Synteny/genetics
5.
Nat Commun ; 4: 2445, 2013.
Article in English | MEDLINE | ID: mdl-24048436

ABSTRACT

Human utilization of the mulberry-silkworm interaction started at least 5,000 years ago and greatly influenced world history through the Silk Road. Complementing the silkworm genome sequence, here we describe the genome of a mulberry species Morus notabilis. In the 330-Mb genome assembly, we identify 128 Mb of repetitive sequences and 29,338 genes, 60.8% of which are supported by transcriptome sequencing. Mulberry gene sequences appear to evolve ~3 times faster than other Rosales, perhaps facilitating the species' spread worldwide. The mulberry tree is among a few eudicots but several Rosales that have not preserved genome duplications in more than 100 million years; however, a neopolyploid series found in the mulberry tree and several others suggest that new duplications may confer benefits. Five predicted mulberry miRNAs are found in the haemolymph and silk glands of the silkworm, suggesting interactions at molecular levels in the plant-herbivore relationship. The identification and analyses of mulberry genes involved in diversifying selection, resistance and protease inhibitor expressed in the laticifers will accelerate the improvement of mulberry plants.


Subject(s)
Genome, Plant/genetics , Morus/genetics , Sequence Analysis, DNA , Trees/genetics , Animals , Base Sequence , Bombyx/genetics , Chromosomes, Plant/genetics , Computer Simulation , Disease Resistance/genetics , Evolution, Molecular , Genetic Variation , Humans , MicroRNAs/genetics , MicroRNAs/metabolism , Models, Genetic , Molecular Sequence Annotation , Phylogeny , Plant Diseases/genetics , Protease Inhibitors/metabolism , Repetitive Sequences, Nucleic Acid/genetics , Selection, Genetic
6.
Nat Genet ; 45(7): 776-783, 2013 Jul.
Article in English | MEDLINE | ID: mdl-23749191

ABSTRACT

The duck (Anas platyrhynchos) is one of the principal natural hosts of influenza A viruses. We present the duck genome sequence and perform deep transcriptome analyses to investigate immune-related genes. Our data indicate that the duck possesses a contractive immune gene repertoire, as in chicken and zebra finch, and this repertoire has been shaped through lineage-specific duplications. We identify genes that are responsive to influenza A viruses using the lung transcriptomes of control ducks and ones that were infected with either a highly pathogenic (A/duck/Hubei/49/05) or a weakly pathogenic (A/goose/Hubei/65/05) H5N1 virus. Further, we show how the duck's defense mechanisms against influenza infection have been optimized through the diversification of its ß-defensin and butyrophilin-like repertoires. These analyses, in combination with the genomic and transcriptomic data, provide a resource for characterizing the interaction between host and influenza viruses.


Subject(s)
Disease Reservoirs , Ducks/genetics , Ducks/virology , Genome , Influenza in Birds/genetics , Transcriptome/genetics , Animals , Base Sequence , Chickens/genetics , Disease Vectors , Ducks/immunology , Female , Geese/genetics , Genome/physiology , Host-Pathogen Interactions/genetics , Host-Pathogen Interactions/immunology , Immunity/genetics , Influenza in Birds/immunology , Molecular Sequence Data , Phylogeny , Species Specificity
7.
Nat Commun ; 4: 1858, 2013.
Article in English | MEDLINE | ID: mdl-23673643

ABSTRACT

The Tibetan antelope (Pantholops hodgsonii) is endemic to the extremely inhospitable high-altitude environment of the Qinghai-Tibetan Plateau, a region that has a low partial pressure of oxygen and high ultraviolet radiation. Here we generate a draft genome of this artiodactyl and use it to detect the potential genetic bases of highland adaptation. Compared with other plain-dwelling mammals, the genome of the Tibetan antelope shows signals of adaptive evolution and gene-family expansion in genes associated with energy metabolism and oxygen transmission. Both the highland American pika, and the Tibetan antelope have signals of positive selection for genes involved in DNA repair and the production of ATPase. Genes associated with hypoxia seem to have experienced convergent evolution. Thus, our study suggests that common genetic mechanisms might have been utilized to enable high-altitude adaptation.


Subject(s)
Antelopes/genetics , Genome/genetics , Adaptation, Physiological/genetics , Altitude , Animals , Base Sequence , Evolution, Molecular , Gene Ontology , Heterozygote , Molecular Sequence Data , Multigene Family/genetics , Polymorphism, Single Nucleotide/genetics , Selection, Genetic , Sequence Analysis, DNA , Tibet , Ursidae/genetics
8.
Nat Commun ; 4: 1595, 2013.
Article in English | MEDLINE | ID: mdl-23481403

ABSTRACT

The wild species of the genus Oryza contain a largely untapped reservoir of agronomically important genes for rice improvement. Here we report the 261-Mb de novo assembled genome sequence of Oryza brachyantha. Low activity of long-terminal repeat retrotransposons and massive internal deletions of ancient long-terminal repeat elements lead to the compact genome of Oryza brachyantha. We model 32,038 protein-coding genes in the Oryza brachyantha genome, of which only 70% are located in collinear positions in comparison with the rice genome. Analysing breakpoints of non-collinear genes suggests that double-strand break repair through non-homologous end joining has an important role in gene movement and erosion of collinearity in the Oryza genomes. Transition of euchromatin to heterochromatin in the rice genome is accompanied by segmental and tandem duplications, further expanded by transposable element insertions. The high-quality reference genome sequence of Oryza brachyantha provides an important resource for functional and evolutionary studies in the genus Oryza.


Subject(s)
Evolution, Molecular , Genome, Plant/genetics , Oryza/genetics , Sequence Analysis, DNA , Base Sequence , Chromatin/genetics , Chromosomes, Plant/genetics , Conserved Sequence , Gene Duplication/genetics , Gene Rearrangement/genetics , Genetic Loci/genetics , Genome Size/genetics , Molecular Sequence Data , Multigene Family/genetics , Mutagenesis, Insertional/genetics , Repetitive Sequences, Nucleic Acid/genetics , Retroelements/genetics , Segmental Duplications, Genomic/genetics , Terminal Repeat Sequences/genetics
9.
Genome Biol ; 14(3): R29, 2013 Mar 28.
Article in English | MEDLINE | ID: mdl-23537097

ABSTRACT

BACKGROUND: The mechanism of high-altitude adaptation has been studied in certain mammals. However, in avian species like the ground tit Pseudopodoces humilis, the adaptation mechanism remains unclear. The phylogeny of the ground tit is also controversial. RESULTS: Using next generation sequencing technology, we generated and assembled a draft genome sequence of the ground tit. The assembly contained 1.04 Gb of sequence that covered 95.4% of the whole genome and had higher N50 values, at the level of both scaffolds and contigs, than other sequenced avian genomes. About 1.7 million SNPs were detected, 16,998 protein-coding genes were predicted and 7% of the genome was identified as repeat sequences. Comparisons between the ground tit genome and other avian genomes revealed a conserved genome structure and confirmed the phylogeny of ground tit as not belonging to the Corvidae family. Gene family expansion and positively selected gene analysis revealed genes that were related to cardiac function. Our findings contribute to our understanding of the adaptation of this species to extreme environmental living conditions. CONCLUSIONS: Our data and analysis contribute to the study of avian evolutionary history and provide new insights into the adaptation mechanisms to extreme conditions in animals.


Subject(s)
Adaptation, Physiological/genetics , Altitude , Genome/genetics , Passeriformes/genetics , Animals , Base Sequence , Evolution, Molecular , Molecular Sequence Annotation , Open Reading Frames/genetics , Phylogeny , Selection, Genetic , Sequence Analysis, DNA , Species Specificity , Synteny/genetics
10.
Proc Natl Acad Sci U S A ; 110(5): 1785-90, 2013 Jan 29.
Article in English | MEDLINE | ID: mdl-23307812

ABSTRACT

One of the most difficult problems in modern genomics is the assembly of full-length chromosomes using next generation sequencing (NGS) data. To address this problem, we developed "reference-assisted chromosome assembly" (RACA), an algorithm to reliably order and orient sequence scaffolds generated by NGS and assemblers into longer chromosomal fragments using comparative genome information and paired-end reads. Evaluation of results using simulated and real genome assemblies indicates that our approach can substantially improve genomes generated by a wide variety of de novo assemblers if a good reference assembly of a closely related species and outgroup genomes are available. We used RACA to reconstruct 60 Tibetan antelope (Pantholops hodgsonii) chromosome fragments from 1,434 SOAPdenovo sequence scaffolds, of which 16 chromosome fragments were homologous to complete cattle chromosomes. Experimental validation by PCR showed that predictions made by RACA are highly accurate. Our results indicate that RACA will significantly facilitate the study of chromosome evolution and genome rearrangements for the large number of genomes being sequenced by NGS that do not have a genetic or physical map.


Subject(s)
Algorithms , Chromosomes/genetics , Genome/genetics , Genomics/methods , Animals , Antelopes/genetics , Cattle , Chromosome Mapping/methods , Evolution, Molecular , Gene Rearrangement/genetics , Humans , Reproducibility of Results , Sequence Analysis, DNA/methods
11.
PLoS One ; 7(9): e46211, 2012.
Article in English | MEDLINE | ID: mdl-23029438

ABSTRACT

BACKGROUND: The relatively short read lengths from next generation sequencing (NGS) technologies still pose a challenge for de novo assembly of complex mammal genomes. One important solution is to use paired-end (PE) sequence information experimentally obtained from long-range DNA fragments (>1 kb). Here, we characterize and extend a long-range PE library construction method based on direct intra-molecule ligation (or molecular linker-free circularization) for NGS. RESULTS: We found that the method performs stably for PE sequencing of 2- to 5- kb DNA fragments, and can be extended to 10-20 kb (and even in extremes, up to ∼35 kb). We also characterized the impact of low quality input DNA on the method, and develop a whole-genome amplification (WGA) based protocol using limited input DNA (<1 µg). Using this PE dataset, we accurately assembled the YanHuang (YH) genome, the first sequenced Asian genome, into a scaffold N50 size of >2 Mb, which is over 100-times greater than the initial size produced with only small insert PE reads(17 kb). In addition, we mapped two 7- to 8- kb insertions in the YH genome using the larger insert sizes of the long-range PE data. CONCLUSIONS: In conclusion, we demonstrate here the effectiveness of this long-range PE sequencing method and its use for the de novo assembly of a large, complex genome using NGS short reads.


Subject(s)
Genome, Human , Mammals/genetics , Mutagenesis, Insertional , Open Reading Frames , Sequence Analysis, DNA/methods , Algorithms , Animals , Chromosome Mapping , Genomic Library , Humans
12.
Nature ; 463(7279): 311-7, 2010 Jan 21.
Article in English | MEDLINE | ID: mdl-20010809

ABSTRACT

Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes.


Subject(s)
Genome/genetics , Genomics , Ursidae/genetics , Algorithms , Animals , China , Conserved Sequence/genetics , Contig Mapping , Diet/veterinary , Dogs , Evolution, Molecular , Female , Fertility/genetics , Fertility/physiology , Heterozygote , Humans , Multigene Family/genetics , Polymorphism, Single Nucleotide/genetics , Receptors, G-Protein-Coupled/genetics , Sequence Alignment , Sequence Analysis, DNA , Synteny/genetics , Ursidae/classification , Ursidae/physiology
13.
Nat Genet ; 41(12): 1275-81, 2009 Dec.
Article in English | MEDLINE | ID: mdl-19881527

ABSTRACT

Cucumber is an economically important crop as well as a model system for sex determination studies and plant vascular biology. Here we report the draft genome sequence of Cucumis sativus var. sativus L., assembled using a novel combination of traditional Sanger and next-generation Illumina GA sequencing technologies to obtain 72.2-fold genome coverage. The absence of recent whole-genome duplication, along with the presence of few tandem duplications, explains the small number of genes in the cucumber. Our study establishes that five of the cucumber's seven chromosomes arose from fusions of ten ancestral chromosomes after divergence from Cucumis melo. The sequenced cucumber genome affords insight into traits such as its sex expression, disease resistance, biosynthesis of cucurbitacin and 'fresh green' odor. We also identify 686 gene clusters related to phloem function. The cucumber genome provides a valuable resource for developing elite cultivars and for studying the evolution and function of the plant vascular system.


Subject(s)
Cucumis sativus/genetics , Genome, Plant , DNA Transposable Elements/genetics , DNA, Plant/chemistry , Gene Duplication , Genes, Plant , Immunity, Innate/genetics , Molecular Sequence Data , Plant Diseases/genetics , Repetitive Sequences, Nucleic Acid , Synteny
SELECTION OF CITATIONS
SEARCH DETAIL
...