Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 13 de 13
Filter
Add more filters










Publication year range
2.
Nat Commun ; 13(1): 2365, 2022 05 02.
Article in English | MEDLINE | ID: mdl-35501320

ABSTRACT

The 4D Nucleome (4DN) Network aims to elucidate the complex structure and organization of chromosomes in the nucleus and the impact of their disruption in disease biology. We present the 4DN Data Portal ( https://data.4dnucleome.org/ ), a repository for datasets generated in the 4DN network and relevant external datasets. Datasets were generated with a wide range of experiments, including chromosome conformation capture assays such as Hi-C and other innovative sequencing and microscopy-based assays probing chromosome architecture. All together, the 4DN data portal hosts more than 1800 experiment sets and 36000 files. Results of sequencing-based assays from different laboratories are uniformly processed and quality-controlled. The portal interface allows easy browsing, filtering, and bulk downloads, and the integrated HiGlass genome browser allows interactive visualization and comparison of multiple datasets. The 4DN data portal represents a primary resource for chromosome contact and other nuclear architecture data for the scientific community.


Subject(s)
Chromosomes , Software , Cell Nucleus/genetics , Chromosomes/genetics , Genome
3.
Nucleic Acids Res ; 45(D1): D663-D671, 2017 01 04.
Article in English | MEDLINE | ID: mdl-27799470

ABSTRACT

Since 1992, FlyBase (flybase.org) has been an essential online resource for the Drosophila research community. Concentrating on the most extensively studied species, Drosophila melanogaster, FlyBase includes information on genes (molecular and genetic), transgenic constructs, phenotypes, genetic and physical interactions, and reagents such as stocks and cDNAs. Access to data is provided through a number of tools, reports, and bulk-data downloads. Looking to the future, FlyBase is expanding its focus to serve a broader scientific community. In this update, we describe new features, datasets, reagent collections, and data presentations that address this goal, including enhanced orthology data, Human Disease Model Reports, protein domain search and visualization, concise gene summaries, a portal for external resources, video tutorials and the FlyBase Community Advisory Group.


Subject(s)
Computational Biology/methods , Databases, Genetic , Drosophila/genetics , Genomics/methods , Animals , Disease Models, Animal , Genetic Association Studies , Humans , Web Browser
4.
G3 (Bethesda) ; 5(8): 1737-49, 2015 Jun 24.
Article in English | MEDLINE | ID: mdl-26109356

ABSTRACT

In the context of the FlyBase annotated gene models in Drosophila melanogaster, we describe the many exceptional cases we have curated from the literature or identified in the course of FlyBase analysis. These range from atypical but common examples such as dicistronic and polycistronic transcripts, noncanonical splices, trans-spliced transcripts, noncanonical translation starts, and stop-codon readthroughs, to single exceptional cases such as ribosomal frameshifting and HAC1-type intron processing. In FlyBase, exceptional genes and transcripts are flagged with Sequence Ontology terms and/or standardized comments. Because some of the rule-benders create problems for handlers of high-throughput data, we discuss plans for flagging these cases in bulk data downloads.


Subject(s)
Drosophila melanogaster/genetics , Molecular Sequence Annotation , Animals , Base Sequence , Codon, Terminator , Databases, Genetic , Mitochondria/genetics , Mitochondria/metabolism , Models, Genetic , Protein Biosynthesis , RNA Editing , RNA Splice Sites
5.
G3 (Bethesda) ; 5(8): 1721-36, 2015 Jun 24.
Article in English | MEDLINE | ID: mdl-26109357

ABSTRACT

We report the current status of the FlyBase annotated gene set for Drosophila melanogaster and highlight improvements based on high-throughput data. The FlyBase annotated gene set consists entirely of manually annotated gene models, with the exception of some classes of small non-coding RNAs. All gene models have been reviewed using evidence from high-throughput datasets, primarily from the modENCODE project. These datasets include RNA-Seq coverage data, RNA-Seq junction data, transcription start site profiles, and translation stop-codon read-through predictions. New annotation guidelines were developed to take into account the use of the high-throughput data. We describe how this flood of new data was incorporated into thousands of new and revised annotations. FlyBase has adopted a philosophy of excluding low-confidence and low-frequency data from gene model annotations; we also do not attempt to represent all possible permutations for complex and modularly organized genes. This has allowed us to produce a high-confidence, manageable gene annotation dataset that is available at FlyBase (http://flybase.org). Interesting aspects of new annotations include new genes (coding, non-coding, and antisense), many genes with alternative transcripts with very long 3' UTRs (up to 15-18 kb), and a stunning mismatch in the number of male-specific genes (approximately 13% of all annotated gene models) vs. female-specific genes (less than 1%). The number of identified pseudogenes and mutations in the sequenced strain also increased significantly. We discuss remaining challenges, for instance, identification of functional small polypeptides and detection of alternative translation starts.


Subject(s)
Drosophila melanogaster/genetics , Molecular Sequence Annotation , 3' Untranslated Regions , Animals , Databases, Genetic , Exons , Female , Male , Models, Genetic , RNA, Small Untranslated/chemistry , RNA, Small Untranslated/metabolism , Sequence Analysis, RNA , Transcription Initiation Site , Transcriptome
6.
Nucleic Acids Res ; 43(Database issue): D690-7, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25398896

ABSTRACT

Release 6, the latest reference genome assembly of the fruit fly Drosophila melanogaster, was released by the Berkeley Drosophila Genome Project in 2014; it replaces their previous Release 5 genome assembly, which had been the reference genome assembly for over 7 years. With the enormous amount of information now attached to the D. melanogaster genome in public repositories and individual laboratories, the replacement of the previous assembly by the new one is a major event requiring careful migration of annotations and genome-anchored data to the new, improved assembly. In this report, we describe the attributes of the new Release 6 reference genome assembly, the migration of FlyBase genome annotations to this new assembly, how genome features on this new assembly can be viewed in FlyBase (http://flybase.org) and how users can convert coordinates for their own data to the corresponding Release 6 coordinates.


Subject(s)
Databases, Genetic , Drosophila melanogaster/genetics , Genome, Insect , Molecular Sequence Annotation , Animals , Genomics/standards , High-Throughput Nucleotide Sequencing , Internet , Models, Genetic , Molecular Sequence Data , Reference Standards , Sequence Alignment , Software
7.
Genome Res ; 17(12): 1823-36, 2007 Dec.
Article in English | MEDLINE | ID: mdl-17989253

ABSTRACT

The availability of sequenced genomes from 12 Drosophila species has enabled the use of comparative genomics for the systematic discovery of functional elements conserved within this genus. We have developed quantitative metrics for the evolutionary signatures specific to protein-coding regions and applied them genome-wide, resulting in 1193 candidate new protein-coding exons in the D. melanogaster genome. We have reviewed these predictions by manual curation and validated a subset by directed cDNA screening and sequencing, revealing both new genes and new alternative splice forms of known genes. We also used these evolutionary signatures to evaluate existing gene annotations, resulting in the validation of 87% of genes lacking descriptive names and identifying 414 poorly conserved genes that are likely to be spurious predictions, noncoding, or species-specific genes. Furthermore, our methods suggest a variety of refinements to hundreds of existing gene models, such as modifications to translation start codons and exon splice boundaries. Finally, we performed directed genome-wide searches for unusual protein-coding structures, discovering 149 possible examples of stop codon readthrough, 125 new candidate ORFs of polycistronic mRNAs, and several candidate translational frameshifts. These results affect >10% of annotated fly genes and demonstrate the power of comparative genomics to enhance our understanding of genome organization, even in a model organism as intensively studied as Drosophila melanogaster.


Subject(s)
Drosophila Proteins/genetics , Drosophila melanogaster/genetics , Genes, Insect , Genome, Insect , Animals , Base Sequence , Codon/genetics , Conserved Sequence , Drosophila Proteins/chemistry , Evolution, Molecular , Molecular Sequence Data , Reading Frames , Sequence Alignment
8.
Genome Res ; 15(1): 1-18, 2005 Jan.
Article in English | MEDLINE | ID: mdl-15632085

ABSTRACT

We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25-55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species--but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila.


Subject(s)
Chromosomes/genetics , Drosophila/genetics , Evolution, Molecular , Genes, Insect/genetics , Genome , Sequence Analysis, DNA/methods , Animals , Chromosome Breakage/genetics , Chromosome Inversion/genetics , Chromosome Mapping/methods , Conserved Sequence/genetics , Drosophila melanogaster/genetics , Enhancer Elements, Genetic , Gene Rearrangement/genetics , Genetic Variation/genetics , Molecular Sequence Data , Predictive Value of Tests , Repetitive Sequences, Nucleic Acid/genetics
9.
J Neurogenet ; 17(2-3): 139-69, 2003.
Article in English | MEDLINE | ID: mdl-14668198

ABSTRACT

Past studies have implicated the Drosophila LARK protein in the circadian control of adult eclosion behavior. LARK has a broad tissue pattern of distribution, and is pan-neuronal in the differentiated brain. In certain peptidergic neurons, LARK abundance changes in a circadian manner. However, the precise cellular requirement for LARK, with respect to circadian behavior, is still not known. To explore this issue, we employed the GAL4/UAS binary expression system to increase LARK abundance in defined neuronal cell types. Interestingly, LARK expression in Crustacean Cardioactive Peptide (CCAP) neurons caused an early-eclosion phenotype, whereas a similar perturbation in the Eclosion Hormone (EH) cells resulted in abnormally late peaks of eclosion. Surprisingly, LARK expression in Pigment Dispersing Factor (PDF)- or TIMELESS (TIM)-containing clock neurons caused behavioral arrhythmicity, even though clock protein cycling was found to be normal in these flies. Although the observed effects of LARK expression mirrored those seen with genetic ablation of the relevant peptidergic populations, there was no evidence of defective cell development or morphology. This suggests that an alteration of cell function rather than cell death is the cause of the aberrant phenotypes. Diminished PDF immunoreactivity in flies expressing LARK in the PDF neurons suggests that an effect on neuropeptide synthesis, transport, or release may contribute to the observed arrhythmicity. Importantly, the expression of LARK in several other cell populations did not have detectable effects on development, viability or behavior, indicating a specificity of action within certain cell types.


Subject(s)
Behavior, Animal/physiology , Brain/metabolism , Circadian Rhythm , Drosophila Proteins/genetics , Drosophila melanogaster/genetics , Genes, Insect , RNA-Binding Proteins/genetics , Animals , Animals, Genetically Modified , Drosophila melanogaster/metabolism , Female , Gene Expression , Locomotion/genetics , Male , Neurons/metabolism
10.
Development ; 130(12): 2645-56, 2003 Jun.
Article in English | MEDLINE | ID: mdl-12736209

ABSTRACT

Insect growth and metamorphosis is punctuated by molts, during which a new cuticle is produced. Every molt culminates in ecdysis, the shedding of the remains of the old cuticle. Both the timing of ecdysis relative to the molt and the actual execution of this vital insect behavior are under peptidergic neuronal control. Based on studies in the moth, Manduca sexta, it has been postulated that the neuropeptide Crustacean cardioactive peptide (CCAP) plays a key role in the initiation of the ecdysis motor program. We have used Drosophila bearing targeted ablations of CCAP neurons (CCAP KO animals) to investigate the role of CCAP in the execution and circadian regulation of ecdysis. CCAP KO animals showed specific defects at ecdysis, yet the severity and nature of the defects varied at different developmental stages. The majority of CCAP KO animals died at the pupal stage from the failure of pupal ecdysis, whereas larval ecdysis and adult eclosion behaviors showed only subtle defects. Interestingly, the most severe failure seen at eclosion appeared to be in a function required for abdominal inflation, which could be cardioactive in nature. Although CCAP KO populations exhibited circadian eclosion rhythms, the daily distribution of eclosion events (i.e., gating) was abnormal. Effects on the execution of ecdysis and its circadian regulation indicate that CCAP is a key regulator of the behavior. Nevertheless, an unexpected finding of this work is that the primary functions of CCAP as well as its importance in the control of ecdysis behaviors may change during the postembryonic development of Drosophila.


Subject(s)
Drosophila/physiology , Molting/physiology , Neurons/physiology , Neuropeptides/physiology , Amino Acid Sequence , Animals , Base Sequence , Circadian Rhythm/physiology , Gene Expression Regulation, Developmental , Genes, Regulator , Molecular Sequence Data , Neuropeptides/genetics , Pupa/physiology
12.
Neuron ; 34(6): 961-72, 2002 Jun 13.
Article in English | MEDLINE | ID: mdl-12086643

ABSTRACT

Mental retardation is a pervasive societal problem, 25 times more common than blindness for example. Fragile X syndrome, the most common form of inherited mental retardation, is caused by mutations in the FMR1 gene. Fragile X patients display neurite morphology defects in the brain, suggesting that this may be the basis of their mental retardation. Drosophila contains a single homolog of FMR1, dfxr (also called dfmr1). We analyzed the role of dfxr in neurite development in three distinct neuronal classes. We find that DFXR is required for normal neurite extension, guidance, and branching. dfxr mutants also display strong eclosion failure and circadian rhythm defects. Interestingly, distinct neuronal cell types show different phenotypes, suggesting that dfxr differentially regulates diverse targets in the brain.


Subject(s)
Brain/physiology , Drosophila Proteins/physiology , Fragile X Syndrome/genetics , Nerve Tissue Proteins/physiology , Neurons/cytology , RNA-Binding Proteins , Amino Acid Sequence , Animals , Brain/pathology , Circadian Rhythm/genetics , Circadian Rhythm/physiology , Fragile X Mental Retardation Protein , Molecular Sequence Data , Motor Activity/genetics , Motor Activity/physiology , Mutation , Nerve Tissue Proteins/deficiency , Nerve Tissue Proteins/genetics , Neuroglia/metabolism , Neuroglia/pathology , Neuroglia/physiology , Neurons/pathology , Neurons/physiology , Sequence Homology, Amino Acid
13.
Genome Biol ; 3(12): RESEARCH0083, 2002.
Article in English | MEDLINE | ID: mdl-12537572

ABSTRACT

BACKGROUND: The recent completion of the Drosophila melanogaster genomic sequence to high quality and the availability of a greatly expanded set of Drosophila cDNA sequences, aligning to 78% of the predicted euchromatic genes, afforded FlyBase the opportunity to significantly improve genomic annotations. We made the annotation process more rigorous by inspecting each gene visually, utilizing a comprehensive set of curation rules, requiring traceable evidence for each gene model, and comparing each predicted peptide to SWISS-PROT and TrEMBL sequences. RESULTS: Although the number of predicted protein-coding genes in Drosophila remains essentially unchanged, the revised annotation significantly improves gene models, resulting in structural changes to 85% of the transcripts and 45% of the predicted proteins. We annotated transposable elements and non-protein-coding RNAs as new features, and extended the annotation of untranslated (UTR) sequences and alternative transcripts to include more than 70% and 20% of genes, respectively. Finally, cDNA sequence provided evidence for dicistronic transcripts, neighboring genes with overlapping UTRs on the same DNA sequence strand, alternatively spliced genes that encode distinct, non-overlapping peptides, and numerous nested genes. CONCLUSIONS: Identification of so many unusual gene models not only suggests that some mechanisms for gene regulation are more prevalent than previously believed, but also underscores the complex challenges of eukaryotic gene prediction. At present, experimental data and human curation remain essential to generate high-quality genome annotations.


Subject(s)
Computational Biology/methods , Drosophila melanogaster/genetics , Euchromatin/genetics , Genes, Insect , Genome , Animals , Databases, Genetic , Databases, Protein , Drosophila Proteins/genetics , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...