Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 17 de 17
Filter
1.
Am J Clin Pathol ; 148(6): 545-554, 2017 Nov 20.
Article in English | MEDLINE | ID: mdl-29165569

ABSTRACT

OBJECTIVES: Chronic lymphocytic leukemia (CLL) is the most common type of leukemia in Western populations, being rarer in Asian and African people. It has been suggested that patients with CLL from Africa might have a more aggressive disease compared with white patients. In this study, we aimed to identify genetic factors that may account for this difference. METHODS: We analyzed immunoglobulin heavy chain (IGH) genes' mutational status by performing next-generation sequencing in 25 Senegalese and 50 Italian patients with CLL. RESULTS: We found that Senegalese patients more frequently had adverse prognostic factors and an unmutated profile. Furthermore, we documented that IGHV1 (IGHV1-69), IGHD3, and IGHJ6 were significantly more frequent in Senegalese patients, whereas IGHV3-30 was common and limited to the Italian cohort. Stereotyped receptors commonly detected in the white population were not recorded in our Senegalese series. CONCLUSIONS: The different IGH repertoire we observed in the Senegalese cohort may reflect the diverse genetic and microenvironmental (ie, polymicrobial stimulation) background.


Subject(s)
Genes, Immunoglobulin Heavy Chain/genetics , Immunoglobulin Heavy Chains/genetics , Immunoglobulin Variable Region/genetics , Leukemia, Lymphocytic, Chronic, B-Cell/genetics , Amino Acid Sequence , Cohort Studies , Female , Humans , Male , Multigene Family/genetics , Senegal , Tumor Microenvironment/genetics
2.
Genome Announc ; 4(2)2016 Apr 07.
Article in English | MEDLINE | ID: mdl-27056222

ABSTRACT

Klebsiella pneumoniaesequence type (ST) 307, carryingblaKPC-3,blaCTX-M-15,blaOXA-1,aac(6')-Ib-cr, andqnrB1 genes, is replacing the predominant hyperepidemic ST258 clone in Italy. Whole-genome and complete plasmid sequencing of one ST307 strain was performed and new features were identified.

3.
Am J Clin Pathol ; 145(1): 116-27, 2016 Jan.
Article in English | MEDLINE | ID: mdl-26712879

ABSTRACT

OBJECTIVES: Recent studies using next-generation sequencing (NGS) analysis disclosed the importance of the intrinsic activation of the B-cell receptor (BCR) pathway in the pathogenesis of sporadic Burkitt lymphoma (sBL) due to mutations of TCF3/ID3 genes. Since no definitive data are available on the genetic landscape of endemic Burkitt (eBL), we first assessed the mutation frequency of TCF3/ID3 in eBL compared with sBL and subsequently the somatic hypermutation status of the BCR to answer whether an extrinsic activation of BCR signaling could also be demonstrated in Burkitt lymphoma. METHODS: We assessed the mutations of TCF3/ID3 by RNAseq and the BCR status by NGS analysis of the immunoglobulin genes (IGs). RESULTS: We detected mutations of TCF3/ID3 in about 30% of the eBL cases. This rate is significantly lower than that detected in sBL (64%). The NGS analysis of IGs revealed intraclonal diversity, suggesting an active targeted somatic hypermutation process in eBL compared with sBL. CONCLUSIONS: These findings support the view that the antigenic pressure plays a key role in the pathogenetic pathways of eBL, which may be partially distinct from those driving sBL development.


Subject(s)
Burkitt Lymphoma/genetics , Genes, Immunoglobulin , Adult , Basic Helix-Loop-Helix Transcription Factors/genetics , Burkitt Lymphoma/immunology , Child , Child, Preschool , Female , High-Throughput Nucleotide Sequencing , Humans , Infant , Male , Middle Aged , Mutation , Young Adult
4.
BMC Res Notes ; 7: 884, 2014 Dec 08.
Article in English | MEDLINE | ID: mdl-25486898

ABSTRACT

BACKGROUND: Plant receptor-like kinase (RLK/Pelle) family regulates growth and developmental processes and interaction with pathogens and symbionts.Platanaceae is one of the earliest branches of Eudicots temporally located before the split which gave rise to Rosids and Asterids. Thus investigations into the RLK family in Platanus can provide information on the evolution of this gene family in the land plants.Moreover RLKs are good candidates for finding genes that are able to confer resistance to Platanus pathogens. RESULTS: Degenerate oligonucleotide primers targeting the kinase domain of stress-related RLKs were used to isolate for the first time 111 RLK gene fragments in Platanus×acerifolia. Sequences were classified as candidates of the following subfamilies: CrRLK1L, LRR XII, WAK-like, and LRR X-BRI1 group. All the structural features typical of the RLK kinase domain were identified, including the non-RD motif which marks potential pathogen recognition receptors (PRRs). The LRR XII candidates, whose counterpart in Arabidopsis and rice comprises non-RD PRRs, were mostly non-RD kinases, suggesting a group of PRRs. Region-specific signatures of a relaxed purifying selection in the LRR XII candidates were also found, which is novel for plant RLK kinase domain and further supports the role of LRR XII candidates as PRRs. As we obtained CrRLK1L candidates using primers designed on Pto of tomato, we analysed the phylogenetic relationship between CrRLK1L and Pto-like of plant species. We thus classified all non-solanaceous Pto-like genes as CrRLK1L and highlighted for the first time the close phylogenetic vicinity between CrRLK1L and Pto group. The origins of Pto from CrRLK1L is proposed as an evolutionary mechanism. CONCLUSIONS: The signatures of relaxed purifying selection highlight that a group of RLKs might have been involved in the expression of phenotypic plasticity and is thus a good candidate for investigations into pathogen resistance.Search of Pto-like genes in Platanus highlighted the close relationship between CrRLK1L and Pto group. It will be exciting to verify if sensu strictu Pto are present in taxonomic groups other than Solanaceae, in order to further clarify the evolutionary link with CrRLK1L.We obtained a first valuable resource useful for an in-depth study on stress perception systems.


Subject(s)
Phylogeny , Plant Immunity/genetics , Plant Proteins/genetics , Protein Kinases/genetics , Amino Acid Sequence , Arabidopsis/classification , Arabidopsis/genetics , DNA Primers/chemistry , Solanum lycopersicum/classification , Solanum lycopersicum/genetics , Molecular Sequence Annotation , Molecular Sequence Data , Oryza/classification , Oryza/genetics , Plant Proteins/chemistry , Plant Proteins/immunology , Polymerase Chain Reaction , Proteaceae/classification , Proteaceae/enzymology , Proteaceae/genetics , Proteaceae/immunology , Protein Kinases/chemistry , Protein Kinases/immunology , Protein Structure, Tertiary , Selection, Genetic , Sequence Alignment , Sequence Homology, Amino Acid
5.
J Genomics ; 2: 89-93, 2014.
Article in English | MEDLINE | ID: mdl-25031660

ABSTRACT

The reconstruction of the complete genome sequence of an organism is an important point for comparative, functional and evolutionary genomics. Nevertheless, overcoming the problems encountered while completing the sequence of an entire genome can still be demanding in terms of time and resources. We have developed Enly, a simple tool based on the iterative mapping of sequence reads at contig edges, capable to extend the genomic contigs deriving from high-throughput sequencing, especially those deriving by Newbler-like assemblies. Testing it on a set of de novo draft genomes led to the closure of up to 20% of the gaps originally present. Enly is cross-platform and most of the steps of its pipeline are parallelizable, making easy and fast to improve a draft genome resulting from a de novo assembly.

6.
Blood ; 122(9): 1634-48, 2013 Aug 29.
Article in English | MEDLINE | ID: mdl-23794064

ABSTRACT

In chronic myeloid leukemia and Philadelphia chromosome-positive acute lymphoblastic leukemia, tyrosine kinase inhibitor (TKI) therapy may select for drug-resistant BCR-ABL mutants. We used an ultra-deep sequencing (UDS) approach to resolve qualitatively and quantitatively the complexity of mutated populations surviving TKIs and to investigate their clonal structure and evolution over time in relation to therapeutic intervention. To this purpose, we performed a longitudinal analysis of 106 samples from 33 patients who had received sequential treatment with multiple TKIs and had experienced sequential relapses accompanied by selection of 1 or more TKI-resistant mutations. We found that conventional Sanger sequencing had misclassified or underestimated BCR-ABL mutation status in 55% of the samples, where mutations with 1% to 15% abundance were detected. A complex clonal texture was uncovered by clonal analysis of samples harboring multiple mutations and up to 13 different mutated populations were identified. The landscape of these mutated populations was found to be highly dynamic. The high degree of complexity uncovered by UDS indicates that conventional Sanger sequencing might be an inadequate tool to assess BCR-ABL kinase domain mutation status, which currently represents an important component of the therapeutic decision algorithms. Further evaluation of the clinical usefulness of UDS-based approaches is warranted.


Subject(s)
DNA Mutational Analysis/methods , Drug Resistance, Neoplasm/genetics , Fusion Proteins, bcr-abl/genetics , High-Throughput Nucleotide Sequencing , Protein Kinase Inhibitors/therapeutic use , Adolescent , Adult , Aged , Catalytic Domain/genetics , Female , Fusion Proteins, bcr-abl/chemistry , Humans , Male , Middle Aged , Protein-Tyrosine Kinases/antagonists & inhibitors , Protein-Tyrosine Kinases/chemistry , Protein-Tyrosine Kinases/genetics , Retrospective Studies , Young Adult
7.
Appl Environ Microbiol ; 78(22): 8142-5, 2012 Nov.
Article in English | MEDLINE | ID: mdl-22941080

ABSTRACT

Intermediates of production of two batches of traditional mozzarella cheese were analyzed by culture-independent pyrosequencing. The quantitative distribution of taxa within the samples suggested that thermophilic lactic acid bacteria from the natural starter were mainly responsible for the fermentation, while microorganisms found in raw milk did not develop during fermentation.


Subject(s)
Bacteria/classification , Bacteria/genetics , Biota , Cheese/microbiology , Metagenome , Animals , Buffaloes , Cluster Analysis , DNA, Bacterial/chemistry , DNA, Bacterial/genetics , DNA, Ribosomal/chemistry , DNA, Ribosomal/genetics , High-Throughput Nucleotide Sequencing , Milk/microbiology , Phylogeny , RNA, Ribosomal, 16S/genetics
8.
PLoS One ; 7(7): e42164, 2012.
Article in English | MEDLINE | ID: mdl-22848739

ABSTRACT

Microdeletions at exon 19 are the most frequent genetic alterations affecting the Epidermal Growth Factor Receptor (EGFR) gene in non-small cell lung cancer (NSCLC) and they are strongly associated with response to treatment with tyrosine kinase inhibitors. A series of 116 NSCLC DNA samples investigated by Sanger Sequencing (SS), including 106 samples carrying exon 19 EGFR deletions and 10 without deletions (control samples), were subjected to deep next generation sequencing (NGS). All samples with deletions at SS showed deletions with NGS. No deletions were seen in control cases. In 93 (88%) cases, deletions detected by NGS were exactly corresponding to those identified by SS. In 13 cases (12%) NGS resolved deletions not accurately characterized by SS. In 21 (20%) cases the NGS showed presence of complex (double/multiple) frameshift deletions producing a net in-frame change. In 5 of these cases the SS could not define the exact sequence of mutant alleles, in the other 16 cases the results obtained by SS were conventionally considered as deletions plus insertions. Different interpretative hypotheses for complex mutations are discussed. In 46 (43%) tumors deep NGS showed, for the first time to our knowledge, subpopulations of DNA molecules carrying EGFR deletions different from the main one. Each of these subpopulations accounted for 0.1% to 17% of the genomic DNA in the different tumors investigated. Our findings suggest that a region in exon 19 is highly unstable in a large proportion of patients carrying EGFR deletions. As a corollary to this study, NGS data were compared with those obtained by immunohistochemistry using the 6B6 anti-mutant EGFR antibody. The immunoreaction was E746-A750del specific. In conclusion, NGS analysis of EGFR exon 19 in NSCLCs allowed us to formulate a new interpretative hypothesis for complex mutations and revealed the presence of subpopulations of deletions with potential pathogenetic and clinical impact.


Subject(s)
Carcinoma, Non-Small-Cell Lung/genetics , Exons/genetics , Gene Deletion , Genes, erbB-1/genetics , High-Throughput Nucleotide Sequencing , Lung Neoplasms/genetics , Base Sequence , Humans , Molecular Sequence Data
9.
IUBMB Life ; 63(12): 1068-74, 2011 Dec.
Article in English | MEDLINE | ID: mdl-22034231

ABSTRACT

The genome sequences of a number of Acinetobacter baumannii strains, including representatives of the main epidemic international lineages, have now been determined, and several others are in progress. The study of A. baumannii genomics has provided an expanded view of the adaptation and virulence capacities of this bacterial species, whilst also presenting novel insights into its intraspecies diversity and genome evolution. Genomic analyses have revealed that the current A. baumannii clinical population consists of low-grade pathogens, whose pathogenicity relies mainly on an ability to persist in the hospital setting and survive antibiotic treatment. A. baumannii has a high capacity to acquire new genetic determinants and displays an open pan genome; this feature may have played a crucial role in the evolution of this human opportunistic pathogen towards clinical success.


Subject(s)
Acinetobacter baumannii/genetics , Acinetobacter baumannii/pathogenicity , Adaptation, Biological/genetics , Drug Resistance, Multiple, Bacterial/genetics , Genes, MDR/genetics , Genome, Bacterial/genetics , Virulence/genetics , Acinetobacter Infections/epidemiology , Acinetobacter Infections/genetics , Acinetobacter Infections/microbiology , Acinetobacter baumannii/drug effects , Adaptation, Biological/drug effects , Anti-Bacterial Agents/pharmacology , Anti-Bacterial Agents/therapeutic use , Biological Evolution , Drug Resistance, Multiple, Bacterial/drug effects , Genes, MDR/drug effects , Genome, Bacterial/drug effects , Humans , Models, Genetic , Open Reading Frames , Virulence/drug effects
10.
BMC Microbiol ; 11: 25, 2011 Feb 01.
Article in English | MEDLINE | ID: mdl-21284853

ABSTRACT

BACKGROUND: Streptococcus pneumoniae is an important human pathogen representing a major cause of morbidity and mortality worldwide. We sequenced the genome of a serotype 11A, ST62 S. pneumoniae invasive isolate (AP200), that was erythromycin-resistant due to the presence of the erm(TR) determinant, and carried out analysis of the genome organization and comparison with other pneumococcal genomes. RESULTS: The genome sequence of S. pneumoniae AP200 is 2,130,580 base pair in length. The genome carries 2216 coding sequences (CDS), 56 tRNA, and 12 rRNA genes. Of the CDSs, 72.9% have a predicted biological known function. AP200 contains the pilus islet 2 and, although its phenotype corresponds to serotype 11A, it contains an 11D capsular locus. Chromosomal rearrangements resulting from a large inversion across the replication axis, and horizontal gene transfer events were observed. The chromosomal inversion is likely implicated in the rebalance of the chromosomal architecture affected by the insertions of two large exogenous elements, the erm(TR)-carrying Tn1806 and a functional prophage designated φSpn_200. Tn1806 is 52,457 bp in size and comprises 49 ORFs. Comparative analysis of Tn1806 revealed the presence of a similar genetic element or part of it in related species such as Streptococcus pyogenes and also in the anaerobic species Finegoldia magna, Anaerococcus prevotii and Clostridium difficile. The genome of φSpn_200 is 35,989 bp in size and is organized in 47 ORFs grouped into five functional modules. Prophages similar to φSpn_200 were found in pneumococci and in other streptococcal species, showing a high degree of exchange of functional modules. φSpn_200 viral particles have morphologic characteristics typical of the Siphoviridae family and are capable of infecting a pneumococcal recipient strain. CONCLUSIONS: The sequence of S. pneumoniae AP200 chromosome revealed a dynamic genome, characterized by chromosomal rearrangements and horizontal gene transfers. The overall diversity of AP200 is driven mainly by the presence of the exogenous elements Tn1806 and φSpn_200 that show large gene exchanges with other genetic elements of different bacterial species. These genetic elements likely provide AP200 with additional genes, such as those conferring antibiotic-resistance, promoting its adaptation to the environment.


Subject(s)
Genome, Bacterial , Streptococcus pneumoniae/genetics , Chromosomes, Bacterial/genetics , DNA Transposable Elements , DNA, Bacterial/genetics , Molecular Sequence Annotation , Molecular Sequence Data , Prophages/genetics , Sequence Analysis, DNA , Streptococcus pneumoniae/classification , Streptococcus pneumoniae/isolation & purification
11.
Anal Biochem ; 406(2): 176-84, 2010 Nov 15.
Article in English | MEDLINE | ID: mdl-20670611

ABSTRACT

The X-linked dystrophin gene is well known for its involvement in Duchenne/Becker muscular dystrophies and for its exceptional megabase size. This locus at Xp21 is prone to frequent random molecular changes, including large deletions and duplications, but also smaller variations. To cope with such huge sequence analysis requirements in forthcoming diagnostic applications, we employed the power of the parallel 454 GS-FLX pyrosequencer to the dystrophin locus. We enriched the genomic region of interest by the robust amplification of 62 fragments under universal conditions by the long-PCR protocol yielding 244,707 bp of sequence. Pooled PCR products were fragmented and used for library preparation and DNA sequencing. To evaluate the entire procedure we analyzed four male DNA samples for sequence coverage and accuracy in DNA sequence variation and for any potential bias. We identified 562 known variations and 55 additional variants not yet reported, among which we detected a causative Arg1844Stop mutation in one sample. Sanger sequencing confirmed all changes. Unexpectedly, only 3 x coverage was sufficient for 99.9993% accuracy. Our results show that long PCR combined to massive pyrosequencing is very reliable for the analysis of the biggest gene of the human genome and open the doors to other demanding applications in molecular diagnostics.


Subject(s)
Dystrophin/genetics , Genetic Loci/genetics , Polymerase Chain Reaction/methods , Sequence Analysis, DNA/methods , Temperature , Base Sequence , Biopolymers/genetics , Humans , Male , Mutation/genetics , Reproducibility of Results
12.
Nucleic Acids Res ; 38(9): e110, 2010 May.
Article in English | MEDLINE | ID: mdl-20144949

ABSTRACT

We have developed a high-throughput protein expression and interaction analysis platform that combines cDNA phage display library selection and massive gene sequencing using the 454 platform. A phage display library of open reading frame (ORF) fragments was created from mRNA derived from different tissues. This was used to study the interaction network of the enzyme transglutaminase 2 (TG2), a multifunctional enzyme involved in the regulation of cell growth, differentiation and apoptosis, associated with many different pathologies. After two rounds of panning with TG2 we assayed the frequency of ORFs within the selected phage population using 454 sequencing. Ranking and analysis of more than 120,000 sequences allowed us to identify several potential interactors, which were subsequently confirmed in functional assays. Within the identified clones, three had been previously described as interacting proteins (fibronectin, SMOC1 and GSTO2), while all the others were new. When compared with standard systems, such as microtiter enzyme-linked immunosorbant assay, the method described here is dramatically faster and yields far more information about the interaction under study, allowing better characterization of complex systems. For example, in the case of fibronectin, it was possible to identify the specific domains involved in the interaction.


Subject(s)
Protein Interaction Mapping/methods , Sequence Analysis, DNA/methods , DNA, Complementary/chemistry , GTP-Binding Proteins/metabolism , Humans , Open Reading Frames , Peptide Library , Protein Glutamine gamma Glutamyltransferase 2 , Protein Interaction Domains and Motifs , Transglutaminases/metabolism
13.
BMC Genomics ; 10: 163, 2009 Apr 20.
Article in English | MEDLINE | ID: mdl-19379481

ABSTRACT

BACKGROUND: The cancer transcriptome is difficult to explore due to the heterogeneity of quantitative and qualitative changes in gene expression linked to the disease status. An increasing number of "unconventional" transcripts, such as novel isoforms, non-coding RNAs, somatic gene fusions and deletions have been associated with the tumoral state. Massively parallel sequencing techniques provide a framework for exploring the transcriptional complexity inherent to cancer with a limited laboratory and financial effort. We developed a deep sequencing and bioinformatics analysis protocol to investigate the molecular composition of a breast cancer poly(A)+ transcriptome. This method utilizes a cDNA library normalization step to diminish the representation of highly expressed transcripts and biology-oriented bioinformatic analyses to facilitate detection of rare and novel transcripts. RESULTS: We analyzed over 132,000 Roche 454 high-confidence deep sequencing reads from a primary human lobular breast cancer tissue specimen, and detected a range of unusual transcriptional events that were subsequently validated by RT-PCR in additional eight primary human breast cancer samples. We identified and validated one deletion, two novel ncRNAs (one intergenic and one intragenic), ten previously unknown or rare transcript isoforms and a novel gene fusion specific to a single primary tissue sample. We also explored the non-protein-coding portion of the breast cancer transcriptome, identifying thousands of novel non-coding transcripts and more than three hundred reads corresponding to the non-coding RNA MALAT1, which is highly expressed in many human carcinomas. CONCLUSION: Our results demonstrate that combining 454 deep sequencing with a normalization step and careful bioinformatic analysis facilitates the discovery and quantification of rare transcripts or ncRNAs, and can be used as a qualitative tool to characterize transcriptome complexity, revealing many hitherto unknown transcripts, splice isoforms, gene fusion events and ncRNAs, even at a relatively low sequence sampling.


Subject(s)
Breast Neoplasms/genetics , Gene Expression Profiling , Sequence Analysis, DNA , Transcription, Genetic , Amino Acid Sequence , Base Sequence , Breast Neoplasms/metabolism , Calmodulin-Binding Proteins/genetics , Computational Biology , Cytoskeletal Proteins/genetics , DNA, Complementary/chemistry , Databases, Genetic , Female , Gene Expression Regulation, Neoplastic , Histone-Lysine N-Methyltransferase/genetics , Humans , Molecular Sequence Data , Nuclear Proteins/genetics , RNA, Untranslated/metabolism , Reverse Transcriptase Polymerase Chain Reaction , Sequence Alignment , Sequence Analysis, RNA , Ubiquitin-Protein Ligases
14.
Antimicrob Agents Chemother ; 52(7): 2616-25, 2008 Jul.
Article in English | MEDLINE | ID: mdl-18411315

ABSTRACT

The whole-genome sequence of an epidemic, multidrug-resistant Acinetobacter baumannii strain (strain ACICU) belonging to the European clone II group and carrying the plasmid-mediated bla(OXA)(-)(58) carbapenem resistance gene was determined. The A. baumannii ACICU genome was compared with the genomes of A. baumannii ATCC 17978 and Acinetobacter baylyi ADP1, with the aim of identifying novel genes related to virulence and drug resistance. A. baumannii ACICU has a single chromosome of 3,904,116 bp (which is predicted to contain 3,758 genes) and two plasmids, pACICU1 and pACICU2, of 28,279 and 64,366 bp, respectively. Genome comparison showed 86.4% synteny with A. baumannii ATCC 17978 and 14.8% synteny with A. baylyi ADP1. A conspicuous number of transporters belonging to different superfamilies was predicted for A. baumannii ACICU. The relative number of transporters was much higher in ACICU than in ATCC 17978 and ADP1 (76.2, 57.2, and 62.5 transporters per Mb of genome, respectively). An antibiotic resistance island, AbaR2, was identified in ACICU and had plausibly evolved by reductive evolution from the AbaR1 island previously described in multiresistant strain A. baumannii AYE. Moreover, 36 putative alien islands (pAs) were detected in the ACICU genome; 24 of these had previously been described in the ATCC 17978 genome, 4 are proposed here for the first time and are present in both ATCC 17978 and ACICU, and 8 are unique to the ACICU genome. Fifteen of the pAs in the ACICU genome encode genes related to drug resistance, including membrane transporters and ex novo acquired resistance genes. These findings provide novel insight into the genetic basis of A. baumannii resistance.


Subject(s)
Acinetobacter baumannii/drug effects , Acinetobacter baumannii/genetics , Drug Resistance, Multiple, Bacterial/genetics , Genome, Bacterial , Acinetobacter/genetics , Acinetobacter Infections/drug therapy , Acinetobacter Infections/epidemiology , Acinetobacter Infections/microbiology , Acinetobacter baumannii/classification , Acinetobacter baumannii/pathogenicity , Bacterial Proteins/genetics , Cross Infection/drug therapy , Cross Infection/epidemiology , Cross Infection/microbiology , DNA, Bacterial/genetics , Disease Outbreaks , Europe/epidemiology , Humans , Membrane Transport Proteins/genetics , Molecular Sequence Data , Plasmids/genetics , Sequence Analysis, DNA , Species Specificity , Virulence/genetics
15.
Gene ; 365: 35-40, 2006 Jan 03.
Article in English | MEDLINE | ID: mdl-16343814

ABSTRACT

We present here a novel methodology for the identification of genome regions potentially spanning one or more protein coding genes. It is based on the detection of clusters of conserved sequence tags whose evolutionary dynamics, based on the observation of an excess bias of synonymous substitutions at nucleotide level and of conservative replacements at protein level, suggests a likely protein coding role. A benchmark test carried out on a 236 Mbp of human-mouse syntenic regions from human chromosomes 15, 21 and 22 identified 25 CST clusters potentially containing unannotated genes. A further annotation update of the human genome assembly revealed that 11/25 clusters actually contained a total of 20 validated genes and 10 of the remaining 14 clusters had several experimental evidence in support of the presence of protein coding genes. These findings demonstrate the effectiveness and high prediction reliability of the proposed methodology which could specifically be applied to the annotation of novel genome sequences.


Subject(s)
Chromosomes, Human, Pair 15 , Chromosomes, Human, Pair 21 , Chromosomes, Human, Pair 22 , Genome, Human , Animals , Base Sequence , Cluster Analysis , Computational Biology , Exons , Expressed Sequence Tags , Humans , Introns , Mice , Predictive Value of Tests , Software , Synteny
16.
Gene ; 349: 97-105, 2005 Apr 11.
Article in English | MEDLINE | ID: mdl-15777708

ABSTRACT

The control of translation is a fundamental mechanism in the regulation of gene expression. Among the cis-acting elements that play a role in translation regulation are upstream open reading frames (uORFs) and upstream AUG (uAUGs) located in the 5'UTR of mRNAs. We present here a genome-wide analysis of uAUGs and uORFs in a curated set of human and rodent mRNAs. Our study shows that the occurrence of uAUGs is suppressed more strongly than that of uORFs and that in-frame uAUGs are more strongly suppressed than out-of-frame uAUGs. A very similar pattern of uAUG/uORF frequency was also observed in mouse mRNAs. The analysis of orthologous 5'UTR sequences revealed a remarkable degree of evolutionary conservation only of those uORFs which acquired some functional activity. Our data suggest that besides leaky scanning and reinitiation, which likely occur with variable and gene-specific efficiency, the ribosome-shunt mechanism, eventually coupled to reinitiation after uORF translation, may be a widespread mode of translation regulation in eukaryotes.


Subject(s)
5' Untranslated Regions , Open Reading Frames , Animals , Base Sequence , Codon , Conserved Sequence , Evolution, Molecular , Gene Expression Regulation , Humans , Mice , Molecular Sequence Data , Protein Processing, Post-Translational , RNA, Messenger/genetics , Rats
17.
Nucleic Acids Res ; 33(Database issue): D141-6, 2005 Jan 01.
Article in English | MEDLINE | ID: mdl-15608165

ABSTRACT

The 5' and 3' untranslated regions of eukaryotic mRNAs play crucial roles in the post-transcriptional regulation of gene expression through the modulation of nucleo-cytoplasmic mRNA transport, translation efficiency, subcellular localization and message stability. UTRdb is a curated database of 5' and 3' untranslated sequences of eukaryotic mRNAs, derived from several sources of primary data. Experimentally validated functional motifs are annotated (and also collated as the UTRsite database) and cross-links to genomic and protein data are provided. The integration of UTRdb with genomic and protein data has allowed the implementation of a powerful retrieval resource for the selection and extraction of UTR subsets based on their genomic coordinates and/or features of the protein encoded by the relevant mRNA (e.g. GO term, PFAM domain, etc.). All internet resources implemented for retrieval and functional analysis of 5' and 3' untranslated regions of eukaryotic mRNAs are accessible at http://www.ba.itb.cnr.it/UTR/.


Subject(s)
3' Untranslated Regions/chemistry , 5' Untranslated Regions/chemistry , Databases, Nucleic Acid , Regulatory Sequences, Ribonucleic Acid , Base Sequence , Eukaryotic Cells/metabolism , Systems Integration
SELECTION OF CITATIONS
SEARCH DETAIL
...