Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 162
Filter
1.
EMBO J ; 2024 Aug 29.
Article in English | MEDLINE | ID: mdl-39210146

ABSTRACT

DNA i-motif structures are formed in the nuclei of human cells and are believed to provide critical genomic regulation. While the existence, abundance, and distribution of i-motif structures in human cells has been demonstrated and studied by immunofluorescent staining, and more recently NMR and CUT&Tag, the abundance and distribution of such structures in human genomic DNA have remained unclear. Here we utilise high-affinity i-motif immunoprecipitation followed by sequencing to map i-motifs in the purified genomic DNA of human MCF7, U2OS and HEK293T cells. Validated by biolayer interferometry and circular dichroism spectroscopy, our approach aimed to identify DNA sequences capable of i-motif formation on a genome-wide scale, revealing that such sequences are widely distributed throughout the human genome and are common in genes upregulated in G0/G1 cell cycle phases. Our findings provide experimental evidence for the widespread formation of i-motif structures in human genomic DNA and a foundational resource for future studies of their genomic, structural, and molecular roles.

2.
Bioinformatics ; 39(12)2023 12 01.
Article in English | MEDLINE | ID: mdl-38001031

ABSTRACT

MOTIVATION: Methods for concept recognition (CR) in clinical texts have largely been tested on abstracts or articles from the medical literature. However, texts from electronic health records (EHRs) frequently contain spelling errors, abbreviations, and other nonstandard ways of representing clinical concepts. RESULTS: Here, we present a method inspired by the BLAST algorithm for biosequence alignment that screens texts for potential matches on the basis of matching k-mer counts and scores candidates based on conformance to typical patterns of spelling errors derived from 2.9 million clinical notes. Our method, the Term-BLAST-like alignment tool (TBLAT) leverages a gold standard corpus for typographical errors to implement a sequence alignment-inspired method for efficient entity linkage. We present a comprehensive experimental comparison of TBLAT with five widely used tools. Experimental results show an increase of 10% in recall on scientific publications and 20% increase in recall on EHR records (when compared against the next best method), hence supporting a significant enhancement of the entity linking task. The method can be used stand-alone or as a complement to existing approaches. AVAILABILITY AND IMPLEMENTATION: Fenominal is a Java library that implements TBLAT for named CR of Human Phenotype Ontology terms and is available at https://github.com/monarch-initiative/fenominal under the GNU General Public License v3.0.


Subject(s)
Algorithms , Language , Humans , Sequence Alignment , Electronic Health Records , Publications
3.
STAR Protoc ; 4(4): 102729, 2023 Dec 15.
Article in English | MEDLINE | ID: mdl-37995194

ABSTRACT

Intercalated motifs or i-Motifs (iMs) are nucleic acid structures formed by cytosine-rich sequences, which may regulate cellular processes and have broad applications in nanotechnology due to their pH-dependent nature. We have developed an iM-specific nanobody (iMbody) that can recognize iM DNA structures regardless of their sequences, making it a versatile research tool for studying iMs in various contexts. Here, we provide a protocol for the bacterial expression and His-tag purification of iMbody. We then describe procedures for performing ELISA and immunostaining using iMbody.


Subject(s)
DNA , Nanotechnology , Nucleotide Motifs , Nanotechnology/methods , DNA/metabolism
4.
Nucleic Acids Res ; 51(18): 9658-9671, 2023 Oct 13.
Article in English | MEDLINE | ID: mdl-37615576

ABSTRACT

Methylation of cytosines in the CG context (mCG) is the most abundant DNA modification in vertebrates that plays crucial roles in cellular differentiation and identity. After fertilization, DNA methylation patterns inherited from parental gametes are remodelled into a state compatible with embryogenesis. In mammals, this is achieved through the global erasure and re-establishment of DNA methylation patterns. However, in non-mammalian vertebrates like zebrafish, no global erasure has been observed. To investigate the evolutionary conservation and divergence of DNA methylation remodelling in teleosts, we generated base resolution DNA methylome datasets of developing medaka and medaka-zebrafish hybrid embryos. In contrast to previous reports, we show that medaka display comparable DNA methylome dynamics to zebrafish with high gametic mCG levels (sperm: ∼90%; egg: ∼75%), and adoption of a paternal-like methylome during early embryogenesis, with no signs of prior DNA methylation erasure. We also demonstrate that non-canonical DNA methylation (mCH) reprogramming at TGCT tandem repeats is a conserved feature of teleost embryogenesis. Lastly, we find remarkable evolutionary conservation of DNA methylation remodelling patterns in medaka-zebrafish hybrids, indicative of compatible DNA methylation maintenance machinery in far-related teleost species. Overall, these results suggest strong evolutionary conservation of DNA methylation remodelling pathways in teleosts, which is distinct from the global DNA methylome erasure and reestablishment observed in mammals.

5.
Genome Biol ; 24(1): 118, 2023 05 17.
Article in English | MEDLINE | ID: mdl-37198692

ABSTRACT

Predicting the impact of coding and noncoding variants on splicing is challenging, particularly in non-canonical splice sites, leading to missed diagnoses in patients. Existing splice prediction tools are complementary but knowing which to use for each splicing context remains difficult. Here, we describe Introme, which uses machine learning to integrate predictions from several splice detection tools, additional splicing rules, and gene architecture features to comprehensively evaluate the likelihood of a variant impacting splicing. Through extensive benchmarking across 21,000 splice-altering variants, Introme outperformed all tools (auPRC: 0.98) for the detection of clinically significant splice variants. Introme is available at https://github.com/CCICB/introme .


Subject(s)
RNA Splice Sites , RNA Splicing , Humans , Introns , Machine Learning , Mutation
6.
Am J Hum Genet ; 110(3): 419-426, 2023 03 02.
Article in English | MEDLINE | ID: mdl-36868206

ABSTRACT

Australian Genomics is a national collaborative partnership of more than 100 organizations piloting a whole-of-system approach to integrating genomics into healthcare, based on federation principles. In the first five years of operation, Australian Genomics has evaluated the outcomes of genomic testing in more than 5,200 individuals across 19 rare disease and cancer flagship studies. Comprehensive analyses of the health economic, policy, ethical, legal, implementation and workforce implications of incorporating genomics in the Australian context have informed evidence-based change in policy and practice, resulting in national government funding and equity of access for a range of genomic tests. Simultaneously, Australian Genomics has built national skills, infrastructure, policy, and data resources to enable effective data sharing to drive discovery research and support improvements in clinical genomic delivery.


Subject(s)
Genomics , Health Policy , Humans , Australia , Rare Diseases , Delivery of Health Care
7.
Nat Rev Mol Cell Biol ; 24(6): 430-447, 2023 06.
Article in English | MEDLINE | ID: mdl-36596869

ABSTRACT

Genes specifying long non-coding RNAs (lncRNAs) occupy a large fraction of the genomes of complex organisms. The term 'lncRNAs' encompasses RNA polymerase I (Pol I), Pol II and Pol III transcribed RNAs, and RNAs from processed introns. The various functions of lncRNAs and their many isoforms and interleaved relationships with other genes make lncRNA classification and annotation difficult. Most lncRNAs evolve more rapidly than protein-coding sequences, are cell type specific and regulate many aspects of cell differentiation and development and other physiological processes. Many lncRNAs associate with chromatin-modifying complexes, are transcribed from enhancers and nucleate phase separation of nuclear condensates and domains, indicating an intimate link between lncRNA expression and the spatial control of gene expression during development. lncRNAs also have important roles in the cytoplasm and beyond, including in the regulation of translation, metabolism and signalling. lncRNAs often have a modular structure and are rich in repeats, which are increasingly being shown to be relevant to their function. In this Consensus Statement, we address the definition and nomenclature of lncRNAs and their conservation, expression, phenotypic visibility, structure and functions. We also discuss research challenges and provide recommendations to advance the understanding of the roles of lncRNAs in development, cell biology and disease.


Subject(s)
RNA, Long Noncoding , RNA, Long Noncoding/genetics , Cell Nucleus/genetics , Chromatin/genetics , Regulatory Sequences, Nucleic Acid , RNA Polymerase II/genetics
8.
Am J Hum Genet ; 109(11): 1960-1973, 2022 11 03.
Article in English | MEDLINE | ID: mdl-36332611

ABSTRACT

Sharing genomic variant interpretations across laboratories promotes consistency in variant assertions. A landscape analysis of Australian clinical genetic-testing laboratories in 2017 identified that, despite the national-accreditation-body recommendations encouraging laboratories to submit genotypic data to clinical databases, fewer than 300 variants had been shared to the ClinVar public database. Consultations with Australian laboratories identified resource constraints limiting routine application of manual processes, consent issues, and differences in interpretation systems as barriers to sharing. This information was used to define key needs and solutions required to enable national sharing of variant interpretations. The Shariant platform, using both the GRCh37 and GRCh38 genome builds, was developed to enable ongoing sharing of variant interpretations and associated evidence between Australian clinical genetic-testing laboratories. Where possible, two-way automated sharing was implemented so that disruption to laboratory workflows would be minimized. Terms of use were developed through consultation and currently restrict access to Australian clinical genetic-testing laboratories. Shariant was designed to store and compare structured evidence, to promote and record resolution of inter-laboratory classification discrepancies, and to streamline the submission of variant assertions to ClinVar. As of December 2021, more than 14,000 largely prospectively curated variant records from 11 participating laboratories have been shared. Discrepant classifications have been identified for 11% (28/260) of variants submitted by more than one laboratory. We have demonstrated that co-design with clinical laboratories is vital to developing and implementing a national variant-interpretation sharing effort. This approach has improved inter-laboratory concordance and enabled opportunities to standardize interpretation practices.


Subject(s)
Databases, Genetic , Laboratories , Humans , Genetic Variation , Australia , Genetic Testing
9.
Nature ; 608(7924): 757-765, 2022 08.
Article in English | MEDLINE | ID: mdl-35948641

ABSTRACT

The notion that mobile units of nucleic acid known as transposable elements can operate as genomic controlling elements was put forward over six decades ago1,2. However, it was not until the advancement of genomic sequencing technologies that the abundance and repertoire of transposable elements were revealed, and they are now known to constitute up to two-thirds of mammalian genomes3,4. The presence of DNA regulatory regions including promoters, enhancers and transcription-factor-binding sites within transposable elements5-8 has led to the hypothesis that transposable elements have been co-opted to regulate mammalian gene expression and cell phenotype8-14. Mammalian transposable elements include recent acquisitions and ancient transposable elements that have been maintained in the genome over evolutionary time. The presence of ancient conserved transposable elements correlates positively with the likelihood of a regulatory function, but functional validation remains an essential step to identify transposable element insertions that have a positive effect on fitness. Here we show that CRISPR-Cas9-mediated deletion of a transposable element-namely the LINE-1 retrotransposon Lx9c11-in mice results in an exaggerated and lethal immune response to virus infection. Lx9c11 is critical for the neogenesis of a non-coding RNA (Lx9c11-RegoS) that regulates genes of the Schlafen family, reduces the hyperinflammatory phenotype and rescues lethality in virus-infected Lx9c11-/- mice. These findings provide evidence that a transposable element can control the immune system to favour host survival during virus infection.


Subject(s)
DNA Transposable Elements , Host Microbial Interactions , Immunity , Retroelements , Virus Diseases , Animals , CRISPR-Cas Systems/genetics , DNA Transposable Elements/genetics , DNA Transposable Elements/immunology , Evolution, Molecular , Host Microbial Interactions/genetics , Host Microbial Interactions/immunology , Immunity/genetics , Mice , RNA, Untranslated/genetics , Regulatory Sequences, Nucleic Acid/genetics , Retroelements/genetics , Retroelements/immunology , Virus Diseases/genetics , Virus Diseases/immunology
10.
Eur J Hum Genet ; 30(10): 1121-1131, 2022 10.
Article in English | MEDLINE | ID: mdl-35970915

ABSTRACT

Whole genome sequencing (WGS) improves Mendelian disorder diagnosis over whole exome sequencing (WES); however, additional diagnostic yields and costs remain undefined. We investigated differences between diagnostic and cost outcomes of WGS and WES in a cohort with suspected Mendelian disorders. WGS was performed in 38 WES-negative families derived from a 64 family Mendelian cohort that previously underwent WES. For new WGS diagnoses, contemporary WES reanalysis determined whether variants were diagnosable by original WES or unique to WGS. Diagnostic rates were estimated for WES and WGS to simulate outcomes if both had been applied to the 64 families. Diagnostic costs were calculated for various genomic testing scenarios. WGS diagnosed 34% (13/38) of WES-negative families. However, contemporary WES reanalysis on average 2 years later would have diagnosed 18% (7/38 families) resulting in a WGS-specific diagnostic yield of 19% (6/31 remaining families). In WES-negative families, the incremental cost per additional diagnosis using WGS following WES reanalysis was AU$36,710 (£19,407;US$23,727) and WGS alone was AU$41,916 (£22,159;US$27,093) compared to WES-reanalysis. When we simulated the use of WGS alone as an initial genomic test, the incremental cost for each additional diagnosis was AU$29,708 (£15,705;US$19,201) whereas contemporary WES followed by WGS was AU$36,710 (£19,407;US$23,727) compared to contemporary WES. Our findings confirm that WGS is the optimal genomic test choice for maximal diagnosis in Mendelian disorders. However, accepting a small reduction in diagnostic yield, WES with subsequent reanalysis confers the lowest costs. Whether WES or WGS is utilised will depend on clinical scenario and local resourcing and availability.


Subject(s)
Exome , Base Sequence , Chromosome Mapping , Humans , Exome Sequencing , Whole Genome Sequencing
SELECTION OF CITATIONS
SEARCH DETAIL