Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25.529
Filtrar
1.
Artigo em Inglês | MEDLINE | ID: mdl-38862426

RESUMO

The high-fidelity (HiFi) long-read sequencing technology developed by PacBio has greatly improved the base-level accuracy of genome assemblies. However, these assemblies still contain base-level errors, particularly within the error-prone regions of HiFi long reads. Existing genome polishing tools usually introduce overcorrections and haplotype switch errors when correcting errors in genomes assembled from HiFi long reads. Here, we describe an upgraded genome polishing tool - NextPolish2, which can fix base errors remaining in those "highly accurate" genomes assembled from HiFi long reads without introducing excessive overcorrections and haplotype switch errors. We believe that NextPolish2 has a great significance to further improve the accuracy of telomere-to-telomere (T2T) genomes. NextPolish2 is freely available at https://github.com/Nextomics/NextPolish2.


Assuntos
Software , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Humanos , Genômica/métodos , Sequências Repetitivas de Ácido Nucleico/genética , Genoma/genética
2.
Croat Med J ; 65(3): 209-219, 2024 Jun 13.
Artigo em Inglês | MEDLINE | ID: mdl-38868967

RESUMO

AIM: To precisely identify and analyze alpha-satellite higher-order repeats (HORs) in T2T-CHM13 assembly of human chromosome 3. METHODS: From the recently sequenced complete T2T-CHM13 assembly of human chromosome 3, the precise alpha satellite HOR structure was computed by using the novel high-precision GRM2023 algorithm with global repeat map (GRM) and monomer distance (MD) diagrams. RESULTS: The major alpha satellite HOR array in chromosome 3 revealed a novel cascading HOR, housing 17mer HOR copies with subfragments of periods 15 and 2. Within each row in the cascading HOR, the monomers were of different types, but different rows within the same cascading 17mer HOR contained more than one monomer of the same type. Each canonical 17mer HOR copy comprised 17 monomers belonging to 16 different monomer types. Another pronounced 10mer HOR array was of the regular Willard's type. CONCLUSION: Our findings emphasize the complexity within the chromosome 3 centromere as well as deviations from expected highly regular patterns.


Assuntos
Cromossomos Humanos Par 3 , DNA Satélite , Humanos , DNA Satélite/genética , Cromossomos Humanos Par 3/genética , Centrômero/genética , Algoritmos , Sequências Repetitivas de Ácido Nucleico/genética
3.
Nat Plants ; 10(5): 691, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38783123
4.
BMC Bioinformatics ; 25(1): 194, 2024 May 17.
Artigo em Inglês | MEDLINE | ID: mdl-38755561

RESUMO

Telomeres are regions of repetitive DNA at the ends of linear chromosomes which protect chromosome ends from degradation. Telomere lengths have been extensively studied in the context of aging and disease, though most studies use average telomere lengths which are of limited utility. We present a method for identifying all 92 telomere alleles from long read sequencing data. Individual telomeres are identified using variant repeats proximal to telomere regions, which are unique across alleles. This high-throughput and high-resolution characterization of telomeres could be foundational to future studies investigating the roles of specific telomeres in aging and disease.


Assuntos
Alelos , Telômero , Telômero/genética , Humanos , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequências Repetitivas de Ácido Nucleico/genética
5.
BMC Ecol Evol ; 24(1): 72, 2024 May 30.
Artigo em Inglês | MEDLINE | ID: mdl-38816840

RESUMO

Ctenoluciidae is a Neotropical freshwater fish family composed of two genera, Ctenolucius (C. beani and C. hujeta) and Boulengerella (B. cuvieri, B. lateristriga, B. lucius, B. maculata, and B. xyrekes), which present diploid number conservation of 36 chromosomes and a strong association of telomeric sequences with ribosomal DNAs. In the present study, we performed chromosomal mapping of microsatellites and transposable elements (TEs) in Boulengerella species and Ctenolucius hujeta. We aim to understand how those sequences are distributed in these organisms' genomes and their influence on the chromosomal evolution of the group. Our results indicate that repetitive sequences may had an active role in the karyotypic diversification of this family, especially in the formation of chromosomal hotspots that are traceable in the diversification processes of Ctenoluciidae karyotypes. We demonstrate that (GATA)n sequences also accumulate in the secondary constriction formed by the 18 S rDNA site, which shows consistent size heteromorphism between males and females in all Boulengerella species, suggesting an initial process of sex chromosome differentiation.


Assuntos
Caraciformes , Mapeamento Cromossômico , Sequências Repetitivas de Ácido Nucleico , Retroelementos , Animais , Caraciformes/genética , Masculino , Feminino , Retroelementos/genética , Sequências Repetitivas de Ácido Nucleico/genética , Evolução Molecular , Repetições de Microssatélites/genética , Cariótipo , Cromossomos/genética
6.
PLoS Pathog ; 20(5): e1012261, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38805555

RESUMO

Marek's disease virus (MDV) vaccines were the first vaccines that protected against cancer. The avirulent turkey herpesvirus (HVT) was widely employed and protected billions of chickens from a deadly MDV infection. It is also among the most common vaccine vectors providing protection against a plethora of pathogens. HVT establishes latency in T-cells, allowing the vaccine virus to persist in the host for life. Intriguingly, the HVT genome contains telomeric repeat arrays (TMRs) at both ends; however, their role in the HVT life cycle remains elusive. We have previously shown that similar TMRs in the MDV genome facilitate its integration into host telomeres, which ensures efficient maintenance of the virus genome during latency and tumorigenesis. In this study, we investigated the role of the TMRs in HVT genome integration, latency, and reactivation in vitro and in vivo. Additionally, we examined HVT infection of feather follicles. We generated an HVT mutant lacking both TMRs (vΔTMR) that efficiently replicated in cell culture. We could demonstrate that wild type HVT integrates at the ends of chromosomes containing the telomeres in T-cells, while integration was severely impaired in the absence of the TMRs. To assess the role of TMRs in vivo, we infected one-day-old chickens with HVT or vΔTMR. vΔTMR loads were significantly reduced in the blood and hardly any virus was transported to the feather follicle epithelium where the virus is commonly shed. Strikingly, latency in the spleen and reactivation of the virus were severely impaired in the absence of the TMRs, indicating that the TMRs are crucial for the establishment of latency and reactivation of HVT. Our findings revealed that the TMRs facilitate integration of the HVT genome into host chromosomes, which ensures efficient persistence in the host, reactivation, and transport of the virus to the skin.


Assuntos
Galinhas , Doença de Marek , Telômero , Integração Viral , Latência Viral , Animais , Galinhas/virologia , Telômero/genética , Telômero/virologia , Doença de Marek/virologia , Doença de Marek/imunologia , Doença de Marek/prevenção & controle , Vetores Genéticos , Herpesvirus Meleagrídeo 1/genética , Herpesvirus Meleagrídeo 1/imunologia , Vacinas contra Doença de Marek/imunologia , Vacinas contra Doença de Marek/genética , Genoma Viral , Herpesvirus Galináceo 2/genética , Herpesvirus Galináceo 2/imunologia , Sequências Repetitivas de Ácido Nucleico , Doenças das Aves Domésticas/virologia , Doenças das Aves Domésticas/imunologia , Doenças das Aves Domésticas/prevenção & controle
7.
Elife ; 122024 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-38656297

RESUMO

Telomeres, which are chromosomal end structures, play a crucial role in maintaining genome stability and integrity in eukaryotes. In the baker's yeast Saccharomyces cerevisiae, the X- and Y'-elements are subtelomeric repetitive sequences found in all 32 and 17 telomeres, respectively. While the Y'-elements serve as a backup for telomere functions in cells lacking telomerase, the function of the X-elements remains unclear. This study utilized the S. cerevisiae strain SY12, which has three chromosomes and six telomeres, to investigate the role of X-elements (as well as Y'-elements) in telomere maintenance. Deletion of Y'-elements (SY12YΔ), X-elements (SY12XYΔ+Y), or both X- and Y'-elements (SY12XYΔ) did not impact the length of the terminal TG1-3 tracks or telomere silencing. However, inactivation of telomerase in SY12YΔ, SY12XYΔ+Y, and SY12XYΔ cells resulted in cellular senescence and the generation of survivors. These survivors either maintained their telomeres through homologous recombination-dependent TG1-3 track elongation or underwent microhomology-mediated intra-chromosomal end-to-end joining. Our findings indicate the non-essential role of subtelomeric X- and Y'-elements in telomere regulation in both telomerase-proficient and telomerase-null cells and suggest that these elements may represent remnants of S. cerevisiae genome evolution. Furthermore, strains with fewer or no subtelomeric elements exhibit more concise telomere structures and offer potential models for future studies in telomere biology.


Assuntos
Sequências Repetitivas de Ácido Nucleico , Saccharomyces cerevisiae , Telomerase , Telômero , Saccharomyces cerevisiae/genética , Telômero/metabolismo , Telômero/genética , Sequências Repetitivas de Ácido Nucleico/genética , Telomerase/genética , Telomerase/metabolismo , Homeostase do Telômero , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Deleção de Sequência
8.
Int J Mol Sci ; 25(8)2024 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-38674025

RESUMO

In this study, we applied the iterative procedure (IP) method to search for families of highly diverged dispersed repeats in the genome of Cyanidioschyzon merolae, which contains over 16 million bases. The algorithm included the construction of position weight matrices (PWMs) for repeat families and the identification of more dispersed repeats based on the PWMs using dynamic programming. The results showed that the C. merolae genome contained 20 repeat families comprising a total of 33,938 dispersed repeats, which is significantly more than has been previously found using other methods. The repeats varied in length from 108 to 600 bp (522.54 bp in average) and occupied more than 72% of the C. merolae genome, whereas previously identified repeats, including tandem repeats, have been shown to constitute only about 28%. The high genomic content of dispersed repeats and their location in the coding regions suggest a significant role in the regulation of the functional activity of the genome.


Assuntos
Sequências Repetitivas de Ácido Nucleico , Rodófitas , Rodófitas/genética , Sequências Repetitivas de Ácido Nucleico/genética , Genoma , Algoritmos , Genômica/métodos
9.
Sci Data ; 11(1): 351, 2024 Apr 08.
Artigo em Inglês | MEDLINE | ID: mdl-38589366

RESUMO

Acanthacorydalis orientalis (McLachlan, 1899) (Megaloptera: Corydalidae) is an important freshwater-benthic invertebrate species that serves as an indicator for water-quality biomonitoring and is valuable for conservation from East Asia. Here, a high-quality reference genome for A. orientalis was constructed using Oxford Nanopore sequencing and High throughput Chromosome Conformation Capture (Hi-C) technology. The final genome size is 547.98 Mb, with the N50 values of contig and scaffold being 7.77 Mb and 50.53 Mb, respectively. The longest contig and scaffold are 20.57 Mb and 62.26 Mb in length, respectively. There are 99.75% contigs anchored onto 13 pseudo-chromosomes. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed that the completeness of the genome assembly is 99.01%. There are 10,977 protein-coding genes identified, of which 84.00% are functionally annotated. The genome contains 44.86% repeat sequences. This high-quality genome provides substantial data for future studies on population genetics, aquatic adaptation, and evolution of Megaloptera and other related insect groups.


Assuntos
Genoma de Inseto , Neópteros , Sequências Repetitivas de Ácido Nucleico , Cromossomos/genética , Anotação de Sequência Molecular , Filogenia , Neópteros/genética
10.
Sci Data ; 11(1): 340, 2024 Apr 05.
Artigo em Inglês | MEDLINE | ID: mdl-38580722

RESUMO

Despite the rapid advances in sequencing technology, limited genomic resources are currently available for phytophagous spider mites, which include many important agricultural pests. One of these pests is Tetranychus piercei (McGregor), a serious banana pest in East Asia exhibiting remarkable tolerance to high temperature. In this study, we assembled a high-quality genome of T. piercei using a combination of PacBio long reads and Illumina short reads sequencing. With the assistance of chromatin conformation capture technology, 99.9% of the contigs were anchored into three pseudochromosomes with a total size of 86.02 Mb. Repetitive elements, accounting for 14.16% of this genome (12.20 Mb), are predominantly composed of long-terminal repeats (30.7%). By combining evidence of ab initio prediction, transcripts, and homologous proteins, we annotated 11,881 protein-coding genes. Both the genome and proteins have high BUSCO completeness scores (>94%). This high-quality genome, along with reliable annotation, provides a valuable resource for investigating the high-temperature tolerance of this species and exploring the genomic basis that underlies the host range evolution of spider mites.


Assuntos
Tetranychidae , Animais , Cromossomos , Genoma , Genômica , Anotação de Sequência Molecular , Filogenia , Sequências Repetitivas de Ácido Nucleico , Tetranychidae/genética
11.
Sci Rep ; 14(1): 7840, 2024 04 03.
Artigo em Inglês | MEDLINE | ID: mdl-38570596

RESUMO

Using a combination of short- and long-reads sequencing, we were able to sequence the complete mitochondrial genome of the invasive 'New Zealand flatworm' Arthurdendyus triangulatus (Geoplanidae, Rhynchodeminae, Caenoplanini) and its two complete paralogous nuclear rRNA gene clusters. The mitogenome has a total length of 20,309 bp and contains repetitions that includes two types of tandem-repeats that could not be solved by short-reads sequencing. We also sequenced for the first time the mitogenomes of four species of Caenoplana (Caenoplanini). A maximum likelihood phylogeny associated A. triangulatus with the other Caenoplanini but Parakontikia ventrolineata and Australopacifica atrata were rejected from the Caenoplanini and associated instead with the Rhynchodemini, with Platydemus manokwari. It was found that the mitogenomes of all species of the subfamily Rhynchodeminae share several unusual structural features, including a very long cox2 gene. This is the first time that the complete paralogous rRNA clusters, which differ in length, sequence and seemingly number of copies, were obtained for a Geoplanidae.


Assuntos
Genoma Mitocondrial , Platelmintos , Animais , Platelmintos/genética , Genoma Mitocondrial/genética , Sequências Repetitivas de Ácido Nucleico , Filogenia , Análise de Sequência de DNA , RNA Ribossômico/genética
12.
Int J Mol Sci ; 25(8)2024 Apr 16.
Artigo em Inglês | MEDLINE | ID: mdl-38673983

RESUMO

Unraveling the intricate centromere structure of human chromosomes holds profound implications, illuminating fundamental genetic mechanisms and potentially advancing our comprehension of genetic disorders and therapeutic interventions. This study rigorously identified and structurally analyzed alpha satellite higher-order repeats (HORs) within the centromere of human chromosome 15 in the complete T2T-CHM13 assembly using the high-precision GRM2023 algorithm. The most extensive alpha satellite HOR array in chromosome 15 reveals a novel cascading HOR, housing 429 15mer HOR copies, containing 4-, 7- and 11-monomer subfragments. Within each row of cascading HORs, all alpha satellite monomers are of distinct types, as in regular Willard's HORs. However, different HOR copies within the same cascading 15mer HOR contain more than one monomer of the same type. Each canonical 15mer HOR copy comprises 15 monomers belonging to only 9 different monomer types. Notably, 65% of the 429 15mer cascading HOR copies exhibit canonical structures, while 35% display variant configurations. Identified as the second most extensive alpha satellite HOR, another novel cascading HOR within human chromosome 15 encompasses 164 20mer HOR copies, each featuring two subfragments. Moreover, a distinct pattern emerges as interspersed 25mer/26mer structures differing from regular Willard's HORs and giving rise to a 34-monomer subfragment. Only a minor 18mer HOR array of 12 HOR copies is of the regular Willard's type. These revelations highlight the complexity within the chromosome 15 centromeric region, accentuating deviations from anticipated highly regular patterns and hinting at profound information encoding and functional potential within the human centromere.


Assuntos
Centrômero , Cromossomos Humanos Par 15 , DNA Satélite , Humanos , DNA Satélite/genética , Centrômero/genética , Cromossomos Humanos Par 15/genética , Sequências Repetitivas de Ácido Nucleico
13.
Semin Cell Dev Biol ; 163: 2-13, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38664119

RESUMO

Homing genetic elements are a form of selfish DNA that inserts into a specific target site in the genome and spreads through the population by a process of biased inheritance. Two well-known types of homing element, called inteins and homing introns, were discovered decades ago. In this review we describe WHO elements, a newly discovered type of homing element that constitutes a distinct third category but is rare, having been found only in a few yeast species so far. WHO elements are inferred to spread using the same molecular homing mechanism as inteins and introns: they encode a site-specific endonuclease that cleaves the genome at the target site, making a DNA break that is subsequently repaired by copying the element. For most WHO elements, the target site is in the glycolytic gene FBA1. WHO elements differ from inteins and homing introns in two fundamental ways: they do not interrupt their host gene (FBA1), and they occur in clusters. The clusters were formed by successive integrations of different WHO elements into the FBA1 locus, the result of an 'arms race' between the endonuclease and its target site. We also describe one family of WHO elements (WHO10) that is no longer specifically associated with the FBA1 locus and instead appears to have become transposable, inserting at random genomic sites in Torulaspora globosa with up to 26 copies per strain. The WHO family of elements is therefore at the borderline between homing genetic elements and transposable elements.


Assuntos
Elementos de DNA Transponíveis , Elementos de DNA Transponíveis/genética , Íntrons/genética , Sequências Repetitivas de Ácido Nucleico/genética
14.
Bioessays ; 46(6): e2400013, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38593286

RESUMO

In addition to monocentric eukaryotes, which have a single localized centromere on each chromosome, there are holocentric species, with extended repeat-based or repeat-less centromeres distributed over the entire chromosome length. At least two types of repeat-based holocentromeres exist, one composed of many small repeat-based centromere units (small unit-type), and another one characterized by a few large centromere units (large unit-type). We hypothesize that the transposable element-mediated dispersal of hundreds of short satellite arrays formed the small centromere unit-type holocentromere in Rhynchospora pubera. The large centromere unit-type of the plant Chionographis japonica is likely a product of simultaneous DNA double-strand breaks (DSBs), which initiated the de novo formation of repeat-based holocentromeres via insertion of satellite DNA, derived from extra-chromosomal circular DNAs (eccDNAs). The number of initial DSBs along the chromosomes must be higher than the number of centromere units since only a portion of the breaks will have incorporated eccDNA at an appropriate position to serve as future centromere unit sites. Subsequently, preferential incorporation of the centromeric histone H3 variant at these positions is assumed. The identification of repeat-based holocentromeres across lineages will unveil the centromere plasticity and elucidate the mechanisms underlying the diverse formation of holocentromeres.


Assuntos
Centrômero , DNA Satélite , Centrômero/genética , DNA Satélite/genética , Quebras de DNA de Cadeia Dupla , Evolução Molecular , Sequências Repetitivas de Ácido Nucleico/genética , Elementos de DNA Transponíveis/genética , Cromossomos de Plantas/genética
15.
PLoS Comput Biol ; 20(4): e1012027, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38598558

RESUMO

Although the length and constituting sequences for pericentromeric repeats are highly variable across eukaryotes, the presence of multiple pericentromeric repeats is one of the conserved features of the eukaryotic chromosomes. Pericentromeric heterochromatin is often misregulated in human diseases, with the expansion of pericentromeric repeats in human solid cancers. In this article, we have developed a mathematical model of the RNAi-dependent methylation of H3K9 in the pericentromeric region of fission yeast. Our model, which takes copy number as an explicit parameter, predicts that the pericentromere is silenced only if there are many copies of repeats. It becomes bistable or desilenced if the copy number of repeats is reduced. This suggests that the copy number of pericentromeric repeats alone can determine the fate of heterochromatin silencing in fission yeast. Through sensitivity analysis, we identified parameters that favor bistability and desilencing. Stochastic simulation shows that faster cell division and noise favor the desilenced state. These results show the unexpected role of pericentromeric repeat copy number in gene silencing and provide a quantitative basis for how the copy number allows or protects repetitive and unique parts of the genome from heterochromatin silencing, respectively.


Assuntos
Centrômero , Heterocromatina , Schizosaccharomyces , Heterocromatina/metabolismo , Heterocromatina/genética , Schizosaccharomyces/genética , Schizosaccharomyces/metabolismo , Centrômero/metabolismo , Centrômero/genética , Modelos Genéticos , Biologia Computacional , Inativação Gênica , Sequências Repetitivas de Ácido Nucleico/genética , Humanos , Histonas/metabolismo , Histonas/genética
16.
Microb Genom ; 10(3)2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38529901

RESUMO

Genome assembly and annotation using short-paired reads is challenging for eukaryotic organisms due to their large size, variable ploidy and large number of repetitive elements. However, the use of single-molecule long reads improves assembly quality (completeness and contiguity), but haplotype duplications still pose assembly challenges. To address the effect of read length on genome assembly quality, gene prediction and annotation, we compared genome assemblers and sequencing technologies with four strains of the ectomycorrhizal fungus Laccaria trichodermophora. By analysing the predicted repertoire of carbohydrate enzymes, we investigated the effects of assembly quality on functional inferences. Libraries were generated using three different sequencing platforms (Illumina Next-Seq, Mi-Seq and PacBio Sequel), and genomes were assembled using single and hybrid assemblies/libraries. Long reads or hybrid assemby resolved the collapsing of repeated regions, but the nuclear heterozygous versions remained unresolved. In dikaryotic fungi, each cell includes two nuclei and each nucleus has differences not only in allelic gene version but also in gene composition and synteny. These heterokaryotic cells produce fragmentation and size overestimation of the genome assembly of each nucleus. Hybrid assembly revealed a wider functional diversity of genomes. Here, several predicted oxidizing activities on glycosyl residues of oligosaccharides and several chitooligosaccharide acetylase activities would have passed unnoticed in short-read assemblies. Also, the size and fragmentation of the genome assembly, in combination with heterozygosity analysis, allowed us to distinguish homokaryotic and heterokaryotic strains isolated from L. trichodermophora fruit bodies.


Assuntos
Genoma , Laccaria , Sequências Repetitivas de Ácido Nucleico , Análise de Sequência de DNA , Haplótipos
17.
BMC Genomics ; 25(1): 278, 2024 Mar 14.
Artigo em Inglês | MEDLINE | ID: mdl-38486136

RESUMO

There is an ongoing process in which mitochondrial sequences are being integrated into the nuclear genome. The importance of these sequences has already been revealed in cancer biology, forensic, phylogenetic studies and in the evolution of the eukaryotic genetic information. Human and numerous model organisms' genomes were described from those sequences point of view. Furthermore, recent studies were published on the patterns of these nuclear localised mitochondrial sequences in different taxa.However, the results of the previously released studies are difficult to compare due to the lack of standardised methods and/or using few numbers of genomes. Therefore, in this paper our primary goal is to establish a uniform mining pipeline to explore these nuclear localised mitochondrial sequences.Our results show that the frequency of several repetitive elements is higher in the flanking regions of these sequences than expected. A machine learning model reveals that the flanking regions' repetitive elements and different structural characteristics are highly influential during the integration process.In this paper, we introduce a general mining pipeline for all mammalian genomes. The workflow is publicly available and is believed to serve as a validated baseline for future research in this field. We confirm the widespread opinion, on - as to our current knowledge - the largest dataset, that structural circumstances and events corresponding to repetitive elements are highly significant. An accurate model has also been trained to predict these sequences and their corresponding flanking regions.


Assuntos
Genoma Mitocondrial , Animais , Humanos , Filogenia , DNA Mitocondrial/genética , Mamíferos/genética , Sequências Repetitivas de Ácido Nucleico
18.
ACS Synth Biol ; 13(3): 963-968, 2024 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-38437525

RESUMO

Gene synthesis efficiency has greatly improved in recent years but is limited when it comes to repetitive sequences, which results in synthesis failure or delays by DNA synthesis vendors. This represents a major obstacle for the development of synthetic biology since repetitive elements are increasingly being used in the design of genetic circuits and design of biomolecular nanostructures. Here, we describe a method for the assembly of small synthetic genes with repetitive elements: First, a gene of interest is split in silico into small synthons of up to 80 base pairs flanked by Golden-Gate-compatible overhangs. Then, synthons are made by oligo extension and finally assembled into a synthetic gene by Golden Gate Assembly. We demonstrate the method by constructing eight challenging genes with repetitive elements, e.g., multiple repeats of RNA aptamers and RNA origami scaffolds with multiple identical aptamers. The genes range in size from 133 to 456 base pairs and are assembled with fidelities of up to 87.5%. The method was developed to facilitate our own specific research but may be of general use for constructing challenging and repetitive genes and, thus, a valuable addition to the molecular cloning toolbox.


Assuntos
Genes Sintéticos , Nanoestruturas , Sequências Repetitivas de Ácido Nucleico/genética , Clonagem Molecular , RNA/química , Nanoestruturas/química , Biologia Sintética/métodos
19.
Genome Biol Evol ; 16(3)2024 Mar 02.
Artigo em Inglês | MEDLINE | ID: mdl-38505885

RESUMO

We report a high-quality genome draft assembly of the dark-branded bushbrown, Mycalesis mineus, a member of the Satyrinae subfamily of nymphalid butterflies. This species is emerging as a promising model organism for investigating the evolution and development of phenotypic plasticity. Using 45.99 Gb of long-read data (N50 = 11.11 kb), we assembled a genome size of 497.4 Mb for M. mineus. The assembly is highly contiguous and nearly complete (96.8% of Benchmarking Universal Single-Copy Orthologs lepidopteran genes were complete and single copy). The genome comprises 38.71% of repetitive elements and includes 20,967 predicted protein-coding genes. The assembled genome was super-scaffolded into 28 pseudo-chromosomes using a closely related species, Bicyclus anynana, with a chromosomal-level genome as a template. This valuable genomic tool will advance both ongoing and future research focused on this model organism.


Assuntos
Borboletas , Animais , Borboletas/genética , Anotação de Sequência Molecular , Genômica , Sequências Repetitivas de Ácido Nucleico , Tamanho do Genoma , Cromossomos
20.
BMC Genomics ; 25(1): 285, 2024 Mar 18.
Artigo em Inglês | MEDLINE | ID: mdl-38500026

RESUMO

BACKGROUND: 'Taishuu' has a crisp texture, abundant juice, and sweet flavor with hints of cantaloupe. The availability of mitochondrial genome data of Diospyros species is far from the known number of species. RESULTS: The sequencing data were assembled into a closed circular mitochondrial chromosome with a 421,308 bp length and a 45.79% GC content. The mitochondrial genome comprised 40 protein-coding, 24 tRNA, and three rRNA genes. The most common codons for arginine (Arg), proline (Pro), glycine (Gly), tryptophan (Trp), valine (Val), alanine (Ala), and leucine (Leu) were AGA, CCA, GGA, UGG, GUA, GCA, and CUA, respectively. The start codon for cox1 and nad4L protein-coding genes was ACG (ATG), whereas the remaining protein-coding genes started with ATG. There are four types of stop codons: CGA, TAA, TAG, and TGA, with TAA being the most frequently used stop codon (45.24%). In the D. kaki Thunb. 'Taishuu' mitochondrial genome, a total of 645 repeat sequences were identified, including 125 SSRs, 7 tandem repeats, and 513 dispersed repeats. Collinearity analysis revealed a close relationship between D. kaki Thunb. 'Taishuu' and Diospyros oleifera, with conserved homologous gene fragments shared among these species in large regions of the mitochondrial genome. The protein-coding genes ccmB and nad4L were observed to undergo positive selection. Analysis of homologous sequences between chloroplasts and mitochondria identified 28 homologous segments, with a total length of 24,075 bp, accounting for 5.71% of the mitochondrial genome. These homologous segments contain 8 annotated genes, including 6 tRNA genes and 2 protein-coding genes (rrn18 and ccmC). There are 23 homologous genes between chloroplasts and nuclei. Mitochondria, chloroplasts, and nuclei share two homologous genes, which are trnV-GAC and trnW-CCA. CONCLUSION: In conclusion, a high-quality chromosome-level draft genome for D. kaki was generated in this study, which will contribute to further studies of major economic traits in the genus Diospyros.


Assuntos
Diospyros , Genoma Mitocondrial , Diospyros/genética , Sequências Repetitivas de Ácido Nucleico , Códon de Terminação , RNA de Transferência/genética , Filogenia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...