Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
bioRxiv ; 2024 Jun 18.
Artigo em Inglês | MEDLINE | ID: mdl-38106141

RESUMO

Although several recent studies have characterized structural variants (SVs) in germline and cancer genomes, the features of SVs in these different contexts have not been directly compared. We examined similarities and differences between 2 million germline and 115 thousand tumor SVs from a cohort of 963 patients from The Cancer Genome Atlas (TCGA). We found significant differences in features related to their genomic sequences and localization that suggest differences between SV-generating processes and selective pressures. For example, we found that transposon-mediated processes shape germline much more than somatic SVs, while somatic SVs more frequently show features characteristic of chromoanagenesis. These differences were extensive enough to enable us to develop a classifier - "the great GaTSV" - that accurately distinguishes between germline and cancer SVs in tumor samples that lack a matched normal sample.

2.
Nucleic Acids Res ; 51(8): e46, 2023 05 08.
Artigo em Inglês | MEDLINE | ID: mdl-36912074

RESUMO

16S rRNA gene sequence clustering is an important tool in characterizing the diversity of microbial communities. As 16S rRNA gene data sets are growing in size, existing sequence clustering algorithms increasingly become an analytical bottleneck. Part of this bottleneck is due to the substantial computational cost expended on small clusters and singleton sequences. We propose an iterative sampling-based 16S rRNA gene sequence clustering approach that targets the largest clusters in the data set, allowing users to stop the clustering process when sufficient clusters are available for the specific analysis being targeted. We describe a probabilistic analysis of the iterative clustering process that supports the intuition that the clustering process identifies the larger clusters in the data set first. Using real data sets of 16S rRNA gene sequences, we show that the iterative algorithm, coupled with an adaptive sampling process and a mode-shifting strategy for identifying cluster representatives, substantially speeds up the clustering process while being effective at capturing the large clusters in the data set. The experiments also show that SCRAPT (Sample, Cluster, Recruit, AdaPt and iTerate) is able to produce operational taxonomic units that are less fragmented than popular tools: UCLUST, CD-HIT and DNACLUST. The algorithm is implemented in the open-source package SCRAPT. The source code used to generate the results presented in this paper is available at https://github.com/hsmurali/SCRAPT.


Assuntos
Algoritmos , Software , RNA Ribossômico 16S/genética , Genes de RNAr , Análise por Conglomerados
3.
PLoS Comput Biol ; 17(9): e1009380, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-34491988

RESUMO

The SARS-CoV-2 pandemic highlights the need for a detailed molecular understanding of protective antibody responses. This is underscored by the emergence and spread of SARS-CoV-2 variants, including Alpha (B.1.1.7) and Delta (B.1.617.2), some of which appear to be less effectively targeted by current monoclonal antibodies and vaccines. Here we report a high resolution and comprehensive map of antibody recognition of the SARS-CoV-2 spike receptor binding domain (RBD), which is the target of most neutralizing antibodies, using computational structural analysis. With a dataset of nonredundant experimentally determined antibody-RBD structures, we classified antibodies by RBD residue binding determinants using unsupervised clustering. We also identified the energetic and conservation features of epitope residues and assessed the capacity of viral variant mutations to disrupt antibody recognition, revealing sets of antibodies predicted to effectively target recently described viral variants. This detailed structure-based reference of antibody RBD recognition signatures can inform therapeutic and vaccine design strategies.


Assuntos
Anticorpos Antivirais , COVID-19/virologia , SARS-CoV-2/genética , Glicoproteína da Espícula de Coronavírus , Anticorpos Antivirais/química , Anticorpos Antivirais/metabolismo , Sítios de Ligação , Análise por Conglomerados , Biologia Computacional , Humanos , Modelos Moleculares , Ligação Proteica , Glicoproteína da Espícula de Coronavírus/química , Glicoproteína da Espícula de Coronavírus/genética , Glicoproteína da Espícula de Coronavírus/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...