Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
2.
mBio ; 12(1)2021 02 02.
Artigo em Inglês | MEDLINE | ID: mdl-33531401

RESUMO

We demonstrate that an assembly-independent and spike-in facilitated metagenomic quantification approach can be used to screen and quantify over 2,000 genes simultaneously, while delivering absolute gene concentrations comparable to those for quantitative PCR (qPCR). DNA extracted from dairy manure slurry, digestate, and compost was spiked with genomic DNA from a marine bacterium and sequenced using the Illumina HiSeq4000. We compared gene copy concentrations, in gene copies per mass of sample, of five antimicrobial resistance genes (ARGs) generated with (i) our quantitative metagenomic approach, (ii) targeted qPCR, and (iii) a hybrid quantification approach involving metagenomics and qPCR-based 16S rRNA gene quantification. Although qPCR achieved lower quantification limits, the metagenomic method avoided biases caused by primer specificity inherent to qPCR-based methods and was able to detect orders of magnitude more genes than is possible with qPCR assays. We used the approach to simultaneously quantify ARGs in the Comprehensive Antimicrobial Resistance Database (CARD). We observed that the total abundance of tetracycline resistance genes was consistent across different stages of manure treatment on three farms, but different samples were dominated by different tetracycline resistance gene families.IMPORTANCE qPCR and metagenomics are central molecular techniques that have offered insights into biological processes for decades, from monitoring spatial and temporal gene dynamics to tracking ARGs or pathogens. Still needed is a tool that can quantify thousands of relevant genes in a sample as gene copies per sample mass or volume. We compare a quantitative metagenomic approach with traditional qPCR approaches in the quantification of ARG targets in dairy manure samples. By leveraging the benefits of nontargeted community genomics, we demonstrate high-throughput absolute gene quantification of all known ARG sequences in environmental samples.


Assuntos
Resistência Microbiana a Medicamentos/genética , Metagenômica , Bases de Dados de Ácidos Nucleicos , Dosagem de Genes , Reação em Cadeia da Polimerase , Resistência a Tetraciclina/genética
3.
Cancer Inform ; 16: 1176935117711940, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28690394

RESUMO

ClinicalTrials.org is a popular portal which physicians use to find clinical trials for their patients. However, the current setup of ClinicalTrials.org makes it difficult for oncologists to locate clinical trials for patients based on mutational status. We present CTMine, a system that mines ClinicalTrials.org for clinical trials per cancer mutation and displays the trials in a user-friendly Web application. The system currently lists clinical trials for 6 common genes (ALK, BRAF, ERBB2, EGFR, KIT, and KRAS). The current machine learning model used to identify relevant clinical trials focusing on the above gene mutations had an average 88% precision/recall. As part of this analysis, we compared human versus machine and found that oncologists were unable to reach a consensus on whether a clinical trial mined by CTMine was "relevant" per gene mutation, a finding that highlights an important topic which deems future exploration.

4.
PLoS One ; 12(4): e0175860, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28437440

RESUMO

Scientists have unprecedented access to a wide variety of high-quality datasets. These datasets, which are often independently curated, commonly use unstructured spreadsheets to store their data. Standardized annotations are essential to perform synthesis studies across investigators, but are often not used in practice. Therefore, accurately combining records in spreadsheets from differing studies requires tedious and error-prone human curation. These efforts result in a significant time and cost barrier to synthesis research. We propose an information retrieval inspired algorithm, Synthesize, that merges unstructured data automatically based on both column labels and values. Application of the Synthesize algorithm to cancer and ecological datasets had high accuracy (on the order of 85-100%). We further implement Synthesize in an open source web application, Synthesizer (https://github.com/lisagandy/synthesizer). The software accepts input as spreadsheets in comma separated value (CSV) format, visualizes the merged data, and outputs the results as a new spreadsheet. Synthesizer includes an easy to use graphical user interface, which enables the user to finish combining data and obtain perfect accuracy. Future work will allow detection of units to automatically merge continuous data and application of the algorithm to other data formats, including databases.


Assuntos
Sistemas de Gerenciamento de Base de Dados , Armazenamento e Recuperação da Informação/métodos , Software , Algoritmos , Bases de Dados Factuais
5.
Int J Mol Sci ; 16(7): 15384-404, 2015 Jul 07.
Artigo em Inglês | MEDLINE | ID: mdl-26198229

RESUMO

Protein disordered regions are segments of a protein chain that do not adopt a stable structure. Thus far, a variety of protein disorder prediction methods have been developed and have been widely used, not only in traditional bioinformatics domains, including protein structure prediction, protein structure determination and function annotation, but also in many other biomedical fields. The relationship between intrinsically-disordered proteins and some human diseases has played a significant role in disorder prediction in disease identification and epidemiological investigations. Disordered proteins can also serve as potential targets for drug discovery with an emphasis on the disordered-to-ordered transition in the disordered binding regions, and this has led to substantial research in drug discovery or design based on protein disordered region prediction. Furthermore, protein disorder prediction has also been applied to healthcare by predicting the disease risk of mutations in patients and studying the mechanistic basis of diseases. As the applications of disorder prediction increase, so too does the need to make quick and accurate predictions. To fill this need, we also present a new approach to predict protein residue disorder using wide sequence windows that is applicable on the genomic scale.


Assuntos
Biologia Computacional/métodos , Proteínas/química , Área Sob a Curva , Bases de Dados de Proteínas , Desenho de Fármacos , Descoberta de Drogas , Proteínas Intrinsicamente Desordenadas/química , Redes Neurais de Computação
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...