RESUMO
Sequencing costs are falling, but the cost of data analysis remains high, often because unforeseen problems arise, such as insufficient depth of sequencing or batch effects. Experimenting with data analysis methods during the planning phase of an experiment can reveal unanticipated problems and build valuable bioinformatics expertise in the organism or process being studied. This protocol describes using R Markdown and RStudio, user-friendly tools for statistical analysis and reproducible research in bioinformatics, to analyze and document the analysis of an example RNA-Seq data set from tomato pollen undergoing chronic heat stress. Also, we show how to use Integrated Genome Browser to visualize read coverage graphs for differentially expressed genes. Applying the protocol described here and using the provided data sets represent a useful first step toward building RNA-Seq data analysis expertise in a research group.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , RNA , Software , Navegador , Biologia Computacional/métodos , Genômica/métodos , Solanum lycopersicum/genéticaRESUMO
Pyrazinamide (PZA) has been in use for almost 50 years as a first-line drug for short-course chemotherapy against Mycobacterium tuberculosis. In this study, PCR mediated automated DNA sequencing is used to check the prevalence of PZA resistance among treatment failure cases of pulmonary tuberculosis. Out of 50 clinical isolates examined, 39 had mutations in the pncA gene that encodes Pyrazinamidase, an enzyme required to activate PZA. Of these, 31 (79.5%) were localized to three regions of pncA. We found two isolates with hitherto unreported mutation at amino acid 26 (Ala-->Gly) of pncA.