Pesquisa | Portal Regional da BVS (teste)

Genome-wide characterization of human minisatellite VNTRs: population-specific alleles and gene expression differences.

Eslami Rasekh, Marzieh; Hernández, Yözen; Drinan, Samantha D; Fuxman Bass, Juan I; Benson, Gary.

Nucleic Acids Res ; 49(8): 4308-4324, 2021 05 07.

Artigo em Inglês | MEDLINE | ID: mdl-33849068

RESUMO

Variable Number Tandem Repeats (VNTRs) are tandem repeat (TR) loci that vary in copy number across a population. Using our program, VNTRseek, we analyzed human whole genome sequencing datasets from 2770 individuals in order to detect minisatellite VNTRs, i.e., those with pattern sizes ≥7 bp. We detected 35 638 VNTR loci and classified 5676 as commonly polymorphic (i.e. with non-reference alleles occurring in >5% of the population). Commonly polymorphic VNTR loci were found to be enriched in genomic regions with regulatory function, i.e. transcription start sites and enhancers. Investigation of the commonly polymorphic VNTRs in the context of population ancestry revealed that 1096 loci contained population-specific alleles and that those could be used to classify individuals into super-populations with near-perfect accuracy. Search for quantitative trait loci (eQTLs), among the VNTRs proximal to genes, indicated that in 187 genes expression differences correlated with VNTR genotype. We validated our predictions in several ways, including experimentally, through the identification of predicted alleles in long reads, and by comparisons showing consistency between sequencing platforms. This study is the most comprehensive analysis of minisatellite VNTRs in the human population to date.

Assuntos

Regulação da Expressão Gênica , Genoma Humano , Repetições Minissatélites , Polimorfismo Genético , Alelos , Conjuntos de Dados como Assunto , Elementos Facilitadores Genéticos , Humanos , População/genética , Sítio de Iniciação de Transcrição , Sequenciamento Completo do Genoma

Discovery of large genomic inversions using long range information.

Eslami Rasekh, Marzieh; Chiatante, Giorgia; Miroballo, Mattia; Tang, Joyce; Ventura, Mario; Amemiya, Chris T; Eichler, Evan E; Antonacci, Francesca; Alkan, Can.

BMC Genomics ; 18(1): 65, 2017 01 10.

Artigo em Inglês | MEDLINE | ID: mdl-28073353

RESUMO

BACKGROUND: Although many algorithms are now available that aim to characterize different classes of structural variation, discovery of balanced rearrangements such as inversions remains an open problem. This is mainly due to the fact that breakpoints of such events typically lie within segmental duplications or common repeats, which reduces the mappability of short reads. The algorithms developed within the 1000 Genomes Project to identify inversions are limited to relatively short inversions, and there are currently no available algorithms to discover large inversions using high throughput sequencing technologies. RESULTS: Here we propose a novel algorithm, VALOR, to discover large inversions using new sequencing methods that provide long range information such as 10X Genomics linked-read sequencing, pooled clone sequencing, or other similar technologies that we commonly refer to as long range sequencing. We demonstrate the utility of VALOR using both pooled clone sequencing and 10X Genomics linked-read sequencing generated from the genome of an individual from the HapMap project (NA12878). We also provide a comprehensive comparison of VALOR against several state-of-the-art structural variation discovery algorithms that use whole genome shotgun sequencing data. CONCLUSIONS: In this paper, we show that VALOR is able to accurately discover all previously identified and experimentally validated large inversions in the same genome with a low false discovery rate. Using VALOR, we also predicted a novel inversion, which we validated using fluorescent in situ hybridization. VALOR is available at https://github.com/BilkentCompGen/VALOR.

Assuntos

Genômica/métodos , Inversão de Sequência/genética , Algoritmos , Genoma Humano/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Sequenciamento Completo do Genoma

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA