RESUMO
As whole-genome sequencing (WGS) becomes the gold standard tool for studying population genomics and medical applications, data on diverse non-European and admixed individuals are still scarce. Here, we present a high-coverage WGS dataset of 1,171 highly admixed elderly Brazilians from a census-based cohort, providing over 76 million variants, of which ~2 million are absent from large public databases. WGS enables identification of ~2,000 previously undescribed mobile element insertions without previous description, nearly 5 Mb of genomic segments absent from the human genome reference, and over 140 alleles from HLA genes absent from public resources. We reclassify and curate pathogenicity assertions for nearly four hundred variants in genes associated with dominantly-inherited Mendelian disorders and calculate the incidence for selected recessive disorders, demonstrating the clinical usefulness of the present study. Finally, we observe that whole-genome and HLA imputation could be significantly improved compared to available datasets since rare variation represents the largest proportion of input from WGS. These results demonstrate that even smaller sample sizes of underrepresented populations bring relevant data for genomic studies, especially when exploring analyses allowed only by WGS.
Assuntos
Genômica , Metagenômica , Idoso , Brasil/epidemiologia , Genoma Humano/genética , Genômica/métodos , Humanos , Polimorfismo de Nucleotídeo Único , Sequenciamento Completo do GenomaRESUMO
C1q domain-containing (C1qDC) proteins are a group of biopolymers involved in immune response as pattern recognition receptors (PRRs) in a lectin-like manner. A new protein MkC1qDC from the hemolymph plasma of Modiolus kurilensis bivalve mollusk widespread in the Northwest Pacific was purified. The isolation procedure included ammonium sulfate precipitation followed by affinity chromatography on pectin-Sepharose. The full-length MkC1qDC sequence was assembled using de novo mass-spectrometry peptide sequencing complemented with N-terminal Edman's degradation, and included 176 amino acid residues with molecular mass of 19 kDa displaying high homology to bivalve C1qDC proteins. MkC1qDC demonstrated antibacterial properties against Gram-negative and Gram-positive strains. MkC1qDC binds to a number of saccharides in Ca2+-dependent manner which characterized by structural meta-similarity in acidic group enrichment of galactose and mannose derivatives incorporated in diversified molecular species of glycans. Alginate, κ-carrageenan, fucoidan, and pectin were found to be highly effective inhibitors of MkC1qDC activity. Yeast mannan, lipopolysaccharide (LPS), peptidoglycan (PGN) and mucin showed an inhibitory effect at concentrations three orders of magnitude greater than for the most effective saccharides. MkC1qDC localized to the mussel hemal system and interstitial compartment. Intriguingly, MkC1qDC was found to suppress proliferation of human adenocarcinoma HeLa cells in a dose-dependent manner, indicating to the biomedical potential of MkC1qDC protein.