Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
1.
Circ Genom Precis Med ; 11(9): e002228, 2018 09.
Article in English | MEDLINE | ID: mdl-30354330

ABSTRACT

BACKGROUND: Outcomes of tailoring statin-type based on solute carrier organic anion transporterfamily member 1B1 ( SLCO1B1)pharmacogenetic toxicity information on patient, provider, and pharmacological outcomes are unknown. METHODS: The trial randomized 159 patients not taking statins because of prior statin myalgia 1:1 to receiving SLCO1B1 GIST (Genotype Informed Statin Therapy) versus usual care (UC) and followed for up to 8 months. The UC arm received their SLCO1B1 results post-trial. The primary outcome was statin adherence using the Morisky Medication Adherence Scale, which was assessed in those patients who reinitiated statins. Secondary outcomes assessed in all participants included statin reinitiation and LDLc (low-density lipoprotein cholesterol), within and post-trial. Using commercial laboratory data, serial LDLc were compared between 1907 patients receiving SLCO1B1 testing and propensity-matched, untested controls. RESULTS: Trial participants were 25% SLCO1B1*5 carriers. Statin adherence was similar between arms (Morisky Medication Adherence Scale in GIST versus UC, 6.8±1.5 versus 6.9±1.6, P=0.96). GIST led to more new statin prescriptions (55.4% versus 38.0%, P=0.04) and lower LDLc at 3 months (131.9±42.0 versus 144.4±43.0 mg/dL; P=0.048) with similar magnitude at 8 months (128.6±37.9 versus 141.0±44.4; P=0.12). SLCO1B1*5 carriers exhibited a greater drop in LDLc with GIST versus UC (interaction P=0.048). Post-trial, LDLc decreased in UC participants who crossed over to GIST compared with those allocated to GIST (-14.9±37.8 versus +9.0±37.3 mg/dL, P=0.03). Patients tested for SLCO1B1 though a commercial laboratory had a greater LDLc decrease ( P=0.04) compared with controls. CONCLUSIONS: Delivery of SLCO1B1 pharmacogenetic testing that addresses statin myalgia improved statin reinitiation and LDLc but did not improve self-reported statin adherence. CLINICAL TRIAL REGISTRATION: URL: https://www.clinicaltrials.gov . Unique identifier: NCT01894230.


Subject(s)
Cardiovascular Diseases/drug therapy , Hydroxymethylglutaryl-CoA Reductase Inhibitors/administration & dosage , Liver-Specific Organic Anion Transporter 1/genetics , Pharmacogenomic Testing/methods , Adult , Aged , Aged, 80 and over , Cardiovascular Diseases/blood , Cardiovascular Diseases/genetics , Cholesterol, LDL/blood , Female , Genotype , Humans , Male , Medication Adherence/statistics & numerical data , Middle Aged , Outcome Assessment, Health Care/methods , Outcome Assessment, Health Care/statistics & numerical data , Precision Medicine/methods , Young Adult
2.
J Lipid Res ; 58(6): 1238-1246, 2017 06.
Article in English | MEDLINE | ID: mdl-28420704

ABSTRACT

It has been reported that low cell-cholesterol efflux capacity (CEC) of HDL is an independent risk factor for CVD. To better understand CEC regulation, we measured ABCA1- and scavenger receptor class B type I (SR-BI)-dependent cell-cholesterol efflux, HDL anti-oxidative capacity, HDL particles, lipids, and inflammatory- and oxidative-stress markers in 122 subjects with elevated plasma levels of triglyceride (TG), serum amyloid A (SAA), fibrinogen, myeloperoxidase (MPO), or ß-sitosterol and in 146 controls. In controls, there were strong positive correlations between ABCA1-dependent cholesterol efflux and small preß-1 concentrations (R2 = 0.317) and SR-BI-dependent cholesterol efflux and large (α-1 + α-2) HDL particle concentrations (R2 = 0.774). In high-TG patients, both the concentration and the functionality (preß-1 concentration-normalized ABCA1 efflux) of preß-1 particles were significantly elevated compared with controls; however, though the concentration of large particles was significantly decreased, their functionality (large HDL concentration-normalized SR-BI efflux) was significantly elevated. High levels of SAA or MPO were not associated with decreased functionality of either the small (preß-1) or the large (α-1 + α-2) HDL particles. HDL anti-oxidative capacity was negatively influenced by high plasma ß-sitosterol levels, but not by the concentrations of HDL particles, TG, SAA, fibrinogen, or MPO. Our data demonstrate that under certain conditions CEC is influenced not only by quantitative (concentration), but also by qualitative (functional) properties of HDL particles.


Subject(s)
Cholesterol, HDL/metabolism , ATP Binding Cassette Transporter 1/metabolism , Antioxidants/metabolism , Biological Transport , CD36 Antigens/metabolism , Female , Fibrinogen/metabolism , Humans , Male , Middle Aged , Peroxidase/blood , Serum Amyloid A Protein/metabolism , Sitosterols/blood , Triglycerides/blood
3.
Clin Proteomics ; 11(1): 32, 2014.
Article in English | MEDLINE | ID: mdl-25114662

ABSTRACT

BACKGROUND: CT screening for lung cancer is effective in reducing mortality, but there are areas of concern, including a positive predictive value of 4% and development of interval cancers. A blood test that could manage these limitations would be useful, but development of such tests has been impaired by variations in blood collection that may lead to poor reproducibility across populations. RESULTS: Blood-based proteomic profiles were generated with SOMAscan technology, which measured 1033 proteins. First, preanalytic variability was evaluated with Sample Mapping Vectors (SMV), which are panels of proteins that detect confounders in protein levels related to sample collection. A subset of well collected serum samples not influenced by preanalytic variability was selected for discovery of lung cancer biomarkers. The impact of sample collection variation on these candidate markers was tested in the subset of samples with higher SMV scores so that the most robust markers could be used to create disease classifiers. The discovery sample set (n = 363) was from a multi-center study of 94 non-small cell lung cancer (NSCLC) cases and 269 long-term smokers and benign pulmonary nodule controls. The analysis resulted in a 7-marker panel with an AUC of 0.85 for all cases (68% adenocarcinoma, 32% squamous) and an AUC of 0.93 for squamous cell carcinoma in particular. This panel was validated by making blinded predictions in two independent cohorts (n = 138 in the first validation and n = 135 in the second). The model was recalibrated for a panel format prior to unblinding the second cohort. The AUCs overall were 0.81 and 0.77, and for squamous cell tumors alone were 0.89 and 0.87. The estimated negative predictive value for a 15% disease prevalence was 93% overall and 99% for squamous lung tumors. The proteins in the classifier function in destruction of the extracellular matrix, metabolic homeostasis and inflammation. CONCLUSIONS: Selecting biomarkers resistant to sample processing variation led to robust lung cancer biomarkers that performed consistently in independent validations. They form a sensitive signature for detection of lung cancer, especially squamous cell histology. This non-invasive test could be used to improve the positive predictive value of CT screening, with the potential to avoid invasive evaluation of nonmalignant pulmonary nodules.

4.
Adv Exp Med Biol ; 735: 283-300, 2013.
Article in English | MEDLINE | ID: mdl-23402035

ABSTRACT

Progression from health to disease is accompanied by complex changes in protein expression in both the circulation and affected tissues. Large-scale comparative interrogation of the human proteome can offer insights into disease biology as well as lead to the discovery of new biomarkers for diagnostics, new targets for therapeutics, and can identify patients most likely to benefit from treatment. Although genomic studies provide an increasingly sharper understanding of basic biological and pathobiological processes, they ultimately only offer a prediction of relative disease risk, whereas proteins offer an immediate assessment of "real-time" health and disease status. We have recently developed a new proteomic technology, based on modified aptamers, for biomarker discovery that is capable of simultaneously measuring more than a thousand proteins from small volumes of biological samples such as plasma, tissues, or cells. Our technology is enabled by SOMAmers (Slow Off-rate Modified Aptamers), a new class of protein binding reagents that contain chemically modified nucleotides that greatly expand the physicochemical diversity of nucleic acid-based ligands. Such modifications introduce functional groups that are absent in natural nucleic acids but are often found in protein-protein, small molecule-protein, and antibody-antigen interactions. The use of these modifications expands the range of possible targets for SELEX (Systematic Evolution of Ligands by EXponential Enrichment), results in improved binding properties, and facilitates selection of SOMAmers with slow dissociation rates. Our assay works by transforming protein concentrations in a mixture into a corresponding DNA signature, which is then quantified on current commercial DNA microarray platforms. In essence, we take advantage of the dual nature of SOMAmers as both folded binding entities with defined shapes and unique nucleic acid sequences recognizable by specific hybridization probes. Currently, our assay is capable of simultaneously measuring 1,030 proteins, extending to sub-pM detection limits, an average dynamic range of each analyte in the assay of > 3 logs, an overall dynamic range of at least 7 logs, and a throughput of one million analytes per week. Our collection includes SOMAmers that specifically recognize most of the complement cascade proteins. We have used this assay to identify potential biomarkers in a range of diseases such as malignancies, cardiovascular disorders, and inflammatory conditions. In this chapter, we describe the application of our technology to discovering large-scale protein expression changes associated with chronic kidney disease and non-small cell lung cancer. With this new proteomics technology-which is fast, economical, highly scalable, and flexible--we now have a powerful tool that enables whole-proteome proteomics, biomarker discovery, and advancing the next generation of evidence-based, "personalized" diagnostics and therapeutics.


Subject(s)
Biomarkers/analysis , Diagnosis , Drug Therapy/methods , Proteomics/methods , Animals , Blood Proteins/chemistry , Complement Inactivating Agents/pharmacology , Complement System Proteins/physiology , Humans , Proteins/chemistry
5.
PLoS One ; 7(10): e46091, 2012.
Article in English | MEDLINE | ID: mdl-23056237

ABSTRACT

BACKGROUND: Malignant pleural mesothelioma (MM) is an aggressive, asbestos-related pulmonary cancer that is increasing in incidence. Because diagnosis is difficult and the disease is relatively rare, most patients present at a clinically advanced stage where possibility of cure is minimal. To improve surveillance and detection of MM in the high-risk population, we completed a series of clinical studies to develop a noninvasive test for early detection. METHODOLOGY/PRINCIPAL FINDINGS: We conducted multi-center case-control studies in serum from 117 MM cases and 142 asbestos-exposed control individuals. Biomarker discovery, verification, and validation were performed using SOMAmer proteomic technology, which simultaneously measures over 1000 proteins in unfractionated biologic samples. Using univariate and multivariate approaches we discovered 64 candidate protein biomarkers and derived a 13-marker random forest classifier with an AUC of 0.99±0.01 in training, 0.98±0.04 in independent blinded verification and 0.95±0.04 in blinded validation studies. Sensitivity and specificity at our pre-specified decision threshold were 97%/92% in training and 90%/95% in blinded verification. This classifier accuracy was maintained in a second blinded validation set with a sensitivity/specificity of 90%/89% and combined accuracy of 92%. Sensitivity correlated with pathologic stage; 77% of Stage I, 93% of Stage II, 96% of Stage III and 96% of Stage IV cases were detected. An alternative decision threshold in the validation study yielding 98% specificity would still detect 60% of MM cases. In a paired sample set the classifier AUC of 0.99 and 91%/94% sensitivity/specificity was superior to that of mesothelin with an AUC of 0.82 and 66%/88% sensitivity/specificity. The candidate biomarker panel consists of both inflammatory and proliferative proteins, processes strongly associated with asbestos-induced malignancy. SIGNIFICANCE: The SOMAmer biomarker panel discovered and validated in these studies provides a solid foundation for surveillance and diagnosis of MM in those at highest risk for this disease.


Subject(s)
Mesothelioma/diagnosis , Pleural Neoplasms/diagnosis , Proteomics/methods , Public Health Surveillance/methods , Adult , Aged , Aged, 80 and over , Asbestos , Biomarkers, Tumor/blood , Carcinogens , Case-Control Studies , Cohort Studies , Enzyme-Linked Immunosorbent Assay , Female , Humans , Lectins/blood , Male , Mesothelioma/chemically induced , Mesothelioma/metabolism , Middle Aged , Pleural Neoplasms/chemically induced , Pleural Neoplasms/metabolism , Principal Component Analysis , Reproducibility of Results , Sensitivity and Specificity , Young Adult , Ficolins
6.
PLoS One ; 7(4): e35157, 2012.
Article in English | MEDLINE | ID: mdl-22509397

ABSTRACT

Lung cancer remains the most common cause of cancer-related mortality. We applied a highly multiplexed proteomic technology (SOMAscan) to compare protein expression signatures of non small-cell lung cancer (NSCLC) tissues with healthy adjacent and distant tissues from surgical resections. In this first report of SOMAscan applied to tissues, we highlight 36 proteins that exhibit the largest expression differences between matched tumor and non-tumor tissues. The concentrations of twenty proteins increased and sixteen decreased in tumor tissue, thirteen of which are novel for NSCLC. NSCLC tissue biomarkers identified here overlap with a core set identified in a large serum-based NSCLC study with SOMAscan. We show that large-scale comparative analysis of protein expression can be used to develop novel histochemical probes. As expected, relative differences in protein expression are greater in tissues than in serum. The combined results from tissue and serum present the most extensive view to date of the complex changes in NSCLC protein expression and provide important implications for diagnosis and treatment.


Subject(s)
Carcinoma, Non-Small-Cell Lung/metabolism , Gene Expression Regulation, Neoplastic , Lung Neoplasms/metabolism , Proteome/analysis , Aged , Apoptosis/genetics , Biomarkers, Tumor/blood , Biomarkers, Tumor/genetics , Biomarkers, Tumor/metabolism , Carcinoma, Non-Small-Cell Lung/blood , Carcinoma, Non-Small-Cell Lung/genetics , Female , Humans , Inflammation/genetics , Lung Neoplasms/blood , Lung Neoplasms/genetics , Male , Middle Aged , Neoplasm Invasiveness/genetics , Neoplasm Metastasis , Neovascularization, Pathologic/genetics
7.
BMC Bioinformatics ; 11 Suppl 1: S62, 2010 Jan 18.
Article in English | MEDLINE | ID: mdl-20122238

ABSTRACT

BACKGROUND: Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particular phenotypes. We integrated these modules, protein-protein interaction data, Gene Ontology annotations, and our database of gene-phenotype associations derived from literature to predict novel human gene-phenotype associations. Our systematic predictions provide us with the opportunity to perform a global analysis of human gene pleiotropy and its underlying regulatory mechanisms. RESULTS: We applied this method to 338 microarray datasets, covering 178 phenotype classes, and identified 193,145 phenotype-specific coexpression modules. We trained random forest classifiers for each phenotype and predicted a total of 6,558 gene-phenotype associations. We showed that 40.9% genes are pleiotropic, highlighting that pleiotropy is more prevalent than previously expected. We collected 77 ChIP-chip datasets studying 69 transcription factors binding over 16,000 targets under various phenotypic conditions. Utilizing this unique data source, we confirmed that dynamic transcriptional regulation is an important force driving the formation of phenotype specific gene modules. CONCLUSION: We created a genome-wide gene to phenotype mapping that has many potential implications, including providing potential new drug targets and uncovering the basis for human disease phenotypes. Our analysis of these phenotype-specific coexpression modules reveals a high prevalence of gene pleiotropy, and suggests that phenotype-specific transcription factor binding may contribute to phenotypic diversity. All resources from our study are made freely available on our online Phenotype Prediction Database.


Subject(s)
Computational Biology/methods , Genome , Phenotype , Databases, Genetic , Gene Expression Profiling
8.
PLoS Genet ; 5(11): e1000711, 2009 Nov.
Article in English | MEDLINE | ID: mdl-19936062

ABSTRACT

About 85% of the maize genome consists of highly repetitive sequences that are interspersed by low-copy, gene-coding sequences. The maize community has dealt with this genomic complexity by the construction of an integrated genetic and physical map (iMap), but this resource alone was not sufficient for ensuring the quality of the current sequence build. For this purpose, we constructed a genome-wide, high-resolution optical map of the maize inbred line B73 genome containing >91,000 restriction sites (averaging 1 site/ approximately 23 kb) accrued from mapping genomic DNA molecules. Our optical map comprises 66 contigs, averaging 31.88 Mb in size and spanning 91.5% (2,103.93 Mb/ approximately 2,300 Mb) of the maize genome. A new algorithm was created that considered both optical map and unfinished BAC sequence data for placing 60/66 (2,032.42 Mb) optical map contigs onto the maize iMap. The alignment of optical maps against numerous data sources yielded comprehensive results that proved revealing and productive. For example, gaps were uncovered and characterized within the iMap, the FPC (fingerprinted contigs) map, and the chromosome-wide pseudomolecules. Such alignments also suggested amended placements of FPC contigs on the maize genetic map and proactively guided the assembly of chromosome-wide pseudomolecules, especially within complex genomic regions. Lastly, we think that the full integration of B73 optical maps with the maize iMap would greatly facilitate maize sequence finishing efforts that would make it a valuable reference for comparative studies among cereals, or other maize inbred lines and cultivars.


Subject(s)
Genome, Plant/genetics , Zea mays/genetics , Algorithms , Base Sequence , Chromosomes, Artificial, Bacterial/genetics , Contig Mapping , Molecular Sequence Data , Optical Phenomena , Physical Chromosome Mapping , Sequence Alignment
9.
J Comput Biol ; 16(8): 1023-34, 2009 Aug.
Article in English | MEDLINE | ID: mdl-19630539

ABSTRACT

Although many studies have been successful in the discovery of cooperating groups of genes, mapping these groups to phenotypes has proved a much more challenging task. In this article, we present the first genome-wide mapping of gene coexpression modules onto the phenome. We annotated coexpression networks from 136 microarray datasets with phenotypes from the Unified Medical Language System (UMLS). We then designed an efficient graph-based simulated annealing approach to identify coexpression modules frequently and specifically occurring in datasets related to individual phenotypes. By requiring phenotype-specific recurrence, we ensure the robustness of our findings. We discovered 118,772 modules specific to 42 phenotypes, and developed validation tests combining Gene Ontology, GeneRIF and UMLS. Our method is generally applicable to any kind of abundant network data with defined phenotype association, and thus paves the way for genome-wide, gene network-phenotype maps.


Subject(s)
Algorithms , Gene Expression Profiling , Genomics/methods , Phenotype , Artificial Intelligence , Humans
10.
Proc Natl Acad Sci U S A ; 106(30): 12323-8, 2009 Jul 28.
Article in English | MEDLINE | ID: mdl-19590007

ABSTRACT

Phenotypes are complex, and difficult to quantify in a high-throughput fashion. The lack of comprehensive phenotype data can prevent or distort genotype-phenotype mapping. Here, we describe "PhenoProfiler," a computational method that enables in silico phenotype profiling. Drawing on the principle that similar gene expression patterns are likely to be associated with similar phenotype patterns, PhenoProfiler supplements the missing quantitative phenotype information for a given microarray dataset based on other well-characterized microarray datasets. We applied our method to 587 human microarray datasets covering >14,000 samples, and confirmed that the predicted phenotype profiles are highly consistent with true phenotype descriptions. PhenoProfiler offers several unique capabilities: (i) automated, multidimensional phenotype profiling, facilitating the analysis and treatment design of complex diseases; (ii) the extrapolation of phenotype profiles beyond provided classes; and (iii) the detection of confounding phenotype factors that could otherwise bias biological inferences. Finally, because no direct comparisons are made between gene expression values from different datasets, the method can use the entire body of cross-platform microarray data. This work has produced a compendium of phenotype profiles for the National Center for Biotechnology Information GEO datasets, which can facilitate an unbiased understanding of the transcriptome-phenome mapping. The continued accumulation of microarray data will further increase the power of PhenoProfiler, by increasing the variety and the quality of phenotypes to be profiled.


Subject(s)
Algorithms , Gene Expression Profiling/statistics & numerical data , Oligonucleotide Array Sequence Analysis/statistics & numerical data , Animals , Gene Expression Profiling/methods , Genotype , Humans , Oligonucleotide Array Sequence Analysis/methods , Phenotype
11.
Bioinformatics ; 23(13): i577-86, 2007 Jul 01.
Article in English | MEDLINE | ID: mdl-17646346

ABSTRACT

MOTIVATION: A major challenge in studying gene regulation is to systematically reconstruct transcription regulatory modules, which are defined as sets of genes that are regulated by a common set of transcription factors. A commonly used approach for transcription module reconstruction is to derive coexpression clusters from a microarray dataset. However, such results often contain false positives because genes from many transcription modules may be simultaneously perturbed upon a given type of conditions. In this study, we propose and validate that genes, which form a coexpression cluster in multiple microarray datasets across diverse conditions, are more likely to form a transcription module. However, identifying genes coexpressed in a subset of many microarray datasets is not a trivial computational problem. RESULTS: We propose a graph-based data-mining approach to efficiently and systematically identify frequent coexpression clusters. Given m microarray datasets, we model each microarray dataset as a coexpression graph, and search for vertex sets which are frequently densely connected across [theta m] datasets (0 < or = theta < or = 1). For this novel graph-mining problem, we designed two techniques to narrow down the search space: (1) partition the input graphs into (overlapping) groups sharing common properties; (2) summarize the vertex neighbor information from the partitioned datasets onto the 'Neighbor Association Summary Graph's for effective mining. We applied our method to 105 human microarray datasets, and identified a large number of potential transcription modules, activated under different subsets of conditions. Validation by ChIP-chip data demonstrated that the likelihood of a coexpression cluster being a transcription module increases significantly with its recurrence. Our method opens a new way to exploit the vast amount of existing microarray data accumulation for gene regulation study. Furthermore, the algorithm is applicable to other biological networks for approximate network module mining. AVAILABILITY: http://zhoulab.usc.edu/NeMo/.


Subject(s)
Chromosome Mapping/methods , Genome, Human/genetics , Regulatory Elements, Transcriptional/genetics , Sequence Analysis, DNA/methods , Transcription Factors/genetics , Transcription, Genetic/genetics , Algorithms , Base Sequence , Binding Sites , Computer Graphics , Humans , Molecular Sequence Data , Protein Binding
12.
Nucleic Acids Res ; 35(Database issue): D756-9, 2007 Jan.
Article in English | MEDLINE | ID: mdl-17090592

ABSTRACT

The recent development of microarray technology provided unprecedented opportunities to understand the genetic basis of aging. So far, many microarray studies have addressed aging-related expression patterns in multiple organisms and under different conditions. The number of relevant studies continues to increase rapidly. However, efficient exploitation of these vast data is frustrated by the lack of an integrated data mining platform or other unifying bioinformatic resource to enable convenient cross-laboratory searches of array signals. To facilitate the integrative analysis of microarray data on aging, we developed a web database and analysis platform 'Gene Aging Nexus' (GAN) that is freely accessible to the research community to query/analyze/visualize cross-platform and cross-species microarray data on aging. By providing the possibility of integrative microarray analysis, GAN should be useful in building the systems-biology understanding of aging. GAN is accessible at http://gan.usc.edu.


Subject(s)
Aging/genetics , Databases, Genetic , Gene Expression Profiling , Oligonucleotide Array Sequence Analysis , Aging/metabolism , Animals , Humans , Internet , Mice , Rats , Software , User-Computer Interface
13.
Hum Genet ; 121(1): 93-100, 2007 Mar.
Article in English | MEDLINE | ID: mdl-17091282

ABSTRACT

About 5% of the human genome consists of large-scale duplicated segments of almost identical sequences. Segmental duplications (SDs) have been proposed to be involved in non-allelic homologous recombination leading to recurrent genomic variation and disease. It has also been suggested that these SDs are associated with syntenic rearrangements that have shaped the human genome. We have analyzed 14 members of a single family of closely related SDs in the human genome, some of which are associated with common inversion polymorphisms at chromosomes 8p23 and 4p16. Comparative analysis with the mouse genome revealed syntenic inversions for these two human polymorphic loci. In addition, 12 of the 14 SDs, while absent in the mouse genome, occur at the breaks of synteny; suggesting a non-random involvement of these sequences in genome evolution. Furthermore, we observed a syntenic familial relationship between 8 and 12 breakpoint-loci, where broken synteny that ends at one family member resumes at another, even across different chromosomes. Subsequent genome-wide assessment revealed that this relationship, which we named continuation-of-synteny, is not limited to the 8p23 family and occurs 46 times in the human genome with high frequency at specific chromosomes. Our analysis supports a non-random breakage model of genomic evolution with an active involvement of segmental duplications for specific regions of the human genome.


Subject(s)
Gene Duplication , Genome , Synteny/genetics , Animals , Chromosomes, Human, Pair 8/genetics , Evolution, Molecular , Genome, Human , Humans , Mice , Multigene Family
14.
Hum Genomics ; 1(5): 335-44, 2004 Aug.
Article in English | MEDLINE | ID: mdl-15588494

ABSTRACT

Recent studies have identified a small number of genomic rearrangements that occur frequently in the general population. Bioinformatics tools are now available for systematic genome-wide surveys of higher-order structures predisposing to such common variations in genomic architecture. Segmental duplications (SDs) constitute up to 5 per cent of the genome and play an important role in generating additional rearrangements and in disease aetiology. We conducted a genome-wide database search for a form of SD, palindromic segmental duplications (PSDs), which consist of paired, inverted duplications, and which predispose to inversions, duplications and deletions. The survey was complemented by a search for SDs in tandem orientation (TSDs) that can mediate duplications and deletions but not inversions. We found more than 230 distinct loci with higher-order genomic structure that can mediate genomic variation, of these about 180 contained a PSD. A number of these sites were previously identified as harbouring common inversions or as being associated with specific genomic diseases characterised by duplication, deletions or inversions. Most of the regions, however, were previously unidentified; their characterisation should identify further common rearrangements and may indicate localisations for additional genomic disorders. The widespread distribution of complex chromosomal architecture suggests a potentially high degree of plasticity of the human genome and could uncover another level of genetic variation within human populations.


Subject(s)
Chromosomes, Human/genetics , Gene Duplication , Genetic Variation , Genome, Human , Computational Biology , Computer Simulation , Gene Rearrangement , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...