ABSTRACT
Host genetics is a key determinant of COVID-19 outcomes. Previously, the COVID-19 Host Genetics Initiative genome-wide association study used common variants to identify multiple loci associated with COVID-19 outcomes. However, variants with the largest impact on COVID-19 outcomes are expected to be rare in the population. Hence, studying rare variants may provide additional insights into disease susceptibility and pathogenesis, thereby informing therapeutics development. Here, we combined whole-exome and whole-genome sequencing from 21 cohorts across 12 countries and performed rare variant exome-wide burden analyses for COVID-19 outcomes. In an analysis of 5,085 severe disease cases and 571,737 controls, we observed that carrying a rare deleterious variant in the SARS-CoV-2 sensor toll-like receptor TLR7 (on chromosome X) was associated with a 5.3-fold increase in severe disease (95% CI: 2.75-10.05, p=5.41x10-7). This association was consistent across sexes. These results further support TLR7 as a genetic determinant of severe disease and suggest that larger studies on rare variants influencing COVID-19 outcomes could provide additional insights. Author SummaryCOVID-19 clinical outcomes vary immensely, but a patients genetic make-up is an important determinant of how they will fare against the virus. While many genetic variants commonly found in the populations were previously found to be contributing to more severe disease by the COVID-19 Host Genetics Initiative, it isnt clear if more rare variants found in less individuals could also play a role. This is important because genetic variants with the largest impact on COVID-19 severity are expected to be rarely found in the population, and these rare variants require different technologies to be studies (usually whole-exome or whole-genome sequencing). Here, we combined sequencing results from 21 cohorts across 12 countries to perform a rare variant association study. In an analysis comprising 5,085 participants with severe COVID-19 and 571,737 controls, we found that the gene for toll-like receptor 7 (TLR7) on chromosome X was an important determinant of severe COVID-19. Importantly, despite being found on a sex chromosome, this observation was consistent across both sexes.
ABSTRACT
The combined impact of common and rare exonic variants in COVID-19 host genetics is currently insufficiently understood. Here, common and rare variants from whole exome sequencing data of about 4,000 SARS-CoV-2-positive individuals were used to define an interpretable machine learning model for predicting COVID-19 severity. Firstly, variants were converted into separate sets of Boolean features, depending on the absence or the presence of variants in each gene. An ensemble of LASSO logistic regression models was used to identify the most informative Boolean features with respect to the genetic bases of severity. The Boolean features selected by these logistic models were combined into an Integrated PolyGenic Score that offers a synthetic and interpretable index for describing the contribution of host genetics in COVID-19 severity, as demonstrated through testing in several independent cohorts. Selected features belong to ultra-rare, rare, low-frequency, and common variants, including those in linkage disequilibrium with known GWAS loci. Noteworthly, around one quarter of the selected genes are sex-specific. Pathway analysis of the selected genes associated with COVID-19 severity reflected the multi-organ nature of the disease. The proposed model might provide useful information for developing diagnostics and therapeutics, while also being able to guide bedside disease management.
ABSTRACT
Given the highly variable clinical phenotype of Coronavirus disease 2019 (COVID-19), a deeper analysis of the host genetic contribution to severe COVID-19 is important to improve our understanding of underlying disease mechanisms. Here, we describe an extended GWAS meta-analysis of a well-characterized cohort of 3,260 COVID-19 patients with respiratory failure and 12,483 population controls from Italy, Spain, Norway and Germany/Austria, including stratified analyses based on age, sex and disease severity, as well as targeted analyses of chromosome Y haplotypes, the human leukocyte antigen (HLA) region and the SARS-CoV-2 peptidome. By inversion imputation, we traced a reported association at 17q21.31 to a highly pleiotropic [~]0.9-Mb inversion polymorphism and characterized the potential effects of the inversion in detail. Our data, together with the 5th release of summary statistics from the COVID-19 Host Genetics Initiative, also identified a new locus at 19q13.33, including NAPSA, a gene which is expressed primarily in alveolar cells responsible for gas exchange in the lung.
ABSTRACT
BackgroundThere is considerable variability in COVID-19 outcomes amongst younger adults--and some of this variation may be due to genetic predisposition. We characterized the clinical implications of the major genetic risk factor for COVID-19 severity, and its age-dependent effect, using individual-level data in a large international multi-centre consortium. MethodThe major common COVID-19 genetic risk factor is a chromosome 3 locus, tagged by the marker rs10490770. We combined individual level data for 13,424 COVID-19 positive patients (N=6,689 hospitalized) from 17 cohorts in nine countries to assess the association of this genetic marker with mortality, COVID-19-related complications and laboratory values. We next examined if the magnitude of these associations varied by age and were independent from known clinical COVID-19 risk factors. FindingsWe found that rs10490770 risk allele carriers experienced an increased risk of all-cause mortality (hazard ratio [HR] 1{middle dot}4, 95% confidence interval [CI] 1{middle dot}2-1{middle dot}6) and COVID-19 related mortality (HR 1{middle dot}5, 95%CI 1{middle dot}3-1{middle dot}8). Risk allele carriers had increased odds of several COVID-19 complications: severe respiratory failure (odds ratio [OR] 2{middle dot}0, 95%CI 1{middle dot}6-2{middle dot}6), venous thromboembolism (OR 1{middle dot}7, 95%CI 1{middle dot}2-2{middle dot}4), and hepatic injury (OR 1{middle dot}6, 95%CI 1{middle dot}2-2{middle dot}0). Risk allele carriers [≤] 60 years had higher odds of death or severe respiratory failure (OR 2{middle dot}6, 95%CI 1{middle dot}8-3{middle dot}9) compared to those > 60 years OR 1{middle dot}5 (95%CI 1{middle dot}3-1{middle dot}9, interaction p-value=0{middle dot}04). Amongst individuals [≤] 60 years who died or experienced severe respiratory COVID-19 outcome, we found that 31{middle dot}8% (95%CI 27{middle dot}6-36{middle dot}2) were risk variant carriers, compared to 13{middle dot}9% (95%CI 12{middle dot}6-15{middle dot}2%) of those not experiencing these outcomes. Prediction of death or severe respiratory failure among those [≤] 60 years improved when including the risk allele (AUC 0{middle dot}82 vs 0{middle dot}84, p=0{middle dot}016) and the prediction ability of rs10490770 risk allele was similar to, or better than, most established clinical risk factors. InterpretationThe major common COVID-19 risk locus on chromosome 3 is associated with increased risks of morbidity and mortality--and these are more pronounced amongst individuals [≤] 60 years. The effect on COVID-19 severity was similar to, or larger than most established risk factors, suggesting potential implications for clinical risk management. FundingFunding was obtained by each of the participating cohorts individually.
ABSTRACT
The ongoing SARS-CoV-2 pandemic has already caused devastating losses. Exponential spread can be slowed by social distancing and population-wide isolation measures, but those place a tremendous burden on society, and, once lifted, exponential spread can re-emerge. Regular population-scale testing, combined with contact tracing and case isolation, should help break the cycle of transmission, but current detection strategies are not capable of such large-scale processing. Here we present a protocol for LAMP-Seq, a barcoded Reverse-Transcription Loop-mediated Isothermal Amplification (RT-LAMP) method that is highly scalable. Individual samples are stabilized, inactivated, and amplified in three isothermal heat steps, generating barcoded amplicons that can be pooled and analyzed en masse by sequencing. Using unique barcode combinations per sample from a compressed barcode space enables extensive pooling, potentially further reducing cost and simplifying logistics. We validated LAMP-Seq on 28 clinical samples, empirically optimized the protocol and barcode design, and performed initial safety evaluation. Relying on world-wide infrastructure for next-generation sequencing, and in the context of population-wide sample collection, LAMP-Seq could be scaled to analyze millions of samples per day.