ABSTRACT
Host genetics is a key determinant of COVID-19 outcomes. Previously, the COVID-19 Host Genetics Initiative genome-wide association study used common variants to identify multiple loci associated with COVID-19 outcomes. However, variants with the largest impact on COVID-19 outcomes are expected to be rare in the population. Hence, studying rare variants may provide additional insights into disease susceptibility and pathogenesis, thereby informing therapeutics development. Here, we combined whole-exome and whole-genome sequencing from 21 cohorts across 12 countries and performed rare variant exome-wide burden analyses for COVID-19 outcomes. In an analysis of 5,085 severe disease cases and 571,737 controls, we observed that carrying a rare deleterious variant in the SARS-CoV-2 sensor toll-like receptor TLR7 (on chromosome X) was associated with a 5.3-fold increase in severe disease (95% CI: 2.75-10.05, p=5.41x10-7). This association was consistent across sexes. These results further support TLR7 as a genetic determinant of severe disease and suggest that larger studies on rare variants influencing COVID-19 outcomes could provide additional insights. Author SummaryCOVID-19 clinical outcomes vary immensely, but a patients genetic make-up is an important determinant of how they will fare against the virus. While many genetic variants commonly found in the populations were previously found to be contributing to more severe disease by the COVID-19 Host Genetics Initiative, it isnt clear if more rare variants found in less individuals could also play a role. This is important because genetic variants with the largest impact on COVID-19 severity are expected to be rarely found in the population, and these rare variants require different technologies to be studies (usually whole-exome or whole-genome sequencing). Here, we combined sequencing results from 21 cohorts across 12 countries to perform a rare variant association study. In an analysis comprising 5,085 participants with severe COVID-19 and 571,737 controls, we found that the gene for toll-like receptor 7 (TLR7) on chromosome X was an important determinant of severe COVID-19. Importantly, despite being found on a sex chromosome, this observation was consistent across both sexes.
ABSTRACT
To elucidate the host genetic loci affecting severity of SARS-CoV-2 infection, or Coronavirus disease 2019 (COVID-19), is an emerging issue in the face of the current devastating pandemic. Here, we report a genome-wide association study (GWAS) of COVID-19 in a Japanese population led by the Japan COVID-19 Task Force, as one of the initial discovery GWAS studies performed on a non-European population. Enrolling a total of 2,393 cases and 3,289 controls, we not only replicated previously reported COVID-19 risk variants (e.g., LZTFL1, FOXP4, ABO, and IFNAR2), but also found a variant on 5q35 (rs60200309-A at DOCK2) that was associated with severe COVID-19 in younger (<65 years of age) patients with a genome-wide significant p-value of 1.2 x 10-8 (odds ratio = 2.01, 95% confidence interval = 1.58-2.55). This risk allele was prevalent in East Asians, including Japanese (minor allele frequency [MAF] = 0.097), but rarely found in Europeans. Cross-population Mendelian randomization analysis made a causal inference of a number of complex human traits on COVID-19. In particular, obesity had a significant impact on severe COVID-19. The presence of the population-specific risk allele underscores the need of non-European studies of COVID-19 host genetics.
ABSTRACT
Dynamic simulation promises a deeper understanding of complex molecular mechanisms of biological pathways. How to determine the reaction kinetic parameters which govern the simulation results is still an open question in the field of systems biology. (1) Background: To execute simulation experiments, it is an essential first step to search effective values of model parameters. The complexity of biological systems and the experimental measurement technology severely limit the acquirement of accurate kinetic parameters. Previously proposed genomic data assimilation (GDA) approach enables users to handle parameter estimation using time-course information. However, it highly depends on successive time points and costs massive computational resource; (2) Methods: To address this problem, we present a new high-speed parameter search method for estimating the kinetic parameters of quantitative biological pathways using time-course transcriptomic profiles. The key idea of our method is to interactively prune the search space by introducing Probabilistic Linear-time Temporal Logic (PLTL) based model checking into GDA. (3) Results and conclusion: We demonstrated the effectiveness of our method by comparing with GDA on Mus musculus transcription circuits modelled by hybrid functional Petri net with extension. As a result, our method works faster and more accurate than GDA for both time-course datasets with dense and sparse observed values.
Subject(s)
Algorithms , Computational Biology/methods , Computer Simulation , Gene Regulatory Networks/genetics , Models, Genetic , Transcriptome/genetics , Animals , Genomics/methods , Kinetics , Mice , Signal Transduction/genetics , Systems Biology/methodsABSTRACT
We have previously reported on a predictive model for deficiency-excess pattern diagnosis that was unable to predict the medium pattern. In this study, we aimed to develop predictive models for deficiency, medium,and excess pattern diagnosis, and to confirm whether cutoff values for diagnosis differed between the clinics. We collected data from patients' first visit to one of six Kampo clinics in Japan from January 2012 to February 2015. Exclusion criteria included unwillingness to participate in the study, missing data, duplicate data, under 20 years old, 20 or less subjective symptoms, and irrelevant patterns. In total, 1,068 participants were included. Participants were surveyed using a 153-item questionnaire. We constructed a predictive model for deficiency, medium, and excess pattern diagnosis using a random forest algorithm from training data, and extracted the most important items. We calculated predictive values for each participant by applying their data to the predictive model, and created receiver operating characteristic (ROC) curves with excess-medium and medium-deficiency patterns. Furthermore, we calculated the cutoff value for these patterns in each clinic using ROC curves, and compared them. Body mass index and blood pressure were the most important items. In all clinics, the cutoff values for diagnosis of excess-medium and medium-deficiency patterns was > 0.5 and < 0.5, respectively. We created a predictive model for deficiency, medium, and excess pattern diagnosis from the data of six Kampo clinics in Japan. The cutoff values for these patterns fell within a narrow range in the six clinics.