RESUMO
Genome-scale data have revealed daily rhythms in various species and tissues. However, current methods to assess rhythmicity largely restrict their focus to quantifying statistical significance, which may not reflect biological relevance. To address this limitation, we developed a method called LimoRhyde2 (the successor to our method LimoRhyde), which focuses instead on rhythm-related effect sizes and their uncertainty. For each genomic feature, LimoRhyde2 fits a curve using a series of linear models based on periodic splines, moderates the fits using an Empirical Bayes approach called multivariate adaptive shrinkage (Mash), then uses the moderated fits to calculate rhythm statistics such as peak-to-trough amplitude. The periodic splines capture non-sinusoidal rhythmicity, while Mash uses patterns in the data to account for different fits having different levels of noise. To demonstrate LimoRhyde2's utility, we applied it to multiple circadian transcriptome datasets. Overall, LimoRhyde2 prioritized genes having high-amplitude rhythms in expression, whereas a prior method (BooteJTK) prioritized "statistically significant" genes whose amplitudes could be relatively small. Thus, quantifying effect sizes using approaches such as LimoRhyde2 has the potential to transform interpretation of genomic data related to biological rhythms.
Assuntos
Ritmo Circadiano , Genômica , Ritmo Circadiano/genética , Teorema de Bayes , Transcriptoma , GenomaRESUMO
Genome-scale data have revealed daily rhythms in various species and tissues. However, current methods to assess rhythmicity largely restrict their focus to quantifying statistical significance, which may not reflect biological relevance. To address this limitation, we developed a method called LimoRhyde2 (the successor to our method LimoRhyde), which focuses instead on rhythm-related effect sizes and their uncertainty. For each genomic feature, LimoRhyde2 fits a curve using a series of linear models based on periodic splines, moderates the fits using an Empirical Bayes approach called multivariate adaptive shrinkage (Mash), then uses the moderated fits to calculate rhythm statistics such as peak-to-trough amplitude. The periodic splines capture non-sinusoidal rhythmicity, while Mash uses patterns in the data to account for different fits having different levels of noise. To demonstrate LimoRhyde2's utility, we applied it to multiple circadian transcriptome datasets. Overall, LimoRhyde2 prioritized genes having high-amplitude rhythms in expression, whereas a prior method (BooteJTK) prioritized "statistically significant" genes whose amplitudes could be relatively small. Thus, quantifying effect sizes using approaches such as LimoRhyde2 has the potential to transform interpretation of genomic data related to biological rhythms.
RESUMO
Biomedical research on mammals has traditionally neglected females, raising the concern that some scientific findings may generalize poorly to half the population. Although this lack of sex inclusion has been broadly documented, its extent within circadian genomics remains undescribed. To address this gap, we examined sex inclusion practices in a comprehensive collection of publicly available transcriptome studies on daily rhythms. Among 148 studies having samples from mammals in vivo, we found strong underrepresentation of females across organisms and tissues. Overall, only 23 of 123 studies in mice, 0 of 10 studies in rats, and 9 of 15 studies in humans included samples from females. In addition, studies having samples from both sexes tended to have more samples from males than from females. These trends appear to have changed little over time, including since 2016, when the US National Institutes of Health began requiring investigators to consider sex as a biological variable. Our findings highlight an opportunity to dramatically improve representation of females in circadian research and to explore sex differences in daily rhythms at the genome level.