Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 29
Filter
1.
Genome Biol Evol ; 16(7)2024 Jul 03.
Article in English | MEDLINE | ID: mdl-38935434

ABSTRACT

Runs of homozygosity (ROHs) are indicative of elevated homozygosity and inbreeding due to mating of closely related individuals. Self-fertilization can be a major source of inbreeding which elevates genome-wide homozygosity and thus should also create long ROHs. While ROHs are frequently used to understand inbreeding in the context of conservation and selective breeding, as well as for consanguinity of populations and their demographic history, it remains unclear how ROH characteristics are altered by selfing and if this confounds expected signatures of inbreeding due to demographic change. Using simulations, we study the impact of the mode of reproduction and demographic history on ROHs. We apply random forests to identify unique characteristics of ROHs, indicative of different sources of inbreeding. We pinpoint distinct features of ROHs that can be used to better characterize the type of inbreeding the population was subjected to and to predict outcrossing rates and complex demographic histories. Using additional simulations and four empirical datasets, two from highly selfing species and two from mixed-maters, we predict the selfing rate and validate our estimations. We find that self-fertilization rates are successfully identified even with complex demography. Population genetic summary statistics improve algorithm accuracy particularly in the presence of additional inbreeding, e.g. from population bottlenecks. Our findings highlight the importance of ROHs in disentangling confounding factors related to various sources of inbreeding and demonstrate situations where such sources cannot be differentiated. Additionally, our random forest models provide a novel tool to the community for inferring selfing rates using genomic data.


Subject(s)
Homozygote , Inbreeding , Machine Learning , Self-Fertilization , Animals , Models, Genetic , Genetics, Population
2.
J Evol Biol ; 36(12): 1761-1782, 2023 Dec.
Article in English | MEDLINE | ID: mdl-37942504

ABSTRACT

Inversions are structural mutations that reverse the sequence of a chromosome segment and reduce the effective rate of recombination in the heterozygous state. They play a major role in adaptation, as well as in other evolutionary processes such as speciation. Although inversions have been studied since the 1920s, they remain difficult to investigate because the reduced recombination conferred by them strengthens the effects of drift and hitchhiking, which in turn can obscure signatures of selection. Nonetheless, numerous inversions have been found to be under selection. Given recent advances in population genetic theory and empirical study, here we review how different mechanisms of selection affect the evolution of inversions. A key difference between inversions and other mutations, such as single nucleotide variants, is that the fitness of an inversion may be affected by a larger number of frequently interacting processes. This considerably complicates the analysis of the causes underlying the evolution of inversions. We discuss the extent to which these mechanisms can be disentangled, and by which approach.


Subject(s)
Chromosome Inversion , Chromosomes , Humans , Heterozygote , Evolution, Molecular
3.
PLoS Genet ; 19(9): e1010883, 2023 09.
Article in English | MEDLINE | ID: mdl-37656747

ABSTRACT

As species expand their geographic ranges, colonizing populations face novel ecological conditions, such as new environments and limited mates, and suffer from evolutionary consequences of demographic change through bottlenecks and mutation load accumulation. Self-fertilization is often observed at species range edges and, in addition to countering the lack of mates, is hypothesized as an evolutionary advantage against load accumulation through increased homozygosity and purging. We study how selfing impacts the accumulation of genetic load during range expansion via purging and/or speed of colonization. Using simulations, we disentangle inbreeding effects due to demography versus due to selfing and find that selfers expand faster, but still accumulate load, regardless of mating system. The severity of variants contributing to this load, however, differs across mating system: higher selfing rates purge large-effect recessive variants leaving a burden of smaller-effect alleles. We compare these predictions to the mixed-mating plant Arabis alpina, using whole-genome sequences from refugial outcrossing populations versus expanded selfing populations. Empirical results indicate accumulation of expansion load along with evidence of purging in selfing populations, concordant with our simulations, suggesting that while purging is a benefit of selfing evolving during range expansions, it is not sufficient to prevent load accumulation due to range expansion.


Subject(s)
Inbreeding , Self-Fertilization , Self-Fertilization/genetics , Alleles , Biological Evolution , Cell Communication
4.
Mol Ecol ; 31(17): 4435-4439, 2022 09.
Article in English | MEDLINE | ID: mdl-35810344

ABSTRACT

Chromosomal inversions are often thought to facilitate local adaptation and population divergence because they can link multiple adaptive alleles into non-recombining genomic blocks. Selection should thus be more efficient in driving inversion-linked adaptive alleles to high frequency in a population, particularly in the face of maladaptive gene flow. But what if ecological conditions and hence selection on inversion-linked alleles change? Reduced recombination within inversions could then constrain the formation of optimal combinations of pre-existing alleles under these new ecological conditions. Here, we outline this idea of inversions limiting adaptation and divergence when ecological conditions change across time or space. We reason and use simulations to illustrate that the benefit of inversions for local adaptation and divergence under one set of ecological conditions can come with a concomitant constraint for adaptation to novel sets of ecological conditions. This limitation of inversions to adaptation may contribute to the maintenance of polymorphism within species.


Subject(s)
Adaptation, Physiological , Chromosome Inversion , Acclimatization , Adaptation, Physiological/genetics , Alleles , Chromosome Inversion/genetics , Humans , Polymorphism, Genetic
5.
Philos Trans R Soc Lond B Biol Sci ; 377(1846): 20210006, 2022 03 14.
Article in English | MEDLINE | ID: mdl-35067089

ABSTRACT

The distribution of genetic diversity over geographical space has long been investigated in population genetics and serves as a useful tool to understand evolution and history of populations. Within some species or across regions of contact between two species, there are instances where there is no apparent ecological determinant of sharp changes in allele frequencies or divergence. To further understand these patterns of spatial genetic structure and potential species divergence, we model the establishment of clines that occur due to the surfing of underdominant alleles during range expansions. We provide analytical approximations for the fixation probability of underdominant alleles at expansion fronts and demonstrate that gene surfing can lead to clines in one-dimensional range expansions. We extend these results to multiple loci via a mixture of analytical theory and individual-based simulations. We study the interaction between the strength of selection against heterozygotes, migration rates, and local recombination rates on the formation of stable hybrid zones. Clines created by surfing at different loci can attract each other and align after expansion, if they are sufficiently close in space and in terms of recombination distance. Our findings suggest that range expansions can set the stage for parapatric speciation due to the alignment of multiple selective clines, even in the absence of ecologically divergent selection. This article is part of the theme issue 'Species' ranges in the face of changing environments (part I)'.


Subject(s)
Genetics, Population , Models, Genetic , Alleles , Gene Frequency , Heterozygote
6.
Genetics ; 220(1)2022 01 04.
Article in English | MEDLINE | ID: mdl-34791202

ABSTRACT

The distribution of fitness effects (DFE) for new mutations is one of the most theoretically important but difficult to estimate properties in population genetics. A crucial challenge to inferring the DFE from natural genetic variation is the sensitivity of the site frequency spectrum to factors like population size change, population substructure, genome structure, and nonrandom mating. Although inference methods aim to control for population size changes, the influence of nonrandom mating remains incompletely understood, despite being a common feature of many species. We report the DFE estimated from 326 genomes of Caenorhabditis elegans, a nematode roundworm with a high rate of self-fertilization. We evaluate the robustness of DFE inferences using simulated data that mimics the genomic structure and reproductive life history of C. elegans. Our observations demonstrate how the combined influence of self-fertilization, genome structure, and natural selection on linked sites can conspire to compromise estimates of the DFE from extant polymorphisms with existing methods. These factors together tend to bias inferences toward weakly deleterious mutations, making it challenging to have full confidence in the inferred DFE of new mutations as deduced from standing genetic variation in species like C. elegans. Improved methods for inferring the DFE are needed to appropriately handle strong linked selection and selfing. These results highlight the importance of understanding the combined effects of processes that can bias our interpretations of evolution in natural populations.


Subject(s)
Caenorhabditis elegans , Animals
7.
Nucleic Acids Res ; 49(D1): D373-D379, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33174605

ABSTRACT

OMA is an established resource to elucidate evolutionary relationships among genes from currently 2326 genomes covering all domains of life. OMA provides pairwise and groupwise orthologs, functional annotations, local and global gene order conservation (synteny) information, among many other functions. This update paper describes the reorganisation of the database into gene-, group- and genome-centric pages. Other new and improved features are detailed, such as reporting of the evolutionarily best conserved isoforms of alternatively spliced genes, the inferred local order of ancestral genes, phylogenetic profiling, better cross-references, fast genome mapping, semantic data sharing via RDF, as well as a special coronavirus OMA with 119 viruses from the Nidovirales order, including SARS-CoV-2, the agent of the COVID-19 pandemic. We conclude with improvements to the documentation of the resource through primers, tutorials and short videos. OMA is accessible at https://omabrowser.org.


Subject(s)
Algorithms , Databases, Genetic , Gene Order/genetics , Genome/genetics , Animals , COVID-19/epidemiology , COVID-19/prevention & control , COVID-19/virology , Chromosome Mapping , Evolution, Molecular , Gene Ontology , Humans , Internet , Pandemics , Phylogeny , SARS-CoV-2/genetics , SARS-CoV-2/physiology , Species Specificity , Synteny
9.
Am Nat ; 195(2): 349-360, 2020 02.
Article in English | MEDLINE | ID: mdl-32017619

ABSTRACT

Understanding the causes and consequences of range expansions or range shifts has a long history in evolutionary biology. Recent theoretical, experimental, and empirical work has identified two particularly interesting phenomena in the context of species range expansions: (i) gene surfing and the relaxation of natural selection and (ii) spatial sorting. The former can lead to an accumulation of deleterious mutations at range edges, causing an expansion load and slowing down expansion. The latter can create gradients in dispersal-related traits along the expansion axis and cause an acceleration of expansion. We present a theoretical framework that treats spatial sorting and gene surfing as spatial versions of natural selection and genetic drift, respectively. This model allows us to analytically study how gene surfing and spatial sorting interact and derive the probability of fixation of pleiotropic mutations at the expansion front. We use our results to predict the coevolution of mean fitness and dispersal rates, taking into account the effects of random genetic drift, natural selection, and spatial sorting, as well as correlations between fitness- and dispersal-related traits. We identify a "rescue effect" of spatial sorting, where the evolution of higher dispersal rates at the leading edge rescues the population from incurring expansion load.


Subject(s)
Genetic Drift , Selection, Genetic , Animal Distribution , Animals , Biological Evolution , Computer Simulation , Genetic Fitness , Models, Genetic , Mutation
10.
Curr Biol ; 30(1): 101-107.e3, 2020 01 06.
Article in English | MEDLINE | ID: mdl-31866368

ABSTRACT

Linked selection is a major driver of genetic diversity. Selection against deleterious mutations removes linked neutral diversity (background selection [BGS]) [1], creating a positive correlation between recombination rates and genetic diversity. Purifying selection against recessive variants, however, can also lead to associative overdominance (AOD) [2, 3], due to an apparent heterozygote advantage at linked neutral loci that opposes the loss of neutral diversity by BGS. Zhao and Charlesworth [3] identified the conditions under which AOD should dominate over BGS in a single-locus model and suggested that the effect of AOD could become stronger if multiple linked deleterious variants co-segregate. We present a model describing how and under which conditions multi-locus dynamics can amplify the effects of AOD. We derive the conditions for a transition from BGS to AOD due to pseudo-overdominance [4], i.e., a form of balancing selection that maintains complementary deleterious haplotypes that mask the effect of recessive deleterious mutations. Simulations confirm these findings and show that multi-locus AOD can increase diversity in low-recombination regions much more strongly than previously appreciated. While BGS is known to drive genome-wide diversity in humans [5], the observation of a resurgence of genetic diversity in regions of very low recombination is indicative of AOD. We identify 22 such regions in the human genome consistent with multi-locus AOD. Our results demonstrate that AOD may play an important role in the evolution of low-recombination regions of many species.


Subject(s)
Genetic Variation , Genome, Human , Recombination, Genetic , Selection, Genetic , Humans , Models, Genetic
11.
PLoS Genet ; 14(9): e1007450, 2018 09.
Article in English | MEDLINE | ID: mdl-30265675

ABSTRACT

The fitness of spatially expanding species has been shown to decrease over time and space, but specialist species tracking their changing environment and shifting their range accordingly have been little studied. We use individual-based simulations and analytical modeling to compare the impact of range expansions and range shifts on genetic diversity and fitness loss, as well as the ability to recover fitness after either a shift or expansion. We find that the speed of a shift has a strong impact on fitness evolution. Fastest shifts show the strongest fitness loss per generation, but intermediate shift speeds lead to the strongest fitness loss per geographic distance. Range shifting species lose fitness more slowly through time than expanding species, however, their fitness measured at equal geographic distances from the source of expansion can be considerably lower. These counter-intuitive results arise from the combination of time over which selection acts and mutations enter the system. Range shifts also exhibit reduced fitness recovery after a geographic shift and may result in extinction, whereas range expansions can persist from the core of the species range. The complexity of range expansions and range shifts highlights the potential for severe consequences of environmental change on species survival.


Subject(s)
Adaptation, Biological/genetics , Genetic Fitness , Models, Genetic , Mutation Rate , Climate Change , Computer Simulation , Genetic Variation , Genetics, Population , Mutation , Selection, Genetic
12.
Am J Bot ; 105(6): 1088-1095, 2018 06.
Article in English | MEDLINE | ID: mdl-29995339

ABSTRACT

PREMISE OF THE STUDY: Plant pathogens that form persistent systemic infections within plants have the potential to affect multiple plant life history traits, yet we tend to focus only on visible symptoms. Anther smut of Silene latifolia caused by the fungus Microbotryum lychnidis-dioicae induces the anthers of its host to support fungal spore production instead of pollen, and the pathogen is primarily transmitted among flowering plants by pollinators. Nevertheless, most of its life cycle is spent in the asymptomatic vegetative phase, and spores falling on seedlings or nonflowering plants can also infect the host. The purpose of this study was to ask whether the fungus also had an effect on its host plant in the juvenile vegetative phase before flowering as this is important for the disease dynamics in species where infection of seedlings is commonplace. METHODS: Leaf length and leaf number of inoculated and uninoculated juvenile plants were compared in greenhouse experiments, and in one experiment, disease status of the plants at flowering was determined. KEY RESULTS: Inoculated plants had shorter but more leaves, and reduced root mass at the early juvenile (preflowering) stage. Some of these effects were detectable in plants that were inoculated but showed no disease symptoms at flowering. CONCLUSIONS: These results show that pathogenic fungi can have endophyte-like effects even in the total absence of their typical and more charismatic symptoms, and conversely that the assessment of endophyte effects on the fitness of their hosts should include all stages of the host life cycle.


Subject(s)
Host-Pathogen Interactions , Silene/microbiology , Ustilago/physiology , Plant Roots/growth & development , Plant Shoots/growth & development , Silene/growth & development
13.
Am Nat ; 189(4): 368-380, 2017 Apr.
Article in English | MEDLINE | ID: mdl-28350500

ABSTRACT

The biotic and abiotic factors that facilitate or hinder species range expansions are many and complex. We examine the impact of two genetic processes and their interaction on fitness at expanding range edges: local maladaptation resulting from the presence of an environmental gradient and expansion load resulting from increased genetic drift at the range edge. Results from spatially explicit simulations indicate that the presence of an environmental gradient during range expansion reduces expansion load; conversely, increasing expansion load allows only locally adapted populations to persist at the range edge. Increased maladaptation reduces the speed of range expansion, resulting in less genetic drift at the expanding front and more immigration from the range center, therefore reducing expansion load at the range edge. These results may have ramifications for species being forced to shift their ranges because of climate change or other anthropogenic changes. If rapidly changing climate leads to faster expansion as populations track their shifting climatic optima, populations may suffer increased expansion load beyond previous expectations.


Subject(s)
Adaptation, Physiological , Climate Change , Genetic Drift , Acclimatization
14.
Mol Ecol Resour ; 16(3): 601-3, 2016 May.
Article in English | MEDLINE | ID: mdl-27062588

ABSTRACT

The program structure has been used extensively to understand and visualize population genetic structure. It is one of the most commonly used clustering algorithms, cited over 11,500 times in Web of Science since its introduction in 2000. The method estimates ancestry proportions to assign individuals to clusters, and post hoc analyses of results may indicate the most likely number of clusters, or populations, on the landscape. However, as has been shown in this issue of Molecular Ecology Resources by Puechmaille (), when sampling is uneven across populations or across hierarchical levels of population structure, these post hoc analyses can be inaccurate and identify an incorrect number of population clusters. To solve this problem, Puechmaille () presents strategies for subsampling and new analysis methods that are robust to uneven sampling to improve inferences of the number of population clusters.


Subject(s)
Cluster Analysis , Computational Biology/methods , Genetics, Population/methods , Sequence Analysis, DNA/methods
15.
Evolution ; 69(8): 2154-66, 2015 08.
Article in English | MEDLINE | ID: mdl-26118738

ABSTRACT

Effective population size is a fundamental parameter in population genetics, evolutionary biology, and conservation biology, yet its estimation can be fraught with difficulties. Several methods to estimate Ne from genetic data have been developed that take advantage of various approaches for inferring Ne . The ability of these methods to accurately estimate Ne , however, has not been comprehensively examined. In this study, we employ seven of the most cited methods for estimating Ne from genetic data (Colony2, CoNe, Estim, MLNe, ONeSAMP, TMVP, and NeEstimator including LDNe) across simulated datasets with populations experiencing migration or no migration. The simulated population demographies are an isolated population with no immigration, an island model metapopulation with a sink population receiving immigrants, and an isolation by distance stepping stone model of populations. We find considerable variance in performance of these methods, both within and across demographic scenarios, with some methods performing very poorly. The most accurate estimates of Ne can be obtained by using LDNe, MLNe, or TMVP; however each of these approaches is outperformed by another in a differing demographic scenario. Knowledge of the approximate demography of population as well as the availability of temporal data largely improves Ne estimates.


Subject(s)
Animal Migration , Animals , Biological Evolution , Computer Simulation , Conservation of Natural Resources/methods , Genetics, Population , Islands , Population Density , Population Dynamics
16.
BJU Int ; 116(5): 815-22, 2015 Nov.
Article in English | MEDLINE | ID: mdl-25818264

ABSTRACT

OBJECTIVE: To examine the safety of intralesional injection of collagenase Clostridium histolyticum (CCH) for the treatment of Peyronie's disease (PD), using a pooled safety analysis of patients who received at least one dose of CCH in any of six clinical studies. PATIENTS AND METHODS: Patients from six clinical studies, including three randomised, double-blind, placebo-controlled studies and three open-label safety and efficacy studies, were included if they had received at least one dose of 0.58 mg CCH. Adverse events (AEs), including treatment-emergent AEs, treatment-related AEs, and serious AEs (SAEs), were characterised. Potential immunogenicity-related AEs were evaluated through examination of increased anti-AUX-I and anti-AUX-II antibody levels, AEs, and reported terms possibly associated with immunological or hypersensitivity events. RESULTS: Overall, 85.8% of 1 044 pooled patients reported at least one treatment-related AE. The most frequently reported (≥25.0% of patients) treatment-related AEs included penile haematoma (82.7% had the verbatim 'penile bruising'), penile pain, and penile swelling. Most patients (75.2%) had mild- or moderate-severity treatment-related AEs, and 14.2% had no treatment-related AEs. Nine patients (0.9%) had treatment-related SAEs: five with penile haematoma and four with corporal rupture. There was no association between AEs and anti-AUX-I or anti-AUX-II antibody levels across treatment cycles, and no systemic hypersensitivity reactions occurred. CONCLUSIONS: This pooled safety analysis shows that although non-serious and serious treatment-related AEs can occur after CCH treatment for PD, most were non-serious and the SAEs were manageable. Providers should be prepared to manage possible SAEs.


Subject(s)
Microbial Collagenase/administration & dosage , Penile Induration/drug therapy , Penis/pathology , Adult , Aged , Aged, 80 and over , Clinical Trials as Topic , Dose-Response Relationship, Drug , Drug Administration Schedule , Follow-Up Studies , Humans , Injections, Intralesional , Male , Middle Aged , Penile Induration/physiopathology , Penis/drug effects , Randomized Controlled Trials as Topic , Treatment Outcome
17.
Mol Ecol Resour ; 15(2): 262-7, 2015 Mar.
Article in English | MEDLINE | ID: mdl-25042150

ABSTRACT

QST , a measure of quantitative genetic differentiation among populations, is an index that can suggest local adaptation if QST for a trait is sufficiently larger than the mean FST of neutral genetic markers. A previous method by Whitlock and Guillaume derived a simulation resampling approach to statistically test for a difference between QST and FST , but that method is limited to balanced data sets with offspring related as half-sibs through shared fathers. We extend this approach (i) to allow for a model more suitable for some plant populations or breeding designs in which offspring are related through mothers (assuming independent fathers for each offspring; half-sibs by dam); and (ii) by explicitly allowing for unbalanced data sets. The resulting approach is made available through the R package QstFstComp.


Subject(s)
Genetic Variation , Genotyping Techniques/methods , Quantitative Trait, Heritable , Statistics as Topic
18.
Proc Biol Sci ; 281(1796): 20141798, 2014 12 07.
Article in English | MEDLINE | ID: mdl-25320176

ABSTRACT

Both landscape structure and population size fluctuations influence population genetics. While independent effects of these factors on genetic patterns and processes are well studied, a key challenge is to understand their interaction, as populations are simultaneously exposed to habitat fragmentation and climatic changes that increase variability in population size. In a population network of an alpine butterfly, abundance declined 60-100% in 2003 because of low over-winter survival. Across the network, mean microsatellite genetic diversity did not change. However, patch connectivity and local severity of the collapse interacted to determine allelic richness change within populations, indicating that patch connectivity can mediate genetic response to a demographic collapse. The collapse strongly affected spatial genetic structure, leading to a breakdown of isolation-by-distance and loss of landscape genetic pattern. Our study reveals important interactions between landscape structure and temporal demographic variability on the genetic diversity and genetic differentiation of populations. Projected future changes to both landscape and climate may lead to loss of genetic variability from the studied populations, and selection acting on adaptive variation will likely occur within the context of an increasing influence of genetic drift.


Subject(s)
Butterflies/genetics , Environment , Animals , Butterflies/physiology , Climate Change , Gene Flow , Genetic Variation , Genetics, Population , Microsatellite Repeats , Population Density , Population Dynamics
19.
Prostaglandins Other Lipid Mediat ; 113-115: 38-44, 2014 Oct.
Article in English | MEDLINE | ID: mdl-25173047

ABSTRACT

Epoxyeicosatrienoic acids (EETs) protect against the development of insulin resistance in rodents. EETs are hydrolyzed to less biologically active diols by soluble epoxide hydrolase (encoded for by EPHX2). Functional variants of EPHX2 encode for enzymes with increased (Lys55Arg) or decreased (Arg287Gln) hydrolase activity. This study tested the hypothesis that variants of EPHX2 are associated with insulin sensitivity or secretion in humans. Subjects participating in metabolic phenotyping studies were genotyped. Eighty-five subjects underwent hyperglycemic clamps. There was no relationship between the Lys55Arg genotype and insulin sensitivity or secretion. In contrast, the EPHX2 287Gln variant was associated with higher insulin sensitivity index (p=0.019 controlling for body mass index and metabolic syndrome). Also, there was an interactive effect of EPHX2 Arg287Gln genotype and body mass index on insulin sensitivity index (p=0.029). There was no relationship between EPHX2 Arg287Gln genotype and acute or late-phase glucose-stimulated insulin secretion, but disposition index was higher in 287Gln carriers compared with Arg/Arg (p=0.022). Plasma EETs correlated with insulin sensitivity index (r=0.64, p=0.015 for total EETs) and were decreased in the metabolic syndrome. A genetic variant that results in decreased soluble epoxide hydrolase activity is associated with increased insulin sensitivity, as are higher EETs.


Subject(s)
Arachidonic Acids/metabolism , Epoxide Hydrolases/genetics , Insulin/physiology , Adult , Arachidonic Acids/blood , Body Mass Index , DNA/chemistry , DNA/genetics , Epoxide Hydrolases/metabolism , Female , Genetic Variation , Genotype , Glucose Clamp Technique , Humans , Insulin/metabolism , Insulin Resistance , Insulin Secretion , Male , Middle Aged , Polymerase Chain Reaction , Statistics, Nonparametric , Young Adult
20.
Curr Biol ; 24(1): 94-97, 2014 Jan 06.
Article in English | MEDLINE | ID: mdl-24361065

ABSTRACT

Policies ensuring that research data are available on public archives are increasingly being implemented at the government [1], funding agency [2-4], and journal [5, 6] level. These policies are predicated on the idea that authors are poor stewards of their data, particularly over the long term [7], and indeed many studies have found that authors are often unable or unwilling to share their data [8-11]. However, there are no systematic estimates of how the availability of research data changes with time since publication. We therefore requested data sets from a relatively homogenous set of 516 articles published between 2 and 22 years ago, and found that availability of the data was strongly affected by article age. For papers where the authors gave the status of their data, the odds of a data set being extant fell by 17% per year. In addition, the odds that we could find a working e-mail address for the first, last, or corresponding author fell by 7% per year. Our results reinforce the notion that, in the long term, research data cannot be reliably preserved by individual researchers, and further demonstrate the urgent need for policies mandating data sharing via public archives.


Subject(s)
Databases, Factual , Publishing/statistics & numerical data , Biomedical Research , Internet , Time Factors
SELECTION OF CITATIONS
SEARCH DETAIL
...