Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
Mais filtros










Intervalo de ano de publicação
1.
PLoS One ; 19(6): e0304889, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38905208

RESUMO

We develop a simulation framework for studying misinformation spread within online social networks that blends agent-based modeling and natural language processing techniques. While many other agent-based simulations exist in this space, questions over their fidelity and generalization to existing networks in part hinder their ability to drive policy-relevant decision making. To partially address these concerns, we create a 'digital clone' of a known misinformation sharing network by downloading social media histories for over ten thousand of its users. We parse these histories to both extract the structure of the network and model the nuanced ways in which information is shared and spread among its members. Unlike many other agent-based methods in this space, information sharing between users in our framework is sensitive to topic of discussion, user preferences, and online community dynamics. To evaluate the fidelity of our method, we seed our cloned network with a set of posts recorded in the base network and compare propagation dynamics between the two, observing reasonable agreement across the twin networks over a variety of metrics. Lastly, we explore how the cloned network may serve as a flexible, low-cost testbed for misinformation countermeasure evaluation and red teaming analysis. We hope the tools explored here augment existing efforts in the space and unlock new opportunities for misinformation countermeasure evaluation, a field that may become increasingly important to consider with the anticipated rise of misinformation campaigns fueled by generative artificial intelligence.


Assuntos
Comunicação , Idioma , Humanos , Mídias Sociais , Redes Sociais Online , Processamento de Linguagem Natural , Rede Social , Modelos Teóricos , Simulação por Computador , Disseminação de Informação/métodos
2.
bioRxiv ; 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38895258

RESUMO

Accurate estimation of the dispersal velocity or speed of evolving organisms is no mean feat. In fact, existing probabilistic models in phylogeography or spatial population genetics generally do not provide an adequate framework to define velocity in a relevant manner. For instance, the very concept of instantaneous speed simply does not exist under one of the most popular approaches that models the evolution of spatial coordinates as Brownian trajectories running along a phylogeny [30]. Here, we introduce a new family of models - the so-called "Phylogenetic Integrated Velocity" (PIV) models - that use Gaussian processes to explicitly model the velocity of evolving lineages instead of focusing on the fluctuation of spatial coordinates over time. We describe the properties of these models and show an increased accuracy of velocity estimates compared to previous approaches. Analyses of West Nile virus data in the U.S.A. indicate that PIV models provide sensible predictions of the dispersal of evolving pathogens at a one-year time horizon. These results demonstrate the feasibility and relevance of predictive phylogeography in monitoring epidemics in time and space.

3.
PLoS Pathog ; 20(6): e1012288, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38900824

RESUMO

Socio-economic disparities were associated with disproportionate viral incidence between neighborhoods of New York City (NYC) during the first wave of SARS-CoV-2. We investigated how these disparities affected the co-circulation of SARS-CoV-2 variants during the second wave in NYC. We tested for correlation between the prevalence, in late 2020/early 2021, of Alpha, Iota, Iota with E484K mutation (Iota-E484K), and B.1-like genomes and pre-existing immunity (seropositivity) in NYC neighborhoods. In the context of varying seroprevalence we described socio-economic profiles of neighborhoods and performed migration and lineage persistence analyses using a Bayesian phylogeographical framework. Seropositivity was greater in areas with high poverty and a larger proportion of Black and Hispanic or Latino residents. Seropositivity was positively correlated with the proportion of Iota-E484K and Iota genomes, and negatively correlated with the proportion of Alpha and B.1-like genomes. The proportion of persisting Alpha lineages declined over time in locations with high seroprevalence, whereas the proportion of persisting Iota-E484K lineages remained the same in high seroprevalence areas. During the second wave, the geographic variation of standing immunity, due to disproportionate disease burden during the first wave of SARS-CoV-2 in NYC, allowed for the immune evasive Iota-E484K variant, but not the more transmissible Alpha variant, to circulate in locations with high pre-existing immunity.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , Cidade de Nova Iorque/epidemiologia , SARS-CoV-2/imunologia , SARS-CoV-2/genética , COVID-19/epidemiologia , COVID-19/virologia , Estudos Soroepidemiológicos , Fatores Socioeconômicos , Masculino , Feminino , Adulto , Pessoa de Meia-Idade , Mutação
4.
Artigo em Inglês | MEDLINE | ID: mdl-38659338

RESUMO

BACKGROUND: Young children involved in the child welfare system (CWS) are at high risk for suicidal ideation (SI) at a time when overall rates of suicide death in this age group are rising. Yet risk factors for and changes in SI over time are poorly understood in this population. METHOD: We combined data from two large representative longitudinal studies of children involved in the United States CWS. We examined patterns of SI among children who were between ages 7 and 12 years at the initial survey wave (N = 2,186), assessed at three waves using a measure of SI in the past 2 weeks. We conducted a multinomial regression to understand the baseline demographic, child maltreatment, and mental health characteristics that distinguish the trajectories. RESULTS: There were eight different subgroups (Non-Ideators, Late Ideators, Boomerang Ideators, Delayed Ideators, Desisters, Boomerang Non-Ideators, Late Desisters, and Persisters). Differences in race, type of maltreatment, sex, and mental health symptoms were identified when comparing Persisters (SI at all three waves) to other groups. CONCLUSIONS: These findings can help researchers and practitioners to develop strategies for better identifying CWS-involved children who are in greatest need of suicide risk monitoring and intervention.

5.
Cell ; 186(26): 5690-5704.e20, 2023 12 21.
Artigo em Inglês | MEDLINE | ID: mdl-38101407

RESUMO

The maturation of genomic surveillance in the past decade has enabled tracking of the emergence and spread of epidemics at an unprecedented level. During the COVID-19 pandemic, for example, genomic data revealed that local epidemics varied considerably in the frequency of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) lineage importation and persistence, likely due to a combination of COVID-19 restrictions and changing connectivity. Here, we show that local COVID-19 epidemics are driven by regional transmission, including across international boundaries, but can become increasingly connected to distant locations following the relaxation of public health interventions. By integrating genomic, mobility, and epidemiological data, we find abundant transmission occurring between both adjacent and distant locations, supported by dynamic mobility patterns. We find that changing connectivity significantly influences local COVID-19 incidence. Our findings demonstrate a complex meaning of "local" when investigating connected epidemics and emphasize the importance of collaborative interventions for pandemic prevention and mitigation.


Assuntos
COVID-19 , Humanos , COVID-19/epidemiologia , COVID-19/transmissão , COVID-19/virologia , Genômica , Pandemias/prevenção & controle , Saúde Pública , SARS-CoV-2/genética , Controle de Infecções , Geografia
6.
Annu Rev Stat Appl ; 10: 353-377, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38774036

RESUMO

Researchers studying the evolution of viral pathogens and other organisms increasingly encounter and use large and complex data sets from multiple different sources. Statistical research in Bayesian phylogenetics has risen to this challenge. Researchers use phylogenetics not only to reconstruct the evolutionary history of a group of organisms, but also to understand the processes that guide its evolution and spread through space and time. To this end, it is now the norm to integrate numerous sources of data. For example, epidemiologists studying the spread of a virus through a region incorporate data including genetic sequences (e.g. DNA), time, location (both continuous and discrete) and environmental covariates (e.g. social connectivity between regions) into a coherent statistical model. Evolutionary biologists routinely do the same with genetic sequences, location, time, fossil and modern phenotypes, and ecological covariates. These complex, hierarchical models readily accommodate both discrete and continuous data and have enormous combined discrete/continuous parameter spaces including, at a minimum, phylogenetic tree topologies and branch lengths. The increased size and complexity of these statistical models have spurred advances in computational methods to make them tractable. We discuss both the modeling and computational advances below, as well as unsolved problems and areas of active research.

7.
Nat Commun ; 13(1): 7003, 2022 11 16.
Artigo em Inglês | MEDLINE | ID: mdl-36385137

RESUMO

Genomic sequencing is essential to track the evolution and spread of SARS-CoV-2, optimize molecular tests, treatments, vaccines, and guide public health responses. To investigate the global SARS-CoV-2 genomic surveillance, we used sequences shared via GISAID to estimate the impact of sequencing intensity and turnaround times on variant detection in 189 countries. In the first two years of the pandemic, 78% of high-income countries sequenced >0.5% of their COVID-19 cases, while 42% of low- and middle-income countries reached that mark. Around 25% of the genomes from high income countries were submitted within 21 days, a pattern observed in 5% of the genomes from low- and middle-income countries. We found that sequencing around 0.5% of the cases, with a turnaround time <21 days, could provide a benchmark for SARS-CoV-2 genomic surveillance. Socioeconomic inequalities undermine the global pandemic preparedness, and efforts must be made to support low- and middle-income countries improve their local sequencing capacity.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , Genoma Viral/genética , COVID-19/epidemiologia , Pandemias , Genômica
8.
J Am Stat Assoc ; 117(538): 678-692, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36060555

RESUMO

Comparative biologists are often interested in inferring covariation between multiple biological traits sampled across numerous related taxa. To properly study these relationships, we must control for the shared evolutionary history of the taxa to avoid spurious inference. An additional challenge arises as obtaining a full suite of measurements becomes increasingly difficult with increasing taxa. This generally necessitates data imputation or integration, and existing control techniques typically scale poorly as the number of taxa increases. We propose an inference technique that integrates out missing measurements analytically and scales linearly with the number of taxa by using a post-order traversal algorithm under a multivariate Brownian diffusion (MBD) model to characterize trait evolution. We further exploit this technique to extend the MBD model to account for sampling error or non-heritable residual variance. We test these methods to examine mammalian life history traits, prokaryotic genomic and phenotypic traits, and HIV infection traits. We find computational efficiency increases that top two orders-of-magnitude over current best practices. While we focus on the utility of this algorithm in phylogenetic comparative methods, our approach generalizes to solve long-standing challenges in computing the likelihood for matrix-normal and multivariate normal distributions with missing data at scale.

9.
Philos Trans R Soc Lond B Biol Sci ; 377(1861): 20210242, 2022 10 10.
Artigo em Inglês | MEDLINE | ID: mdl-35989603

RESUMO

Recent advances in Bayesian phylogenetics offer substantial computational savings to accommodate increased genomic sampling that challenges traditional inference methods. In this review, we begin with a brief summary of the Bayesian phylogenetic framework, and then conceptualize a variety of methods to improve posterior approximations via Markov chain Monte Carlo (MCMC) sampling. Specifically, we discuss methods to improve the speed of likelihood calculations, reduce MCMC burn-in, and generate better MCMC proposals. We apply several of these techniques to study the evolution of HIV virulence along a 1536-tip phylogeny and estimate the internal node heights of a 1000-tip SARS-CoV-2 phylogenetic tree in order to illustrate the speed-up of such analyses using current state-of-the-art approaches. We conclude our review with a discussion of promising alternatives to MCMC that approximate the phylogenetic posterior. This article is part of a discussion meeting issue 'Genomic population structures of microbial pathogens'.


Assuntos
COVID-19 , Software , Algoritmos , Teorema de Bayes , Humanos , Cadeias de Markov , Método de Monte Carlo , Filogenia , SARS-CoV-2/genética
10.
J Infect Dis ; 226(12): 2142-2149, 2022 12 13.
Artigo em Inglês | MEDLINE | ID: mdl-35771664

RESUMO

BACKGROUND: Monitoring the emergence and spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants is an important public health objective. We investigated how the Gamma variant was established in New York City (NYC) in early 2021 in the presence of travel restrictions that aimed to prevent viral spread from Brazil, the country where the variant was first identified. METHODS: We performed phylogeographic analysis on 15 967 Gamma sequences sampled between 10 March and 1 May 2021, to identify geographic sources of Gamma lineages introduced into NYC. We identified locally circulating Gamma transmission clusters and inferred the timing of their establishment in NYC. RESULTS: We identified 16 phylogenetically distinct Gamma clusters established in NYC (cluster sizes ranged 2-108 genomes); most of them were introduced from Florida and Illinois and only 1 directly from Brazil. By the time the first Gamma case was reported by genomic surveillance in NYC on 10 March, the majority (57%) of circulating Gamma lineages had already been established in the city for at least 2 weeks. CONCLUSIONS: Although travel from Brazil to the United States was restricted from May 2020 through the end of the study period, this restriction did not prevent Gamma from becoming established in NYC as most introductions occurred from domestic locations.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , Cidade de Nova Iorque/epidemiologia , COVID-19/epidemiologia , Filogenia
11.
Methods Ecol Evol ; 13(10): 2181-2197, 2022 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-36908682

RESUMO

Biological phenotypes are products of complex evolutionary processes in which selective forces influence multiple biological trait measurements in unknown ways. Phylogenetic comparative methods seek to disentangle these relationships across the evolutionary history of a group of organisms. Unfortunately, most existing methods fail to accommodate high-dimensional data with dozens or even thousands of observations per taxon. Phylogenetic factor analysis offers a solution to the challenge of dimensionality. However, scientists seeking to employ this modeling framework confront numerous modeling and implementation decisions, the details of which pose computational and replicability challenges.We develop new inference techniques that increase both the computational efficiency and modeling flexibility of phylogenetic factor analysis. To facilitate adoption of these new methods, we present a practical analysis plan that guides researchers through the web of complex modeling decisions. We codify this analysis plan in an automated pipeline that distills the potentially overwhelming array of decisions into a small handful of (typically binary) choices.We demonstrate the utility of these methods and analysis plan in four real-world problems of varying scales. Specifically, we study floral phenotype and pollination in columbines, domestication in industrial yeast, life history in mammals, and brain morphology in New World monkeys.General and impactful community employment of these methods requires a data scientific analysis plan that balances flexibility, speed and ease of use, while minimizing model and algorithm tuning. Even in the presence of non-trivial phylogenetic model constraints, we show that one may analytically address latent factor uncertainty in a way that (a) aids model flexibility, (b) accelerates computation (by as much as 500-fold) and (c) decreases required tuning. These efforts coalesce to create an accessible Bayesian approach to high-dimensional phylogenetic comparative methods on large trees.

12.
medRxiv ; 2021 Dec 09.
Artigo em Inglês | MEDLINE | ID: mdl-34462754

RESUMO

Genomic sequencing provides critical information to track the evolution and spread of SARS-CoV-2, optimize molecular tests, treatments and vaccines, and guide public health responses. To investigate the spatiotemporal heterogeneity in the global SARS-CoV-2 genomic surveillance, we estimated the impact of sequencing intensity and turnaround times (TAT) on variant detection in 167 countries. Most countries submit genomes >21 days after sample collection, and 77% of low and middle income countries sequenced <0.5% of their cases. We found that sequencing at least 0.5% of the cases, with a TAT <21 days, could be a benchmark for SARS-CoV-2 genomic surveillance efforts. Socioeconomic inequalities substantially impact our ability to quickly detect SARS-CoV-2 variants, and undermine the global pandemic preparedness.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...