Your browser doesn't support javascript.
loading
Tree sequences as a general-purpose tool for population genetic inference.
Whitehouse, Logan S; Ray, Dylan; Schrider, Daniel R.
Afiliación
  • Whitehouse LS; Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA.
  • Ray D; Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA.
  • Schrider DR; Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA.
bioRxiv ; 2024 Aug 17.
Article en En | MEDLINE | ID: mdl-39185244
ABSTRACT
As population genetics data increases in size new methods have been developed to store genetic information in efficient ways, such as tree sequences. These data structures are computationally and storage efficient, but are not interchangeable with existing data structures used for many population genetic inference methodologies such as the use of convolutional neural networks (CNNs) applied to population genetic alignments. To better utilize these new data structures we propose and implement a graph convolutional network (GCN) to directly learn from tree sequence topology and node data, allowing for the use of neural network applications without an intermediate step of converting tree sequences to population genetic alignment format. We then compare our approach to standard CNN approaches on a set of previously defined benchmarking tasks including recombination rate estimation, positive selection detection, introgression detection, and demographic model parameter inference. We show that tree sequences can be directly learned from using a GCN approach and can be used to perform well on these common population genetics inference tasks with accuracies roughly matching or even exceeding that of a CNN-based method. As tree sequences become more widely used in population genetics research we foresee developments and optimizations of this work to provide a foundation for population genetics inference moving forward.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: BioRxiv Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: BioRxiv Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos