Learning dynamics of deep linear networks with multiple pathways.

Shi, Jianghong; Shea-Brown, Eric; Buice, Michael A

Shi, Jianghong; Shea-Brown, Eric; Buice, Michael A.

Afiliación

Shi J; Department of Applied Mathematics, University of Washington, Seattle, WA 98195.
Shea-Brown E; Department of Applied Mathematics, University of Washington, Seattle, WA 98195.
Buice MA; Allen Institute MindScope Program, Seattle, WA 98109.

Adv Neural Inf Process Syst ; 35: 34064-34076, 2022 Dec.

Article en En | MEDLINE | ID: mdl-38288081

ABSTRACT

ABSTRACT

Not only have deep networks become standard in machine learning, they are increasingly of interest in neuroscience as models of cortical computation that capture relationships between structural and functional properties. In addition they are a useful target of theoretical research into the properties of network computation. Deep networks typically have a serial or approximately serial organization across layers, and this is often mirrored in models that purport to represent computation in mammalian brains. There are, however, multiple examples of parallel pathways in mammalian brains. In some cases, such as the mouse, the entire visual system appears arranged in a largely parallel, rather than serial fashion. While these pathways may be formed by differing cost functions that drive different computations, here we present a new mathematical analysis of learning dynamics in networks that have parallel computational pathways driven by the same cost function. We use the approximation of deep linear networks with large hidden layer sizes to show that, as the depth of the parallel pathways increases, different features of the training set (defined by the singular values of the input-output correlation) will typically concentrate in one of the pathways. This result is derived analytically and demonstrated with numerical simulation with both linear and non-linear networks. Thus, rather than sharing stimulus and task features across multiple pathways, parallel network architectures learn to produce sharply diversified representations with specialized and specific pathways, a mechanism which may hold important consequences for codes in both biological and artificial systems.

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Adv Neural Inf Process Syst Año: 2022 Tipo del documento: Article Pais de publicación: Estados Unidos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google