Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 32
Filter
Add more filters










Publication year range
1.
Nat Neurosci ; 27(3): 403-408, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38200183

ABSTRACT

The prefrontal cortex is crucial for learning and decision-making. Classic reinforcement learning (RL) theories center on learning the expectation of potential rewarding outcomes and explain a wealth of neural data in the prefrontal cortex. Distributional RL, on the other hand, learns the full distribution of rewarding outcomes and better explains dopamine responses. In the present study, we show that distributional RL also better explains macaque anterior cingulate cortex neuronal responses, suggesting that it is a common mechanism for reward-guided learning.


Subject(s)
Learning , Reinforcement, Psychology , Animals , Learning/physiology , Reward , Prefrontal Cortex/physiology , Neurons , Macaca , Decision Making/physiology
2.
bioRxiv ; 2023 Dec 16.
Article in English | MEDLINE | ID: mdl-38168410

ABSTRACT

The prefrontal cortex is crucial for economic decision-making and representing the value of options. However, how such representations facilitate flexible decisions remains unknown. We reframe economic decision-making in prefrontal cortex in line with representations of structure within the medial temporal lobe because such cognitive map representations are known to facilitate flexible behaviour. Specifically, we framed choice between different options as a navigation process in value space. Here we show that choices in a 2D value space defined by reward magnitude and probability were represented with a grid-like code, analogous to that found in spatial navigation. The grid-like code was present in ventromedial prefrontal cortex (vmPFC) local field potential theta frequency and the result replicated in an independent dataset. Neurons in vmPFC similarly contained a grid-like code, in addition to encoding the linear value of the chosen option. Importantly, both signals were modulated by theta frequency - occurring at theta troughs but on separate theta cycles. Furthermore, we found sharp-wave ripples - a key neural signature of planning and flexible behaviour - in vmPFC, which were modulated by accuracy and reward. These results demonstrate that multiple cognitive map-like computations are deployed in vmPFC during economic decision-making, suggesting a new framework for the implementation of choice in prefrontal cortex.

3.
PLoS Comput Biol ; 16(6): e1007944, 2020 06.
Article in English | MEDLINE | ID: mdl-32569311

ABSTRACT

Contemporary reinforcement learning (RL) theory suggests that potential choices can be evaluated by strategies that may or may not be sensitive to the computational structure of tasks. A paradigmatic model-free (MF) strategy simply repeats actions that have been rewarded in the past; by contrast, model-sensitive (MS) strategies exploit richer information associated with knowledge of task dynamics. MF and MS strategies should typically be combined, because they have complementary statistical and computational strengths; however, this tradeoff between MF/MS RL has mostly only been demonstrated in humans, often with only modest numbers of trials. We trained rhesus monkeys to perform a two-stage decision task designed to elicit and discriminate the use of MF and MS methods. A descriptive analysis of choice behaviour revealed directly that the structure of the task (of MS importance) and the reward history (of MF and MS importance) significantly influenced both choice and response vigour. A detailed, trial-by-trial computational analysis confirmed that choices were made according to a combination of strategies, with a dominant influence of a particular form of model sensitivity that persisted over weeks of testing. The residuals from this model necessitated development of a new combined RL model which incorporates a particular credit assignment weighting procedure. Finally, response vigor exhibited a subtly different collection of MF and MS influences. These results provide new illumination onto RL behavioural processes in non-human primates.


Subject(s)
Models, Theoretical , Primates/physiology , Animals , Computational Biology , Decision Making , Humans
4.
Front Neural Circuits ; 14: 615626, 2020.
Article in English | MEDLINE | ID: mdl-33408616

ABSTRACT

Neural processing occurs across a range of temporal scales. To facilitate this, the brain uses fast-changing representations reflecting momentary sensory input alongside more temporally extended representations, which integrate across both short and long temporal windows. The temporal flexibility of these representations allows animals to behave adaptively. Short temporal windows facilitate adaptive responding in dynamic environments, while longer temporal windows promote the gradual integration of information across time. In the cognitive and motor domains, the brain sets overarching goals to be achieved within a long temporal window, which must be broken down into sequences of actions and precise movement control processed across much shorter temporal windows. Previous human neuroimaging studies and large-scale artificial network models have ascribed different processing timescales to different cortical regions, linking this to each region's position in an anatomical hierarchy determined by patterns of inter-regional connectivity. However, even within cortical regions, there is variability in responses when studied with single-neuron electrophysiology. Here, we review a series of recent electrophysiology experiments that demonstrate the heterogeneity of temporal receptive fields at the level of single neurons within a cortical region. This heterogeneity appears functionally relevant for the computations that neurons perform during decision-making and working memory. We consider anatomical and biophysical mechanisms that may give rise to a heterogeneity of timescales, including recurrent connectivity, cortical layer distribution, and neurotransmitter receptor expression. Finally, we reflect on the computational relevance of each brain region possessing a heterogeneity of neuronal timescales. We argue that this architecture is of particular importance for sensory, motor, and cognitive computations.


Subject(s)
Brain/physiology , Memory, Short-Term/physiology , Nerve Net/physiology , Neural Pathways/physiology , Animals , Cerebral Cortex/physiology , Humans , Neurons/physiology
5.
Proc Natl Acad Sci U S A ; 116(45): 22795-22801, 2019 11 05.
Article in English | MEDLINE | ID: mdl-31636178

ABSTRACT

Visual fixations play a vital role in decision making. Recent studies have demonstrated that the longer subjects fixate an option, the more likely they are to choose it. However, the role of evaluating stimuli covertly (i.e., without fixating them), and how covert evaluations determine where to subsequently fixate, remains relatively unexplored. Here, we trained monkeys to perform a decision-making task where they made binary choices between reward-predictive stimuli which were well-learned ("overtrained"), recently learned ("novel"), or a combination of both ("mixed"). Subjects were free to saccade around the screen and make a choice (via joystick response) at any time. Subjects rarely fixated both options, yet choice behavior was better explained by assuming the values of both stimuli governed choices. The first fixation latency was fast (∼150 ms) but, surprisingly, its direction was value-driven. This suggests covert evaluation of stimulus values prior to first saccade. This was particularly evident for overtrained stimuli. For novel stimuli, first fixations became increasingly value-driven throughout a behavioral session. However, this improvement lagged behind learning of accurate economic choices, suggesting separate processes governed their learning. Finally, mixed trials revealed a strong bias toward fixating the novel stimulus first but no bias toward choosing it. Our results suggest that the primate brain contains fast covert evaluation mechanisms for guiding fixations toward highly valuable and novel information. By employing such covert mechanisms, fixation behavior becomes dissociable from the value comparison processes that drive final choice. This implies that primates use separable decision systems for value-guided fixations and value-guided choice.


Subject(s)
Choice Behavior , Fixation, Ocular , Learning , Animals , Macaca , Photic Stimulation
6.
Nat Neurosci ; 21(10): 1471-1481, 2018 10.
Article in English | MEDLINE | ID: mdl-30258238

ABSTRACT

Naturalistic decision-making typically involves sequential deployment of attention to choice alternatives to gather information before a decision is made. Attention filters how information enters decision circuits, thus implying that attentional control may shape how decision computations unfold. We recorded neuronal activity from three subregions of the prefrontal cortex (PFC) while monkeys performed an attention-guided decision-making task. From the first saccade to decision-relevant information, a triple dissociation of decision- and attention-related computations emerged in parallel across PFC subregions. During subsequent saccades, orbitofrontal cortex activity reflected the value comparison between currently and previously attended information. In contrast, the anterior cingulate cortex carried several signals reflecting belief updating in light of newly attended information, the integration of evidence to a decision bound and an emerging plan for what action to choose. Our findings show how anatomically dissociable PFC representations evolve during attention-guided information search, supporting computations critical for value-guided choice.


Subject(s)
Attention/physiology , Brain Mapping , Decision Making/physiology , Neurons/physiology , Prefrontal Cortex/physiology , Action Potentials/physiology , Animals , Cues , Macaca mulatta , Male , Models, Neurological , Patch-Clamp Techniques , Reinforcement, Psychology , Saccades/physiology
7.
Nat Commun ; 9(1): 3498, 2018 08 29.
Article in English | MEDLINE | ID: mdl-30158519

ABSTRACT

Competing accounts propose that working memory (WM) is subserved either by persistent activity in single neurons or by dynamic (time-varying) activity across a neural population. Here, we compare these hypotheses across four regions of prefrontal cortex (PFC) in an oculomotor-delayed-response task, where an intervening cue indicated the reward available for a correct saccade. WM representations were strongest in ventrolateral PFC neurons with higher intrinsic temporal stability (time-constant). At the population-level, although a stable mnemonic state was reached during the delay, this tuning geometry was reversed relative to cue-period selectivity, and was disrupted by the reward cue. Single-neuron analysis revealed many neurons switched to coding reward, rather than maintaining task-relevant spatial selectivity until saccade. These results imply WM is fulfilled by dynamic, population-level activity within high time-constant neurons. Rather than persistent activity supporting stable mnemonic representations that bridge subsequent salient stimuli, PFC neurons may stabilise a dynamic population-level process supporting WM.


Subject(s)
Memory, Short-Term/physiology , Prefrontal Cortex/physiology , Animals , Macaca mulatta , Male
8.
PLoS Biol ; 15(11): e1002618, 2017 11.
Article in English | MEDLINE | ID: mdl-29190275

ABSTRACT

[This corrects the article DOI: 10.1371/journal.pbio.2000638.].

9.
PLoS Biol ; 14(11): e2000638, 2016 Nov.
Article in English | MEDLINE | ID: mdl-27832071

ABSTRACT

Information sampling is often biased towards seeking evidence that confirms one's prior beliefs. Despite such biases being a pervasive feature of human behavior, their underlying causes remain unclear. Many accounts of these biases appeal to limitations of human hypothesis testing and cognition, de facto evoking notions of bounded rationality, but neglect more basic aspects of behavioral control. Here, we investigated a potential role for Pavlovian approach in biasing which information humans will choose to sample. We collected a large novel dataset from 32,445 human subjects, making over 3 million decisions, who played a gambling task designed to measure the latent causes and extent of information-sampling biases. We identified three novel approach-related biases, formalized by comparing subject behavior to a dynamic programming model of optimal information gathering. These biases reflected the amount of information sampled ("positive evidence approach"), the selection of which information to sample ("sampling the favorite"), and the interaction between information sampling and subsequent choices ("rejecting unsampled options"). The prevalence of all three biases was related to a Pavlovian approach-avoid parameter quantified within an entirely independent economic decision task. Our large dataset also revealed that individual differences in the amount of information gathered are a stable trait across multiple gameplays and can be related to demographic measures, including age and educational attainment. As well as revealing limitations in cognitive processing, our findings suggest information sampling biases reflect the expression of primitive, yet potentially ecologically adaptive, behavioral repertoires. One such behavior is sampling from options that will eventually be chosen, even when other sources of information are more pertinent for guiding future action.


Subject(s)
Bias , Choice Behavior , Decision Making , Humans , Information Services
10.
Elife ; 52016 10 05.
Article in English | MEDLINE | ID: mdl-27705742

ABSTRACT

Correlates of value are routinely observed in the prefrontal cortex (PFC) during reward-guided decision making. In previous work (Hunt et al., 2015), we argued that PFC correlates of chosen value are a consequence of varying rates of a dynamical evidence accumulation process. Yet within PFC, there is substantial variability in chosen value correlates across individual neurons. Here we show that this variability is explained by neurons having different temporal receptive fields of integration, indexed by examining neuronal spike rate autocorrelation structure whilst at rest. We find that neurons with protracted resting temporal receptive fields exhibit stronger chosen value correlates during choice. Within orbitofrontal cortex, these neurons also sustain coding of chosen value from choice through the delivery of reward, providing a potential neural mechanism for maintaining predictions and updating stored values during learning. These findings reveal that within PFC, variability in temporal specialisation across neurons predicts involvement in specific decision-making computations.


Subject(s)
Choice Behavior , Decision Making , Neurons/physiology , Prefrontal Cortex/physiology , Rest , Reward , Animals , Macaca mulatta , Neurophysiological Monitoring
11.
J Neurosci ; 36(39): 10002-15, 2016 09 28.
Article in English | MEDLINE | ID: mdl-27683898

ABSTRACT

UNLABELLED: Integrating costs and benefits is crucial for optimal decision-making. Although much is known about decisions that involve outcome-related costs (e.g., delay, risk), many of our choices are attached to actions and require an evaluation of the associated motor costs. Yet how the brain incorporates motor costs into choices remains largely unclear. We used human fMRI during choices involving monetary reward and physical effort to identify brain regions that serve as a choice comparator for effort-reward trade-offs. By independently varying both options' effort and reward levels, we were able to identify the neural signature of a comparator mechanism. A network involving supplementary motor area and the caudal portion of dorsal anterior cingulate cortex encoded the difference in reward (positively) and effort levels (negatively) between chosen and unchosen choice options. We next modeled effort-discounted subjective values using a novel behavioral model. This revealed that the same network of regions involving dorsal anterior cingulate cortex and supplementary motor area encoded the difference between the chosen and unchosen options' subjective values, and that activity was best described using a concave model of effort-discounting. In addition, this signal reflected how precisely value determined participants' choices. By contrast, separate signals in supplementary motor area and ventromedial prefrontal cortex correlated with participants' tendency to avoid effort and seek reward, respectively. This suggests that the critical neural signature of decision-making for choices involving motor costs is found in human cingulate cortex and not ventromedial prefrontal cortex as typically reported for outcome-based choice. Furthermore, distinct frontal circuits seem to drive behavior toward reward maximization and effort minimization. SIGNIFICANCE STATEMENT: The neural processes that govern the trade-off between expected benefits and motor costs remain largely unknown. This is striking because energetic requirements play an integral role in our day-to-day choices and instrumental behavior, and a diminished willingness to exert effort is a characteristic feature of a range of neurological disorders. We use a new behavioral characterization of how humans trade off reward maximization with effort minimization to examine the neural signatures that underpin such choices, using BOLD MRI neuroimaging data. We find the critical neural signature of decision-making, a signal that reflects the comparison of value between choice options, in human cingulate cortex, whereas two distinct brain circuits drive behavior toward reward maximization or effort minimization.


Subject(s)
Choice Behavior/physiology , Cognition/physiology , Gyrus Cinguli/physiology , Nerve Net/physiology , Physical Exertion/physiology , Reward , Adult , Discrimination Learning/physiology , Female , Humans , Male
12.
Elife ; 42015 Dec 11.
Article in English | MEDLINE | ID: mdl-26653139

ABSTRACT

Activity in prefrontal cortex (PFC) has been richly described using economic models of choice. Yet such descriptions fail to capture the dynamics of decision formation. Describing dynamic neural processes has proven challenging due to the problem of indexing the internal state of PFC and its trial-by-trial variation. Using primate neurophysiology and human magnetoencephalography, we here recover a single-trial index of PFC internal states from multiple simultaneously recorded PFC subregions. This index can explain the origins of neural representations of economic variables in PFC. It describes the relationship between neural dynamics and behaviour in both human and monkey PFC, directly bridging between human neuroimaging data and underlying neuronal activity. Moreover, it reveals a functionally dissociable interaction between orbitofrontal cortex, anterior cingulate cortex and dorsolateral PFC in guiding cost-benefit decisions. We cast our observations in terms of a recurrent neural network model of choice, providing formal links to mechanistic dynamical accounts of decision-making.


Subject(s)
Decision Making , Prefrontal Cortex/physiology , Animals , Haplorhini , Humans , Magnetoencephalography , Models, Neurological , Neural Pathways/physiology , Neurophysiology
13.
PLoS Comput Biol ; 11(3): e1004116, 2015 Mar.
Article in English | MEDLINE | ID: mdl-25816114

ABSTRACT

There has been considerable interest from the fields of biology, economics, psychology, and ecology about how decision costs decrease the value of rewarding outcomes. For example, formal descriptions of how reward value changes with increasing temporal delays allow for quantifying individual decision preferences, as in animal species populating different habitats, or normal and clinical human populations. Strikingly, it remains largely unclear how humans evaluate rewards when these are tied to energetic costs, despite the surge of interest in the neural basis of effort-guided decision-making and the prevalence of disorders showing a diminished willingness to exert effort (e.g., depression). One common assumption is that effort discounts reward in a similar way to delay. Here we challenge this assumption by formally comparing competing hypotheses about effort and delay discounting. We used a design specifically optimized to compare discounting behavior for both effort and delay over a wide range of decision costs (Experiment 1). We then additionally characterized the profile of effort discounting free of model assumptions (Experiment 2). Contrary to previous reports, in both experiments effort costs devalued reward in a manner opposite to delay, with small devaluations for lower efforts, and progressively larger devaluations for higher effort-levels (concave shape). Bayesian model comparison confirmed that delay-choices were best predicted by a hyperbolic model, with the largest reward devaluations occurring at shorter delays. In contrast, an altogether different relationship was observed for effort-choices, which were best described by a model of inverse sigmoidal shape that is initially concave. Our results provide a novel characterization of human effort discounting behavior and its first dissociation from delay discounting. This enables accurate modelling of cost-benefit decisions, a prerequisite for the investigation of the neural underpinnings of effort-guided choice and for understanding the deficits in clinical disorders characterized by behavioral inactivity.


Subject(s)
Choice Behavior/physiology , Models, Biological , Reward , Adult , Computational Biology , Female , Hand Strength/physiology , Humans , Male , Task Performance and Analysis , Young Adult
14.
Curr Opin Behav Sci ; 1: 78-85, 2015 Feb 01.
Article in English | MEDLINE | ID: mdl-26937446

ABSTRACT

Our environment and internal states are frequently complex, ambiguous and dynamic, meaning we need to have selection mechanisms to ensure we are basing our decisions on currently relevant information. Here, we review evidence that orbitofrontal (OFC) and ventromedial prefrontal cortex (VMPFC) play conserved, critical but distinct roles in this process. While OFC may use specific sensory associations to enhance task-relevant information, particularly in the context of learning, VMPFC plays a role in ensuring irrelevant information does not impinge on the decision in hand.

16.
J Neurosci ; 33(44): 17385-97, 2013 Oct 30.
Article in English | MEDLINE | ID: mdl-24174671

ABSTRACT

Effective decision-making requires consideration of costs and benefits. Previous studies have implicated orbitofrontal cortex (OFC), dorsolateral prefrontal cortex (DLPFC), and anterior cingulate cortex (ACC) in cost-benefit decision-making. Yet controversy remains about whether different decision costs are encoded by different brain areas, and whether single neurons integrate costs and benefits to derive a subjective value estimate for each choice alternative. To address these issues, we trained four subjects to perform delay- and effort-based cost-benefit decisions and recorded neuronal activity in OFC, ACC, DLPFC, and the cingulate motor area (CMA). Although some neurons, mainly in ACC, did exhibit integrated value signals as if performing cost-benefit computations, they were relatively few in number. Instead, the majority of neurons in all areas encoded the decision type; that is whether the subject was required to perform a delay- or effort-based decision. OFC and DLPFC neurons tended to show the largest changes in firing rate for delay- but not effort-based decisions; whereas, the reverse was true for CMA neurons. Only ACC contained neurons modulated by both effort- and delay-based decisions. These findings challenge the idea that OFC calculates an abstract value signal to guide decision-making. Instead, our results suggest that an important function of single PFC neurons is to categorize sensory stimuli based on the consequences predicted by those stimuli.


Subject(s)
Choice Behavior/physiology , Frontal Lobe/physiology , Neurons/physiology , Psychomotor Performance/physiology , Animals , Brain Mapping/methods , Cost-Benefit Analysis , Frontal Lobe/cytology , Macaca mulatta , Male , Photic Stimulation/methods
18.
Ann N Y Acad Sci ; 1239: 33-42, 2011 Dec.
Article in English | MEDLINE | ID: mdl-22145873

ABSTRACT

Damage to the orbitofrontal cortex (OFC) and anterior cingulate cortex (ACC) impairs decision making, but the underlying value computations that cause such impairments remain unclear. Both the OFC and ACC encode a wide variety of signals correlated with decision making. The current challenge is to determine how these two different areas support decision-making processes. Here, we review a series of experiments that have helped define these roles. A special population of neurons in the ACC, but not the OFC, multiplex value information across decision parameters using a unified encoding scheme, and encode reward prediction errors. In contrast, neurons in the OFC, but not the ACC, encode the value of a choice relative to the recent history of choice values. Together, these results suggest complementary valuation processes: OFC neurons dynamically evaluate current choices relative to the value contexts recently experienced, while ACC neurons encode choice predictions and prediction errors using a common valuation currency reflecting the integration of multiple decision parameters.


Subject(s)
Brain Mapping , Decision Making , Frontal Lobe/physiology , Gyrus Cinguli/physiology , Neurons/physiology , Animals , Behavior, Animal , Humans , Models, Psychological , Physiology, Comparative/methods , Reward
19.
Nat Neurosci ; 14(12): 1581-9, 2011 Oct 30.
Article in English | MEDLINE | ID: mdl-22037498

ABSTRACT

Damage to prefrontal cortex (PFC) impairs decision-making, but the underlying value computations that might cause such impairments remain unclear. Here we report that value computations are doubly dissociable among PFC neurons. Although many PFC neurons encoded chosen value, they used opponent encoding schemes such that averaging the neuronal population extinguished value coding. However, a special population of neurons in anterior cingulate cortex (ACC), but not in orbitofrontal cortex (OFC), multiplexed chosen value across decision parameters using a unified encoding scheme and encoded reward prediction errors. In contrast, neurons in OFC, but not ACC, encoded chosen value relative to the recent history of choice values. Together, these results suggest complementary valuation processes across PFC areas: OFC neurons dynamically evaluate current choices relative to recent choice values, whereas ACC neurons encode choice predictions and prediction errors using a common valuation currency reflecting the integration of multiple decision parameters.


Subject(s)
Brain Mapping , Decision Making/physiology , Gyrus Cinguli/cytology , Neurons/physiology , Prefrontal Cortex/cytology , Animals , Computer Simulation , Macaca mulatta , Male , Models, Neurological , Photic Stimulation , Predictive Value of Tests , Probability , Reaction Time/physiology , Reward
20.
Behav Neurosci ; 125(3): 297-317, 2011 Jun.
Article in English | MEDLINE | ID: mdl-21534649

ABSTRACT

Patients with damage to the prefrontal cortex (PFC)--especially the ventral and medial parts of PFC--often show a marked inability to make choices that meet their needs and goals. These decision-making impairments often reflect both a deficit in learning concerning the consequences of a choice, as well as deficits in the ability to adapt future choices based on experienced value of the current choice. Thus, areas of PFC must support some value computations that are necessary for optimal choice. However, recent frameworks of decision making have highlighted that optimal and adaptive decision making does not simply rest on a single computation, but a number of different value computations may be necessary. Using this framework as a guide, we summarize evidence from both lesion studies and single-neuron physiology for the representation of different value computations across PFC areas.


Subject(s)
Decision Making/physiology , Frontal Lobe/physiology , Neurons/physiology , Prefrontal Cortex/physiology , Reward , Animals , Brain Mapping/psychology , Choice Behavior/physiology , Cognition/physiology , Learning/physiology
SELECTION OF CITATIONS
SEARCH DETAIL
...