. .. , A computational model of a single CBG loop (motor loop)

, 162 6.9 A computational model of simple pavlovian conditioning in the basolateral amygdala (BLA)

A. Case and . .. Ofc, 169 6.10.1 State space and Task space abstraction

O. .. Medial,

O. .. Medial, 177 6.11.5 External bias from Medial to Lateral OFC

, 1.1 2-Arm Bandit Task and Probabilistic Reward Learning, p.181

, 1.3 Proximity of Values and Decision Making, p.190

. .. , Better rewarding v/s closer choice

, 3 2-stage markov task

B. Abler, H. Walter, and S. Erk, Neural correlates of frustration, Neuroreport 16.7, pp.669-672, 2005.

G. K. Adams, Neuroethology of decision-making, 2012.

G. Alexander, Parallel Organization of Functionally Segregated Circuits Linking Basal Ganglia and Cortex, Annu. Rev. Neurosci. issn: 0147006X, 1986.

W. H. Alexander and J. W. Brown, Medial prefrontal cortex as an action-outcome predictor, Nat. Neurosci. issn, p.10976256, 2011.

F. Alexandre, A behavioral framework for a systemic view of brain modeling, Comput. Model. Brain Behav, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01246653

M. C. Anderson and C. Weaver, Inhibitory Control over Action and Memory". In: Encycl. Neurosci. isbn, p.9780080450469, 2009.

F. Arana and . Sergio, Dissociable Contributions of the Human Amygdala and Orbitofrontal Cortex to Incentive Motivation and Goal Selection, J. Neurosci. issn, p.2706474, 2003.

G. Aston-jones, J. Rajkowski, and P. Kubiak, Conditioned responses of monkey locus coeruleus neurons anticipate acquisition of discriminative behavior in a vigilance task, Neuroscience. issn, p.3064522, 1997.

. Bibliography and A. Baddeley, Dementia and Working Memory, Q. J. Exp. Psychol. Sect. A. issn, p.14640740, 1986.

D. Badre, Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes, 2008.

P. P. Balasubramani, An extended Reinforcement Learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making, reward prediction, and punishment learning, 2014.

, Neurosci. 8.1 APR, p.16625188

B. W. Balleine and A. Dickinson, Balleine and Dickinson, pp.1-13, 1998.

B. W. Balleine and S. Killcross, Parallel incentive processing: an integrated view of amygdala function, Trends Neurosci. 29, vol.5, pp.272-279, 2006.

A. Bandura, The anatomy of stages of change, In: Am. J. Health Promot. issn, p.8901171, 1997.

N. Bault, Dissociation between private and social counterfactual value signals following ventromedial prefrontal cortex damage, J. Cogn. Neurosci. issn, p.15308898, 2019.

M. Baxter and E. Murray, Reinterpreting the behavioural effects of amygdala lesions in non-human primates, amygdala, a Funct. Anal, 2000.

A. Bechara and A. R. Damasio, The somatic marker hypothesis: A neural theory of economic decision, Games Econ. Behav. issn, p.10902473, 2005.

A. Bechara, Deciding advantageously before knowing the advantageous strategy, pp.1293-1295, 1997.

A. Bechara, Different contributions of the human amygdala and ventromedial prefrontal cortex to decision-making, J. Neurosci, 1999.

B. Behrens and T. E. , Learning the value of information in an uncertain world, Nat. Neurosci. issn, p.10976256, 2007.

K. C. Berridge and T. E. Robinson, The Mind of an Addicted Brain: Neural Sensitization of Wanting Versus Liking, Curr. Dir. Psychol. Sci. issn, p.14678721, 1995.

G. B. Bissonette, Separate Populations of Neurons in Ventral Striatum Encode Value and Motivation, PLoS One, vol.8, issue.5, p.19326203, 2013.

K. Blair, Choosing the lesser of two evils, the better of two goods: Specifying the roles of ventromedial prefrontal cortex and dorsal anterior cingulate in object choice, J. Neurosci. issn, p.2706474, 2006.

R. Bogacz, The physics of optimal decision making: A formal analysis of models of performance in two-alternative forced-choice tasks, Psychol. Rev, vol.113, pp.700-765, 2006.

E. D. Boorman, How Green Is the Grass on the Other Side? Frontopolar Cortex and the Evidence in Favor of Alternative Courses of Action, Neuron 62.5, p.8966273, 2009.

. //dx,

E. D. Boorman, Two Anatomically and Computationally Distinct Learning Signals Predict Changes to Stimulus-Outcome Associations in Hippocampus, Neuron. issn, p.10974199, 2016.

T. Boraud, A. Leblois, and N. P. Rougier, A natural history of skills, Prog. Neurobiol. issn, p.18735118, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01874690

S. Bouret and B. J. Richmond, Ventromedial and Orbital Prefrontal Neurons Differentially Encode Internally and Externally Driven Motivational Values in Monkeys, J. Neurosci, vol.30, pp.270-6474, 2010.

S. Bouret and S. J. Sara, Network reset: A simplified overarching theory of locus coeruleus noradrenaline function, Trends Neurosci. issn, p.1662236, 2005.
URL : https://hal.archives-ouvertes.fr/hal-00088131

L. A. Bradfield, Medial Orbitofrontal Cortex Mediates Outcome Retrieval in Partially Observable Task Situations, Neuron 88, vol.6, p.8966273, 2015.

,

K. Brodmann, Brodmann's: Localisation in the cerebral cortex, 2007.

D. W. Bryden, Attention for learning signals in anterior cingulate cortex, J. Neurosci. issn, p.2706474, 2011.

M. Cabanac, Pleasure: the common currency, J. Theor. Biol. issn, pp.80594-80600, 1992.

L. Calandreau, Extracellular hippocampal acetylcholine level controls amygdala function and promotes adaptive conditioned emotional response, J. Neurosci. issn, p.2706474, 2006.
URL : https://hal.archives-ouvertes.fr/hal-02669014

D. Canamero, Modeling motivations and emotions as a basis for intelligent behavior, Proc. First Inrl. Can)'. Auton. Agents, pp.148-155, 1997.

S. T. Carmichael and J. L. Price, Architectonic subdivision of the orbital and medial prefrontal cortex in the macaque monkey, J. Comp. Neurol, vol.346, issue.3, pp.366-402, 1994.

, Limbic connections of the orbital and medial prefrontal cortex in macaque monkeys, J. Comp. Neurol. issn, p.10969861, 1995.

, Connectional networks within the orbital and medial prefrontal cortex of macaque monkeys, J. Comp. Neurol. issn, vol.371, p.219967, 1996.

S. T. Carmichael and J. L. Price, Sensory and premotor connections of the orbital and medial prefrontal cortex of macaque monkeys, J. Comp. Neurol, vol.363, p.10969861, 1995.

M. Carrere and F. Alexandre, A pavlovian model of the amygdala and its influence within the medial temporal lobe, Front. Syst. Neurosci. 9.March. issn, pp.1662-5137, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01145790

E. Cartoni, S. Puglisi-allegra, and G. Baldassarre, The three principles of action: A Pavlovian-instrumental transfer hypothesis, Front. Behav. Neurosci. issn, p.16625153, 2013.

T. C. Cassey, Adaptive Sampling of Information in Perceptual Decision-Making". In: PLoS One 8.11, 2013.

C. Cavada, The Anatomical Connections of the Macaque Monkey Orbitofrontal Cortex. A Review, Cereb. Cortex, 2000.

, The Mysterious Orbitofrontal Cortex. Foreword". In: Cereb. Cortex, 2000.

V. S. Chakravarthy, D. Joseph, and R. S. Bapi, What do the basal ganglia do? A modeling perspective, 2010.

S. C. Chan, Y. Niv, and K. A. Norman, A probability distribution over latent causes, in the orbitofrontal cortex, J. Neurosci. issn, p.15292401, 2016.

J. Choi and . Seok, Left anterior subregion of orbitofrontal cortex volume reduction and impaired organizational strategies in obsessive-compulsive disorder, J. Psychiatr. Res, vol.38, p.223956, 2004.

G. I. Christopoulos, Neural correlates of value, risk, and risk aversion contributing to decision making under risk, J. Neurosci, vol.29, pp.12574-12583, 2009.

Y. Chudasama and T. W. Robbins, Dissociable contributions of the orbitofrontal and infralimbic cortex to pavlovian autoshaping and discrimination reversal learning: Further evidence for the functional heterogeneity of the rodent frontal cortex, J. Neurosci, vol.23, p.2706474, 2003.

N. Chuhma, Functional connectome of the striatal medium spiny neuron, J. Neurosci. issn, p.2706474, 2011.

P. Cisek, Cortical mechanisms of action selection: the affordance competition hypothesis, Model. Nat. Action Sel. April, pp.208-238, 2011.

H. F. Clarke, W. Trevor, A. C. Robbins, and . Roberts, Lesions of the medial striatum in monkeys produce perseverative impairments during reversal learning similar to those produced by lesions of the orbitofrontal cortex, J. Neurosci. 28, vol.43, pp.10972-10982, 2008.

J. Y. Cohen, Neuron-type-specific signals for reward and punishment in the ventral tegmental area, 2012.

K. E. Conen and C. Padoa-schioppa, Partial Adaptation to the Value Range in the Macaque Orbitofrontal Cortex, J. Neurosci. Pp, pp.270-6474, 2019.

S. M. Constantino and N. D. Daw, Learning the opportunity cost of time in a patch-foraging task, Cogn. Affect. Behav. Neurosci, vol.15, p.15307026, 2015.

R. Cools, K. Nakamura, and N. D. Daw, Serotonin and dopamine: Unifying affective, activational, and decision functions, Neuropsychopharmacology 36.1, pp.98-113, 2011.

L. H. Corbit, C. Sarah, P. H. Fischbach, and . Janak, Nucleus accumbens core and shell are differentially involved in general and outcome-specific forms of Pavlovian-instrumental transfer with alcohol and sucrose rewards, Eur. J. Neurosci, vol.43, p.14609568, 2016.

G. Coricelli, Regret and its avoidance: a neuroimaging study of choice behavior, Nat. Neurosci, vol.8, issue.9, pp.1097-6256, 2005.

. Cos, L. Ignasi, G. M. Cañamero, and . Hayes, Learning affordances of consummatory behaviors: Motivation-driven adaptive perception, Adapt. Behav, vol.18, issue.3, p.10597123, 2010.

P. L. Croxson, Quantitative investigation of connections of the prefrontal cortex in the human and macaque using probabilistic diffusion tractography, J. Neurosci. issn, p.2706474, 2005.

N. D. Daw, P. Sham-kakade, and . Dayan, Opponent interactions between serotonin and dopamine, Neural Networks, vol.15, pp.603-616, 2002.

N. D. Daw, Y. Niv, and P. Dayan, Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control, 2005.

, , pp.1097-6256

N. D. Daw, Cortical substrates for exploratory decisions in humans, Nature. issn, p.14764687, 2006.

N. D. Daw, Model-Based Influences on Humans' Choices and Striatal Prediction Errors, Neuron 69, vol.6, pp.1204-1215, 2011.

P. Dayan, Goal-directed control and its antipodes, Neural Networks, 2009.

, Twenty-Five Lessons from Computational Neuromodulation, Neuron 76.1, pp.240-256, 2012.

P. Dayan and Y. Niv, Reinforcement learning: The Good, The Bad and The Ugly, Curr. Opin. Neurobiol, vol.18, pp.185-196, 2008.

N. Denoyelle, From biological to numerical experiments in systemic neuroscience: A simulation platform, In: Biosyst. Biorobotics, vol.12, p.21953570, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01227968

B. E. Depue, T. Curran, and M. T. Banich, Prefrontal regions orchestrate suppression of emotional memories via a two-phase process, 2007.

A. Dezfouli and B. W. Balleine, Learning the structure of the world: The adaptive nature of state-space and action representations in multi-stage decisionmaking, PLOS Comput. Biol, vol.15, p.1007334, 2019.

R. J. Dolan and P. Dayan, Goals and habits in the brain, pp.312-337, 2013.

D. Domínguez and J. F. , Lateral orbitofrontal cortex activity is modulated by group membership in situations of justified and unjustified violence, Soc. Neurosci. issn, p.17470927, 2018.

K. Doya, What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?, In: Neural Networks. issn, issue.99, pp.46-51, 1999.

K. Doya, Multiple model-based reinforcement learning, Neural Comput. 14.6, pp.1347-1369, 2002.

R. P. Dum, L. Peter, and . Strick, In: Neurobiol. cingulate cortex limbic thalamus A Compr. handbook. Cambridge, MA, US: Birkhäuser, pp.3-7643, 1993.

R. Elliott, Dissociable Functions in the Medial and Lateral Orbitofrontal Cortex: Evidence from Human Neuroimaging Studies, Cereb. Cortex, 2000.

R. Elliott, Z. Agnew, and J. F. Deakin, Medial orbitofrontal cortex codes relative rather than absolute value of financial rewards in humans, Eur. J. Neurosci, vol.27, pp.2213-2218, 2008.

N. Eshel, Arithmetic and local circuitry underlying dopamine prediction errors, Nature. issn, p.14764687, 2015.

F. Fearing, I. P. Pavlov, and G. V. Anrep, Conditioned Reflexes. An Investigation of the Physiological Activity of the Cerebral Cortex, J. Am. Inst. Crim. Law Criminol. issn, p.8854173, 1929.

L. K. Fellows, Ventromedial frontal cortex mediates affective shifting in humans: evidence from a reversal learning paradigm, Brain 126, vol.8, pp.1460-2156, 2003.

, Orbitofrontal contributions to value-based decision making: Evidence from humans with frontal lobe damage, Ann. N. Y. Acad. Sci. 1239, vol.1, p.17496632, 2011.

L. K. Fellows and M. J. Farah, Different underlying impairments in decision-making following ventromedial and dorsolateral frontal lobe damage in humans, Cereb. Cortex. issn, p.10473211, 2005.

, The role of ventromedial prefrontal cortex in decision making: Judgment under uncertainty or judgment per se?, In: Cereb. Cortex, vol.17, pp.2669-2674, 2007.

C. D. Fiorillo, P. N. Tobler, and W. Schultz, Discrete coding of reward probability and uncertainty by dopamine neurons, 2003.

T. H. Fitzgerald, B. Seymour, and R. J. Dolan, The Role of Human Orbitofrontal Cortex in Value Comparison for Incommensurable Objects, pp.270-6474, 2009.

S. B. Floresco, The Nucleus Accumbens: An Interface Between Cognition, Emotion, and Action, Annu. Rev. Psychol. issn, pp.66-4308, 2015.

C. Frith and R. J. Dolan, Brain mechanisms associated with top-down processes in perception, 1997.

. Fu, N. F. Bo, M. A. Noy, and . Storey, Eye tracking the user experience -An evaluation of ontology visualization techniques, In: Semant. Web. issn, p.22104968, 2017.

S. Funahashi, C. J. Bruce, and P. S. Goldman-rakic, Mnemonic coding of visual space in the monkey's dorsolateral prefrontal cortex, J. Neurophysiol, vol.61, issue.2, pp.331-349, 1989.

D. Gaffan, Memory, action and the corpus striatum: Current developments in the memory-habit distinction, Semin. Neurosci. issn: 10445765, 1996.

M. P. Gardner, Lateral Orbitofrontal Inactivation Dissociates Devaluation-Sensitive Behavior and Economic Choice, Neuron 96, vol.5, 2017.

A. Garenne, Basal Ganglia Preferentially Encode Context Dependent Choice in a Two-Armed Bandit Task, Front. Syst. Neurosci, vol.5, pp.1662-5137, 2011.
URL : https://hal.archives-ouvertes.fr/hal-01155573

J. Gläscher, A. N. Hampton, and J. P. O'doherty, Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during rewardrelated decision making, Cereb. Cortex. issn, p.14602199, 2009.

J. Gläscher, States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning, vol.4, pp.585-595, 2010.

M. F. Glasser, A multi-modal parcellation of human cerebral cortex, Nature. issn, p.14764687, 2016.

S. Gluth, M. S. Spektor, and J. Rieskamp, Value-based attentional capture affects multi-alternative decision making, vol.7, pp.1-36, 2018.

P. S. Goldman-rakic, Circuitry of Primate Prefrontal Cortex and Regulation of Behavior by Representational Memory, Compr. Physiol, 2011.

P. S. Goldman-rakic, Architecture of the Prefrontal Cortex and the Central Executive, Ann. N. Y. Acad. Sci. issn, p.17496632, 1995.

J. A. Gottfried, J. Doherty, and R. Dolan, Value in Human Amygdala and Orbitofrontal Cortex, p.10959203, 2003.

F. Grabenhorst and E. T. Rolls, Different representations of relative and absolute subjective value in the human brain, Neuroimage 48.1, p.10538119, 2009.

, Value, pleasure and choice in the ventral prefrontal cortex, Trends Cogn, 2011.

, , p.13646613

. Grabenhorst, E. T. Fabian, B. A. Rolls, and . Parris, From affective value to decision-making in the prefrontal cortex, Eur. J. Neurosci. issn: 0953816X, 2008.

F. Grabenhorst, A common neural scale for the subjective pleasantness of different primary rewards, Neuroimage 51.3, pp.1265-1274, 2010.

A. M. Graybiel, Building action repertoires: memory and learning functions of the basal ganglia, Curr. Opin. Neurobiol. issn, issue.95, pp.80100-80106, 1995.

S. M. Groman, Orbitofrontal Circuits Control Multiple Reinforcement-Learning Processes, Neuron, p.8966273, 2019.

K. Gurney, T. J. Prescott, and P. Redgrave, A computational model of action selection in the basal ganglia. II. Analysis and simulation of behaviour, 2001.

, Cybern. 84, vol.6, pp.411-423

K. Gurney, T. Prescott, and P. Redgrave, Gurney Et Al, pp.1-10, 2001.

M. Guthrie, Interaction between cognitive and motor cortico-basal ganglia loops during decision making: a computational study, J. Neurophysiol. 109, vol.12, pp.22-3077, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00828004

D. Ha and J. Schmidhuber, Recurrent World Models Facilitate Policy Evolution, 2018.

S. N. Haber, The primate basal ganglia: Parallel and integrative networks, J. Chem. Neuroanat, 2003.

B. Haber, N. Suzanne, and B. Knutson, The reward circuit: Linking primate anatomy and human imaging, 2010.

A. N. Hampton, P. Bossaerts, and J. P. O'doherty, The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans, J. Neurosci. issn, p.2706474, 2006.

T. Hatfield, Neurotoxic lesions of basolateral, but not central, amygdala interfere with pavlovian second-order conditioning and reinforcer devaluation effects, J. Neurosci. issn, p.2706474, 1996.

M. Hebscher, Memory, decision-making, and the ventromedial prefrontal cortex (vmpfc): The roles of subcallosal and posterior orbitofrontal cortices in monitoring and control processes, Cereb. Cortex, vol.26, p.14602199, 2016.

J. Hornak, Reward-related reversal learning after surgical excisions in orbito-frontal or dorsolateral prefrontal cortex in humans, J. Cogn. Neurosci, vol.16, issue.3, pp.463-478, 2004.

J. D. Howard and T. Kahnt, Identity-specific reward representations in orbitofrontal cortex are modulated by selective devaluation, J. Neurosci. issn, p.15292401, 2017.

O. J. Hulme, B. Tobias-morville, and . Gutkin, Neurocomputational theories of homeostatic control, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02355206

M. D. Humphries, M. Khamassi, and K. Gurney, Dopaminergic control of the exploration-exploitation trade-off via the basal ganglia, Front. Neurosci. issn, p.16624548, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00688928

L. T. Hunt, Y. Benjamin, and . Hayden, A distributed, hierarchical and recurrent framework for reward-based choice, Nat. Rev. Neurosci, vol.18, issue.3, p.14710048, 2017.

L. T. Hunt, Mechanisms underlying cortical activity during valueguided choice, Nat. Neurosci, vol.15, issue.3, p.10976256, 2012.

,

S. D. Iversen and M. Mishkin, Perseverative interference in monkeys following selective lesions of the inferior prefrontal convexity, Exp. Brain Res. issn, p.144819, 1970.

A. Izquierdo and J. Jentsch, Reversal learning as a measure of impulsive and compulsive behavior in addictions, 2012.

A. Izquierdo, R. K. Suda, and E. A. Murray, Bilateral orbital prefrontal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency, J. Neurosci. 24, vol.34, pp.7540-7548, 2004.

D. Joel, Y. Niv, and E. Ruppin, Actor-critic models of the basal ganglia: new anatomical and computational perspectives, Neural Networks, vol.15, pp.535-547, 2002.

M. Johnson, The malmo platform for artificial intelligence experimentation, IJCAI Int. Jt. Conf. Artif. Intell, 2016.

B. Jones and M. Mishkin, Limbic lesions and the problem of stimulus-Reinforcement associations, Exp. Neurol. issn, pp.90030-90031, 1972.

J. L. Jones, Orbitofrontal Cortex Supports Behavior and Learning Using Inferred But Not Cached Values, pp.36-8075, 2012.

J. W. Kable, W. Paul, and . Glimcher, The neural correlates of subjective value during intertemporal choice, Nat. Neurosci. 10, vol.12, pp.1097-6256, 2007.

J. W. Kable and P. W. Glimcher, The Neurobiology of Decision: Consensus and Controversy, Neuron 63, vol.6, pp.733-745, 2009.

D. Kahneman and A. Tversky, Choices, values, and frames, Am. Psychol. issn, p.3066, 1984.

T. Kahnt, Connectivity-based parcellation of the human orbitofrontal cortex, J. Neurosci. issn, p.2706474, 2012.

P. S. Kaushik, A biologically inspired neuronal model of reward prediction error computation, Proc. Int. Jt. Conf. Neural Networks. isbn: 9781509061815, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01528658

R. Kawagoe, Y. Takikawa, and O. Hikosaka, Expectation of reward modulates cognitive signals in the basal ganglia, Nat. Neurosci. issn, p.10976256, 1998.

A. Kazama and J. Bachevalier, Selective aspiration or neurotoxic lesions of orbital frontal areas 11 and 13 spared monkeys' performance on the object discrimination reversal task, J. Neurosci. issn, p.2706474, 2009.

S. W. Kennerley and J. D. Wallis, Evaluating Choices By Single Neurons in the Frontal Lobe, Neuroscience 29, vol.10, pp.2061-2073, 2009.

S. W. Kennerley, E. J. Timothy, J. D. Behrens, and . Wallis, Double dissociation of value computations in orbitofrontal and anterior cingulate neurons, Nat. Neurosci. 14, vol.12, p.10976256, 2011.

M. Keramati and B. Gutkin, A reinforcement learning theory for homeostatic regulation, Annu. Conf. Neural Inf. Process. Syst, vol.24, pp.1-9, 2011.

, Homeostatic reinforcement learning for integrating reward collection and physiological stability, vol.3, pp.1-26, 2014.

M. Khamassi and M. D. Humphries, Integrating cortico-limbic-basal ganglia architectures for learning model-based and model-free navigation strategies, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01219958

S. Killcross and E. Coutureau, Coordination of actions and habits in the medial prefrontal cortex of rats, Cereb. Cortex, vol.13, pp.400-408, 2003.

J. Kim and M. E. Ragozzino, The involvement of the orbitofrontal cortex in learning under changing task contingencies, Neurobiol. Learn. Mem. issn, p.10747427, 2005.

M. Kim, Anatomical correlates of the functional organization in the human occipitotemporal cortex, Magn. Reson. Imaging. issn: 0730725X, 2006.

B. Knutson, Distributed neural representation of expected value, J. Neurosci. issn, p.2706474, 2005.

S. Kobayashi, O. Pinto-de-carvalho, and W. Schultz, Adaptation of Reward Sensitivity in Orbitofrontal Neurons, J. Neurosci, vol.30, pp.270-6474, 2010.

E. Koechlin, C. Ody, and F. Kouneiher, The Architecture of Cognitive Control in the Human Prefrontal Cortex, 2003.

B. Kolb, Do all mammals have a prefrontal cortex?, In: Evol. Nerv. Syst. isbn, p.9780123708786, 2007.

N. Kolling, Neural Mechanisms of Foraging, pp.36-8075, 2012.

. Bibliography,

Y. Kondo, Changes in brain activation associated with use of a memory strategy: A functional MRI study, Neuroimage. issn, p.10538119, 2005.

G. Konidaris and A. Barto, An Adaptive Robot Motivational System, pp.346-356, 2006.

P. Krack, Deep brain stimulation: From neurology to psychiatry?, In: Trends Neurosci. 33, vol.10, pp.474-484, 2010.
URL : https://hal.archives-ouvertes.fr/inserm-00593713

I. Krajbich and A. Rangel, Multialternative drift-diffusion model predicts the relationship between visual fixations and choice in value-based decisions, Proc, 2011.

, , pp.1091-6490

J. E. Krettek and J. L. Price, The cortical projections of the mediodorsal nucleus and adjacent thalamic nuclei in the rat, J. Comp. Neurol, vol.171, pp.157-191, 1977.

M. L. Kringelbach, Activation of the human orbitofrontal cortex to a liquid food stimulus is correlated with its subjective pleasantness, Cereb. Cortex. issn, p.10473211, 2003.

M. L. Kringelbach, The human orbitofrontal cortex: linking reward to hedonic experience, Nat. Rev. Neurosci, vol.6, issue.9, pp.691-702, 2005.

M. L. Kringelbach and E. T. Rolls, The functional neuroanatomy of the human orbitofrontal cortex: Evidence from neuroimaging and neuropsychology, 2004.

A. Leblois, Competition between Feedback Loops Underlies Normal and Pathological Dynamics in the Basal Ganglia, J. Neurosci, vol.26, pp.3567-3583, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00094738

M. Lebreton, An Automatic Valuation System in the Human Brain: Evidence from Functional Neuroimaging, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00504100

S. Lee, S. Wan, J. P. Shimojo, and . O'doherty, Neural Computations Underlying Arbitration between Model-Based and Model-free Learning, Neuron 81.3, pp.687-699, 2014.

M. Lewis and L. Cañamero, Hedonic quality or reward? A study of basic pleasure in homeostasis and decision making of a motivated autonomous robot, Adapt. Behav. 24, vol.5, p.17412633, 2016.

Y. Li, Local Morphology Predicts Functional Organization of Experienced Value Signals in the Human Orbitofrontal Cortex, J. Neurosci. issn, p.15292401, 2015.
URL : https://hal.archives-ouvertes.fr/hal-02341041

Y. Li, The neural dynamics of reward value and risk coding in the human orbitofrontal cortex, p.14602156, 2016.
URL : https://hal.archives-ouvertes.fr/hal-02341038

S. Lim, J. P. O'doherty, and A. Rangel, The Decision Value Computations in the vmPFC and Striatum Use a Relative Value Code That is Guided by Visual Attention, J. Neurosci. 31, vol.37, pp.270-6474, 2011.

C. Luk, J. D. Hay, and . Wallis, Dynamic encoding of responses and outcomes by neurons in medial prefrontal cortex, J. Neurosci. issn, p.2706474, 2009.

A. Luria and . Romanovich, Higher cortical functions in man, 2012.

B. Mackey, S. , and M. Petrides, Quantitative demonstration of comparable architectonic areas within the ventromedial and lateral orbital frontal cortex in the human and the macaque monkey brains, Eur. J. Neurosci. issn, p.953816, 2010.

L. Málková, D. Gaffan, and E. A. Murray, Excitotoxic lesions of the amygdala fail to produce impairment in visual learning for auditory secondary reinforcement but interfere with reinforcer devaluation effects in rhesus monkeys, J. Neurosci. issn, p.2706474, 1997.

M. Malvaez, Distinct cortical-amygdala projections drive reward value encoding and retrieval, Nat. Neurosci. issn, p.15461726, 2019.

F. Mannella, K. Gurney, and G. Baldassarre, The nucleus accumbens as a nexus between values and goals in goal-directed behavior: a review and a new hypothesis, Front. Behav. Neurosci. 7.October, pp.1-29, 2013.

C. D. Marsden, The mysterious motor function of the basal ganglia: The Robert Wartenberg Lecture, Neurology. issn: 1526632X, 1982.

C. D. Marsden and J. A. Obeso, The functions of the basal ganglia and the paradox of stereotaxic surgery in parkinson's disease, 1994.

T. Matiisen, Teacher-Student Curriculum Learning, 2017.

M. Matsumoto and O. Hikosaka, Lateral habenula as a source of negative reward signals in dopamine neurons, Nature 447, vol.7148, p.14764687, 2007.

H. Maturana and F. Varela, Autopoiesis and Cognition : The Realization of the Living (Boston Studies in the, Philosophy of Science). isbn, p.9027710163, 1991.

M. A. Mcdannald, Ventral Striatum and Orbitofrontal Cortex Are Both Required for Model-Based, But Not Model-Free, Reinforcement Learning, J. Neurosci. 31, vol.7, pp.270-6474, 2011.

,

M. Meunier, J. Bachevalier, and M. Mishkin, Effects of orbital frontal and anterior cingulate lesions on object and spatial memory in rhesus monkeys, Neuropsychologia 35, vol.7, pp.27-31, 1997.

E. K. Miller, Neural correlates of categories and concepts, 2003.

K. J. Miller, Value Representations in Orbitofrontal Cortex Drive Learning , but not Choice, In: bioRXiv Prepr. Pp, pp.1-25, 2018.

M. Milosavljevic, The Drift Diffusion Model can account for the accuracy and reaction time of value-based choices under high and low time pressure, 2010.

. Decis and . Mak, , pp.437-449

J. W. Mink, The basal ganglia: Focused selection and inhibition of competing motor programs, Prog. Neurobiol, vol.50, pp.381-425, 1996.

M. Minsky and . Lee, Theory of neural-analog reinforcement systems and its application to the brain model problem, 1954.

P. Montague, G. S. Read, and . Berns, Neural economics and the biological substrates of valuation, 2002.

P. Montague, P. Read, T. J. Dayan, and . Sejnowski, A framework for mesencephalic dopamine systems based on predictive Hebbian learning, J. Neurosci. issn, p.2706474, 1996.

R. J. Morecraft, Amygdala interconnections with the cingulate motor cortex in the rhesus monkey, J. Comp. Neurol. issn, p.219967, 2007.

S. E. Morrison, Different Time Courses for Learning-Related Changes in Amygdala and Orbitofrontal Cortex, Neuron 71, vol.6, 2011.

E. A. Murray and A. Izquierdo, Orbitofrontal cortex and amygdala contributions to affect and action in primates, Ann. N. Y. Acad. Sci, vol.1121, p.17496632, 2007.

E. A. Murray, H. Peter, and . Rudebeck, Specializations for reward-guided decision-making in the primate ventral prefrontal cortex, Nat. Rev. Neurosci, vol.19, p.14710048, 2018.

B. T. Nallapu, N. P. Bhargav-teja, P. Nicolas, and . Rougier, Dynamics of reward based decision making: A computational study, Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), p.16113349, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01333210

E. S. Nisenbaum and C. J. Wilson, Potassium currents responsible for inward and outward rectification in rat neostriatal spiny projection neurons, J. Neurosci. issn, p.2706474, 1995.

Y. Niv, Tonic dopamine: Opportunity costs and the control of response vigor, Psychopharmacology (Berl). issn, p.333158, 2007.

R. Nogueira, Lateral orbitofrontal cortex anticipates choices and integrates prior with current information, Nat. Commun. 8. issn, p.20411723, 2017.

M. P. Noonan, Separate value comparison and learning mechanisms in macaque medial and lateral orbitofrontal cortex, Proc. Natl. Acad. Sci. 107.47, pp.27-8424, 2010.

M. P. Bibliography-noonan, R. B. Mars, and M. F. Rushworth, Distinct Roles of Three Frontal Cortical Areas in Reward-Guided Behavior, J. Neurosci. 31, vol.40, pp.270-6474, 2011.

M. P. Noonan, Re-evaluating the role of the orbitofrontal cortex in reward and reinforcement, Eur. J. Neurosci, vol.35, pp.997-1010, 2012.

J. O'doherty, Abstract reward and punishment representations in the human orbitofrontal cortex, Nat. Neurosci. issn, p.10976256, 2001.

J. O'doherty, Beauty in a smile: The role of medial orbitofrontal cortex in facial attractiveness, Neuropsychologia 41, vol.2, pp.147-155, 2003.

J. P. O'doherty, Contributions of the ventromedial prefrontal cortex to goal-directed action selection, Ann. N. Y. Acad. Sci. 1239, vol.1, p.17496632, 2011.

J. P. O'doherty, Predictive neural coding of reward preference involves dissociable responses in human ventral midbrain and ventral striatum, Neuron. issn, p.8966273, 2006.

J. Olds and P. Milner, POSITIVE REINFORCEMENT PRODUCED BY ELECTRICAL STIMULATION OF SEPTAL AREA AND OTHER REGIONS OF RAT BRAIN, J. Comp. Physiol. Psychol. issn, p.219940, 1954.

M. O'neill and W. Schultz, Coding of reward risk by orbitofrontal neurons is mostly distinct from coding of reward value, Neuron. issn, p.8966273, 2010.

. Öngür, A. T. Dost, J. L. Ferry, and . Price, Architectonic subdivision of the human orbital and medial prefrontal cortex, J. Comp. Neurol, vol.460, issue.3, pp.425-449, 2003.

O. Bibliography, R. C. Reilly, A. Seth, W. M. Herd, and . Pauli, Computational models of cognitive control, Curr. Opin. Neurobiol, vol.20, pp.257-261, 2010.

C. Padoa-schioppa, Range-Adapting Representation of Economic Value in the Orbitofrontal Cortex, J. Neurosci, vol.29, pp.14004-14014, 2009.

. Padoa-schioppa, J. A. Camillo, and . Assad, Neurons in the orbitofrontal cortex encode economic value, Nature 441, vol.7090, p.14764687, 2006.

, The representation of economic value in the orbitofrontal cortex is invariant for changes of menu, Nat. Neurosci, vol.11, issue.1, p.10976256, 2008.

. Padoa-schioppa, K. E. Camillo, and . Conen, Orbitofrontal Cortex: A Neural Circuit for Economic Decisions, Neuron 96, vol.4, p.10974199, 2017.

. Padoa-schioppa, L. Camillo, E. Jandolo, and . Visalberghi, Multi-stage mental process for economic choice in capuchins, Cognition 99.1, B1-B13. issn: 00100277, 2006.

J. A. Parkinson, The role of the primate amygdala in conditioned reinforcement, J. Neurosci. issn, p.2706474, 2001.

B. Pasquereau, Shaping of Motor Responses by Incentive Values through the Basal Ganglia, J. Neurosci, vol.27, pp.270-6474, 2007.
URL : https://hal.archives-ouvertes.fr/hal-02658533

R. E. Passingham, The frontal lobes and voluntary action, p.299, 1993.

J. J. Paton, The primate amygdala represents the positive and negative value of visual stimuli during learning, p.14764687, 2006.

V. Pawlak and J. N. Kerr, Dopamine receptor activation is required for corticostriatal spike-timing-dependent plasticity, J. Neurosci, vol.28, pp.2435-2446, 2008.

A. Pears, Lesions of the Orbitofrontal but not Medial Prefrontal Cortex Disrupt Conditioned Reinforcement in Primates, J. Neurosci. issn, p.2706474, 2003.

M. Petrides, Specialized systems for the processing of mnemonic information within the primate frontal cortex, Philos. Trans. R. Soc. B Biol. Sci. issn, p.9628436, 1996.

M. Petrides and D. N. Pandya, Comparative cytoarchitectonic analysis of the human and the macaque ventrolateral prefrontal cortex and corticocortical connection patterns in the monkey, Eur. J. Neurosci, vol.16, pp.291-310, 2002.

G. Pezzulo, The mechanics of embodiment: A dialog on embodiment and computational modeling, Front. Psychol. 2.JAN, p.16641078, 2011.

C. L. Pickens, Different Roles for Orbitofrontal Cortex and Basolateral Amygdala in a Reinforcer Devaluation Task, J. Neurosci. issn, p.2706474, 2003.

M. L. Platt and P. W. Glimcher, Neural correlates of decision variables in parietal cortex, Nature. issn, p.280836, 1999.

L. J. Porrino, A. M. Crane, and P. S. Goldman-rakic, Direct and indirect pathways from the amygdala to the frontal lobe in rhesus monkeys, J. Comp. Neurol. 198, vol.1, p.10969861, 1981.

B. Preuss and M. Todd, Do rats have prefrontal cortex? The Rose-Woolsey-Akert program reconsidered, Journal of cognitive neuroscience, issue.1, pp.1-24, 1995.

J. L. Price, Definition of the orbital cortex in relation to specific connections with limbic and visceral structures and other cortical regions, Ann. N. Y. Acad. Sci. isbn, p.9781573316835, 2007.

R. Quilodran, M. Rothé, and E. Procyk, Behavioral Shifts and Action Valuation in the Anterior Cingulate Cortex, 2008.
URL : https://hal.archives-ouvertes.fr/inserm-00906686

A. Rangel, C. Camerer, and P. Montague, A framework for studying the neurobiology of value-based decision making, 2008.

E. C. Ravlin, M. Bruce, and . Meglino, Effect of values on perception and decision making: A study of alternative work values measures, J. Appl. Psychol, pp.21-9010, 1987.

P. Redgrave, T. J. Prescott, and K. Gurney, The basal ganglia: A vertebrate solution to the selection problem?, pp.1009-1023, 1999.

R. A. Rescorla and A. Wagner, A Theory of Pavlovian Conditioning: Variations in the Effectiveness of Reinforcement and Nonreinforcement BT -Clasical conditioning II: current research and theory, Clasical Cond. II Curr. Res. theory, 1972.

A. Black and W. Prokasy, , pp.64-99

J. S. Riceberg and M. L. Shapiro, Reward stability determines the contribution of orbitofrontal cortex to adaptive behavior, J. Neurosci, vol.32, pp.16402-16409, 2012.

E. L. Rich, M. Frederic, P. H. Stoll, and . Rudebeck, Linking dynamic patterns of neural activity in orbitofrontal cortex with decision making, Curr. Opin, p.244, 2018.

, Bibliography Neurobiol, vol.49, p.18736882

T. W. Robbins, J. Verity, and . Brown, The Role of the Striatum in the Mental Chronometry of Action: A Theoretical Review, Rev. Neurosci. issn, p.21910200, 1990.

A. C. Roberts, Primate orbitofrontal cortex and adaptive behaviour, Trends Cogn. Sci, vol.10, pp.83-90, 2006.

E. Rolls, The Anterior and Midcingulate Cortices and Reward, Cingulate Neurobiol. Dis, 2009.

E. T. Rolls, Emotion-related learning in patients with social and emotional changes associated with frontal lobe damage, J. Neurol. Neurosurg. Psychiatry, vol.57, pp.1518-1524, 1994.

E. T. Rolls and F. Grabenhorst, The orbitofrontal cortex and beyond: From affect to decision-making, 2008.

E. T. Rolls, C. Mccabe, and J. Redoute, Expected value, reward outcome, and temporal difference error representations in a probabilistic decision task, Cereb. Cortex. issn, p.10473211, 2008.

E. T. Rolls, F. Grabenhorst, and G. Deco, Choice, difficulty, and confidence in the brain, Neuroimage. issn, p.10538119, 2010.

J. E. Rose and C. N. Woolsey, The orbitofrontal cortex and its connections with the mediodorsal nucleus in rabbit, sheep and cat, In: Res. Publ. Assoc. Res. Nerv. Ment. Dis. issn, p.917443, 1948.

P. H. Rudebeck and E. A. Murray, Dissociable Effects of Subtotal Lesions within the Macaque Orbital Prefrontal Cortex on Reward-Guided Behavior, J. Neurosci. 31, vol.29, pp.270-6474, 2011.

P. H. Rudebeck, Frontal Cortex Subregions Play Distinct Roles in Choices between Actions and Stimuli, J. Neurosci, vol.28, pp.270-6474, 2008.

P. H. Rudebeck and E. A. Murray, The orbitofrontal oracle: Cortical mechanisms for the prediction and evaluation of specific behavioral outcomes, 2014.

P. H. Rudebeck, Separate neural pathways process different decision costs, Nat. Neurosci, vol.9, p.10976256, 2006.

P. H. Rudebeck, Prefrontal mechanisms of behavioral flexibility, emotion regulation and value updating, Nat. Neurosci. 16, vol.8, p.10976256, 2013.

P. H. Rudebeck, Specialized Representations of Value in the Orbital and Ventrolateral Prefrontal Cortex: Desirability versus Availability of Outcomes, Neuron 95, vol.5, 2017.

M. Rushworth and . Fs, Functional organization of the medial frontal cortex, 2007.

M. Rushworth and . Fs, Valuation and decision-making in frontal cortex: one or many serial or parallel systems?, In: Curr. Opin. Neurobiol. 22, vol.6, pp.946-955, 2012.

P. A. Samuelson, A Note on the Pure Theory of Consumer's Behaviour, Economica. issn, p.130427, 1938.

M. I. Sandstrom and G. V. Rebec, Characterization of striatal activity in conscious rats: Contribution of NMDA and AMPA/kainate receptors to both spontaneous and glutamate-driven firing, Synapse. issn, p.8874476, 2003.

G. Schoenbaum, A. A. Chiba, and M. Gallagher, Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning, Nat. Neurosci. issn, p.10976256, 1998.

, Neural encoding in orbitofrontal cortex and basolateral amygdala during olfactory discrimination learning, J. Neurosci, vol.19, p.2706474, 1999.

G. Schoenbaum, Orbitofrontal lesions in rats impair reversal but not acquisition of go, no-go odor discriminations, Neuroreport 13, vol.6, p.9594965, 2002.

G. Schoenbaum, Encoding predicted outcome and acquired value in orbitofrontal cortex during cue sampling depends upon input from basolateral amygdala, Neuron 39.5, pp.474-478, 2003.

G. Schoenbaum, M. R. Roesch, and T. A. Stalnaker, Orbitofrontal cortex, decision-making and drug addiction, 2006.

F. Schrodt, Mario Becomes Cognitive, Top. Cogn. Sci, vol.9, p.17568765, 2017.

N. W. Schuck, Human Orbitofrontal Cortex Represents a Cognitive Map of State Space, Neuron. issn, p.10974199, 2016.

N. W. Schuck, R. Wilson, and Y. Niv, A State Representation for Reinforcement Learning and Decision-Making in the Orbitofrontal Cortex, Goal-Directed Decis. Mak, pp.259-278, 2018.

W. Schultz, P. Dayan, and P. R. Montague, A neural substrate of prediction and reward, 1997.

B. Schultz and W. , Neuronal reward and decision signals: From theories to data, Physiol. Rev, vol.95, issue.3, pp.853-951, 2014.

J. K. Seamans, C. Christopher, D. Lapish, and . Durstewitz, Comparing the prefrontal cortex of rats and primates: Insights from electrophysiology, Neurotox. Res. issn, p.10298428, 2008.

H. Seo and D. Lee, Temporal filtering of reward signals in the dorsal anterior cingulate cortex during a mixed-strategy game, J. Neurosci. issn, p.2706474, 2007.

P. Sequeira, F. S. Melo, and A. Paiva, Emotion-based intrinsic motivation for reinforcement learning agents, Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics). isbn, p.9783642245992, 2011.

J. T. Serences, Value-Based Modulations in Human Visual Cortex, 2008.

G. Sescousse, Processing of primary and secondary rewards: A quantitative meta-analysis and review of human functional neuroimaging studies, Neurosci. Biobehav. Rev, vol.37, pp.681-696, 2013.
URL : https://hal.archives-ouvertes.fr/hal-02341139

T. Shallice, Specific impairments of planning, In: Philos. Trans. R. Soc. Lond. B. Biol. Sci. issn, p.9628436, 1982.

T. Shallice, From neuropsychology to mental structure, 1988.

M. Shidara and B. J. Richmond, Anterior cingulate: Single neuronal signals related to degree of reward expectancy, 2002.

D. Silver, Mastering the game of Go with deep neural networks and tree search, Nature 529.7587, pp.484-489, 2016.

R. Singla, R. R. Ganta, and K. Vemuri, An Exergame Themed on the Power of Religious Belief for Stroke/Motor Rehabilitation, pp.1-6, 2017.

B. F. Skinner, The behavior of organisms: an experimental analysis. Appleton-Century, 1938.

B. Skinner and . Frederic, Science and human behavior. 92904. Simon and Schuster, 1965.

M. A. Sommer, H. Robert, and . Wurtz, What the Brain Stem Tells the Frontal Cortex. I. Oculomotor Signals Sent from Superior Colliculus to Frontal Eye Field Via Mediodorsal Thalamus, J. Neurophysiol. issn, p.223077, 2003.

T. A. Stalnaker, Basolateral Amygdala Lesions Abolish Orbitofrontal-Dependent Reversal Impairments". In: Neuron. issn, p.8966273, 2007.

T. A. Stalnaker, Neural correlates of stimulus-response and responseoutcome associations in dorsolateral versus dorsomedial striatum, Front. Integr. Neurosci. issn, p.16625145, 2010.

T. A. Stalnaker, K. Nisha, G. Cooch, and . Schoenbaum, What the orbitofrontal cortex does not do, Nat. Neurosci. 18, vol.5, pp.1097-6256, 2015.

T. A. Stalnaker, Cholinergic interneurons use orbitofrontal input to track beliefs about current state, J. Neurosci. issn, p.15292401, 2016.

A. P. Steiner and A. D. Redish, Behavioral and neurophysiological correlates of regret in rat decision-making on a neuroeconomic task, Nat. Neurosci. 17, vol.7, pp.1097-6256, 2014.

B. Strait, C. E. Tommy, C. Blanchard, and B. Y. Hayden, Reward value comparison via mutual inhibition in ventromedial prefrontal cortex, Neuron. issn, p.10974199, 2014.

C. Strannegård, Generic animats, Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics). isbn, p.9783319637020, 2017.

C. Strannegård, Learning and decision-making in artificial animals, J. Artif. Gen. Intell, vol.9, issue.1, pp.55-82, 2018.

A. Strock, N. Rougier, and X. Hinaut, Using Conceptors to Transfer Between Long-Term and Short-Term Memory, Int. Conf. Artif. Neural Networks, pp.19-23, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02387559

R. Sugden, Economic Choice Theory: An Experimental Analysis of Animal Behaviour, In: Econ. J. issn, p.130133, 1996.

L. P. Sugrue, G. S. Corrado, and W. T. Newsome, Matching behavior and the representation of value in the parietal cortex, pp.1782-1787, 2004.

R. S. Sutton, G. Andrew, and . Barto, Reinforcement Learning : An Introduction, p.262193981, 1998.

Y. K. Takahashi, Expectancy-related changes in firing of dopamine neurons depend on orbitofrontal cortex, Nat. Neurosci. issn, p.10976256, 2011.

S. C. Tanaka, W. Bernard, J. P. Balleine, and . O'doherty, Calculating consequences: Brain systems that encode the causal effects of actions, J. Neurosci. issn, p.2706474, 2008.

S. C. Tanaka, Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops, Behav. Econ. Prefer. Choices, Happiness, p.9784431554028, 2016.

B. Thorndike and L. Edward, Animal intelligence: An experimental study of the associative processes in animals, Psychol. Rev. Monogr, pp.96-9753, 1898.

S. M. Tom, The neural basis of loss aversion in decision-making under risk, 2007.

M. Topalidou, Re ] Interaction between cognitive and motor corticobasal ganglia loops during decision making : a computational study To cite this version : HAL Id : hal-01201790 Re Science [ Re ] Interaction between cognitive and motor cortico-basal ganglia loops du, pp.0-6, 2015.

M. Topalidou, A computational model of dual competition between the basal ganglia and the cortex". In: eNeuro 5.6, ENEURO.0339-17, p.23732822, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01925643

A. Tsuchida, B. B. Doll, and L. K. Fellows, Beyond reversal: A critical role for human orbitofrontal cortex in flexible learning from probabilistic feedback, J. Neurosci, vol.30, pp.16868-16875, 2010.

K. Tsutsui and . Ichiro, A dynamic code for economic object valuation in prefrontal cortex neurons, Nat. Commun. 7. issn, p.20411723, 2016.

H. Uylings, . Bm, J. Henk, B. Groenewegen, and . Kolb, Do rats have a prefrontal cortex?, In: Behavioural brain research, vol.146, issue.2, pp.3-17, 2003.

V. V. Valentin, A. Dickinson, and J. P. O'doherty, Determining the Neural Substrates of Goal-Directed Learning in the Human Brain, J. Neurosci. 27, vol.15, pp.270-6474, 2007.

. Van-hoesen, W. Gary, J. Robert, B. Morecraft, and . Vogt, Connections of the monkey cingulate cortex, Neurobiol. cingulate cortex limbic thalamus, pp.249-284, 1993.

B. Vanni-mercier and G. , The hippocampus codes the uncertainty of cueoutcome associations: An intracranial Electro physiological study in humans, 2009.

. Neurosci, , vol.29, pp.5287-5294, 2009.

F. Varela, Autopoiesis and a biology of intentionality, Autopoiesis Percept. A Work. with ESPRIT ?, 1992.

F. J. Varela, Organism: A Meshwork of Selfless Selves, 1991.

. Verplanken, R. W. Bas, and . Holland, Motivated decision making: Effects of activation and self-centrality of values on choices and behavior. Verplanken, Bas: U Tromsø, 2002.

P. F. Verschure, Distributed Adaptive Control: A theory of the Mind, Brain, Body Nexus, Biol. Inspired Cogn. Archit, vol.1, pp.55-72, 2012.

J. Vitay and F. H. Hamker, Timing and expectation of reward: A neurocomputational model of the afferents to the ventral tegmental area, Front. Neurorobot. issn: 16625218, 2014.

I. Vlachos, Context-dependent encoding of fear and extinction memories in a large-scale network model of the basal amygdala, PLoS Comput. Biol. issn: 1553734X, 2011.
URL : https://hal.archives-ouvertes.fr/inserm-00704803

V. Neumann and . Morgenstern, Theory of games and economic behavior, vol.625, p.625, 1944.

P. Voorn, Putting a spin on the dorsal-ventral divide of the striatum, Trends Neurosci. 27.8, pp.468-474, 2004.

A. Walker and . Earl, A cytoarchitectural study of the prefrontal area of the macaque monkey, J. Comp. Neurol. issn, p.10969861, 1940.

B. Wallis and J. D. , Cross-species studies of orbitofrontal cortex and value-based decision-making, Nat. Neurosci. 15, vol.1, p.10976256, 2012.

J. D. Wallis, C. Kathleen, E. K. Anderson, and . Miller, Single neurons in prefrontal cortex encode abstract roles, Nature. issn, p.280836, 2001.

M. E. Walton, Functional specialization within medial frontal cortex of the anterior cingulate for evaluating effort-related decisions, J. Neurosci. issn, p.2706474, 2003.

M. E. Walton, Separable Learning Systems in the Macaque Brain and the Role of Orbitofrontal Cortex in Contingent Learning, Neuron 65, vol.6, pp.927-939, 2010.

M. E. Walton, Giving credit where credit is due: Orbitofrontal cortex and valuation in an uncertain world, Ann. N. Y. Acad. Sci. 1239, vol.1, p.17496632, 2011.

J. X. Wang, Prefrontal cortex as a meta-reinforcement learning system, Nat. Neurosci. 21, vol.6, pp.1097-6256, 2018.

T. Wang, J. Deng, and B. He, Classifying EEG-based motor imagery tasks by means of time-frequency synthesized spatial patterns, Clin. Neurophysiol. issn, p.13882457, 2004.

X. Wang and . Jing, Decision Making in Recurrent Neuronal Circuits, Neuron 60, vol.2, pp.215-234, 2008.

J. R. Wickens, Dopaminergic mechanisms in actions and habits, J. Neurosci. 27, vol.31, pp.8181-8183, 2007.

S. Williams, P. S. Mark, and . Goldman-rakic, Characterization of the dopaminergic innervation of the primate frontal cortex using a dopamine-specific antibody, Cereb. Cortex. issn, p.14602199, 1993.

C. J. Wilson and P. M. Groves, Spontaneous firing patterns of identified spiny neurons in the rat neostriatum, Brain Res. issn, issue.81, pp.90211-90211, 1981.

R. C. Wilson, Orbitofrontal cortex as a cognitive map of task space, Neuron 81, vol.2, p.10974199, 2014.

Y. Xia, Nucleus accumbens medium spiny neurons target nondopaminergic neurons in the ventral tegmental area, J. Neurosci. issn, p.2706474, 2011.

Y. Yang and A. Raine, Prefrontal structural and functional brain imaging findings in antisocial, violent, and psychopathic individuals: A meta-analysis, 2009.

H. H. Yin, J. Barbara, B. W. Knowlton, and . Balleine, Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning, Eur. J. Neurosci. issn, p.953816, 2004.

A. J. Yu and P. Dayan, Uncertainty, neuromodulation, and attention, 2005.

D. H. Zald, Meta-Analytic Connectivity Modeling Reveals Differential Functional Connectivity of the Medial and Lateral Orbitofrontal Cortex". In: Cereb. Cortex, vol.24, pp.1460-2199, 2014.

Z. Zhang, A neural network model for the orbitofrontal cortex and task space acquisition during reinforcement learning, PLoS Comput. Biol. 14, vol.1, pp.1-24, 2018.

A. Definitions,

, Définition 1. Credit assignment The ability to learn that a particular outcome (in experiments, this is typically food or fluid) was produced by a particular choice

, Définition 2. Value-based decision-making The ability to make informed choices that optimize subjective value

, Cognitive map A neural representation of stimuli, actions and other sensory features that occur in association with outcomes in a multidimensional array. The cognitive map has been theorized to guide value-based decision-making

, Aspiration lesion A technique for removing grey matter (that is, neurons) that is based on subpial aspiration of tissue. Lesions are typically carried out with the aid of an operating microscope

, Excitotoxic lesions Lesions created using a technique for selectively removing grey matter (that is, neurons) and sparing white matter (that is, axons) that is based on Annexes the injection of neurotoxins. injections are often carried out via a stereotaxic approach based on coordinates obtained from magnetic resonance images of the brain