RESUMO
Mesolimbic dopamine activity occasionally exhibits ramping dynamics, reigniting debate on theories of dopamine signaling. This debate is ongoing partly because the experimental conditions under which dopamine ramps emerge remain poorly understood. Here, we show that during Pavlovian and instrumental conditioning, mesolimbic dopamine ramps are only observed when the inter-trial interval is short relative to the trial period. These results constrain theories of dopamine signaling and identify a critical variable determining the emergence of dopamine ramps.
RESUMO
Impulsive choice, often characterized by excessive preference for small, short-term rewards over larger, long-term rewards, is a prominent feature of substance use and other neuropsychiatric disorders. The neural mechanisms underlying impulsive choice are not well understood, but growing evidence implicates nucleus accumbens (NAc) dopamine and its actions on dopamine D2 receptors (D2Rs). Because several NAc cell types and afferents express D2Rs, it has been difficult to determine the specific neural mechanisms linking NAc D2Rs to impulsive choice. Of these cell types, cholinergic interneurons (CINs) of the NAc, which express D2Rs, have emerged as key regulators of striatal output and local dopamine release. Despite these relevant functions, whether D2Rs expressed specifically in these neurons contribute to impulsive choice behavior is unknown. Here, we show that D2R upregulation in CINs of the mouse NAc increases impulsive choice as measured in a delay discounting task without affecting reward magnitude sensitivity or interval timing. Conversely, mice lacking D2Rs in CINs showed decreased delay discounting. Furthermore, CIN D2R manipulations did not affect probabilistic discounting, which measures a different form of impulsive choice. Together, these findings suggest that CIN D2Rs regulate impulsive decision-making involving delay costs, providing new insight into the mechanisms by which NAc dopamine influences impulsive behavior.
RESUMO
How do we learn associations in the world (e.g., between cues and rewards)? Cue-reward associative learning is controlled in the brain by mesolimbic dopamine1-4. It is widely believed that dopamine drives such learning by conveying a reward prediction error (RPE) in accordance with temporal difference reinforcement learning (TDRL) algorithms5. TDRL implementations are "trial-based": learning progresses sequentially across individual cue-outcome experiences. Accordingly, a foundational assumption-often considered a mere truism-is that the more cue-reward pairings one experiences, the more one learns this association. Here, we disprove this assumption, thereby falsifying a foundational principle of trial-based learning algorithms. Specifically, when a group of head-fixed mice received ten times fewer experiences over the same total time as another, a single experience produced as much learning as ten experiences in the other group. This quantitative scaling also holds for mesolimbic dopaminergic learning, with the increase in learning rate being so high that the group with fewer experiences exhibits dopaminergic learning in as few as four cue-reward experiences and behavioral learning in nine. An algorithm implementing reward-triggered retrospective learning explains these findings. The temporal scaling and few-shot learning observed here fundamentally changes our understanding of the neural algorithms of associative learning.
RESUMO
Impulsive choice, often characterized by excessive preference for small, short-term rewards over larger, long-term rewards, is a prominent feature of substance use and other neuropsychiatric disorders. The neural mechanisms underlying impulsive choice are not well understood, but growing evidence implicates nucleus accumbens (NAc) dopamine and its actions on dopamine D2 receptors (D2Rs). Because several NAc cell types and afferents express D2Rs, it has been difficult to determine the specific neural mechanisms linking NAc D2Rs to impulsive choice. Of these cell types, cholinergic interneurons (CINs) of the NAc, which express D2Rs, have emerged as key regulators of striatal output and local dopamine release. Despite these relevant functions, whether D2Rs expressed specifically in these neurons contribute to impulsive choice behavior is unknown. Here, we show that D2R upregulation in CINs of the mouse NAc increases impulsive choice as measured in a delay discounting task without affecting reward magnitude sensitivity or interval timing. Conversely, mice lacking D2Rs in CINs showed decreased delay discounting. Furthermore, CIN D2R manipulations did not affect probabilistic discounting, which measures a different form of impulsive choice. Together, these findings suggest that CIN D2Rs regulate impulsive decision-making involving delay costs, providing new insight into the mechanisms by which NAc dopamine influences impulsive behavior.
Assuntos
Núcleo Accumbens , Receptores de Dopamina D2 , Camundongos , Animais , Núcleo Accumbens/metabolismo , Receptores de Dopamina D2/metabolismo , Dopamina/metabolismo , Comportamento Impulsivo/fisiologia , Recompensa , Colinérgicos , Interneurônios/metabolismo , Receptores de Dopamina D1/metabolismoRESUMO
Learning to predict rewards based on environmental cues is essential for survival. It is believed that animals learn to predict rewards by updating predictions whenever the outcome deviates from expectations, and that such reward prediction errors (RPEs) are signaled by the mesolimbic dopamine system-a key controller of learning. However, instead of learning prospective predictions from RPEs, animals can infer predictions by learning the retrospective cause of rewards. Hence, whether mesolimbic dopamine instead conveys a causal associative signal that sometimes resembles RPE remains unknown. We developed an algorithm for retrospective causal learning and found that mesolimbic dopamine release conveys causal associations but not RPE, thereby challenging the dominant theory of reward learning. Our results reshape the conceptual and biological framework for associative learning.
Assuntos
Aprendizagem por Associação , Dopamina , Sistema Límbico , Recompensa , Animais , Dopamina/metabolismo , Sistema Límbico/metabolismo , Sinais (Psicologia) , CamundongosRESUMO
The basolateral amygdala (BLA) is critical for reward behaviors via a projection to the nucleus accumbens (NAc). Specifically, BLA-NAc projections are involved in reinforcement learning, reward-seeking, sustained instrumental responding, and risk behaviors. However, it remains unclear whether chronic stress interacts with BLA-NAc projection neurons to result in maladaptive behaviors. Here we take a chemogenetic, projection-specific approach to clarify how NAc-projecting BLA neurons affect avoidance, reward, and feeding behaviors in male mice. Then, we examine whether chemogenetic activation of NAc-projecting BLA neurons attenuates the maladaptive effects of chronic corticosterone (CORT) administration on these behaviors. CORT mimics the behavioral and neural effects of chronic stress exposure. We found a nuanced role of BLA-NAc neurons in mediating reward behaviors. Surprisingly, activation of BLA-NAc projections rescues CORT-induced deficits in the novelty suppressed feeding, a behavior typically associated with avoidance. Activation of BLA-NAc neurons also increases instrumental reward-seeking without affecting free-feeding in chronic CORT mice. Taken together, these data suggest that NAc-projecting BLA neurons are involved in chronic CORT-induced maladaptive reward and motivation behaviors.
RESUMO
Behavioral approaches utilizing rodents to study mood disorders have focused primarily on negative valence behaviors associated with potential threat (anxiety-related behaviors). However, for disorders such as depression, positive valence behaviors that assess reward processing may be more translationally valid and predictive of antidepressant treatment outcome. Chronic corticosterone (CORT) administration is a well-validated pharmacological stressor that increases avoidance in negative valence behaviors associated with anxiety1-4. However, whether chronic stress paradigms such as CORT administration also lead to deficits in positive valence behaviors remains unclear. We treated male C57BL/6J mice with chronic CORT and assessed both negative and positive valence behaviors. We found that CORT induced avoidance in the open field and NSF. Interestingly, CORT also impaired instrumental acquisition, reduced sensitivity to a devalued outcome, reduced breakpoint in progressive ratio, and impaired performance in probabilistic reversal learning. Taken together, these results demonstrate that chronic CORT administration at the same dosage both induces avoidance in negative valence behaviors associated with anxiety and impairs positive valence behaviors associated with reward processing. These data suggest that CORT administration is a useful experimental system for preclinical approaches to studying stress-induced mood disorders.