Augmented reinforcer value and accelerated habit formation after repeated amphetamine treatment

https://doi.org/10.1016/j.euroneuro.2006.12.005Get rights and content

Abstract

Various processes might explain the progression from casual to compulsive drug use underlying the development of drug addiction. Two of these, accelerated stimulus–response (S–R) habit learning and augmented assignment of motivational value to reinforcers, could be mediated via neuroadaptations associated with long-lasting sensitization to psychostimulant drugs, i.e. augmented dopaminergic neurotransmission in the striatum. Here, we tested the hypothesis that both processes, which are often regarded as mutually exclusive alternatives, are present in amphetamine-sensitized rats. Amphetamine-sensitized rats showed increased responding for food under a random ratio schedule of reinforcement, indicating increased incentive motivational value of food. In addition, satiety-specific devaluation experiments under a random interval schedule of reinforcement showed that amphetamine-sensitized animals exhibit accelerated development of S–R habits. These data show that both habit formation and motivational value of reinforcers are augmented in amphetamine-sensitized rats, and suggest that the task demands determine which behavioral alteration is most prominently expressed.

Introduction

One of the defining characteristics of drug addiction is the occurrence of a compulsive pattern of drug use, which persists in the face of adverse social and health consequences (American Psychiatric Association, 2000, O'Brien and McLellan, 1996, Volkow and Li, 2004). Several influential hypotheses attempt to explain the progression from casual to compulsive drug use, which occurs during the development of addiction. According to the incentive sensitization theory, drug-induced adaptations in the neural circuits underlying incentive motivation cause drugs to gain an enhanced incentive value (i.e. exaggerated drug “wanting”, perhaps equivalent to drug craving), leading to excessive pursuit of drugs (Robinson and Berridge, 1993). A different theory argues that aberrant instrumental learning underlies the development of addictive behavior. Thus, instrumental learning for both drugs and natural rewards can progress through action–outcome (A–O) responding, during which performance of an action is determined by explicit knowledge of its consequences, to an automatic, stimulus–response (S–R) structure (commonly referred to as habits). Responses driven by an S–R structure are stimulus-driven and occur without the necessity for volitional control (Dickinson, 1994). In the case of drug abuse, accelerated and/or inappropriate S–R learning has been hypothesized to cause drug seeking behavior to become inflexible (Berke and Hyman, 2000, Everitt and Robbins, 2005, Tiffany, 1990).

Repeated exposure to drugs of abuse causes behavioral sensitization, a progressive and persistent increase in both the psychomotor response to drugs (Pierce and Kalivas, 1997, Robinson and Becker, 1986, Robinson and Berridge, 1993, Stewart and Badiani, 1993, Vanderschuren and Kalivas, 2000) and the reinforcing properties of drugs and natural reinforcers (Fiorino and Phillips, 1999, Lett, 1989, Piazza et al., 1989, Robinson and Berridge, 2003, Taylor and Horger, 1999, Valadez and Schenk, 1994, Vezina et al., 2002, Wyvell and Berridge, 2001). Behavioral sensitization is associated with an exaggerated dopaminergic response to drugs in both dorsal and ventral striatum (Kolta et al., 1985, Paulson and Robinson, 1995, Pierce and Kalivas, 1997, Robinson and Berridge, 1993, Robinson et al., 1988, Vanderschuren and Kalivas, 2000). Because of the involvement of ventral striatal dopamine in incentive salience attribution and the motivational influence of environmental stimuli on behavior (Berridge and Robinson, 1998, Cardinal et al., 2002, Schultz, 2002), this augmented release of dopamine has been hypothesized to critically contribute to exaggerated drug “wanting” (Robinson and Berridge, 1993, Robinson and Berridge, 2003). Interestingly, the dorsal striatum (Packard and Knowlton, 2002, White and McDonald, 2002, Yin and Knowlton, 2006), including its dopaminergic innervation (Faure et al., 2005, Packard and White, 1991), has been shown to be involved in S–R learning processes. Thus, though often regarded as mutually exclusive explanations for the development of drug addiction, both the increase in motivational value of drugs predicted by the incentive sensitization theory and the abnormal S–R learning proposed in the aberrant learning theory could be mediated by neural changes associated with drug sensitization.

We therefore hypothesized that amphetamine sensitization is associated with augmented incentive motivational as well as S–R learning processes. To test this hypothesis, we employed distinct schedules of reinforcement in operant conditioning paradigms and devaluation setups to elucidate responding based on S–R and A–O responding. Studies by Dickinson and colleagues have demonstrated that interval and ratio schedules of reinforcement in an operant conditioning paradigm rely on different associative structures (Dickinson, 1985, Dickinson et al., 1983). Ratio schedules, in which higher response rates are rewarded with more reinforcement in a linear fashion, promote responding based mainly on motivational value of the reinforcer. Animals trained on this schedule continue to demonstrate explicit knowledge of reinforcer value even after long periods of training, indicating that responding is determined by an A–O rather than an S–R structure. In contrast, interval schedules impose a maximum on the number of reinforcers that can be earned in a set time period, irrespective of response rate. Responding following extended training under an interval schedule can become independent of reinforcer value, demonstrating that it is based on S–R associations. Using the distinct properties of the two reward schedules, we tested the hypothesis that elevated response levels in sensitized animals are related to motivational value of the reinforcer using a ratio schedule. Increased response rates in amphetamine-treated animals would indicate enhanced motivational value of the reinforcer. Next, we investigated whether amphetamine-sensitized rats would show accelerated S–R learning using a satiety-specific devaluation paradigm in animals trained under an interval schedule of reinforcement. In this paradigm, the effect of decreased incentive value of a reinforcer can be tested by measuring rates of responding for a food reinforcer after animals have been prefed to satiation with that particular foodstuff before testing. After limited training, when responding is directed by A–O processes, responding is decreased by devaluation, but after extended training when an S–R relationship has developed, responding is insensitive to devaluation (Coutureau and Killcross, 2003, Dickinson, 1985). Earlier occurrence of insensitivity of responding to devaluation in amphetamine-treated animals would indicate accelerated S–R learning.

Section snippets

Experimental procedures

A total of 36 male Wistar rats (Harlan, Horst, the Netherlands) were used for both experiments combined. Rats were housed in groups of four in 54 × 34 × 18 cm (l × w × h) cages. The rats were allowed to acclimate to the reversed day–night light cycle (white light from 7 p.m. to 7 a.m.) and were handled 2–3 times per week for at least 2 weeks preceding experiments. They were fed ad libitum for the first week after arrival (Rat chow AMII, 10 mm, Hope Farms, Woerden, The Netherlands), then food-deprived

Increased responding for food in amphetamine-treated rats in the RR task

Analysis of response levels in the RR task was performed both for the first 10 min of each session (Fig. 1A) and for the entire 30-minute session (Fig. 1B). Significant main effect of sessions and session × drug treatment interactions were found using both time periods (Sessions 30 min: F(44,729) =19.10, p < 0.001; Session 10 min: F(44,792) = 27.08; p < 0.001; Session × drug treatment interaction 30 min: F(44,792) = 10.80; p < 0.005; Session × drug treatment interaction 10 min: F(44,792) = 2.61; p < 0.001). In the

Discussion

The present study demonstrates that a sensitizing regimen of amphetamine both augments the motivational value of a sucrose reinforcer and accelerates S–R learning. Amphetamine-treated rats responded markedly more than controls in a task where more action led to more reward delivery, whereas no differences were seen between amphetamine- and saline-treated controls when high response rates were not proportionally rewarded. This suggests that amphetamine treatment results in the enhancement of

Acknowledgements

Support for this project was provided by The Netherlands Organization for Scientific Research (NWO grant 903-47-092). We thank Prof. A. Dickinson for his valuable comments on the manuscript and help with the satiety devaluation experiments.

References (57)

  • T.E. Robinson et al.

    Enduring changes in brain and behavior produced by chronic amphetamine administration: a review and evaluation of animal models of amphetamine psychosis

    Brain Res.

    (1986)
  • T.E. Robinson et al.

    The neural basis of drug craving: an incentive-sensitization theory of addiction

    Brain Res. Brain Res. Rev.

    (1993)
  • T.E. Robinson et al.

    Persistent sensitization of dopamine neurotransmission in ventral striatum (nucleus accumbens) produced by prior experience with (+)− amphetamine: a microdialysis study in freely moving rats

    Brain Res.

    (1988)
  • W. Schultz

    Getting formal with dopamine and reward

    Neuron

    (2002)
  • J.D. Sokolowski et al.

    The role of accumbens dopamine in lever pressing and response allocation: Effects of 6-OHDA injected into core and dorsomedial shell

    Pharmacol. Biochem. Behav.

    (1998)
  • A. Valadez et al.

    Persistence of the ability of amphetamine preexposure to facilitate acquisition of cocaine self-administration

    Pharmacol. Biochem. Behav.

    (1994)
  • L.J.M.J. Vanderschuren et al.

    Lack of cross-sensitization of the locomotor effects of morphine in amphetamine-treated rats

    Neuropsychopharmacology

    (1999)
  • P. Voorn et al.

    Putting a spin on the dorsal–ventral divide of the striatum

    Trends Neurosci.

    (2004)
  • N.M. White et al.

    Multiple parallel memory systems in the brain of the rat

    Neurobiol. Learn. Mem.

    (2002)
  • American Psychiatric Association

    Diagnostic and Statistical Manual of Mental Disorders, IV-TR

    (2000)
  • B.W. Balleine et al.

    The effect of lesions of the basolateral amygdala on instrumental conditioning

    J. Neurosci.

    (2003)
  • A. Dickinson

    Actions and habits: the development of behavioral autonomy

    Philos. Trans. R. Soc. Lond., B

    (1985)
  • A. Dickinson et al.

    The effect of the instrumental training contingency on susceptibility to reinforcer devaluation

    Q. J. Exp. Psychol., B

    (1983)
  • A. Dickinson et al.

    Alcohol seeking by rats: action or habit?

    Q. J. Exp. Psychol., B

    (2002)
  • B.J. Everitt et al.

    Neural systems of reinforcement for drug addiction: from actions to habits to compulsion

    Nat. Neurosci.

    (2005)
  • A. Faure et al.

    Lesion to the nigrostriatal dopamine system disrupts stimulus–response habit formation

    J. Neurosci.

    (2005)
  • D.F. Fiorino et al.

    Facilitation of sexual behavior and enhanced dopamine efflux in the nucleus accumbens of male rats after d-amphetamine-induced behavioral sensitization

    J. Neurosci.

    (1999)
  • R. Ito et al.

    Dopamine release in the dorsal striatum during cocaine-seeking behavior under the control of a drug-associated cue

    J. Neurosci.

    (2002)
  • Cited by (123)

    • Dopamine sensitization by methamphetamine treatment prior to instrumental training delays the transition into habit in female rats

      2022, Behavioural Brain Research
      Citation Excerpt :

      Data here indicating that METH pretreatment in females trained to 160 R-O exposures acts to slow habit development of the instrumental task, as well as delaying devaluation of the Pavlovian task, may indicate that METH pretreatment is prolonging Pavlovian approach and therefore offsetting instrumental habit formation in female rats. Our results in male rats are aligned with previous reports that psychostimulants advance habitual behavior in males [19,38–40], albeit with considerably more instrumental training (320 R-O exposures) than in these previous studies (120 R-O exposures). This may be because d-amphetamine, which was used in most of the previous studies, may be acting differently than METH.

    View all citing articles on Scopus
    View full text