A time course analysis of satiety-induced instrumental outcome devaluation

Parkes, Shauna L.; Marchand, Alain R.; Ferreira, Guillaume; Coutureau, Etienne

doi:10.3758/s13420-016-0226-1

A time course analysis of satiety-induced instrumental outcome devaluation

Published: 29 April 2016

Volume 44, pages 347–355, (2016)
Cite this article

Download PDF

Learning & Behavior Aims and scope Submit manuscript

A time course analysis of satiety-induced instrumental outcome devaluation

Download PDF

Shauna L. Parkes^1,2,3,
Alain R. Marchand^2,3,
Guillaume Ferreira^1,3 &
…
Etienne Coutureau^2,3

2194 Accesses
14 Citations
1 Altmetric
Explore all metrics

Abstract

Sensory-specific satiety is commonly used in studies of decision making to selectively devalue a food reward. Devaluation is reflected in an immediate reduction in the subsequent intake of the food and in the performance of actions that gain access to that food. Despite its frequent use, the lasting effects of satiety-induced devaluation on instrumental actions are unknown. Here, we examined the time course and contextual dependency of sensory-specific satiety-induced devaluation on instrumental responding and consumption. Rats were trained to perform two instrumental actions for two distinct food rewards. Then, one of the instrumental outcomes was provided ad libitum for 1 hour in separate feeding cages and the effect of this devaluation was assessed 0, 2, or 5 hours after satiation. At a delay of 0 or 2 hours, both intake and instrumental responding were sensitive to the satiety treatment. That is, rats consumed less of the devalued outcome and responded less for the devalued outcome than for the valued outcome. By contrast, after 5 hours, rats showed sensitivity to devaluation in consumption but not in instrumental responding. Strikingly, sensitivity to devaluation was restored for the instrumental response after a 5 hour delay when devaluation was performed in the instrumental context. These results indicate that, in rats, specific satiety-induced devaluation endures and is context-independent for up to 2 hours post-satiation. At longer delays, the impact of sensory-specific satiety on instrumental responding is context-dependent, suggesting that contextual cues may be required for the value of specific outcomes to control instrumental responding.

Identification of an unconventional process of instrumental learning characteristically initiated with outcome devaluation-insensitivity and generalized action selection

Article Open access 27 February 2017

Stimuli associated with the cancellation of food and its cues enhance eating but display negative incentive value

Article 11 September 2014

Secondary rewards acquire enhanced incentive motivation via increasing anticipatory activity of the lateral orbitofrontal cortex

Article 12 July 2021

Introduction

The performance of goal-directed instrumental actions in both humans and animals is determined by knowledge regarding the contingency between actions and their outcomes as well as the current motivational value of the outcome (Adams & Dickinson, 1981; Balleine & Dickinson, 1998; Colwill & Rescorla, 1985; Dickinson & Balleine, 1994). Outcome-specific devaluation has become a gold standard in the assessment of goal-directed behavior (Colwill & Rescorla, 1985). Devaluation of the instrumental outcome, usually a palatable food reward, is typically achieved via one of two procedures: conditioned taste aversion (CTA; Adams, 1982; Garcia & Koelling, 1967) or sensory-specific satiety. Sensory-specific satiety refers to the decrease in preference for a food recently eaten to satiety relative to other foods (Hetherington & Rolls, 1996; Rolls, 1986, 1990; Rolls, Rolls, Rowe, & Sweeney, 1981; Young, 1940), and its use is often favored over CTA as it induces a temporary, reversible devaluation of the outcome.

In humans and animals, specific satiation reduces both the value of the food reward and the performance of actions to obtain that reward. Thus, following devaluation of a food reward, the performance of instrumental actions for that food is suppressed compared to responding for another food with distinct sensory properties (e.g., Balleine, 2005; Balleine & Dickinson, 1992, 1998; Dickinson & Balleine, 1994, 2002). Typically, the effects of outcome devaluation via specific satiety on instrumental behavior in humans (Alvares et al., 2013; Gottfried, O’Doherty, & Dolan, 2003; Hogarth, Chase, & Baess, 2012; Schwabe, Tegenthoff, Hoffken, & Wolf, 2012; Tricomi, Balleine, & O'Doherty, 2009; Valentin, Dickinson, & O'Doherty, 2007) and animals (Balleine & Dickinson, 1998; Balleine & O'Doherty, 2010) are assessed immediately or shortly after satiation. This is likely due to the assumption that sensory-specific satiety represents a temporary decline in the value of the food, and over time the value of the food will recover. In humans, the effects of sensory-specific satiety are observed from 2 minutes (Rolls et al., 1981) to 24 hours after satiation (Weenen, Stafleu, & de Graaf, 2005). However, despite its frequent use in the assessment of goal-directed behavior, the precise time course of satiety-induced devaluation has not been systematically studied in animals. Understanding the time course therefore has the strong potential to influence the practice of future studies using the outcome devaluation task.

Here, we evaluated the effect of sensory-specific satiety-induced devaluation on instrumental performance and consumption at 0, 2, or 5 hours following satiation. Experiments 1 and 2 revealed that both consumption and instrumental behavior are influenced by specific satiety-induced devaluation up to 2 hours after satiation. However, after a 5 hour delay, only consumption was effected by outcome specific satiation. This result is especially interesting as it suggests that, while the prefed outcome was less desirable relative to the non-prefed outcome, rats did not respond less to the prefed than to the non-prefed outcome. One obvious difference between the consumption and instrumental tests was that the former occurred in the same context as satiety-induced devaluation, whereas the latter did not. Consistent with the literature, we observed no effect of context when the test was conducted shortly after satiation. However, there is evidence that, under some conditions, instrumental actions remain unaffected by devaluation when devaluation occurs outside the instrumental context (Holman, 1975; Wilson, Sherman, & Holman, 1981). It was therefore plausible that the failure to observe an effect of devaluation on instrumental responding with a 5 hour delay was because satiation occurred in a context different to that used for instrumental responding. To test this hypothesis, in Experiment 3, devaluation was performed either in a different context, as in Experiments 1 and 2, or in the instrumental context. We showed that the instrumental response was sensitive to devaluation 5 hours after satiation only if devaluation was performed in the instrumental context. This finding provides novel evidence that contextual cues can bias action selection by modulating value representations.

Materials and methods

Experiment 1

The aim of this experiment was to explore the time course and recovery of the effect of sensory-specific satiety on instrumental responding. Briefly, rats were trained on two actions for distinct food rewards. Then, all rats were allowed to consume one food reward for 1 h before a choice extinction test. Critically, the choice test was given either immediately or 2 h or 5 h after the devaluation session (Fig. 1). Rats were given a choice consumption test immediately after the instrumental test.

Subjects and apparatus

Twenty-four experimentally naïve male outbred Long Evans rats (Janvier, France) served as subjects. They were housed in plastic boxes (two rats per box) located in a climate-controlled colony room, and were maintained on a 12-h light/dark cycle (lights on at 7:00 a.m.). All behavioral procedures occurred during the light phase of the cycle. Rats were handled daily for 4 days before the behavioral procedures and were put on a food deprivation schedule 2 days before behavioral procedures to maintain them at approximately 90 % of their ad libitum feeding weight. All experiments were conducted in agreement with the French (council directive 2013-118, 1 February 2013) and international (directive 2010-63, 22 September 2010, European Community) legislations and received approval # 5012053-A from the local ethics committee.

Training and testing took place in eight operant chambers (40 cm wide × 30 cm deep × 35 cm high; Imetronic, Pessac, France) enclosed in sound- and light-resistant shells. Each chamber was equipped with two pellet dispensers that delivered grain or sugar pellets (45 mg) into a recessed magazine when activated. The chambers contained two retractable levers that could be inserted to the left and the right of the magazine. An infrared photobeam crossed the magazine opening, allowing for the detection of head entries. Four LED house lights provided illumination of the operant chamber.

Behavioral procedures

Instrumental training

On Days 1 and 2, rats were given two sessions of magazine training. During each session, rats were confined to the operant chamber while 45 mg grain (BioServ; 3.35 kcal/g) and sugar (Test Diet; 3.4 kcal/g) food pellets were delivered at random 60-s intervals. Forty outcomes were delivered per session, 20 of each outcome. On Days 3–11, rats underwent instrumental training during which time two responses (left and right lever presses) were trained each with a different food pellet. Each session involved two presentations of each lever for a maximum of 10 min each or until 20 outcomes were earned; that is, rats could earn a maximum of 40 grain and 40 sugar pellets within each session. The inter-trial interval between lever presentations was 2.5 min. The order of the lever presentation was alternated and counterbalanced across rats and days. For the first 3 days, lever pressing was continuously reinforced. Then, the probability of the outcome given a response was gradually shifted over days using increasing random ratio (RR) schedules: a RR 5 schedule was used on Days 6–8 and a RR 10 schedule on Days 9–11.

Outcome-specific devaluation

On Day 12, rats were given their first outcome devaluation test. In this test, rats received ad libitum access to one of the two food outcomes (20 g) for 1 h in distinct, polycarbonate feeding cages (42 × 28 × 20 cm) located in a different room to that used for training. Half of the rats in each response-outcome assignment received grain pellets and the remaining rats received sugar pellets. Next, all rats were given a 10 min choice extinction test in which both levers were available but no outcome was delivered. Critically, we manipulated the delay between the end of the devaluation session and the start of the test. Rats were either tested immediately (group 0-hr; n = 8) or at 2 h (group 2-hr; n = 8) or 5 h (group 5-hr; n = 8) after the end of the devaluation session (see Fig. 1). Rats in groups 2-hr and 5-hr were returned to their home cages during the delay period. All rats were given 1 day of retraining on the RR 10 schedule, and on Day 14 rats were given a second test with the other outcome devalued. Twenty-four hours before the first devaluation session, all rats were familiarized with the plastic feeding cages for 1 h and were allowed to consume four grain and four sugar pellets.

Consumption test of specific satiety

Immediately after each extinction test (Days 12 and 14), rats were returned to the feeding cages and given a choice consumption test of satiety-induced devaluation. Rats received 10 min access to both of the food pellets (10 g each) and the total amount of each outcome (valued and devalued) was recorded.

Statistical analyses

All data were analysed using planned, orthogonal contrasts in a mixed-model analysis of variance (ANOVA) with alpha set at 0.05. Simple main effects analyses were used to establish the source of any significant interactions. Measures of effect size (partial η² for ANOVA and Cohen’s d for between-subjects contrasts with two groups, Experiment 3 only) are stated for each comparison and confidence intervals (CI; 95 % for the mean difference, standardized using the sample standard deviation units) are reported for each significant comparison. Data are presented as mean ± SEM.

Experiment 2

As the groups of Experiment 1 were tested at different times of the day, in Experiment 2, we attempted to replicate the results of Experiment 1 with rats tested at the same time of the day.

Subjects and apparatus

Twenty-four experimentally naïve, male, Long-Evans rats (Janvier) served as subjects. The housing and training apparatus were the same as those described for Experiment 1.

Behavioral procedures

The training and testing procedures were identical to those described for Experiment 1 with one notable exception. For the outcome-specific devaluation tests, all groups were tested at the same time of day but received satiety-induced devaluation at different times of the day (see Fig. 2). Again, rats were given the devaluation treatment either immediately (group 0-hr; n = 8), 2 h (group 2-hr; n = 8) or 5 h (group 5-hr; n = 8) before the outcome-specific devaluation test.

Experiment 3

The previous experiments show that instrumental responses are sensitive to satiety-induced outcome devaluation 2 h, but not 5 h, after satiation. However, consummatory responses were sensitive to devaluation following a 5 h delay. Given that, in the previous experiments the instrumental test was conducted in the training cages (i.e., a context where the rat had experienced the outcomes as valuable) whereas the consummatory test was conducted in the devaluation cages (i.e., a context where the rat had experienced the outcome as devalued); the context may have influenced which outcome representation was retrieved to guide behavior (Bouton, 1993). To test this hypothesis, in the current experiment devaluation was performed in either the instrumental context, i.e., the operant box (group Same) or in a different context (group Different), as in Experiments 1 and 2.

Subjects and apparatus

Sixteen experimentally naïve, male, Long-Evans rats (Janvier) served as subjects. The housing and training apparatus were the same as those described for Experiments 1 and 2.

Behavioral procedures

Rats were trained as above to press two levers for two distinct food rewards. Twenty-four hours after the final training session, rats were given an outcome devaluation test. For half of the rats, outcome-specific devaluation occurred in the same operant boxes used for training and testing (group Same) and for the remaining rats devaluation occurred in the feeding cages used in Experiments 1 and 2 (group Different). All rats were familiarized with the plastic feeding cages the day prior to the first devaluation test and were allowed to consume four grain and four sugar pellets in the cages. Rats were given 1 h access to one of the two food outcomes (20 g) in a small glass feeding dish; half of the rats in each group were given grain pellets and the remaining half was given sugar pellets. After devaluation, rats were returned to their home cages for 5 h and were then placed back into the operant cages for the instrumental test. Immediately following this test, all rats were given a choice consumption test in the operant cages.