Introduction

According to the influential model of statistical learning PARSER (Perruchet & Vinter, 1998), when a to-be-learned sequence of items is not frequently repeated, the memory trace of this percept rapidly vanishes as a consequence of spontaneous decay and interference with similar material. In the present study, we specifically focus on the impact of interference on the learning of sequential material. In other words, we examine whether and how similarities between random and repeated visuo-spatial sequences induce interference in active representations in working memory and in the creation of sequential long-term memory representations. Indeed, when learning sequences in everyday life (e.g., sequencing movements in typing or sports, when playing an instrument, or sequencing sounds in speech), there is generally more or less overlap between the to-be-learnt items composing a sequence (e.g., movements or sounds), which is likely to induce interference between sequences and consequently to affect learning.

The effect of interference on sequence learning has been studied in humans by Page et al. (2013) with the Hebb repetition paradigm. In that study, the authors examined to what extent item-overlap is likely to affect learning in adults. Participants were presented with a repeated Hebb sequence of seven single-syllable words that were interposed by either non-overlapping or fully overlapping filler sequences. The fully overlapping filler sequences were composed of exactly the same items as those contained in the Hebb sequence, but in different orders (e.g., Hebb sequence = “flea, vase, disc, moss, shed, curb, soup”; filler sequence = “shed, disc, flea, soup, moss, curb, vase”), while the non-overlapping filler sequences did not share any item with the Hebb sequence (e.g., Hebb sequence = “flea, vase, disc, moss, shed, curb, soup”; filler sequence = “cow, cart, wink, seam, coin, arch, grown”). Results revealed reliable learning in the non-overlapping condition, while no learning was observed in the fully overlapping condition, suggesting that interference significantly affected learning. However, in our view, full overlap between items is very rare in natural learning situations. Taking the example of vocabulary acquisition in infants, not all words of a given language are anagrams of each other (e.g., eat and tea) and rather do, in many cases, share at most some phonemes only (e.g., / kɒpi / [copy] versus / kɒfi / [coffee]). As a consequence, mouth movements during speech production of different words are not systematically permutations of the same movements. The same is true for the acquisition of non-linguistic sequences such as sequences of movements when tapping out a phone number, learning to drive a car, or when playing an instrument.

As far as we know, only one study in humans used a semi-overlapping design, meaning that random and repeated sequence shared some but not all items (Saint-Aubin et al., 2015). However, as the aim of that study was different from ours, the degree of item-overlap between sequences was not controlled for and therefore conclusions about the effect of interference on sequence learning cannot be drawn.

The aim of the present study was thus to examine the impact of interference on sequential statistical learning by proposing a semi-overlapping design, which is more similar to what is usually observed in natural learning situations. To do so, we created an adaptation of the Serial Reaction Time task (Nissen & Bullemer, 1987) combined with the advantage of the Hebb repetition paradigm allowing to measure the impact of interference by inserting semi-overlapping random sequences between repeated sequences. In the present study, we tested a population of Guinea baboons in order to examine the impact of interference on sequence learning by controlling for language-related refreshing mechanisms, which are absent in non-human primates. Baboons were presented with sequences of three target locations on the touch screens. One random sequence was inserted between the presentations of the repeated sequence. In a first condition (Condition 1), the random sequences contained one location that was part of the repeated sequence, and this location was always presented at the same position (e.g., repeated sequence = 7 6 2, random sequence = 7 9 1 or 9 6 1 or 1 9 2, etc., random sequences were presented in random order). In a second condition (Condition 2), random sequences again contained one item of the repeated sequence but this time, it was located at one of the two remaining positions compared to the repeated sequence (e.g., repeated sequence = 7 6 2, random sequence = 9 7 1 or 9 1 7, etc., and the same for the two other locations 6 and 2, again presented in random order). In the third condition (Condition 3), we used a non-overlapping design, meaning that different locations were used for the repeated and the random sequences (e.g., repeated sequence = 7 6 2, random sequence = 4 9 1). In the present study, we hypothesize that the representation of the items of the repeated sequence and the connections between these items within the sequence (Burgess & Hitch, 2006) may interfere with the item representations of the overlapping random sequences (in Conditions 1 and 2) as a consequence of overwriting of features shared by these sequences (Oberauer & Kliegel, 2006). More precisely and according to previous studies (Page et al., 2013; Smalle et al., 2016), we expected to observe weaker or even no reliable learning in the two overlapping conditions but reliable learning in the non-overlapping condition. Moreover, we expected stronger interference in the “same-position” condition (Condition 1) compared to Condition 2. Indeed, imagine the following repeated sequence: 7 6 2, the co-occurrence between e.g., 7 and 6 should be more affected when being alternately presented with the following random sequence: 7 1 9 (Condition 1) compared to e.g., 1 9 7 (Condition 2), because 7 at the first position can here be associated to 6 or to 1, leading to an unstable representation of this association.

Methods

Participants

We tested 25 Guinea baboons (Papio papio, 16 females) living in a social group at the CNRS primate facility in Rousset (France). The baboons were housed in a 700 m2 outdoor enclosure with access to indoor housing.

Apparatus

Baboons had free access to 14 Automated Learning Devices for Monkeys (ALDMs, Fagot & Bonté, 2010; Fagot & Paleressompoulle, 2009) equipped with touch screens and a food dispenser. When entering the ALDM test box, baboons were identified by microchips implanted in each arm. The system saved the last trial the baboon had achieved before leaving the box, allowing him/her to continue the task later on where it had stopped. The experiment was controlled by E-Prime 2.0 software (Psychology Software Tools, Pittsburgh, PA, USA). All the baboons were familiar with touch-screen experimentation.

Materials and procedure

To begin a trial, baboons must press the yellow fixation crossFootnote 1 centered at the bottom of the screen. After pressing the cross, they saw a black screen that was divided into an invisible matrix of (3 × 3) cells, each containing a white cross in their center (see Fig. 1a and b). In this task, baboons needed to touch sequentially one red circle that moved on the screen in sequences of three target locations. When the baboon touched the first target position, it disappeared and was replaced by the white cross. The red circle then appeared at the second position and must again be touched before being presented with the last position of the sequence, where a last touch was required. Reward (grains of dry wheat) was delivered at the end of each sequence of three correct touches. In case of an error (i.e., the participant touched another location than the target one or failed to touch the screen within 5,000 ms), a green screen was displayed for 3,000 ms as a marker of failure.

Fig. 1
figure 1

a Representation of the touch screen with the nine locations and example of the experimental display and stimuli presentation for repeated and random sequences in Condition 1 (overlap at the same position). b Representation of the touch screen with the nine locations and example of the experimental display and stimuli presentation for repeated and random sequences in Condition 2 (overlap at a different position)

The experiment therefore consisted of three conditions: Two overlapping conditions (Conditions 1 and 2) and one non-overlapping condition (Condition 3). Each baboon completed all three conditions and the order of Conditions 1 and 2 was randomized. Thirteen baboons began with the first condition and subsequently completed the second one, while 12 baboons completed the second and then the first condition. The non-overlapping condition was part of a previous study conducted in our lab (Ordonez Magro et al., revision submitted). Each condition began with a random phase during which baboons were presented with three blocks of 100 random sequences of three positions. This phase was used as a familiarization phase at the beginning of the experiment as well as a phase allowing to “clean up” the memory trace of the previously learned repeated sequence to avoid interference between to-be-learned repeated sequences across conditions. The test phase began when the baboon achieved a performance higher than 80% correct within three consecutive blocks of 100 random trials. Response times (RTs) between the appearance of the red circle and the participant’s touch of each of the three positions for each sequence was recorded and served as a dependent variable. The present task follows a one-filler design, meaning that the presentations of the repeated sequence were spaced by one random sequence (i.e., random-repeated-random-repeated-…). Baboons performed five blocks of 100 trials, resulting in 250 presentations of the repeated sequence plus 250 presentations of the random sequence. To perform the 500 trials, baboons visited the test boxes in Condition 1 about 8.70 times on average, with an average of 65.96 trials per visit. In Condition 2, they visited the test boxes about 7.35 times with an average of 74.35 trials per visit. Finally, in Condition 3, baboons visited the test boxes about 8.12 times with an average of 69.85 trials per visit.

To avoid learning effects across conditions, participants were presented with a different repeated sequence in each condition (see Table 1 for an example). Repeated sequences were matched across positions for motor difficulty. To do so, a random phase of sequence production of six positions for 1,000 trials was conducted on 13 baboons. Based on these random trials, a baseline measure for all possible transitions from one location to another was computed by calculating mean RTs for each transition (e.g., from position 1 to 9), leading to a 9 × 9 matrix of mean RTs (see Appendix 1).

Table 1 Example of sequences presented in the three conditions for a given participant. Digits correspond to the nine screen locations. Repeated sequences are bolded, and the underlined digits correspond to the overlapping items

In this experiment, and in accordance with the serial recurrent network (SRN; Elman, 1990) model, learning was measured by comparing RTs between the three positions of the sequence. In a repeating three-position sequence such as e.g., 7 6 2, location 7 is always followed by location 6 and location 6 is always followed by location 2. Thus, location 7 (Position 1) is less predictable over trials, because of the previously presented random sequences, compared to locations 6 (Position 2) and 2 (Position 3), which benefit from the systematic presence of 7 or 7 6 presented just before. Thus, if learning takes place, we should observe faster RTs for Positions 2 and 3 compared to Position 1.

To be able to assess learning accurately, the mean RTs of the transition between Positions 1 and 2 (transition 1) and the transition between Positions 2 and 3 (transition 2) of the repeated sequences should be comparable. In our previous, study (Ordonez Magro et al., revision submitted), we created our repeated sequences by computing the RTs for the first and the second transition of all possible 504 triplets of the matrix and retained the triplets with the smallest difference of RTs (going from 2.06 ms to 15.55 ms), by making sure to choose sequences that were not too similar to each other (e.g., avoiding that all sequences begin with the same location). For the present study, we used novel sequences and thus retained the next five triplets of the list with the smallest difference of RTs (going from 6.97 ms to 18.22 ms), again making sure that they are not too similar too each other.

To create the random sequences of Condition 1, we computed, for each location, all sequences containing the given location at the same position and retained only those that did not contain the two remaining locations. For example, taking the repeated sequence 7 6 2, we computed all sequences starting with 7 and retained those that did not contain 6 and 2. We did the same for the location 6 and the location 2. This gave us a total of 90 different random sequences for Condition 1. Regarding Condition 2, we computed, for each location, all sequences that did contain the given location but at a different position and again retained those that did not contain the two remaining locations of the repeated sequence. Taking the example of the repeated sequence 7 6 2, we computed all random sequences containing the location 7 at position two or three and retained those that did not contain 6 and 2. We did the same for locations 6 and 2. This gave us a total of 180 different random sequences for Condition 2. The presentation order of the random sequences was randomized across trials.

Analyses

The ability of monkeys to learn the sequence was assessed by looking at the evolution of RTs on the three positions by computing linear regression analyses. The slope of the repeated sequence for the less predictable Position 1 and the mean slope of the predictable Positions 2 and 3 of the linear regression was used as an index of learning (see Fig. 2 for an example. To see the learning slopes for each baboon, each position, and each condition, see https://osf.io/6cpfm/?view_only=35017b02cc2947688636ee7620e1906f).

Fig. 2
figure 2

Evolution of response times (RTs) for all three positions of the repeated sequence and mean slopes for one individual (Mako) across the 250 trials for Condition 1

Results

We retained the 17 baboons (13 females, age range 3.58–25.16 y) who completed all three conditions (for the full data set, see https://osf.io/6cpfm/?view_only=35017b02cc2947688636ee7620e1906f). As in previous studies on regularity extraction in non-human primates (Malassis et al., 2018; Minier et al., 2016; Rey et al., 2019, 2022; Tosatto et al., 2022), analyses were only conducted on the repeated sequences (but see Appendix 3, for some analyses on the random sequences). For these sequences, baboons obtained a mean accuracy level of 99.0% (SD = 19.0%) in Condition 1 (item-overlap at the same position), of 99.1% (SD = 15.8%) in Condition 2 (item-overlap at a different position), and of 98.4% (SD = 15.7%) in Condition 3 (no overlap)Footnote 2. Incorrect trials were removed from the data set. We then removed RTs greater than 800 ms and subsequently conducted a recursive trimming procedure excluding RTs greater than two standard deviations from the mean for each of the three possible positions in a block of 50 trials and for each baboon (see Appendix 2 for the mean RTs for both repeated and random sequences as a function of block and condition). To measure learning of the repeated sequence, we looked at the evolution of RTs at each of the three positions in the sequence. Previous studies have shown that learning of the sequence leads to faster RTs on the predictable positions (i.e., Positions 2 and 3) compared to the less predictable position (i.e., Position 1) due to stronger contextual information (Elman, 1990; Malassis et al., 2018; Minier et al., 2016; Rey et al., 2019, 2020).

More precisely, as a measure of learning we computed, for each baboon, a slope on the RTs for each of the three positions of the repeated sequence and for each condition. We averaged the slopes of Positions 2 and 3 for which learning was possible and compared them with the slope of the less predictable Position 1Footnote 3. We ran a 3 (Condition: Condition 1, Condition 2, and Condition 3) × 2 (Position: Position 1 vs. mean of (Position 2 + 3)) repeated measures ANOVA on the learning slopes (see Fig. 3). Bayesian Factors (BF) are also reported, a major advantage of a Bayesian statistical framework is that it allows to quantify evidence in favor of, but also against, the presence of an effect, and hence both positive and null effects can be reliably interpreted (Kruschke et al., 2012; Wagenmakers, 2007). The Bayesian model comparison approach directly compares the null hypothesis to the alternative hypothesis (i.e., the effect of interest) and assesses evidence for the null effect and the effect of interest simultaneously (Dienes, 2014). Results are interpreted using the Bayes factor (BF), which reflects the likelihood ratio of two compared models. The BF10 is used to determine the likelihood ratio of the alternative model relative to the null model. A BF10 > 3 provides anecdotal evidence; a BF10 > 10 provides strong evidence, and a BF10 > 100 provides decisive evidence for the alternative hypothesis, while a BF01 < 0.33 provides anecdotal evidence; a BF01 < 0.10 provides strong evidence, and a BF01 < 0.01 provides decisive evidence for the null hypothesis (Jeffreys, 1961). The analyses were conducted with the JASP software package (JASP Team, 2021), using default settings for Cauchy prior distribution and the Monte Carlo Markov Chain Method for parameter estimation (Wagenmakers et al., 2018). Analyses revealed no significant main effect of Condition (F(2,32) = 0.367, p = .696, η2 = .022, BF01 = 8.12), a significant main effect of Position (F(1,16) = 27.184, p < .001, η2 = .629, BF10 > 100) with a steeper mean learning slope for Positions 2 and 3 (Mean = -.139, SD = .114) compared to Position 1 (Mean = -.020, SD = .108), and no significant interaction between Condition and Position (F(2,32) = 1.599, p = .218, η2 = .091, BF01 = 0.33). Thus, our findings showed faster mean RTs for the predictable Positions 2 and 3 compared to the less predictable Position 1 for all three conditions, and crucially, the mean RTs of the predictable Positions 2 and 3 were significantly different from 0 (Condition 1: Mean = -0.151, CI [-0.103, -0.199]; Condition 2: Mean = -0.144, CI [-0.097, -0.191]; Condition 3: Mean = -0.122, CI [-0.055, -0.189]), while this was not the case for Position 1 (Condition 1: Mean = 0.011, CI [0.058, -0.036]; Condition 2: Mean = -0.045, CI [0.004, -0.086]; Condition 3: Mean = -0.026, CI [0.036, -0.088]). Thus, findings revealed that learning occurred for all three conditions and the absence of a main effect of Condition as well as of an interaction between Condition and Position indicates that learning was similar across all conditions. In sum, it seems that interference did not affect learning significantly in the conditions tested in the present study using a semi-overlapping design.

Fig. 3
figure 3

Mean slopes for Position 1 and the average for Positions 2 and 3 for each condition: Condition 1 (overlapping item located at the same position), Condition 2 (overlapping item located at a different position), and Condition 3 (no overlap between random and repeated sequences). Error bars represent 95% confidence intervals

Discussion

In the present study, we examined the impact of interference on the learning of item sequences. According to the influential model of statistical learning PARSER (Perruchet & Vinter, 1998) and previous experimental studies on sequence learning (Page et al., 2013; Smalle et al., 2016), the learning of sequences of items is significantly hindered in situations in which interference is caused by the presentation of similar sequences. Note that most studies in humans use either fully overlapping random and repeated sequences (e.g., Attout et al., 2020; Bogaerts et al., 2016; Hebb, 1961; Szmalec et al., 2011), meaning that random sequences contain all items of the repeated sequence, but in a different order, or non-overlapping sequences (e.g., Hitch et al., 2009; Ordonez Magro et al., 2018, 2020, 2021;Page et al., 2013 ; Smalle et al., 2016), meaning that the repeated and the random sequences are composed of different items. To our knowledge, only one study in humans uses a semi-overlapping design, meaning that random and repeated sequence share some but not all items (Saint-Aubin et al., 2015). Moreover, previous studies in non-human primates examining the nature of regularity extraction in sequence learning (Malassis et al., 2018; Minier et al., 2016; Rey et al., 2019, 2022; Tosatto et al., 2022) usually use a design similar to the one introduced by Saffran et al. (1996), where baboons are repeatedly exposed to one and the same sequence of locations. This design, however, does not allow researchers to measure the impact of interference on sequence learning. The present study therefore proposes a learning paradigm mixing repeated and random sequences with a semi-overlapping design in non-human primates. In our semi-overlapping design, repeated and random sequences shared only one item, either located at the same position (Condition 1) or on one of the two other positions (Condition 2). This design is, in our opinion, more representative of what generally occurs in natural learning situations like, for instance, learning to play an instrument, the acquisition of sports, learning to drive a car, sequencing speech sounds, etc. Moreover, the present study is the first one to use a semi-overlapping learning paradigm in non-human primates, allowing us to measure interference in memory while controlling for language-related factors.

In contrast to previous studies (Page et al., 2013; Smalle et al., 2016) and to what we predicted on the basis of these studies, we did not observe a negative impact of interference on sequence learning. Indeed, our analyses showed that learning slopes were comparable across conditions. One potential explanation for this finding may be the fact that we used a semi-overlapping design rather than the fully overlapping design used in the previous studies. Indeed, a full overlap between all items of a sequence undeniably causes more interference as compared to partial overlap. However, our findings, even if unexpected, are not totally surprising, given that such interference would be a major handicap for learning in natural situations (for a discussion, see also Page et al., 2013).

Moreover, according to PARSER (Perruchet & Vinter, 1998), the impact of interference may become practically negligible for a percept, once it has been encountered repeatedly and its representation in long-term memory has become relatively robust. Thus, our finding that interference does not affect learning is compatible with the predictions of PARSER and with what is observed in natural learning situations involving a high level of interference.

The absence of a significant difference in learning the semi-overlapping repeated sequences (Conditions 1 and 2) compared to the non-overlapping repeated sequences (Condition 3) can also be explained in terms of frequency of occurrence. Indeed, in our semi-overlapping conditions the random sequences contained an item that was also part of the repeated sequence, which certainly increased the frequency of occurrence (during the experiment) of this given overlapping item. Thus, it is possible that the repetition of the same item in the random sequence might have strengthened its representation in long-term memory, which in turn would have facilitated the learning of item-item associations (i.e., co-occurrences between the three positions), via associative learning mechanisms (Majerus et al., 2012; Perruchet & Pacton, 2006; Perruchet & Vinter, 1998). This hypothesis is in line with sequence learning models (Page & Norris, 1998; Perruchet & Vinter, 1998), according to which learning is strongly dependent on repetition, a major component determining consolidation in long-term memory. There is indeed some evidence in our data that supports this hypothesis (see Fig. 3). Even if the difference between conditions is not significant, there was a slightly steeper learning slope for Positions 2 and 3 for the overlapping conditions (Conditions 1 and 2, -0.151 and -0.141, respectively) compared to the non-overlapping condition (Condition 3, -0.122) suggesting that the presentation of an overlapping item in the random sequences may have facilitated, rather than perturbed, the learning of the repeated sequence. Moreover, the observed absence of interference between random and repeated sequences can also be explained in terms of context signals. Indeed, according to the model of long-term learning proposed by Burgess and Hitch (2006), each new sequence is associated with a new set of context nodes, while a repeated sequence is associated with its own set of context signals and these connections are further strengthened with each repetition. Based on this, our random and repeated sequences, even if they had one item in common, might have been associated with different context nodes, consequently limiting the interference between random and repeated sequences.

Finally, our observations are in line with several previous studies on non-human primates, showing that monkeys like tamarins, macaques, and baboons can extract regular patterns from continuous sequences allowing them to learn predictable motor sequences by using statistical cues (Hauser et al., 2001; Heimbauer et al., 2012; Locurto et al., 2010, 2013; Malassis et al., 2018; Minier et al., 2016; Procyk et al., 2000; Rey et al., 2019, 2022; Tosatto et al., 2022; Wilson et al., 2013, 2015).

In sum, the present study used a semi-overlapping design to examine the influence of interference on memory forgetting in conditions that better mimic natural learning conditions compared to prior studies. In the present work, we observed that interference has not such a deleterious impact on sequence learning as would be expected based on the results of prior research. In contrast to the observation of previous studies, suggesting that no learning is possible when interference occurs (Page et al., 2013; Smalle et al., 2016), our findings are easy to reconcile with what is observed in natural sequence learning situations, where learning still takes place in an interfering environment.