Enhanced Reactivation of Remapping Place Cells during Aversive Learning

Introduction

Episodic memories include details about salient experiences from our past as well as information about the location in which these events occurred (Tulving, 1972). The hippocampus plays a central role in episodic memory and spatial decision-making (Pfeiffer and Foster, 2013, 2015; Wikenheiser and Redish, 2015; Yu and Frank, 2015), and this is thought to rely in part on its ability to create and store unique representations or maps of different spatial and nonspatial contexts (O’Keefe and Nadel, 1978; Eichenbaum, 2017). Alteration of the spatial representation, termed “remapping” (Muller and Kubie, 1987), is also thought to play a role in the encoding of episodic events occurring in familiar environments (Colgin et al., 2008). How this occurs without disrupting the representation of the physical and spatial features of the environment is not clear. Furthermore, the mechanisms through which episodic experiences are integrated into hippocampal representations during learning are not well understood.

Studies in which physical properties of the environment, such as the color or shape of the recording apparatus, are altered have shown that the infield firing rates of existing place fields (PFs) can change in response to changing sensory input without any change in place field location, a process termed “rate remapping” (Leutgeb et al., 2005). Further support for this concept has been found in studies in which changes in behavioral contingencies, such as turning direction on a plus or T-maze, also produce rate remapping (Frank et al., 2000; Wood et al., 2000; Ferbinteanu and Shapiro, 2003). However, studies of remapping during episodic memory encoding using reward or fear learning have reported changes in the place field locations of some, but not all, the recorded place cells (Moita et al., 2003; Smith and Mizumori, 2006; Dupret et al., 2010; Wang et al., 2012; Wu et al., 2017). This partial remapping, meaning remapping in a subset of the place cell population, suggests the possibility of a unique population of place cells that might dynamically encode information relating to events, while leaving a more spatially specific population to stably encode space. But how might episodic memory encoding take place such that it is restricted to one subpopulation, while leaving another unchanged?

One possibility is that encoding cells may be preferentially recruited into sharp-wave ripple (SWR)-associated replay events. Recent work has shown that in CA1, a distinct population of place cells can be identified on the basis of differential recruitment into replay events (Grosmark and Buzsáki, 2016). Further, patterns of activity associated with spatial novelty or reward learning have been shown to recur more frequently in these events (Singer and Frank, 2009; Ambrose et al., 2016). Last, these events have been shown to be important for the stabilization of place fields (Roux et al., 2017) and place cell assemblies in novel environments (van de Ven et al., 2016). However, it is not clear from this previous work whether these memory-encoding mechanisms occur in a specific population of hippocampal neurons.

We set out to explore the existence of an episodic encoding subpopulation of place cells in CA1 and to determine the neural coding mechanisms through which these cells may be selectively integrated into hippocampal networks during learning. Using a novel spatial decision-making task incorporating avoidance of aversive stimuli, we found that remapping cells were preferentially recruited into SWRs and replay events.

Results

Animals learn to avoid the shock arm when navigating to the food reward

In reward-learning paradigms, remapping occurs mainly around reward locations (Breese et al., 1989; Dupret et al., 2010; Gauthier and Tank, 2018). Thus, any effort to determine what intrinsic properties cause some place cells to remap while others do not is bound to be confounded by the locations of the place fields of the cells. To circumvent this limitation, we developed a novel decision-making task incorporating aversive stimuli, which produce spatially distributed remapping of hippocampal place cells (Moita et al., 2004; Wang et al., 2012; Wu et al., 2017). The task requires animals to use spatial knowledge about a stable, familiar environment in conjunction with information about changing episodic experiences occurring within this environment, leading to the creation of powerful episodic memories. Rats (n = 4) were trained to run laps on a track arranged such that they first ran to a feeder point for a small food reward, followed by a second short run to a choice point where they could freely choose, or alternatively be forced, to run along one of three choice arms to return to a goal location for a larger food reward (Fig. 1A). Rats were implanted with eyelid shock wires, and shock (a 1 s train of 1 ms pulses at 7 Hz, with intensity between 0.4 and 1.5 mA) could be triggered by breaking an infrared beam on the choice arms (Fig. 1B). During initial training, rats learned to navigate to the reward location; animals were never exposed to shock during this initial training. The experiment consisted of two sessions. In the first session [Fig. 1C, Session 1: control (CTRL)], rats ran a large number of laps in the absence of shock (laps: n = 132, 105, 97, and 114 for rats 1–4, respectively). The first and final 12 trials consisted of forced choice, with equal sampling of the three choice arms (i.e., each arm was run four times during both the PRE-learning and POST-learning epochs), and these were used to construct PRE-learning and POST-learning rate maps to assess baseline levels of place field stability. In the second session (Fig. 1C, Session 2: Learning), rats first ran five free-choice trials (Fig. 1F, PRE-learning) to determine their arm preferences. This was followed by 12 forced-choice trials (PRE), as in session 1, to establish the baseline rate maps. Shock was then introduced on two of the choice arms, making these the “shock” arms and the remaining arm the safe arm (the arm least preferred by the animal was chosen as the safe arm). A variable number of forced-choice trials was then run to introduce the new contingencies to the animal (see Materials and Methods), followed by free-choice trials allowing the experimenter to assess the preference of the animal for the safe arm. Once the animal reached 100% correct choices, we interleaved free-choice trials with forced-choice trials on the shock arms to record postlearning rate maps for all three choice arms (Fig. 1E, POST-learning); all but one of the animals (rat 2, one error) made 100% correct choices during this postlearning epoch (choices significantly nonrandom; binomial test, p < 0.001 for each animal; Fig. 1F). The latency with which animals entered the shock arms after learning was significantly greater than both the latency before introduction of the shock and the latency to enter the safe arm after learning (Kruskal–Wallis test followed by post hoc Dunn–Sidak test, H(95) = 17.8, p < 0.001; Fig. 1G); in subsequent remapping analysis, we velocity thresholded our data to remove times when the animal hesitated to enter a choice arm (see Materials and Methods). Further, there was a significant reduction in running velocity from PRE to POST that did not occur in CTRL sessions (Wilcoxon rank-sum test; CTRL: z = −0.81, p = 0.42; Learning: z = 3.3126, p = 0.0017; Fig. 1H).

Figure 1.
Figure 1.

A novel aversive spatial decision-making task. A, On each trial, animals first run to a small reward location (left) and then return to the main large reward location along 1 of the 3 choice arms (middle and right). Sampling of an arm could be forced by closing off the other arms (middle), or could be free choice (right), allowing an assessment of the arm preference of the animal. B, Infrared (IR) beams at the top of the choice arms could be used to trigger eyelid shock. A large food reward was delivered at the end of each trail (top right), while a small food reward was delivered at the top left to motivate the animal to leave the large reward zone. C, Two sessions were run by each of the 4 animals. In the first, animals simply ran along the track, receiving food reward at the reward locations. In the second, after an initial baseline period was recorded, aversive shock was introduced on 2 of the 3 choice arms. D, The sequence of trial types within a session, and their use for the remapping analysis. Note that in a subset of sessions where the animal did not appear to switch its behavior after the first set of “forced-choice-new contingencies” trials, a second set was then run, hence the recursive arrows. E, An example learning curve showing that Rat 2 eventually achieved 100% correct choices. Note that in the final “postlearning” period, forced-choice trials were interleaved with free-choice trials in order for the spatial map on the shock arms to be recorded. F, All animals (4 rats) learned the location of the safe arm (binomial test, p < 0.001 for each animal). Note that the experimenter chose the safe arm identity after first running 5 free-choice trials (prelearning) at the outset of the recording session to ensure that animals did not have a bias toward the safe arm before the shock was introduced. G, Latency to enter the shock arm after learning was greater than latency to enter the choice arm before introduction of shock and latency to enter the safe arm after learning. H, Running velocity, averaged within laps, dropped during POST compared with PRE in the learning sessions. *p < 0.05, **p < 0.001, n.s. indicates not significant.

A subset of CA1 place cells remap after aversive learning

Once dorsal CA1 tetrodes reached their target (Fig. 2A), experiments commenced. We recorded 228 place cells during the CTRL session (Fig. 2B) and 277 place cells during the Learning session (Fig. 2C; place cells defined as having a minimum spatial information score of 0.3 bits/spike, and a minimum peak firing rate of 3 Hz). The paths of the animal along the track were linearized and concatenated (Fig. 2C, bottom) to allow for the remapping analysis. We assessed remapping using the following three commonly used measures: (1) place field correlations; (2) mean rate differences; and (3) population vector correlations. Surprisingly, place field correlations for cells recorded during the Learning session were not significantly lower than for those recorded during the CTRL session (Wilcoxon rank-sum test, n = 228 and 277 units, z = 1.72, p = 0.085; Fig. 2D). This result indicated a lack of what is often referred to as “global” remapping, where place field locations change, but left open the possibility of rate remapping, whereby the in-field firing rates of place cells change in an uncoordinated fashion across the population. However, mean rate differences, typically used to assess rate remapping (Leutgeb et al., 2005), were not significantly greater after learning compared with control sessions (Wilcoxon rank-sum test, z = −1.43, p = 0.154; Fig. 2E). Finally, we examined correlations between population vectors corresponding to the firing rates of each cell at individual spatial bins (Fig. 2F, left). In contrast to the place field correlations and mean rate differences, population vector correlations after learning were significantly lower than those from the control session, indicating that remapping had indeed occurred [Wilcoxon rank-sum test, n = 800 (200 population vectors from each session), z = 7.31, p < 0.001; Fig. 2F, right]. This remapping was not an artifact of lower running velocities during POST as replacing the fastest PRE and slowest POST laps so as to have velocity match the two epochs (Wilcoxon rank-sum, z = −0.026, p = 0.98; Fig. 2G; see Materials and Methods) increased, rather than diminished, the magnitude of remapping (Wilcoxon sign-rank test, z = 2.54, p = 0.011; Fig. 2I).

Figure 2.
Figure 2.

Task learning induces partial remapping. A, Representative histologic section showing positions of tetrode tips in CA1 cell layer. B, Normalized rate maps for all cells recorded during the first (PRE) and final (POST) trials of session 1, in which no shock was introduced. C, As in B but from Session 2, in which shock was introduced on 2 of the choice arms after the PRE epoch. Schematic at bottom shows how the maze was linearized for place field analysis. White dashed lines outline the rate map of a single cell (horizontal), used in subsequent place field remapping and rate remapping analyses, and a single population vector (vertical), used in subsequent population vector remapping analysis. D, Two representative place cell rate maps from the same unit before and after introduction of the shock in Session 2 (left, outlined by horizontal dashed box in C). PRE-population rate and POST-population rate maps were not significantly less correlated during Session 2 than during control Session 1 (right). E, Rate remapping was not significantly greater in Session 2 than in Session 1. F, Two representative population vectors from the same maze locations before and after introduction of the shock in Session 2 (left, outlined by vertical dashed box in C). PRE-population and POST-population vectors were significantly less correlated during Session 2 than during control Session 1 (right). G, Selecting the slowest laps from the combined set of laps containing both the initial 5 free-choice laps and the PRE epoch, and the fastest laps from the POST epoch abolishes the velocity difference between the PRE and POST epochs. H, Using these velocity-matched laps for the remapping analysis does not reduce the magnitude of the population vector decorrelation, but rather increases it. *p < 0.05, **p < 0.001, n.s. indicates not significant.

What can explain the discrepancy between the place field correlations and mean rate differences, on the one hand, and the population vector correlations on the other? Partial remapping, in which only a subset of cells remap, could produce this result if the subset of remapping cells had place fields distributed across all spatial bins, as this would cause each pair of population vectors to be slightly more decorrelated than they otherwise would be. To confirm this, we first transformed the population vector data into scores that could be assigned to individual cells. We did this by performing a subtraction analysis in which population vector correlations were recalculated using population vectors in which a given place cell was omitted. A “remapping contribution” score was then calculated for each place cell as the difference between the population vector correlations using all place cells and the correlations using all but that particular cell. Thus, a cell that contributed to decorrelation received a positive remapping contribution score, and a cell that instead contributed to stability (i.e., the correlations were reduced with that cell omitted) received a negative remapping contribution score (Fig. 3A); for each of the four rats, we classified 25, 37, 39, and 32 place cells as remapping, and 21, 38, 26, and 59 as stable, respectively. We performed a stepwise regression to create a generalized linear model to determine which variables best predicted the final remapping contribution score; as variables, we included the place field correlation of each cell, its mean rate difference, as well as its maximal firing rate. The remapping contribution score for a given cell was best modeled as a combination of its place field correlations, its mean rate difference, as well as interactions (i.e., product of terms) between each of these variables and the maximal firing rate of the cell (t(275) = −10.5, 4.8, −11.3, 5.7, respectively; all p < 0.001; Fig. 3B). Thus, a cell whose fields remap but whose firing rate is relatively low will make a smaller contribution to remapping at the population level than a similar cell with a relatively high firing rate.

Figure 3.
Figure 3.

Splitting the place cell population into remapping and stable subpopulations. A, The contribution of each cell to PV decorrelation during Session 2 (Fig. 2F) was calculated. No clear bimodality was observed, suggesting a continuous distribution of place cells with highly remapping and stable cells at the extremes of the distribution. B, Stepwise multivariable linear regression showed that a cell’s PV remapping contribution was significantly correlated with its PF remapping (PF), its rate remapping (RR), and the products of these remapping values with its maximal firing rate (FR; t tests, all p < 0.001). C, Session 2 cells were split into remapping and stable groups according to whether they had positive or negative PV remapping contributions. Note that both classes of cells had place fields distributed across the full extent of the maze. D, Remapping cells were less stable than stable cells at all time points. However, they were significantly more stable at the final time point when the contingencies were well learned. **p < 0.001

To compare cells that remapped to cells that did not, we classified all cells with positive remapping contributions as remapping cells, and cells with negative contributions as “stable” cells (n = 133 remapping cells, 144 stable cells). Replotting the rate maps for cells classified in this way showed that both remapping and stable cells were distributed across the maze, and that the remapping population was composed both of cells whose place field locations changed and rate-remapping cells whose field locations did not change (Fig. 3C). We examined the stability of remapping and stable cells by calculating PV correlations between rate maps constructed from individual consecutive laps. We then calculated the average PV correlation for each pair of consecutive laps and grouped them according to whether the lap occurred before the contingency change, immediately after, or at the end of the session. Remapping cells were less stable than stable cells at all time points (Wilcoxon signed-rank test, z = −3.52, p < 0.001 at each time point; Fig. 3D), including before the contingency change. However, they were significantly more stable at the final time point (Kruskal–Wallis test, post hoc Dunn–Sidak; H(47) = 25.5, p < 0.001; stable cells: H(47) = 1.18, p = 0.56; Fig. 3D). This shows that remapping cells were initially unstable in their representations, but became stabilized over the course of learning.

Investigating place field properties of remapping and stable cells in greater detail, we found that place field sizes were smaller in remapping cells before learning, and increased across learning to a size not different from stable cells (Wilcoxon rank-sum test: PRE: z = −2.72, p = 0.007; POST: z = −0.46, p = 0.065; Remapping: z = −3.02, p = 0.003; Stable: z = −0.35, p = 0.73; Fig. 4A). The number of fields expressed by both remapping and stable cells increased with learning, though to a larger extent than in remapping cells (Wilcoxon rank-sum test: PRE: z = 0.12, p = 0.090; POST: z = 3.21, p = 0.0013; Wilcoxon signed-rank test: Remapping: z = −4.25, p < 0.001; Stable: z = −2.16, p = 0.031; Fig. 4B). Mean and maximal firing rates in remapping cells were lower in remapping cells before learning, and then rose across learning to a level not different from those of stable cells (mean rates: Wilcoxon rank-sum test: PRE: z = −4.00, p < 0.001; POST: z = −0.23, p = 0.82; Wilcoxon signed-rank test: Remapping: z = −3.98, p < 0.001; Stable: z = 0.78, p = 0.44; maximal rates: Wilcoxon rank-sum test: PRE: z = −4.55, p < 0.001; POST: z = −1.65, p = 0.094; Remapping: z = 3.45, p < 0.001; Stable: z = 0.15, p = 0.88; Fig. 4C,D). Stable cells carried more spatial information than remapping cells after learning (Wilcoxon rank-sum test: PRE: z = −0.96, p = 0.34; POST: z = −2.13, p = 0.04; Wilcoxon signed-rank test: Remapping: z = 1.31, p = 0.19; Stable: z = 1.28, p = 0.20; Fig. 4E). Confirming the validity of cell identification procedure, we found that the spatial distribution of remapping cell firing changed significantly across learning compared with stable cells, measured both as change in the rate map center of mass (Wilcoxon rank-sum test: z = 6.81, p < 0.001; Fig. 4F) and reduced rate map correlations (Wilcoxon rank-sum test: z = −10.59, p < 0.001; Fig. 4G). Next, we investigated burst firing and coordination of firing by the theta rhythm. Remapping cells did not have significantly greater burst index scores than stable cells (Mizuseki and Buzsáki, 2013; Wilcoxon rank-sum test: z = 1.69, p = 0.091; Fig. 5A). The magnitude of theta modulation, as defined by the mean resultant length of the circular distribution of spike theta phases, was also not significantly different between remapping and stable cells (Wilcoxon rank-sum test: z = −0.11, p = 0.091; Fig. 5B). However, the mean theta phases of significantly modulated remapping cells (126 of 133 cells, 94.7%) were shifted to later phases compared with stable cells (138 of 144 cells, 95.8%; common median test: z = 4.92, p = 0.027; Fig. 5C). These findings reveal that in addition to being more sensitive to learning, remapping cells also exhibited important differences in their coding properties compared with stable cells.

Figure 4.
Figure 4.

Place field properties of remapping and stable cells. A, Remapping cells had smaller place fields before the contingency change, which grew to a size not significantly different from that of stable cells after learning. B, The number of place fields expressed by remapping cells was not different from that of stable cells before learning, but increased after learning. C, Mean firing rates were initially lower in remapping cells, but increased across learning to a level not significantly different from stable cells. D, The same pattern observed in the mean firing rate data were observed in the maximal firing rate data. E, Spatial information was greater in stable cells after learning compared with remapping cells, but neither group changed significantly across learning. F, The centers of mass of remapping cell rate maps shifted significantly more than those of stable cells across learning. G, Similarly, rate maps of remapping cells were significantly more decorrelated after learning than those of stable cells. *p < 0.05, **p < 0.001, n.s. indicates not significant.

Figure 5.
Figure 5.

Bursting and theta modulation of remapping and stable cells. A, Remapping cells did not have higher burst index scores than stable cells. B, The degree of theta modulation, measured as the mean resultant length of the circular distribution of spike theta phases, was not different between remapping and stable cells. C, However, the mean theta phases of remapping cells were shifted to later phases compared with those of stable cells. *p < 0.05, n.s. indicates not significant.

Remapping cells increase their replay participation during learning

The main purpose of our study was to test whether awake replay of hippocampal place cell sequences, which has been proposed to function as a mechanism to stabilize plasticity following learning (Dupret et al., 2010; Carr et al., 2011), might serve such a function with regard to the partial remapping induced by aversive learning in the “Learning” sessions. We identified candidate replay events as brief increases in the population-wide firing rate coinciding with SWRs (Fig. 6A). Neither the ripple frequency of these events (Wilcoxon rank-sum test: z = 0.46, p = 0.65; Fig. 6B) nor their rate of generation (t(3) = −2.57, p = 0.082; Fig. 6C) changed across learning, but their duration did increase (Wilcoxon rank-sum test: z = −2.46, p = 0.014; Fig. 6D), consistent with a recent report showing that ripple duration increases with learning and memory demands in rats (Fernández-Ruiz et al., 2019).

Figure 6.
Figure 6.

Ripple properties. A, Local field potential recordings showed a large increase in power in the ripple band (100–250 Hz) during candidate replay events. B, There was a significant increase in the frequency of replay events after the introduction of shock on the choice arms. C, The oscillatory frequency of ripples was not significantly different across learning. D, The rate of ripple generation was not significantly different across learning. E, Ripples were significantly longer after learning. *p < 0.05, n.s. indicates not significant.

A candidate event was classified as a replay event if the correlation between decoded position and time within the event was greater than that of the 95th percentiles of each of two control distributions calculated from two separate shuffling procedures (see Materials and Methods). Replay events became more frequent after the contingency change (i.e., introduction of shock), indicating an important role for these events in learning (t(3) = −5.71, p = 0.0107; Fig. 4B). The very small number of replay events observed before the contingency change (PRE: 0 events in rats 1 and 2, 1 event in rat 3, 3 events in rat 4) precluded any further examination of replay before learning (these events were excluded from further analysis). In total, there were 98 significant replay events during the POST-all epoch, which consisted of all trials occurring after the contingency change (18, 27, 28, and 25 events in rats 1–4, respectively; Fig. 7AD).

Figure 7.
Figure 7.

Remapping cells show greater involvement in replay. AD, Example replay events showing the raw and ripple band-filtered LFP and spikes color coded by cell identity (remapping, red; stable, black) and ordered according to their location of peak firing (left; vertical dashed lines show the temporal borders of the event) and heat maps showing the posterior probabilities calculated using Bayesian decoding (right). E, Remapping cells fired more spikes per replay event than stable cells. F, There was a significant correlation between remapping contribution and number of spikes fired per replay event in the combined remapping and stable population. G, Remapping cells made a larger contribution to the correlation between elapsed time and decoded position within each replay event. H, Remapping cells were more likely than stable cells to participate within a given replay event. I, The proportion of the remapping cells population participating within any given event was larger than that of the stable population. J, The number of remapping cell spikes, normalized by the number of remapping cells, fired within a given replay event was larger than the corresponding normalized number of stable cell spikes. *p < 0.05, **p < 0.001.

We asked whether there was a difference between remapping and stable cells in terms of their participation within these replay events. Consistent with our hypothesis, we found that remapping cells fired significantly more spikes during replay events than stable cells (Wilcoxon rank-sum test: z = 3.03, p = 0.0024; Fig. 7E). Supporting this, we found a significant correlation between remapping contribution and the number of spikes fired during replay across the place cell population (n = 277 cells; Pearson correlation coefficient = 0.1879, p = 0.0017; Fig. 7F). To measure the contribution that individual cells made to the correlations between time and decoded position, which is the hallmark of hippocampal replay, we used a previously published cell-specific shuffling technique (Grosmark and Buzsáki, 2016). This analysis showed that remapping cells made significantly greater contributions than stable cells to the time–position correlation (Wilcoxon rank-sum test: z = 2.88, p = 0.0040; Fig. 7G). These results were not skewed by remapping cells firing a large number of spikes in a minority of replay events, as we found that remapping cells were more likely to fire at least one spike within any given event (Wilcoxon rank-sum test: z = 3.23, p = 0.0012; Fig. 7H). Further, examining individual events rather than cells confirmed that events contained a greater proportion of the remapping cell population (Wilcoxon signed-rank test: z = 4.87, p < 0.001; Fig. 7I) as well as a greater number of remapping cell spikes compared with stable cells (Wilcoxon signed-rank test: z = 4.97, p < 0.001; Fig. 7J).

To confirm that these differences were because of learning, we turned to the control sessions. Because all place fields will undergo a certain amount of change or fluctuation (Mankin et al., 2012) over time, which, like any biological process, will vary to some degree across units, we were able to split the control session place cell population using the same cell identification procedure into stable and unstable populations, with the unstable population being equivalent to the remapping population in the Learning sessions (n = 123 stable cells, n = 105 remapping cells). There were no significant differences between these two groups in terms of per cell contribution, spikes per event, or likelihood of participation (Wilcoxon rank-sum test: z = −1.60, −1.67, and −1.48; p = 0.11, 0.10, and 0.14, respectively; Fig. 8A–C). Further, the correlation between remapping contribution and spike count was abolished in the CTRL sessions (Pearson correlation coefficient = 0.085, p = 0.203; Fig. 8D). Repeating our analysis of individual replay events, we found no difference in the proportions of more and less stable cell populations active within the events (Wilcoxon signed-rank test: z = −1.21, p = 0.227; Fig. 8E), but we did find that the normalized number of stable cells spikes was significantly greater than for unstable cells (Wilcoxon signed-rank test: z = −2.35, p = 0.019; Fig. 8F).

Figure 8.
Figure 8.

Unstable cells do not show greater involvement in replay during CTRL sessions. A, Stable and unstable cells made similar contributions to the correlation between elapsed time and decoded position within replay events. B, Stable and unstable cells fired similar numbers of spikes during replay events. C, Stable and unstable cells had similar likelihoods to fire spikes within replay events. D, There was no correlation between the remapping contribution of a cell and the number of spikes it fired during replay. E, The proportions of the stable and unstable populations active within replay events were not significantly different. F, The number of spikes per stable cell within replay events was not significantly different from the number of spikes per unstable cell. **p < 0.001, n.s. indicates not significant.

Because animals ran more trials on the safe arm than on the shock arms (55.9% trials on the safe arm postcontingency change), we wondered whether differences in remapping and stable cell distribution across the three arm types (prechoice, safe, and shock) could interact with an experience-dependent effect on replay (e.g., more replay of the safe arm because of more safe arm experience) to produce the enhancement of replay activity in remapping cells. However, we did not observe any differences in mean firing rates across the three arm types for either remapping or stable cells (Kruskal–Wallis test: remapping cells: H(398) = 0.093, p = 0.95; stable cells: H(431) = 1.23, p = 0.54; Fig. 9A,B), suggesting that any experience-dependent effect on replay would affect both cell populations equally.

Figure 9.
Figure 9.

The firing of remapping and stable cells is evenly distributed across the maze. A, Mean firing rates of remapping cells were not different across the 3 different maze arm types. B, Similarly, there was no difference in firing across the 3 arm types in stable cells. n.s. indicates not significant.

Replay events may not represent planning

We plotted the maze positions at which each replay event occurred. The vast majority occurred at the main reward site in the interval between successive trials (Fig. 10A). In three of the four animals, there were a small number that occurred at the small reward location. In two of the animals, there were a small number of events immediately before entry into the choice arms; in rat 2, both events were decoded as forward replays up a shock arm, but the animal ultimately chose the safe arm; in rat 3, all six events occurred during forced-choice trials after the animal had learned the identity of the safe and shock arms, and four of these consisted of two pairs of replays, with the first representing a trajectory up a shock arm followed by a second up the safe arm, suggesting the possibility that they were involved in deliberation.

Figure 10.
Figure 10.

Replay events are unlikely to underlie route planning. A, Black dots represent the maze locations at which replay events occurred. Most replay events occurred at the large reward location where the animal remained confined for 10 s after each trial. A smaller number were also observed at the small reward location. A very small number was observed at the choice points, but only in rats 2 and 3. BF, No bias was observed for replay events to encode either forward or reverse trajectories (B), the prechoice or the choice arms (C), the safe or the shock arms (D), the next traversed choice arm (E), or the previously traversed arm (F). n.s. indicates not significant.

We examined the content of the individual replay events more closely in an effort to determine whether there was any further evidence that the events were guiding behavior. Forward replay events (i.e., in which the decoded position progresses in a forward direction around the maze) occurred with a frequency that was not different from reverse replay, arguing against an exclusive role in planning (binomial test: chance level = 50%, p = 0.86; Fig. 10B). We did not observe any bias of replay for paths along the choice arms compared with paths that only included the prechoice portion of the maze (i.e., the path from the top right to bottom left; chance level of encoding prechoice arm = 25%, p = 0.48; Figs. 1, 10C), any bias toward paths along safe arm (chance level of encoding safe arm = 33.3%, p = 0.72; Fig. 10D), or paths along the next arm (chance level = 33.3%, p = 0.82; Fig. 10E; this analysis omitted forced-choice trials), supporting our interpretation that replay did not appear to support route planning in this particular behavioral paradigm, and adding further evidence that the greater number of trajectories on the safe arm did not skew the content of replay. Finally, we found no bias toward replay of the previous choice arm (chance level, 33%; binomial test, p = 0.180; Fig. 10F; this analysis also omitted forced-choice trials), suggesting that replay was not reinforcing the spatial representation of the previous choice. Thus, the content of replay seems to correspond to trajectories distributed around the entire maze, and these trajectories appear to be selected randomly.

Together, our analyses of replay events during aversive learning suggest that the role of replay may be in facilitating the storage or stabilization of a new spatial memory. However, given the relatively small number of replay events and the possibility that not all events necessarily serve the same purpose, a role in planning for individual replay events cannot be ruled out.

Firing properties during sharp-wave ripples

We wondered whether a general upregulation of firing during SWRs might underlie the increased participation of remapping cells in replay events. We analyzed all SWRs that did not co-occur with statistically significant replay events from both the precontingency and postcontingency change periods (this analysis used the PRE-all epoch, consisting of PRE with the addition of five preceding free-choice trials, see Methods and Materials; PRE-all, n = 84; POST-all, n = 814). The rate of ripple occurrence did not increase significantly across learning (Fig. 11A). Calculating the average remapping cell and stable cell ripple-centered firing rates within each ripple, we found that remapping cells increased their firing rates across learning (Wilcoxon rank-sum test: remapping: z = −2.79, p = 0.005; stable: z = −0.88, p = 0.38) to a level that was significantly higher than that of stable cells (Wilcoxon signed-rank test: PRE-all: z = 0.24, p = 0.81; POST-all: z = 5.02, p < 0.001; Fig. 11B,C). Similar to replay, we found a significant correlation between remapping contribution and ripple firing rate for individual place cells after the contingency change (remapping and stable cells: Wilcoxon rank-sum test: z = 2.60, p = 0.009; pooled population: Pearson correlation coefficient = 0.132, p = 0.028; Fig. 11D,E); in CTRL sessions, this correlation was abolished (stable vs unstable cells: Wilcoxon rank-sum test: z = 0.98, p = 0.33; Pearson correlation coefficient = −0.0031, p = 0.963; Fig. 12A,B). The proportions of the remapping and stable populations active within any given event increased across learning, but this increase was significantly larger in remapping cells (Remapping: Wilcoxon rank-sum test: z = −4.58, p < 0.001; Stable: Wilcoxon rank-sum test: z = −2.45, p = 0.014; PRE-all: Wilcoxon signed-rank test: z = −0.17, p = 0.86; POST-all: Wilcoxon signed-rank test: z = 5.16, p < 0.001; Fig. 11F); this difference was reversed in CTRL sessions (Wilcoxon signed-rank test: z = −2.79, p = 0.005; difference from remapping: Wilcoxon rank-sum test: z = −5.27, p < 0.001; Fig. 12C).

Figure 11.
Figure 11.

Remapping cells have higher firing rates during non-replay-associated ripples. A, There was no significant difference in the rate of ripple generation after learning. B, The average ripple-centered remapping cell firing rate increased after learning. Solid line shows the median values calculated across all ripples; shaded area is the 95% confidence interval calculated using bootstrap sampling. C, This increased average rate was significantly higher than that of stable cells. D, Similarly, the ripple-centered firing rates of individual remapping cells were higher after learning than those of stable cells. E, There was a significant correlation between the remapping contributions and ripple-centered firing rates in the pooled remapping and stable cell population. F, The proportion of the remapping cell population active within a given ripple was higher after learning than that of the stable cell population. *p < 0.05, **p < 0.001, n.s. indicates not significant.

Figure 12.
Figure 12.

Ripple-centered firing of stable and unstable cells during CTRL sessions. A, Ripple-centered firing rates of individual unstable cells were no higher than those of stable cells during POST-all in CTRL sessions. B, Unlike in the Learning sessions, there was no correlation between remapping contribution and ripple-centered firing rate in the pooled stable and unstable cell populations. C, Learning session ripples contain a greater proportion of the remapping population relative to the stable population, while the opposite is observed in CTRL sessions. **p < 0.001, n.s. indicates not significant.

Discussion

Our data confirm previous findings that aversive learning in a familiar environment induces partial remapping (Wang et al., 2012; Wu et al., 2017). We show for the first time that the subset of cells that remap express an enhanced recruitment into awake SWRs and replay events compared with nonremapping stable cells. This enhanced recruitment of remapping cells cannot be explained by increases in in-field firing rates; though the firing rates of remapping cells did increase across learning, they did not surpass those of stable cells. Further, because remapping cell firing was distributed along all arms of the maze, differences in behavior at specific locations on the maze, such as increased latency to enter shock arms and greater choice preference for the safe arm after learning are also unlikely to explain this enhancement. Thus, our data uncover a link between the excitability of a cell, specifically during SWRs, and the reorganization of its spatial pattern of firing during locomotion, when theta oscillations dominate the local field potential.

While we have found it useful to classify cells as belonging to either remapping or stable place cell populations, we point out here that they may instead exist on a continuum with highly remapping and highly stable cells at the extremes, a possibility highlighted by our correlational analyses (Figs. 7F, 11E).

Replay as a memory storage mechanism occurring in remapping cells

Previous studies have shown that in reward learning, cells whose fields become clustered around the reward zones increase their firing during SWRs (Dupret et al., 2010). However, because reward increases firing rates during SWRs, and SWRs tend to involve cells that have place fields at the reward zone (Singer and Frank, 2009), it was not clear whether the enhanced contributions of reward zone cells was because of a unique role for these cells in replay, or simply a by-product of their place field locations. Our task, in which remapping and nonremapping cell place fields were both distributed along the track arms, circumvented this limitation, and our data show that cells involved in memory encoding increase their contributions during learning more than nonremapping stable cells. Thus, in our task, the location of the place field of a cell does not seem to be the primary determinant of its involvement in either SWRs or replay events.

Research into replay has focused on two roles it may play. First, there is substantial evidence for a role in trajectory planning. In typical goal-directed tasks, the trajectory, or trajectories, leading to the goal reward are thought to be replayed preferentially as a means of the animal planning and previewing where it must navigate to reach the goal (Pfeiffer and Foster, 2013; Widloski and Foster, 2022). Conversely, in a spatial paradigm featuring an aversive shock zone, but no goal zone, replay was shown to preferentially activate cells with fields in the shock zone (i.e., trajectories to be avoided; Wu et al., 2017). Similarly, replay has also been shown to encode a nonpreferred/unrewarded option when another rewarded option is available, indicating that replay can encode trajectories to avoid even in the absence of aversive stimuli (Gupta et al., 2010; Carey et al., 2019). Because replay has been reported both to encode trajectories to navigate, as well as those to avoid, it is impossible to make a prediction of replay content in our paradigm if replay were indeed to serve a role in planning; in our task, there is a part of the maze that is aversive and should be avoided, as well as safe arms that are preferentially navigated by the animal, and the studies cited above suggest that both should be replayed. Therefore, we cannot rule out a role for replay in trajectory planning in our task.

The second, not necessarily mutually exclusive, view of replay is that it underlies memory storage. Disruption of ripples during learning (Jadhav et al., 2012) and post-training periods (Girardeau et al., 2009) has been shown to impair rats trained on a hippocampus-dependent spatial memory task. Further, an increase in SWR incidence has been reported at novel goal locations, and the frequency of these events predicted task performance (Dupret et al., 2010). In a task designed to promote replay before memory-based choice, replay content was found to be decoupled from subsequent choice (Gillespie et al., 2021). Reverse replay, in particular, has been suggested to play an important role in memory encoding, as it is enhanced in novel environments and has been postulated to allow the linking of a reward with the sequence of places through which the animal had to travel to receive it (Foster and Wilson, 2006). Our data appear more consistent with this second view. First, we observed very few replay events at the choice points; instead, most events occurred at the large reward location, where the animal was confined for 10 s between trials. While this is not surprising, given that cessation of locomotion is necessary for brain state to transition from theta into slow-wave activity (Buzsáki, 1986), a prerequisite for SWRs, we would nevertheless expect a greater number of replays at the choice points if they were necessary for route planning/decision-making, as animals were free to stop at the choice points to deliberate over their options. Second, we did not observe any difference in the likelihood of involvement of the relatively neutral prechoice arms and the non-neutral (i.e., aversive or safe) choice arms. Third, we did not observe any preference of forward replay for the next traversed arm. Last, we did not find that forward replay occurred more frequently and, in fact, saw a trend, though not significant, toward a greater incidence of reverse replay. Together, these data indicate that replay during our task may serve to incorporate the spatial coding of the remapping cells into the larger hippocampal representation.

Ripples versus replay

Ripples are brief (duration, 50–100 ms), high-frequency (150–300 Hz) network oscillations within the hippocampus that emerge during nonexploratory states such as slow-wave sleep, quiet rest, grooming, and eating/drinking (Buzsáki, 1986, 2015), and disruption in their expression results in significant memory impairments (Girardeau et al., 2009; Nakashiba et al., 2009; Ego-Stengel and Wilson, 2010; Jadhav et al., 2012; Wang et al., 2015). Typically, only a subset of ripples is found to contain significant replay of place field sequences (Foster and Wilson, 2006; Tingley and Peyrache, 2020). This raises the question of whether replay events and nonreplay ripple events serve different functions. Our finding that remapping cells had enhanced participation in both kinds of events suggests that both serve the same underlying function in our task, namely, to consolidate the incorporation of emotional and contextual information into the spatial representation of the maze. Further, recent work suggests that the heuristic approaches typically used in the field to identify replay events only identify the most salient trajectories, and more sophisticated techniques suggest that most ripple events do contain replay (Krause and Drugowitsch, 2022); this suggests that our ripple results may be best viewed as a replication of our replay results. Together, these data suggest the existence of a mechanism by which remapping cell excitability is enhanced during SWRs, such as a differential response relative to stable cells to the decreased activity of cholinergic and septal GABAergic inputs when transitioning from theta (Buzsáki, 2002).

Conclusion

Our identification and examination of remapping cells show that increased coordination during ripples and replay events in these cells may facilitate the generation and stabilization of plasticity in the memory-encoding place cell network, thereby linking a spatially restricted experience to a representation of the larger environment in which it occurred. The enhancement of replay contribution in remapping cells suggests the existence of a mechanism to select those cells with the emergent coding properties that best support adaptive behavior for participation in replay. That these processes occur in a specific cell population, while a separate population of place cells exhibit stable spatial coding, demonstrates a mechanism whereby the hippocampus can both maintain a stable spatial representation of the environment while incorporating changing features of experiences that occur within that environment.