Introduction
Episodic memories include details about salient experiences from our past as well as information about the location in which these events occurred (Tulving, 1972). The hippocampus plays a central role in episodic memory and spatial decision-making (Pfeiffer and Foster, 2013, 2015; Wikenheiser and Redish, 2015; Yu and Frank, 2015), and this is thought to rely in part on its ability to create and store unique representations or maps of different spatial and nonspatial contexts (O’Keefe and Nadel, 1978; Eichenbaum, 2017). Alteration of the spatial representation, termed “remapping” (Muller and Kubie, 1987), is also thought to play a role in the encoding of episodic events occurring in familiar environments (Colgin et al., 2008). How this occurs without disrupting the representation of the physical and spatial features of the environment is not clear. Furthermore, the mechanisms through which episodic experiences are integrated into hippocampal representations during learning are not well understood.
Studies in which physical properties of the environment, such as the color or shape of the recording apparatus, are altered have shown that the infield firing rates of existing place fields (PFs) can change in response to changing sensory input without any change in place field location, a process termed “rate remapping” (Leutgeb et al., 2005). Further support for this concept has been found in studies in which changes in behavioral contingencies, such as turning direction on a plus or T-maze, also produce rate remapping (Frank et al., 2000; Wood et al., 2000; Ferbinteanu and Shapiro, 2003). However, studies of remapping during episodic memory encoding using reward or fear learning have reported changes in the place field locations of some, but not all, the recorded place cells (Moita et al., 2003; Smith and Mizumori, 2006; Dupret et al., 2010; Wang et al., 2012; Wu et al., 2017). This partial remapping, meaning remapping in a subset of the place cell population, suggests the possibility of a unique population of place cells that might dynamically encode information relating to events, while leaving a more spatially specific population to stably encode space. But how might episodic memory encoding take place such that it is restricted to one subpopulation, while leaving another unchanged?
One possibility is that encoding cells may be preferentially recruited into sharp-wave ripple (SWR)-associated replay events. Recent work has shown that in CA1, a distinct population of place cells can be identified on the basis of differential recruitment into replay events (Grosmark and Buzsáki, 2016). Further, patterns of activity associated with spatial novelty or reward learning have been shown to recur more frequently in these events (Singer and Frank, 2009; Ambrose et al., 2016). Last, these events have been shown to be important for the stabilization of place fields (Roux et al., 2017) and place cell assemblies in novel environments (van de Ven et al., 2016). However, it is not clear from this previous work whether these memory-encoding mechanisms occur in a specific population of hippocampal neurons.
We set out to explore the existence of an episodic encoding subpopulation of place cells in CA1 and to determine the neural coding mechanisms through which these cells may be selectively integrated into hippocampal networks during learning. Using a novel spatial decision-making task incorporating avoidance of aversive stimuli, we found that remapping cells were preferentially recruited into SWRs and replay events.
Results
Animals learn to avoid the shock arm when navigating to the food reward
In reward-learning paradigms, remapping occurs mainly around reward locations (Breese et al., 1989; Dupret et al., 2010; Gauthier and Tank, 2018). Thus, any effort to determine what intrinsic properties cause some place cells to remap while others do not is bound to be confounded by the locations of the place fields of the cells. To circumvent this limitation, we developed a novel decision-making task incorporating aversive stimuli, which produce spatially distributed remapping of hippocampal place cells (Moita et al., 2004; Wang et al., 2012; Wu et al., 2017). The task requires animals to use spatial knowledge about a stable, familiar environment in conjunction with information about changing episodic experiences occurring within this environment, leading to the creation of powerful episodic memories. Rats (n = 4) were trained to run laps on a track arranged such that they first ran to a feeder point for a small food reward, followed by a second short run to a choice point where they could freely choose, or alternatively be forced, to run along one of three choice arms to return to a goal location for a larger food reward (Fig. 1A). Rats were implanted with eyelid shock wires, and shock (a 1 s train of 1 ms pulses at 7 Hz, with intensity between 0.4 and 1.5 mA) could be triggered by breaking an infrared beam on the choice arms (Fig. 1B). During initial training, rats learned to navigate to the reward location; animals were never exposed to shock during this initial training. The experiment consisted of two sessions. In the first session [Fig. 1C, Session 1: control (CTRL)], rats ran a large number of laps in the absence of shock (laps: n = 132, 105, 97, and 114 for rats 1–4, respectively). The first and final 12 trials consisted of forced choice, with equal sampling of the three choice arms (i.e., each arm was run four times during both the PRE-learning and POST-learning epochs), and these were used to construct PRE-learning and POST-learning rate maps to assess baseline levels of place field stability. In the second session (Fig. 1C, Session 2: Learning), rats first ran five free-choice trials (Fig. 1F, PRE-learning) to determine their arm preferences. This was followed by 12 forced-choice trials (PRE), as in session 1, to establish the baseline rate maps. Shock was then introduced on two of the choice arms, making these the “shock” arms and the remaining arm the safe arm (the arm least preferred by the animal was chosen as the safe arm). A variable number of forced-choice trials was then run to introduce the new contingencies to the animal (see Materials and Methods), followed by free-choice trials allowing the experimenter to assess the preference of the animal for the safe arm. Once the animal reached 100% correct choices, we interleaved free-choice trials with forced-choice trials on the shock arms to record postlearning rate maps for all three choice arms (Fig. 1E, POST-learning); all but one of the animals (rat 2, one error) made 100% correct choices during this postlearning epoch (choices significantly nonrandom; binomial test, p < 0.001 for each animal; Fig. 1F). The latency with which animals entered the shock arms after learning was significantly greater than both the latency before introduction of the shock and the latency to enter the safe arm after learning (Kruskal–Wallis test followed by post hoc Dunn–Sidak test, H(95) = 17.8, p < 0.001; Fig. 1G); in subsequent remapping analysis, we velocity thresholded our data to remove times when the animal hesitated to enter a choice arm (see Materials and Methods). Further, there was a significant reduction in running velocity from PRE to POST that did not occur in CTRL sessions (Wilcoxon rank-sum test; CTRL: z = −0.81, p = 0.42; Learning: z = 3.3126, p = 0.0017; Fig. 1H).
A subset of CA1 place cells remap after aversive learning
Once dorsal CA1 tetrodes reached their target (Fig. 2A), experiments commenced. We recorded 228 place cells during the CTRL session (Fig. 2B) and 277 place cells during the Learning session (Fig. 2C; place cells defined as having a minimum spatial information score of 0.3 bits/spike, and a minimum peak firing rate of 3 Hz). The paths of the animal along the track were linearized and concatenated (Fig. 2C, bottom) to allow for the remapping analysis. We assessed remapping using the following three commonly used measures: (1) place field correlations; (2) mean rate differences; and (3) population vector correlations. Surprisingly, place field correlations for cells recorded during the Learning session were not significantly lower than for those recorded during the CTRL session (Wilcoxon rank-sum test, n = 228 and 277 units, z = 1.72, p = 0.085; Fig. 2D). This result indicated a lack of what is often referred to as “global” remapping, where place field locations change, but left open the possibility of rate remapping, whereby the in-field firing rates of place cells change in an uncoordinated fashion across the population. However, mean rate differences, typically used to assess rate remapping (Leutgeb et al., 2005), were not significantly greater after learning compared with control sessions (Wilcoxon rank-sum test, z = −1.43, p = 0.154; Fig. 2E). Finally, we examined correlations between population vectors corresponding to the firing rates of each cell at individual spatial bins (Fig. 2F, left). In contrast to the place field correlations and mean rate differences, population vector correlations after learning were significantly lower than those from the control session, indicating that remapping had indeed occurred [Wilcoxon rank-sum test, n = 800 (200 population vectors from each session), z = 7.31, p < 0.001; Fig. 2F, right]. This remapping was not an artifact of lower running velocities during POST as replacing the fastest PRE and slowest POST laps so as to have velocity match the two epochs (Wilcoxon rank-sum, z = −0.026, p = 0.98; Fig. 2G; see Materials and Methods) increased, rather than diminished, the magnitude of remapping (Wilcoxon sign-rank test, z = 2.54, p = 0.011; Fig. 2I).
What can explain the discrepancy between the place field correlations and mean rate differences, on the one hand, and the population vector correlations on the other? Partial remapping, in which only a subset of cells remap, could produce this result if the subset of remapping cells had place fields distributed across all spatial bins, as this would cause each pair of population vectors to be slightly more decorrelated than they otherwise would be. To confirm this, we first transformed the population vector data into scores that could be assigned to individual cells. We did this by performing a subtraction analysis in which population vector correlations were recalculated using population vectors in which a given place cell was omitted. A “remapping contribution” score was then calculated for each place cell as the difference between the population vector correlations using all place cells and the correlations using all but that particular cell. Thus, a cell that contributed to decorrelation received a positive remapping contribution score, and a cell that instead contributed to stability (i.e., the correlations were reduced with that cell omitted) received a negative remapping contribution score (Fig. 3A); for each of the four rats, we classified 25, 37, 39, and 32 place cells as remapping, and 21, 38, 26, and 59 as stable, respectively. We performed a stepwise regression to create a generalized linear model to determine which variables best predicted the final remapping contribution score; as variables, we included the place field correlation of each cell, its mean rate difference, as well as its maximal firing rate. The remapping contribution score for a given cell was best modeled as a combination of its place field correlations, its mean rate difference, as well as interactions (i.e., product of terms) between each of these variables and the maximal firing rate of the cell (t(275) = −10.5, 4.8, −11.3, 5.7, respectively; all p < 0.001; Fig. 3B). Thus, a cell whose fields remap but whose firing rate is relatively low will make a smaller contribution to remapping at the population level than a similar cell with a relatively high firing rate.
To compare cells that remapped to cells that did not, we classified all cells with positive remapping contributions as remapping cells, and cells with negative contributions as “stable” cells (n = 133 remapping cells, 144 stable cells). Replotting the rate maps for cells classified in this way showed that both remapping and stable cells were distributed across the maze, and that the remapping population was composed both of cells whose place field locations changed and rate-remapping cells whose field locations did not change (Fig. 3C). We examined the stability of remapping and stable cells by calculating PV correlations between rate maps constructed from individual consecutive laps. We then calculated the average PV correlation for each pair of consecutive laps and grouped them according to whether the lap occurred before the contingency change, immediately after, or at the end of the session. Remapping cells were less stable than stable cells at all time points (Wilcoxon signed-rank test, z = −3.52, p < 0.001 at each time point; Fig. 3D), including before the contingency change. However, they were significantly more stable at the final time point (Kruskal–Wallis test, post hoc Dunn–Sidak; H(47) = 25.5, p < 0.001; stable cells: H(47) = 1.18, p = 0.56; Fig. 3D). This shows that remapping cells were initially unstable in their representations, but became stabilized over the course of learning.
Investigating place field properties of remapping and stable cells in greater detail, we found that place field sizes were smaller in remapping cells before learning, and increased across learning to a size not different from stable cells (Wilcoxon rank-sum test: PRE: z = −2.72, p = 0.007; POST: z = −0.46, p = 0.065; Remapping: z = −3.02, p = 0.003; Stable: z = −0.35, p = 0.73; Fig. 4A). The number of fields expressed by both remapping and stable cells increased with learning, though to a larger extent than in remapping cells (Wilcoxon rank-sum test: PRE: z = 0.12, p = 0.090; POST: z = 3.21, p = 0.0013; Wilcoxon signed-rank test: Remapping: z = −4.25, p < 0.001; Stable: z = −2.16, p = 0.031; Fig. 4B). Mean and maximal firing rates in remapping cells were lower in remapping cells before learning, and then rose across learning to a level not different from those of stable cells (mean rates: Wilcoxon rank-sum test: PRE: z = −4.00, p < 0.001; POST: z = −0.23, p = 0.82; Wilcoxon signed-rank test: Remapping: z = −3.98, p < 0.001; Stable: z = 0.78, p = 0.44; maximal rates: Wilcoxon rank-sum test: PRE: z = −4.55, p < 0.001; POST: z = −1.65, p = 0.094; Remapping: z = 3.45, p < 0.001; Stable: z = 0.15, p = 0.88; Fig. 4C,D). Stable cells carried more spatial information than remapping cells after learning (Wilcoxon rank-sum test: PRE: z = −0.96, p = 0.34; POST: z = −2.13, p = 0.04; Wilcoxon signed-rank test: Remapping: z = 1.31, p = 0.19; Stable: z = 1.28, p = 0.20; Fig. 4E). Confirming the validity of cell identification procedure, we found that the spatial distribution of remapping cell firing changed significantly across learning compared with stable cells, measured both as change in the rate map center of mass (Wilcoxon rank-sum test: z = 6.81, p < 0.001; Fig. 4F) and reduced rate map correlations (Wilcoxon rank-sum test: z = −10.59, p < 0.001; Fig. 4G). Next, we investigated burst firing and coordination of firing by the theta rhythm. Remapping cells did not have significantly greater burst index scores than stable cells (Mizuseki and Buzsáki, 2013; Wilcoxon rank-sum test: z = 1.69, p = 0.091; Fig. 5A). The magnitude of theta modulation, as defined by the mean resultant length of the circular distribution of spike theta phases, was also not significantly different between remapping and stable cells (Wilcoxon rank-sum test: z = −0.11, p = 0.091; Fig. 5B). However, the mean theta phases of significantly modulated remapping cells (126 of 133 cells, 94.7%) were shifted to later phases compared with stable cells (138 of 144 cells, 95.8%; common median test: z = 4.92, p = 0.027; Fig. 5C). These findings reveal that in addition to being more sensitive to learning, remapping cells also exhibited important differences in their coding properties compared with stable cells.
Remapping cells increase their replay participation during learning
The main purpose of our study was to test whether awake replay of hippocampal place cell sequences, which has been proposed to function as a mechanism to stabilize plasticity following learning (Dupret et al., 2010; Carr et al., 2011), might serve such a function with regard to the partial remapping induced by aversive learning in the “Learning” sessions. We identified candidate replay events as brief increases in the population-wide firing rate coinciding with SWRs (Fig. 6A). Neither the ripple frequency of these events (Wilcoxon rank-sum test: z = 0.46, p = 0.65; Fig. 6B) nor their rate of generation (t(3) = −2.57, p = 0.082; Fig. 6C) changed across learning, but their duration did increase (Wilcoxon rank-sum test: z = −2.46, p = 0.014; Fig. 6D), consistent with a recent report showing that ripple duration increases with learning and memory demands in rats (Fernández-Ruiz et al., 2019).
A candidate event was classified as a replay event if the correlation between decoded position and time within the event was greater than that of the 95th percentiles of each of two control distributions calculated from two separate shuffling procedures (see Materials and Methods). Replay events became more frequent after the contingency change (i.e., introduction of shock), indicating an important role for these events in learning (t(3) = −5.71, p = 0.0107; Fig. 4B). The very small number of replay events observed before the contingency change (PRE: 0 events in rats 1 and 2, 1 event in rat 3, 3 events in rat 4) precluded any further examination of replay before learning (these events were excluded from further analysis). In total, there were 98 significant replay events during the POST-all epoch, which consisted of all trials occurring after the contingency change (18, 27, 28, and 25 events in rats 1–4, respectively; Fig. 7A–D).
We asked whether there was a difference between remapping and stable cells in terms of their participation within these replay events. Consistent with our hypothesis, we found that remapping cells fired significantly more spikes during replay events than stable cells (Wilcoxon rank-sum test: z = 3.03, p = 0.0024; Fig. 7E). Supporting this, we found a significant correlation between remapping contribution and the number of spikes fired during replay across the place cell population (n = 277 cells; Pearson correlation coefficient = 0.1879, p = 0.0017; Fig. 7F). To measure the contribution that individual cells made to the correlations between time and decoded position, which is the hallmark of hippocampal replay, we used a previously published cell-specific shuffling technique (Grosmark and Buzsáki, 2016). This analysis showed that remapping cells made significantly greater contributions than stable cells to the time–position correlation (Wilcoxon rank-sum test: z = 2.88, p = 0.0040; Fig. 7G). These results were not skewed by remapping cells firing a large number of spikes in a minority of replay events, as we found that remapping cells were more likely to fire at least one spike within any given event (Wilcoxon rank-sum test: z = 3.23, p = 0.0012; Fig. 7H). Further, examining individual events rather than cells confirmed that events contained a greater proportion of the remapping cell population (Wilcoxon signed-rank test: z = 4.87, p < 0.001; Fig. 7I) as well as a greater number of remapping cell spikes compared with stable cells (Wilcoxon signed-rank test: z = 4.97, p < 0.001; Fig. 7J).
To confirm that these differences were because of learning, we turned to the control sessions. Because all place fields will undergo a certain amount of change or fluctuation (Mankin et al., 2012) over time, which, like any biological process, will vary to some degree across units, we were able to split the control session place cell population using the same cell identification procedure into stable and unstable populations, with the unstable population being equivalent to the remapping population in the Learning sessions (n = 123 stable cells, n = 105 remapping cells). There were no significant differences between these two groups in terms of per cell contribution, spikes per event, or likelihood of participation (Wilcoxon rank-sum test: z = −1.60, −1.67, and −1.48; p = 0.11, 0.10, and 0.14, respectively; Fig. 8A–C). Further, the correlation between remapping contribution and spike count was abolished in the CTRL sessions (Pearson correlation coefficient = 0.085, p = 0.203; Fig. 8D). Repeating our analysis of individual replay events, we found no difference in the proportions of more and less stable cell populations active within the events (Wilcoxon signed-rank test: z = −1.21, p = 0.227; Fig. 8E), but we did find that the normalized number of stable cells spikes was significantly greater than for unstable cells (Wilcoxon signed-rank test: z = −2.35, p = 0.019; Fig. 8F).
Because animals ran more trials on the safe arm than on the shock arms (55.9% trials on the safe arm postcontingency change), we wondered whether differences in remapping and stable cell distribution across the three arm types (prechoice, safe, and shock) could interact with an experience-dependent effect on replay (e.g., more replay of the safe arm because of more safe arm experience) to produce the enhancement of replay activity in remapping cells. However, we did not observe any differences in mean firing rates across the three arm types for either remapping or stable cells (Kruskal–Wallis test: remapping cells: H(398) = 0.093, p = 0.95; stable cells: H(431) = 1.23, p = 0.54; Fig. 9A,B), suggesting that any experience-dependent effect on replay would affect both cell populations equally.
Replay events may not represent planning
We plotted the maze positions at which each replay event occurred. The vast majority occurred at the main reward site in the interval between successive trials (Fig. 10A). In three of the four animals, there were a small number that occurred at the small reward location. In two of the animals, there were a small number of events immediately before entry into the choice arms; in rat 2, both events were decoded as forward replays up a shock arm, but the animal ultimately chose the safe arm; in rat 3, all six events occurred during forced-choice trials after the animal had learned the identity of the safe and shock arms, and four of these consisted of two pairs of replays, with the first representing a trajectory up a shock arm followed by a second up the safe arm, suggesting the possibility that they were involved in deliberation.
We examined the content of the individual replay events more closely in an effort to determine whether there was any further evidence that the events were guiding behavior. Forward replay events (i.e., in which the decoded position progresses in a forward direction around the maze) occurred with a frequency that was not different from reverse replay, arguing against an exclusive role in planning (binomial test: chance level = 50%, p = 0.86; Fig. 10B). We did not observe any bias of replay for paths along the choice arms compared with paths that only included the prechoice portion of the maze (i.e., the path from the top right to bottom left; chance level of encoding prechoice arm = 25%, p = 0.48; Figs. 1, 10C), any bias toward paths along safe arm (chance level of encoding safe arm = 33.3%, p = 0.72; Fig. 10D), or paths along the next arm (chance level = 33.3%, p = 0.82; Fig. 10E; this analysis omitted forced-choice trials), supporting our interpretation that replay did not appear to support route planning in this particular behavioral paradigm, and adding further evidence that the greater number of trajectories on the safe arm did not skew the content of replay. Finally, we found no bias toward replay of the previous choice arm (chance level, 33%; binomial test, p = 0.180; Fig. 10F; this analysis also omitted forced-choice trials), suggesting that replay was not reinforcing the spatial representation of the previous choice. Thus, the content of replay seems to correspond to trajectories distributed around the entire maze, and these trajectories appear to be selected randomly.
Together, our analyses of replay events during aversive learning suggest that the role of replay may be in facilitating the storage or stabilization of a new spatial memory. However, given the relatively small number of replay events and the possibility that not all events necessarily serve the same purpose, a role in planning for individual replay events cannot be ruled out.
Firing properties during sharp-wave ripples
We wondered whether a general upregulation of firing during SWRs might underlie the increased participation of remapping cells in replay events. We analyzed all SWRs that did not co-occur with statistically significant replay events from both the precontingency and postcontingency change periods (this analysis used the PRE-all epoch, consisting of PRE with the addition of five preceding free-choice trials, see Methods and Materials; PRE-all, n = 84; POST-all, n = 814). The rate of ripple occurrence did not increase significantly across learning (Fig. 11A). Calculating the average remapping cell and stable cell ripple-centered firing rates within each ripple, we found that remapping cells increased their firing rates across learning (Wilcoxon rank-sum test: remapping: z = −2.79, p = 0.005; stable: z = −0.88, p = 0.38) to a level that was significantly higher than that of stable cells (Wilcoxon signed-rank test: PRE-all: z = 0.24, p = 0.81; POST-all: z = 5.02, p < 0.001; Fig. 11B,C). Similar to replay, we found a significant correlation between remapping contribution and ripple firing rate for individual place cells after the contingency change (remapping and stable cells: Wilcoxon rank-sum test: z = 2.60, p = 0.009; pooled population: Pearson correlation coefficient = 0.132, p = 0.028; Fig. 11D,E); in CTRL sessions, this correlation was abolished (stable vs unstable cells: Wilcoxon rank-sum test: z = 0.98, p = 0.33; Pearson correlation coefficient = −0.0031, p = 0.963; Fig. 12A,B). The proportions of the remapping and stable populations active within any given event increased across learning, but this increase was significantly larger in remapping cells (Remapping: Wilcoxon rank-sum test: z = −4.58, p < 0.001; Stable: Wilcoxon rank-sum test: z = −2.45, p = 0.014; PRE-all: Wilcoxon signed-rank test: z = −0.17, p = 0.86; POST-all: Wilcoxon signed-rank test: z = 5.16, p < 0.001; Fig. 11F); this difference was reversed in CTRL sessions (Wilcoxon signed-rank test: z = −2.79, p = 0.005; difference from remapping: Wilcoxon rank-sum test: z = −5.27, p < 0.001; Fig. 12C).
Discussion
Our data confirm previous findings that aversive learning in a familiar environment induces partial remapping (Wang et al., 2012; Wu et al., 2017). We show for the first time that the subset of cells that remap express an enhanced recruitment into awake SWRs and replay events compared with nonremapping stable cells. This enhanced recruitment of remapping cells cannot be explained by increases in in-field firing rates; though the firing rates of remapping cells did increase across learning, they did not surpass those of stable cells. Further, because remapping cell firing was distributed along all arms of the maze, differences in behavior at specific locations on the maze, such as increased latency to enter shock arms and greater choice preference for the safe arm after learning are also unlikely to explain this enhancement. Thus, our data uncover a link between the excitability of a cell, specifically during SWRs, and the reorganization of its spatial pattern of firing during locomotion, when theta oscillations dominate the local field potential.
While we have found it useful to classify cells as belonging to either remapping or stable place cell populations, we point out here that they may instead exist on a continuum with highly remapping and highly stable cells at the extremes, a possibility highlighted by our correlational analyses (Figs. 7F, 11E).
Replay as a memory storage mechanism occurring in remapping cells
Previous studies have shown that in reward learning, cells whose fields become clustered around the reward zones increase their firing during SWRs (Dupret et al., 2010). However, because reward increases firing rates during SWRs, and SWRs tend to involve cells that have place fields at the reward zone (Singer and Frank, 2009), it was not clear whether the enhanced contributions of reward zone cells was because of a unique role for these cells in replay, or simply a by-product of their place field locations. Our task, in which remapping and nonremapping cell place fields were both distributed along the track arms, circumvented this limitation, and our data show that cells involved in memory encoding increase their contributions during learning more than nonremapping stable cells. Thus, in our task, the location of the place field of a cell does not seem to be the primary determinant of its involvement in either SWRs or replay events.
Research into replay has focused on two roles it may play. First, there is substantial evidence for a role in trajectory planning. In typical goal-directed tasks, the trajectory, or trajectories, leading to the goal reward are thought to be replayed preferentially as a means of the animal planning and previewing where it must navigate to reach the goal (Pfeiffer and Foster, 2013; Widloski and Foster, 2022). Conversely, in a spatial paradigm featuring an aversive shock zone, but no goal zone, replay was shown to preferentially activate cells with fields in the shock zone (i.e., trajectories to be avoided; Wu et al., 2017). Similarly, replay has also been shown to encode a nonpreferred/unrewarded option when another rewarded option is available, indicating that replay can encode trajectories to avoid even in the absence of aversive stimuli (Gupta et al., 2010; Carey et al., 2019). Because replay has been reported both to encode trajectories to navigate, as well as those to avoid, it is impossible to make a prediction of replay content in our paradigm if replay were indeed to serve a role in planning; in our task, there is a part of the maze that is aversive and should be avoided, as well as safe arms that are preferentially navigated by the animal, and the studies cited above suggest that both should be replayed. Therefore, we cannot rule out a role for replay in trajectory planning in our task.
The second, not necessarily mutually exclusive, view of replay is that it underlies memory storage. Disruption of ripples during learning (Jadhav et al., 2012) and post-training periods (Girardeau et al., 2009) has been shown to impair rats trained on a hippocampus-dependent spatial memory task. Further, an increase in SWR incidence has been reported at novel goal locations, and the frequency of these events predicted task performance (Dupret et al., 2010). In a task designed to promote replay before memory-based choice, replay content was found to be decoupled from subsequent choice (Gillespie et al., 2021). Reverse replay, in particular, has been suggested to play an important role in memory encoding, as it is enhanced in novel environments and has been postulated to allow the linking of a reward with the sequence of places through which the animal had to travel to receive it (Foster and Wilson, 2006). Our data appear more consistent with this second view. First, we observed very few replay events at the choice points; instead, most events occurred at the large reward location, where the animal was confined for 10 s between trials. While this is not surprising, given that cessation of locomotion is necessary for brain state to transition from theta into slow-wave activity (Buzsáki, 1986), a prerequisite for SWRs, we would nevertheless expect a greater number of replays at the choice points if they were necessary for route planning/decision-making, as animals were free to stop at the choice points to deliberate over their options. Second, we did not observe any difference in the likelihood of involvement of the relatively neutral prechoice arms and the non-neutral (i.e., aversive or safe) choice arms. Third, we did not observe any preference of forward replay for the next traversed arm. Last, we did not find that forward replay occurred more frequently and, in fact, saw a trend, though not significant, toward a greater incidence of reverse replay. Together, these data indicate that replay during our task may serve to incorporate the spatial coding of the remapping cells into the larger hippocampal representation.
Ripples versus replay
Ripples are brief (duration, 50–100 ms), high-frequency (150–300 Hz) network oscillations within the hippocampus that emerge during nonexploratory states such as slow-wave sleep, quiet rest, grooming, and eating/drinking (Buzsáki, 1986, 2015), and disruption in their expression results in significant memory impairments (Girardeau et al., 2009; Nakashiba et al., 2009; Ego-Stengel and Wilson, 2010; Jadhav et al., 2012; Wang et al., 2015). Typically, only a subset of ripples is found to contain significant replay of place field sequences (Foster and Wilson, 2006; Tingley and Peyrache, 2020). This raises the question of whether replay events and nonreplay ripple events serve different functions. Our finding that remapping cells had enhanced participation in both kinds of events suggests that both serve the same underlying function in our task, namely, to consolidate the incorporation of emotional and contextual information into the spatial representation of the maze. Further, recent work suggests that the heuristic approaches typically used in the field to identify replay events only identify the most salient trajectories, and more sophisticated techniques suggest that most ripple events do contain replay (Krause and Drugowitsch, 2022); this suggests that our ripple results may be best viewed as a replication of our replay results. Together, these data suggest the existence of a mechanism by which remapping cell excitability is enhanced during SWRs, such as a differential response relative to stable cells to the decreased activity of cholinergic and septal GABAergic inputs when transitioning from theta (Buzsáki, 2002).
Conclusion
Our identification and examination of remapping cells show that increased coordination during ripples and replay events in these cells may facilitate the generation and stabilization of plasticity in the memory-encoding place cell network, thereby linking a spatially restricted experience to a representation of the larger environment in which it occurred. The enhancement of replay contribution in remapping cells suggests the existence of a mechanism to select those cells with the emergent coding properties that best support adaptive behavior for participation in replay. That these processes occur in a specific cell population, while a separate population of place cells exhibit stable spatial coding, demonstrates a mechanism whereby the hippocampus can both maintain a stable spatial representation of the environment while incorporating changing features of experiences that occur within that environment.