$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
When recorded from the scalp, brain electrical activity is observed as continuous and regular changes in voltages over time. This electrical activity is called electroencephalogram (EEG) and was first described by Hans Berger in the late twenties of the last century1. Subsequent seminal studies described the EEG as a compound time series, in which different rhythmic or repetitive patterns can be observed2,3,4. Nowadays, the EEG is typically divided into five well-established frequency bands, delta, theta, alpha, beta, and gamma, which are associated with the different sensory and cognitive process.
For years, the study of brain oscillations using EEG was restricted to either analysis of the spectrum in the ongoing activity or changes in oscillatory activity elicited by non-periodic sensory events. In the last decades, different methodologies have been implemented for modulating ongoing EEG oscillations and exploring the effects of such modulations on perceptual and cognitive processes, including the presentation of rhythmic sensory stimulation for inducing neural entrainment. The term neural entrainment refers to the synchronization of neural activity with the periodic properties of sensory stimuli. This process leads to the generation of steady-state evoked potentials (i.e., EEG oscillations locked to the periodic properties of the driving stimuli). Steady-state evoked potentials are most commonly elicited by visual, auditory, and vibrotactile stimulation, using either transient stimuli presented at a constant rate or continuous stimulation modulated in amplitude at the frequency of interest. Whereas somatosensory steady-state evoked potentials (SSSEPs) are recorded in response to repetitive tactile stimulation5,6, steady-state visually evoked potentials (SSVEPs) are generally elicited by the periodic presentation of luminance flickers, pictures, and faces7,8. Auditory steady-state responses (ASSRs) are usually generated by trains of transient acoustic stimuli or by the continuous presentation of amplitude-modulated tones9,10.
The extraction of steady-state evoked potentials from the measured EEG essentially relies on averaging subsequently acquired EEG epochs time-locked to the stimulus11. Due to the periodicity of the responses, they can be analyzed in both time and frequency domains. After the frequency-domain transformation, the sensory response is observed as peaks of amplitude at the presentation rate or modulation frequency of the external stimuli, and their corresponding harmonics. These procedures (time-domain averaging and the subsequent frequency-domain transformation) have been essential for developing a hearing test based on the detection of ASSR methods with clinical purposes12,13,14,15,16.
Furthermore, the classical time-domain averaging of EEG epochs has been extremely useful for analyzing physiological processes such as the generation and extinction of SSVEP17,18. Presenting consecutive trains of flicker lights and averaging subsequent epochs within a recording, Wacker et al.19 observed that the phase-locking index of the SSVEP rapidly increased during the first 400 ms of stimulation and remained high afterwards. They also reported that robust visual entrainment was established between 700-1 100 ms after stimulus onset. A certain degree of entrainment remained effective after the offset of the stimulation train, which lasted approximately three periods of the oscillatory response17,19. Those behaviors have been interpreted as the engaging/disengaging effect of the observed oscillations, which is a consequence of the nonlinear information processing in the human visual system17. Alternatively, it is known that under certain experimental conditions, the flicker stimulation can elicit on-responses at the beginning, and off-responses at the end of stimulation trains instead of neural entrainment18.
The main assumption to average consecutively acquired EEG epochs is that the EEG signal represents a linear combination of the sensory response and the background noise20. Furthermore, the amplitude, frequency, and phase of the oscillatory response are assumed to be stationary, whereas the background noise is considered as a random activity. However, in cases in which this assumption is not met, the response amplitude computed after several epochs do not necessarily correspond to the instantaneous amplitude of the evoked potential.
It has been recently reported that the ASSR generated in the brainstem of rats adapts to the continuous presentation of amplitude-modulated tones (i.e., the response amplitude decrease exponentially over time)21,22. Adaptation has been interpreted as a neural mechanism that reflects the loss of novelty of a monotonously repetitive sensory stimulus, increasing the sensitivity to relevant fluctuations in the acoustic environment23,24. In the auditory pathway, adaptation may enhance speech comprehension in noisy environments. Furthermore, this process may be a part of existing mechanisms to monitor the auditory feedback of one's own voice to control the speech production.
Analyzing the time evolution of the 40 Hz ASSR in humans, Van Eeckhoutte et al.25 observed a significant but small decrease in the response amplitude over time (around -0.0002 µV/s based on the group analysis, when assuming a linear decrease over time). Consequently, these authors concluded that the 40 Hz ASSR in humans does not adapt to the stimulation. In humans, non-stationary behaviors have been observed when analyzing the stability of the SSVEP26. These authors observed that the amplitude of the fundamental frequency and the second harmonic of the SSVEP were stationary in only 30% and 66.7% of the subjects they tested, respectively. The phases of both SSVEP frequency components, although relatively stable over time, exhibited small drifts26.
Therefore, although the classical time-domain averaging of subsequently acquired epochs allows exploring of stationary properties of the neural entrainment, this methodology needs to be revised when long-term dynamics of the entrainment is the focus of the research, or when the averaging of short-term dynamics is corrupted by the occurrence of long-term dynamics. To characterize non-stationary behaviors of the steady-state responses, the evoked response computed at a given time window should not be compromised by those computed in the preceding EEG segments. In other words, the evoked potential should be extracted from the background noise without epochs being time-domain averaged with the preceding EEG segments.
In this study, a method for assessing the dynamics of neural entrainment is presented. Steady-state responses are repetitively recorded in response to the same stimulation, where consecutive recordings are interleaved by a resting interval of three times the length of the experimental run. Considering that if the time evolution of the physiological response is the same in different independent experimental runs (independent recordings), recordings are column-wise averaged. In other words, epochs corresponding to the same location in the different recordings are averaged, without averaging epochs within a recording. Consequently, the response amplitude computed at any stimulation interval will correspond to the instantaneous amplitude of the evoked potential. The sensory responses can be either analyzed in the time-domain or transformed into the frequency-domain, depending on the aim of the experiment. In any case, the amplitudes can be plotted as a function of time to analyze time evolution of the steady-state response. Generation and extinction of the steady-state evoked potentials can be assessed by restricting the analysis to the first and last epochs of the recordings.
The dynamics of the neural entrainment can be analyzed using other approaches, such as narrowband filtering single-trial measurements around the frequency of interest and computing the envelope of the power signal using low-pass filtering25 and the Hilbert transformation27. Compared to these methodologies, the column-wise averaging of epochs allows computing steady-state parameters based on signals with the higher signal-to-noise ratio (SNR). Recently, Kalman filtering has emerged as a promising technique for the estimation of 40-Hz ASSR amplitudes28,29,30. Implementation of Kalman filtering can improve the detection of steady-state responses closer to the electrophysiological threshold and reduce the time of the hearing test29. Furthermore, stationary responses are not needed to be assumed when a Kalman filtering approach is used to estimate the ASSR amplitude30. Nevertheless, only one study has analyzed the time evolution of ASSRs using Kalman filtering25. The conclusion of the study is that the 40-Hz ASSR amplitude is stable over the stimulation interval. Therefore, Kalman filtering needs to be tested in conditions under which the ASRR is not stationary.
Although time consuming, the column-wise averaging method is model-free and does not need initialization values and/or a priori definitions of the noise behavior. Furthermore, since it does not involve convergence times, the column-wise averaging may provide a more reliable representation of the onset of neural entrainment. Therefore, the results obtained with the column-wise averaging method can be considered as the ground truth for analyzing dynamics of the neural entrainment using Kalman filtering.
This description of the protocol is based on an example of SSVEP. However, it is important to note that the method presented here is modality-independent, such that it can also be used to analyze the time evolution of SSSEP and ASSR.