$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
The purpose of this protocol is to convey a procedure for a standard experiment on psychophysics. Psychophysics is the study of perception phenomena through the measure of behavioral responses, elicited by sensory inputs1,2,3. Usually, human psychophysics is an inexpensive and essential tool to implement in imaging or neurophysiological experiments4. However, it is never easy to select the most appropriate psychophysical method out of many that exist, and the selection somewhat depends on experience and preference. Nevertheless, we encourage beginners to revise available methodologies thoroughly in order to learn about selection criteria5,6,7. Here, we provide a procedure for performing a 2IFC task, which many researchers frequently use for studying perceptual processes such as working memory8, decision making9,10, or time perception11,12,13.
To guide the readers along the method, we recreate a report on the perceptual duration of visual (V), auditory (A), and audiovisual (AV) intervals of aperiodic sequences of pulses. We will refer to this task as an aperiodic interval discrimination (AID) task13. When attempting to describe this paradigm in psychophysics jargon, it would be a class-A, type-1, performance-based, criterion-dependent discrimination task that uses a non-adaptive method of the constants and a hyperbolic tangent (tanh) model to calculate a differential threshold. Even when such a characterization sounds somewhat entangled, we will use it to introduce the reader to some general aspects of psychophysics, hoping to provide decision criteria for new experiments and maybe even the possibility of tailoring the current protocol to other needs.
Any psychophysical experiment, such as a 2IFC task, requires implementing stimuli, a task, a method, an analysis, and a measurement6. The goal is to obtain the psychometric function that better accounts for the measured performance14. A 2IFC task consists of presenting to participants, who are naïve to the purpose of the experiment, trials of two sequential stimuli. After comparing the stimuli, they report the outcome by selecting one, and only one, out of two possible responses that better suits their perception.
With stimuli, we refer to technical considerations about the sensory modality under study. A class-A experiment consists of the comparison of stimuli of the same modality within a trial, whereas class-B experiments include cross-modal comparisons. Other essential considerations about stimuli include their implementation, such as the technical ways of modulating stimuli within a required range. For example, if we want to find the just noticeable difference (JND) between two flutter frequencies vibrating on the skin15, we need a precision stimulator to generate frequencies within the confines of flutter (i.e., 4 - 40 Hz). In other words, the dynamic operating range of the technical elements depend on the dynamic spectrum of each sensory modality.
Selecting a task is about the perceptual phenomenon under study. For instance, finding whether two stimuli are the same, or equivalent, may rely on different brain mechanisms than those resolving if a stimulus is longer or shorter than a reference16 (as in the AID paradigm). Intrinsically, stimuli selection defines the type of obtained responses. Type-1 experiments, sometimes closely related to the so-called performance experiments, include correct or incorrect responses. In contrast, a type-2 experiment (or appearance experiment) produces mostly qualitative answers that depend on the participant's criteria and not on any explicitly imposed criterion; in other words, criterion-independent experiments. It is noteworthy that 2IFC task responses are criterion dependent because, in every trial, the standard stimulus (sometimes called base or reference stimulus) constitutes the criterion on which the comparison's perception depends.
The method may refer to three things; first, it may refer to the mechanism for selecting the range of stimuli to test or, in other words, to an already known range of stimulus variability, as opposed to adaptive methods aimed to establish the adequate range17. These adaptive matters are recommended for quickly finding detection and discrimination thresholds and for minimum trial repetitions18. Also, adaptive methods are optimal for pilot experiments. The second definition of a method is the scale of stimuli modulations (e.g., the method of the constants) or a logarithmic scale. The selected scale may or may not be a direct consequence of the outcome of an adaptive method, but primarily, it regards the dynamics of the studied sensory modality. Lastly, the method also refers to the number of trials and their presentation order.
As for analysis, it relates to the statistics of experimental measurements. Regardless of selecting appropriate analytical methods for comparisons between test and control groups, psychophysics is mostly about measuring absolute or differential thresholds between two conditions (e.g., presence vs. absence of a stimulus, or the JND between two stimuli), particularly in 2IFC19. Such measurements derive from psychometric functions (i.e., continuous models of behavior as a function of the probability of detecting or discerning one of the conditions at stake). Selecting the model function depends on the scale or, in other words, on the spacing of the values of the independent variable. Functions such as cumulative normal, logistic, Quick, and Weibull are appropriate for values spaced linearly, whereas Gumbel and log-Quick are better suited for logarithmic spacing. Alternative models also exist, such as the tanh employed in the AID task. Importantly, selecting a correct model depends on the parameters of interest, as considered in the design of the experiment20. After fitting the data to a model, it should be possible to derive two parameters: α and β parameters. In the case of a logistic function typically employed in a 2IFC paradigm, α refers to the abscissas value projecting to the point of subjective equality (i.e., at half the logistic). The β parameter refers to the slope at α value (i.e., the steepness of the transition between conditions). Finally, a parameter commonly obtained out of a psychometric curve is the differential limen21 (DL). In a 2IFC experiment, the DL relates to β, but strictly, corresponds to the minimum perceived difference between two intervals. The formula to determine the DL is the following equation (1).
(1)
Here, x stands for independent variable values projecting at a 0.75 and 0.25 performance measured directly at the sigmoidal curve. Up until this point, we have covered only some generalities about psychometric functions. We recommend further study of estimating and interpreting psychometric functions, with these and other parameters22.
Other technical aspects to consider when implementing a psychophysical experiment are related to equipment and software. Memory and speed capacities of commercial computers nowadays are usually optimal for processing in high-fidelity visual and auditory tasks. Moreover, the dynamic resolution of complementary material, such as noise-blocking headphones, speakers, and monitors, must fulfill the sampling rate at which the sensory modalities operate (e.g., frequency, amplitude, contrast, and refreshing rate). Also, software programs such as PsychToolbox23 and PsychoPy24 are easy to implement and highly efficient at synchronizing tasks' events and equipment.
The previously described AID task assembles many of the topics described above for a 2IFC paradigm. Interestingly, it explores the perception of V, A, and AV intervals in the range of milliseconds, where most of the brain's processes occur25,26,27. Paradoxically, it is also a challenging lapse for studying vision, which, compared to audition, begets a somewhat constrained sampling rate28. In this sense, multimodal comparisons require additional theoretical scopes12,29,30. Sometimes, they need further tailoring to encompass a common modulation spectrum or to achieve congruent interpretations.
This protocol focuses on a discrimination task (i.e., a 2IFC where a base stimulus, also called reference or standard, is contrasted against a set of comparison or test stimuli to find a JND or, in other words, a discrimination threshold). Here, the task is set to study the capacity of humans to discriminate time intervals of V, A, or AV aperiodic patterns of pulses13. We provide information on creating and parameterizing stimuli, as well as on analyses of accuracy and reaction times. Importantly, we discuss how to interpret subjects' time perception from the psychometrics statistical outcome parameters, and some experimental and analytical alternatives within topics of a 2IFC psychophysical method.