Research Article

Computational Modeling of Affective User Experience Using Multimodal Physiological and Behavioral Signals

DOI:

10.3791/69823

April 7th, 2026

In This Article

Summary

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

This protocol describes a computational framework that models the affective user experience by integrating physiological and behavioral signals in a multimodal fashion, using techniques for correlation-based feature learning and multimodal fusion. This protocol proposes and tests a framework for multimodal affective modelling on the AMIGOS benchmark dataset.

Abstract

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

This work proposes a reproducible computational protocol for multimodal affective modeling that utilizes physiological signals. The goal of the protocol is to enable offline emotion recognition by integrating multiple bio signals using a unified deep learning framework. The proposed work consists of five steps: data collection, preprocessing, feature alignment, multimodal fusion, and evaluation. EEG, ECG, and GSR signals from publicly accessible AMIGOS data were used as the experimental baseline in this work. Bio signals were pre-processed and normalized to extract modality-specific features. Heterogeneous feature spaces were aligned across modalities using Deep Canonical Correlation Analysis, followed by a multimodal fusion network for classifying an affective state. The protocol has been evaluated with offline experiments and compared to conventional fusion and classification models using standard performance metrics such as accuracy, precision, recall, F1-score, and AUC. This study focuses on the development and validation of a computational framework for multimodal affective user experience modeling rather than the deployment of a real-time interactive system. With 92.1% accuracy for UX-affective state prediction and 94.2% F1-score for valence-arousal classification, the results consistently outperformed baseline models on emotional dimensions. These findings verified the effectiveness of the proposed multimodal fusion workflow for computational affective modeling by benchmarking physiological data.

Introduction

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The complex interplay of thinking, feeling, and action shapes how people think and act. Affective computing is studying these relationships by leveraging cross-disciplinary knowledge from neuroscience, psychology, and artificial intelligence to build systems that are capable of analysing, understanding, and reacting to human emotion. This area has been increasingly applied to human–technology communication by incorporating expressive consciousness into responsive AI structures, making technology interact not just with intellectual but also with emotive conditions, resulting in more individualized and emotion-aware user knowledge. An emotion, a complex mental pro....

Access restricted. Please log in or start a trial to view this content.

Protocol

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The AMIGOS dataset used in this study is publicly available and was collected with prior institutional review board approval and informed consent, as reported in the original publication. This study involves only secondary analysis of the dataset, and no additional ethical approval was required.

The present method uses feature alignment and multimodal fusion approaches to handle multimodal physiological and behavioral data in order to describe perception–emotion correlations. This study proposes a computational model for affective user experience (UX) in interactive exhibitions, leveraging multimodal biophysical sensing and AI-based emotion....

Access restricted. Please log in or start a trial to view this content.

Results

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Assessment of the proposed system
To assess the proposed system, it performed experiments on the publicly available AMIGOS dataset, which provides synchronized measurements of EEG, ECG, GSR, video, and audio of 40 users exposed to emotionally engaging stimuli. For the purpose of this research, the authors used data from 33 participants (following preprocessing and removal of incomplete trials), resulting in 1,320 valid samples on the valence and arousal dimensions. The assessment emphasized emotion c.......

Access restricted. Please log in or start a trial to view this content.

Discussion

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Spatial, environmental, and physical interaction contexts, such as spatial layout, crowd density, or ambient environmental conditions, are explicitly not given in the AMIGOS dataset. Thus, such factors are also not directly modelled in the current experiments. The suggested computational framework for Affective User Experience (UX) modeling progresses much further than the base paper's foundational concepts that dealt with user, task-oriented child–robot interaction employing biophysical emotion detection. Gene.......

Access restricted. Please log in or start a trial to view this content.

Disclosures

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The authors have no conflicts of interest.

Acknowledgements

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The authors acknowledge the support of the School of Space Design and the School of Industrial Design at Hongik University. The authors also thank the exhibition partners and participants for their contributions to the study.

....

Access restricted. Please log in or start a trial to view this content.

Materials

List of materials used in this article
NameCompanyCatalog NumberComments
DatasetAMIGOS dataset40 participants; EEG (128 Hz), ECG (1000 Hz), GSR (1000 Hz), facial video, self-reported valence/arousal labelsMultimodal ground truth data for affective state modeling
Physiological SensorsEEG headsetEmotiv EPOC+ (14 channels, 128 Hz)Capturing brain activity related to attention, arousal, and engagement
ECG sensorBiopac MP150 or equivalent (1000 Hz)Heart rate variability and arousal
GSR/EDA sensorShimmer GSR+ or equivalent (1000 Hz)Skin conductance as measure of arousal
Behavioral SensorsEye-tracking deviceTobii Pro X2-60 or equivalentRecording gaze fixation and saccades
Facial expression recordingHigh-resolution video camera; analyzed with OpenFace (AUs, gaze vectors)Extracting facial Action Units (AUs) and gaze cues
Environmental InputsAudio-visual recording setupMicrophone + Camera (synchronized with stimuli)Capturing contextual stimuli during exhibition
Software / ToolkitsOpenFaceOpen-source facial behavior analysis toolkitExtracting Action Units (AUs), gaze direction
MATLAB / Python (NumPy, SciPy, scikit-learn)Signal preprocessing (resampling, z-score normalization, PSD computation)Data preprocessing and feature extraction
TensorFlowv2.13 / PyTorchv2.0Deep learning framework for DCCA and MMFNModel implementation and training
Algorithms / ModelsDeep Canonical Correlation Analysis (DCCA)Nonlinear feature alignment methodLearning correlated latent representations across modalities
Multimodal Fusion Network (MMFN)BiLSTM + Attention-based fusion layersHierarchical fusion of heterogeneous modalities for UX state classification
Evaluation MetricsAccuracy, Precision, Recall, F1-Score, Cohen’s Kappa, AUC-ROC, Confusion MatrixImplemented with scikit-learn / TensorFlow metricsModel performance assessment
Computing HardwareWorkstation / GPU clusterNVIDIA RTX 3080 (10GB) or equivalent, 32 GB RAM, Intel i9 processorModel training and simulation

Reprints and Permissions

Request permission to reuse the text or figures of this JoVE article

Request Permission

Tags

Affective ModelingMultimodal FusionPhysiological SignalsEmotion RecognitionDeep Learning FrameworkEEG SignalsECG SignalsGSR SignalsFeature AlignmentOffline Experiments

Related Articles