Research Article

Development of Interactive Artificial Intelligence Tools for Personalized Somatosensory and Rhythm Evaluation in Intelligent Music Education Platforms

DOI:

10.3791/69058

⸱

December 19th, 2025

In This Article

Summary

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

This study presents a reproducible somatosensory music-learning protocol combining residual LSTM recognition with TRPO for adaptive difficulty. It covers preprocessing, FFT features, training, personalization, and evaluation. On a public dataset, the hybrid model reached Acc 95.0 / P 93.5 / R 94.6 / F1 94.2 over three subject-disjoint folds.

Abstract

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Traditional music education often lacks interactivity and real-time adaptability, especially in remote settings. This study introduces a personalized somatosensory framework, TRPO-ResLSTM, for music education platforms. The system captures movement, rhythm, and response time, preprocesses data with Wiener filtering and Z-score normalization, and extracts features via FFT. Gesture recognition is performed by DeepRes-LSTM, while adaptive difficulty is regulated by TRPO reinforcement learning. Incremental learning ensures personalization across sessions. Experiments on a publicly available, anonymized gesture-rhythm dataset (n = 2,730 samples; training/validation/test split 70/15/15) show superior performance over multimodal baselines, achieving 95% accuracy, 93.5% precision, 94.6% recall, and 94.2% F1-score. Ablation studies confirm the individual contributions of TRPO and Res-LSTM. The innovation of this protocol lies in integrating reinforcement learning with residual temporal modeling for adaptive gesture recognition, enabling stable yet personalized learning. This work demonstrates that adaptive, gesture-responsive tools can enhance engagement, personalization, and progressive skill development in intelligent music education. Limitations include reliance on a single dataset and the need for real-learner validation, which define directions for future work.

Introduction

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Recent advances in artificial intelligence (AI) and somatosensory technology are reshaping music education by enabling learners to interact with music through body movements, where gestures are translated into notes, rhythms, or controls for virtual instruments1,2. These interactive features enhance engagement, retention, and creativity compared to traditional classroom instruction, and somatosensory tools allow students to practice rhythm, coordination, and expression through body percussion, conducting gestures, and ensemble simulations3. Combined with AI-driven adaptive pathways, lea....

Access restricted. Please log in or start a trial to view this content.

Protocol

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

This study analyzed anonymized, publicly available data and did not involve human subjects or animals. Therefore, additional ethical approval was not required.

1. Overview

This protocol describes a reproducible framework for somatosensory music education based on deep residual LSTM recognition and Trust Region Policy Optimization (TRPO) for adaptive difficulty control. It includes dataset preparation, preprocessing, frequency-domain feature extraction, model architecture, training, personalization, and evaluation. Figure 1 illustrates the end-to-end workflow

Access restricted. Please log in or start a trial to view this content.

Results

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Experimental Setup
The TRPO-ResLSTM framework was implemented in Python 3.10.1 with GPU acceleration. The computing environment, motion-sensing hardware, and Python libraries are listed in the Table of Materials. Data were split into subject-disjoint training/validation/test partitions as shown in Table 1 (70/15/15). Key hyperparameters are summarized in Table 2. Three models were evaluated: baseline TRPO, baseline ResLSTM, and the integrated TRPO-ResLSTM. This set.......

Access restricted. Please log in or start a trial to view this content.

Discussion

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

This study proposes a hybrid protocol, TRPO-ResLSTM, that integrates reinforcement learning and residual temporal modeling for gesture-based music education. By combining the stability of Trust Region Policy Optimization (TRPO) with the sequence-learning capacity of residual LSTMs, the framework delivers real-time gesture recognition together with adaptive difficulty control, enabling personalized feedback and progressive skill acquisition25. To ensure reproducibility, subject-disjoint folds, fixe.......

Access restricted. Please log in or start a trial to view this content.

Disclosures

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The authors declare no conflicts of interest.

Acknowledgements

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The authors thank their colleagues for constructive feedback on the study design and manuscript preparation. This work received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

....

Access restricted. Please log in or start a trial to view this content.

Materials

List of materials used in this article
NameCompanyCatalog NumberComments
Accelerometer sensor dataKaggle (Public domain)Multimodal input signals (motion patterns, timing features) included in dataset
GPU workstationNVIDIA Corporation, USATraining hardware: NVIDIA RTX 3080 (10 GB), 32 GB RAM, Ubuntu 20.04
Hand - joint position dataKaggle (Public domain)Somatosensory input for gesture recognition
Matplotlib (v3.7)https://matplotlib.orgVisualization library for plotting figures and performance metrics
NumPy (v1.23)https://numpy.orgNumerical computation library for array operations
Public music gesture and rhythm datasetKaggle (Public domain)Anonymized dataset of 2,730 samples recording body responses to tempo and beat; used for training/validation/testing (70/15/15)
Python 3.10.1Python Software Foundation, https://www.python.orgProgramming environment for model implementation and analysis
PyTorch (v1.13)https://pytorch.orgDeep learning framework for implementing ResLSTM and TRPO modules
scikit - learn (v1.2)https://scikit-learn.orgMachine learning utilities for preprocessing and evaluation
SciPy (v1.10)https://scipy.orgScientific computing library (used for Wiener filtering)

References

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,
  1. Wei, J., Karuppiah, M., Prathik, A. College music education and teaching based on AI techniques. Comput Electr Eng. 100, 107851(2022).
  2. Yu, X., et al. Developments and applications of artificial intelligence in music education. Technol. 11 (2), ....

Access restricted. Please log in or start a trial to view this content.

Reprints and Permissions

Request permission to reuse the text or figures of this JoVE article

Request Permission

Tags

Intelligent Music EducationSomatosensory EvaluationGesture RecognitionRhythm EvaluationTRPO Reinforcement LearningResLSTM ModelAdaptive DifficultyIncremental LearningFeature ExtractionPersonalized Learning

Related Articles