Why does Mann-Whitney U testing matter for target validation?

Mann-Whitney U testing identifies statistically significant differences in voice features between COPD and RTI groups, supporting functional target validation and reducing mechanistic ambiguity in biomarker selection.

How does principal component analysis support independent variable isolation?

Principal component analysis reduces dimensionality and isolates key voice features, enabling clearer interpretation of independent variables that drive disease classification in the discovery pipeline.

What do quantitative AUC and confusion matrix outputs enable?

Quantitative AUC and confusion matrix outputs provide objective measures of model performance, supporting reliable comparison of classification accuracy and informing go/no-go decisions in R&D workflows.

Why are cross-validation and replication critical for collaboration?

Cross-validation and replication ensure model robustness and reproducibility, enabling cross-functional teams to trust and adopt machine learning-based diagnostics in collaborative research settings.

What statistical analysis capabilities are required before implementation?

Robust statistical analysis, including nonparametric testing and principal component analysis, is essential to validate feature selection and model performance prior to broader implementation in biopharma pipelines.

Machine Learning-Based Cough Tone Classification: Diagnostic Exploration of Chronic Obstructive Pulmonary Disease and Respiratory Tract Infections

888 views

06:22 min

September 19th, 2025

10.3791/68222-v

September 19th, 2025

888 views

Xiaohan Liu^*¹ , Tengteng Li^*² , Jingxin Zhang¹ , Jiutang Sun¹ , Jinghua Li² , Zhulv Zhang² , Benzhang Zhao³ , Jianjun Wu³ , Yu Lu¹^,² , Tao Lu¹

¹Beijing University of Chinese Medicine, ²Institute of Information on Traditional Chinese Medicine, Chinese Academy of Traditional Chinese Medicine, ³Beijing University of Chinese Medicine Third Affiliated Hospital

^* These authors contributed equally

This study effectively accomplished the automated classification of two distinct categories by acquiring cough sound data from patients diagnosed with chronic obstructive pulmonary disease (COPD) and respiratory tract infections (RTI), utilizing an integration of speech signal processing techniques and machine learning algorithms.