Why does Mann-Whitney U testing matter for target validation?

Mann-Whitney U testing identifies statistically significant differences in voice features between COPD and RTI groups, supporting functional target validation and reducing mechanistic ambiguity in biomarker selection.

How does principal component analysis support independent variable isolation?

Principal component analysis reduces dimensionality and isolates key voice features, enabling clearer interpretation of independent variables that drive disease classification in the discovery pipeline.

What do quantitative AUC and confusion matrix outputs enable?

Quantitative AUC and confusion matrix outputs provide objective measures of model performance, supporting reliable comparison of classification accuracy and informing go/no-go decisions in R&D workflows.

Why are cross-validation and replication critical for collaboration?

Cross-validation and replication ensure model robustness and reproducibility, enabling cross-functional teams to trust and adopt machine learning-based diagnostics in collaborative research settings.

What statistical analysis capabilities are required before implementation?

Robust statistical analysis, including nonparametric testing and principal component analysis, is essential to validate feature selection and model performance prior to broader implementation in biopharma pipelines.

Op machine learning gebaseerde classificatie van hoesttonen: diagnostische verkenning van chronische obstructieve longziekte en luchtweginfecties

888 views

06:22 min

September 19th, 2025

10.3791/68222-v

September 19th, 2025

888 views

Xiaohan Liu^*¹ , Tengteng Li^*² , Jingxin Zhang¹ , Jiutang Sun¹ , Jinghua Li² , Zhulv Zhang² , Benzhang Zhao³ , Jianjun Wu³ , Yu Lu¹^,² , Tao Lu¹

¹Beijing University of Chinese Medicine, ²Institute of Information on Traditional Chinese Medicine, Chinese Academy of Traditional Chinese Medicine, ³Beijing University of Chinese Medicine Third Affiliated Hospital

^* These authors contributed equally

Deze studie heeft de geautomatiseerde classificatie van twee verschillende categorieën effectief bereikt door hoestgeluidsgegevens te verkrijgen van patiënten met de diagnose chronische obstructieve longziekte (COPD) en luchtweginfecties (RTI), met behulp van een integratie van spraaksignaalverwerkingstechnieken en machine learning-algoritmen.