Laryngeal motility assessment is essential for diagnosing and managing laryngeal disorders. However, paralysis evaluations suffer from high inter-rater variability, necessitating a more objective and quantitative approach. This study introduces a novel AI-driven pipeline that leverages computer vision techniques to classify 155 video-laryngoscopies into unilateral paralysis (n = 68), bilateral paralysis (n = 50), and healthy laryngeal function (n = 37). Our approach includes several advancements over existing literature. We extract the vocal fold positions from each video and automatically identify the most informative, noise-cleaned video segments for classification. We define novel movement-based features that quantitatively capture the restricted mobility characteristics of laryngeal paralysis. These features are used to train two classification models using a 5-fold cross-validation strategy: one model for binary classification (healthy vs. paralyzed) and the other for multi-class classification (healthy vs. unilateral paralysis vs. bilateral paralysis). To assess the importance of these features, we conduct an ablation study using Shapley values. Our method achieves a precision of 0.83, sensitivity (recall) of 0.85, F1-score of 0.84, and balanced accuracy of 0.85 for distinguishing between healthy and paralyzed individuals. For multi-class classification (healthy vs unilateral paralysis vs bilateral paralysis), our model achieves a precision of 0.80, sensitivity of 0.83, F1-score of 0.81, and a balanced accuracy of 0.83. These results highlight the effectiveness of our method and underscore the relevance of our features, further validated by the ablation study. Our AI-grounded approach enhances the accuracy and reliability of automatic laryngeal motility assessment. By introducing novel metrics to quantify paralysis severity, we provide a more objective, reproducible, and clinically valuable evaluation tool.

Artificial Intelligence in Otolaryngology: Redefining Automatic Laryngeal Paralysis Assessment for Optimal Care

Ferrari, Marco;Nicolai, Piero;
2025

Abstract

Laryngeal motility assessment is essential for diagnosing and managing laryngeal disorders. However, paralysis evaluations suffer from high inter-rater variability, necessitating a more objective and quantitative approach. This study introduces a novel AI-driven pipeline that leverages computer vision techniques to classify 155 video-laryngoscopies into unilateral paralysis (n = 68), bilateral paralysis (n = 50), and healthy laryngeal function (n = 37). Our approach includes several advancements over existing literature. We extract the vocal fold positions from each video and automatically identify the most informative, noise-cleaned video segments for classification. We define novel movement-based features that quantitatively capture the restricted mobility characteristics of laryngeal paralysis. These features are used to train two classification models using a 5-fold cross-validation strategy: one model for binary classification (healthy vs. paralyzed) and the other for multi-class classification (healthy vs. unilateral paralysis vs. bilateral paralysis). To assess the importance of these features, we conduct an ablation study using Shapley values. Our method achieves a precision of 0.83, sensitivity (recall) of 0.85, F1-score of 0.84, and balanced accuracy of 0.85 for distinguishing between healthy and paralyzed individuals. For multi-class classification (healthy vs unilateral paralysis vs bilateral paralysis), our model achieves a precision of 0.80, sensitivity of 0.83, F1-score of 0.81, and a balanced accuracy of 0.83. These results highlight the effectiveness of our method and underscore the relevance of our features, further validated by the ablation study. Our AI-grounded approach enhances the accuracy and reliability of automatic laryngeal motility assessment. By introducing novel metrics to quantify paralysis severity, we provide a more objective, reproducible, and clinically valuable evaluation tool.
2025
File in questo prodotto:
File Dimensione Formato  
s42979-025-04606-w.pdf

accesso aperto

Tipologia: Published (Publisher's Version of Record)
Licenza: Creative commons
Dimensione 4.31 MB
Formato Adobe PDF
4.31 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3573672
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex 0
social impact