Back To Schedule
Thursday, October 28 • 9:00pm - Friday, December 3 • 6:00pm
Phoneme Mappings for Online Vocal Percussion Transcription

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Vocal Percussion Transcription (VPT) aims at detecting vocal percussion sound events in a beatboxing performance and classifying them into the correct drum instrument class (kick, snare, or hi-hat). To do this in an online (real-time) setting, however, algorithms are forced to classify these events within just a few milliseconds after they are detected. The purpose of this study was to investigate which phoneme-to-instrument mappings are the most robust for online transcription purposes. We used three different evaluation criteria to base our decision upon: frequency of use of phonemes among different performers, spectral similarity to reference drum sounds, and classification separability. With these criteria applied, the recommended mappings would potentially feel natural for performers to articulate while enabling the classification algorithms to achieve the best performance possible. Given the final results, we provided a detailed discussion on which phonemes to choose given different contexts and applications.


Alejandro Luezas

Roli / Queen Mary University of London

Charalampos Saitis

Queen Mary University of London

Mark Sandler

Queen Mary University of London

Thursday October 28, 2021 9:00pm - Friday December 3, 2021 6:00pm EST