Paper ID | AUD-10.6 |
Paper Title |
KARAOKE KEY RECOMMENDATION VIA PERSONALIZED COMPETENCE-BASED RATING PREDICTION |
Authors |
Yuan Wang, Santa Clara University, United States; Shigeki Tanaka, NTT DOCOMO, INC., Japan; Keita Yokoyama, Hsin-Tai Wu, DOCOMO Innovations, Inc., United States; Yi Fang, Santa Clara University, United States |
Session | AUD-10: Music Information Retrieval and Music Language Processing 2: Singing Voice |
Location | Gather.Town |
Session Time: | Wednesday, 09 June, 14:00 - 14:45 |
Presentation Time: | Wednesday, 09 June, 14:00 - 14:45 |
Presentation |
Poster
|
Topic |
Audio and Acoustic Signal Processing: [AUD-MIR] Music Information Retrieval and Music Language Processing |
IEEE Xplore Open Preview |
Click here to view in IEEE Xplore |
Virtual Presentation |
Click here to watch in the Virtual Conference |
Abstract |
Karaoke machines have become a popular choice for many people's daily entertainment. In this paper, we address a novel task of recommending a suitable key for a user to sing a given song to meet his or her vocal competence, by proposing the Personalized Competence-based Rating Prediction (PCRP) model. Specifically, we learn the song embedding vectors from the sequences of songs' notes, and then design a history encoder with recurrent units to extract users’ vocal information from the history rating records and utilize a rating decoder based on the Transformer. The experimental results on a real world karaoke rating dataset demonstrate the effectiveness of the proposed approach. |