Paper ID | SPE-29.2 | ||
Paper Title | IMPROVING ULTRASOUND TONGUE CONTOUR EXTRACTION USING U-NET AND SHAPE CONSISTENCY-BASED REGULARIZER | ||
Authors | Ming Feng, Yin Wang, Tongji University, China; Kele Xu, Huaimin Wang, Bo Ding, National University of Defense Technology, China | ||
Session | SPE-29: Speech Processing 1: Production | ||
Location | Gather.Town | ||
Session Time: | Wednesday, 09 June, 16:30 - 17:15 | ||
Presentation Time: | Wednesday, 09 June, 16:30 - 17:15 | ||
Presentation | Poster | ||
Topic | Speech Processing: [SPE-SPRD] Speech Production | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | B-mode ultrasound tongue imaging is widely used to visualize the tongue motion, due to its appearing properties. Extracting the tongue surface contour in the B-mode ultrasound image is still a challenge, while it is a prerequisite for further quantitative analysis. Recently, deep learning-based approach has been adopted in this task. However, the standard deep models fail to address faint contour when the ultrasound wave goes parallel to the tongue surface. To address the faint or missing contours in the sequence, we explore the shape consistency-based regularizer, which can take sequential information into account. By incorporating the regularizer, the deep model not only can extract frame-specific contours, but also can enforce the similarity between the contours extracted from adjacent frames. Extensive experiments are conducted both on the synthetic and real ultrasound tongue imaging dataset and the results demonstrate the effectiveness of proposed method. |