| | |
| | |
Stat |
Members: 3645 Articles: 2'501'711 Articles rated: 2609
20 April 2024 |
|
| | | |
|
Article overview
| |
|
Speaker dependent acoustic-to-articulatory inversion using real-time MRI of the vocal tract | Tamás Gábor Csapó
; | Date: |
4 Aug 2020 | Abstract: | Acoustic-to-articulatory inversion (AAI) methods estimate articulatory
movements from the acoustic speech signal, which can be useful in several tasks
such as speech recognition, synthesis, talking heads and language tutoring.
Most earlier inversion studies are based on point-tracking articulatory
techniques (e.g. EMA or XRMB). The advantage of rtMRI is that it provides
dynamic information about the full midsagittal plane of the upper airway, with
a high ’relative’ spatial resolution. In this work, we estimated midsagittal
rtMRI images of the vocal tract for speaker dependent AAI, using MGC-LSP
spectral features as input. We applied FC-DNNs, CNNs and recurrent neural
networks, and have shown that LSTMs are the most suitable for this task. As
objective evaluation we measured normalized MSE, Structural Similarity Index
(SSIM) and its complex wavelet version (CW-SSIM). The results indicate that the
combination of FC-DNNs and LSTMs can achieve smooth generated MR images of the
vocal tract, which are similar to the original MRI recordings (average CW-SSIM:
0.94). | Source: | arXiv, 2008.02098 | Services: | Forum | Review | PDF | Favorites |
|
|
No review found.
Did you like this article?
Note: answers to reviews or questions about the article must be posted in the forum section.
Authors are not allowed to review their own article. They can use the forum section.
browser Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)
|
| |
|
|
|
| News, job offers and information for researchers and scientists:
| |