I’m a senior research scientist and lead the Voice team at Naver Cloud, Korea (from Jan 2023; Naver Corporation from Mar 2017 to Dec 2022). I’m also an adjunct professor in Artificial Intelligence Institute at Seoul National University, Seoul, Korea (from Aug 2022).
I received my Ph.D. degree in department of Electrical and Electronics at Yonsei University, Seoul, Korea. During my Ph.D., I served my internships at Microsoft Research Asia, Beijing, China and Qualcomm Technologies Inc., San Diego, CA.
My research interests include speech synthesis and its real-world applications. Specifically, I develop a high-quality TTS api for cloud services (Clova Voice Pro, Clova Dubbing), an automatic TTS modeling with smartphone recordings (Voice Maker), and a hybrid TTS system combining deep learning and unit-selection TTS models (Clova AI speaker, Naver Maps navigation, Naver News anchor).
If you are interested in me, feel free to contact me.
Download my CV
Period VITS: Variational inference with explicit pitch modeling for end-to-end emotional speech synthesis [paper][demo]
Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana
Proc. ICASSP, 2023, pp. 4299-4303.
HierSpeech: Bridging the gap between text and speech by hierarchical variational inference using self-supervised representations for speech synthesis [paper][demo]
Sang-Hoon Lee, Seung-Bin Kim, Ji-Hyun Lee, Eunwoo Song, Min-Jae Hwang, Seong-Whan Lee
Proc. NeurIPS, 2022, pp. 16624-16636.
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder [paper][demo]
Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim
Proc. INTERSPEECH, 2022, pp. 1941-1945.
Parallel waveform synthesis [Slides]
Samsung Research, Sep 2022
Data-selective TTS augmentation [Slides]
Naver Engineering Day, Jul 2022
Voice synthesis and applications [Slides]
KAIST and SNU, Apr 2022
Introduction to text-to-speech [Slides]
Naver Engineering Day, Apr 2021
Deep learning-based text-to-speech [Slides]
Yonsei Univ. and Korea Univ., Apr 2021