I am a third-year Ph.D. student advised by Prof. Kenji Kawaguchi at the School of Computing (SoC), National University of Singapore (NUS). Previously, I graduated from Tsinghua University with a B.S. degree in Electronic Engineering. I’ve had the fortune to work with Prof. Dongmei Li at Tsinghua University. Afterwards, I joined Center for Speech and Language Technologies (CSLT) as a research intern with Dr. Lantian Li and Prof. Dong Wang. Then I became an intern in ASR Oteam, Tencent Inc. in Beijing and did research in ASR and Multimodal Learning, organizing ICPR MSR 2022 with Dr. Jian Kang, etc. My research centers on data-centric and generative AI, with a particular emphasis on curating high-quality datasets to enhance the performance and trustworthiness of models such as diffusion and multimodal language models. My previous interest also focuses on audio processing. Email: weida_liang[at]u.nus.edu |
![]() |
![]() |
National University of Singapore
2022.8 - Present Ph.D. Student in Computer Science Advisor: Prof. Kenji Kawaguchi |
![]() |
Tencent
2022.3 - 2022.6 Research Intern Advisor: Dr. Jian Kang |
![]() |
Center for Speech and Language Technologies
2021.8 - 2022.4 Research Intern Advisor: Prof. Dong Wang |
![]() |
Tsinghua University
2017.8 - 2021.6 B.S. in Electronic Engineering Advisor: Prof. Dongmei Li |
![]() |
ICPR 2022 Challenge on Multi-Modal Subtitle Recognition. ICPR 2022 accepted
|
![]() |
Enhanced exemplar autoencoder with cycle consistency loss in any-to-one voice conversion. Interspeech 2022 submitted
Weida Liang,
Lantian Li,
Dong Wang,
Wenqiang Du
|