Weida Liang

I am a third-year Ph.D. student advised by Prof. Kenji Kawaguchi at the School of Computing (SoC), National University of Singapore (NUS). Previously, I graduated from Tsinghua University with a B.S. degree in Electronic Engineering. I’ve had the fortune to work with Prof. Dongmei Li at Tsinghua University. Afterwards, I joined Center for Speech and Language Technologies (CSLT) as a research intern with Dr. Lantian Li and Prof. Dong Wang. Then I became an intern in ASR Oteam, Tencent Inc. in Beijing and did research in ASR and Multimodal Learning, organizing ICPR MSR 2022 with Dr. Jian Kang, etc.

My research centers on data-centric and generative AI, with a particular emphasis on curating high-quality datasets to enhance the performance and trustworthiness of models such as diffusion and multimodal language models. My previous interest also focuses on audio processing.

Email: weida_liang[at]u.nus.edu
Github / Google Scholar / Wechat

profile photo

Research Experience

National University of Singapore
2022.8 - Present

Ph.D. Student in Computer Science
Advisor: Prof. Kenji Kawaguchi
Tencent
2022.3 - 2022.6

Research Intern
Advisor: Dr. Jian Kang
Center for Speech and Language Technologies
2021.8 - 2022.4

Research Intern
Advisor: Prof. Dong Wang
Tsinghua University
2017.8 - 2021.6

B.S. in Electronic Engineering
Advisor: Prof. Dongmei Li

Papers

ICPR 2022 Challenge on Multi-Modal Subtitle Recognition.

ICPR 2022 accepted
Shan Huang, SHEN HUANG, Li Lu, PENGFEI HU, Lijuan Wang, Xiang Wang*, Jian Kang, Weida Liang, Lianwen Jin, Yuliang Liu, Yaqiang Wu
project page

Enhanced exemplar autoencoder with cycle consistency loss in any-to-one voice conversion.

Interspeech 2022 submitted

Weida Liang, Lantian Li, Dong Wang, Wenqiang Du
pdf / code / project page

Patent

A cycle loss based voice conversion device, 2022

Honors & Awards

Meritorious Winner in Mathematical Contest in Modeling, 2019
Bronze Medal in Chinese Mathematical Olympiad, 2017