Tony Zhang
Tony Zhang
Home
Projects
Publications
Contact
Light
Dark
Automatic
Projects
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
To enhance the reliability and robustness of language identification (LID) and language diarization (LD) systems for heterogeneous populations and scenarios, there is a need for speech processing models to be trained on datasets that feature diverse language registers and speech patterns.
Twin-S: A Digital Twin for Skull-base Surgery
Purpose: Digital twins are virtual interactive models of the real world, exhibiting identical behavior and properties. In surgical applications, computational analysis from digital twins can be used, for example, to enhance situational awareness.
Paper Link
PQLM - Multilingual Decentralized Portable Quantum Language Model
With careful manipulation, malicious agents can reverse engineer private information encoded in pre-trained language models. Security concerns motivate the development of quantum pre-training. In this work, we propose a highly portable quantum language model (PQLM) that can easily transmit information to downstream tasks on classical machines.
Paper Link
A New Approach to Extract Fetal Electrocardiogram Using Affine Combination of Adaptive Filters
The detection of abnormal fetal heartbeats during pregnancy is important for monitoring the health conditions of the fetus. While adult ECG has made several advances in modern medicine, noninvasive fetal electrocardiography (FECG) re- mains a great challenge.
Paper Link
End-to-End Lyrics Recognition with Self-supervised Learning
Lyrics recognition is an important task in music processing. Despite traditional algorithms such as the hybrid HMM- TDNN model achieving good performance, studies on applying end-to-end models and self-supervised learning (SSL) are limited.
Paper Link
Cite
×