Victor Chen's Projects
Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension
A curated list of different papers and datasets in various areas of audio-visual processing
awesome-audio-visual-robustness
Audio Codec Speech processing Universal PERformance Benchmark
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
The unofficial repository of Dynamic-SUPERB.
Pytorch code for End-to-End Audiovisual Speech Recognition
Web Retrieval and Mining 2020 (CSIE 5137) - Programming Homework 2
This contains my machine learning notes in latex form
Pre-trained Wav2vec2.0 for Mandarin
mlpack: a scalable C++ machine learning library --
Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).
NTU_FinTech
Official repository for the paper Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection.
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).
Awesome speech/audio LLMs, representation learning, and codec models
Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices