Light

Victor Chen photo

xjchengit Goto Github PK

followers: 31.0 following: 267.0 repos: 27.0 gists: 0.0

Name: Victor Chen

Type: User

Company: National Taiwan University

Bio: Ph.D. Student @ EECS, NTU.

Twitter: xjchen_ntu

Location: Taipei, Taiwan

Blog: xjchen.tech

Hi there 👋

👨🏼‍💻 I'm Victor, a Ph.D. student at National Taiwan University (NTU), advised by Prof. Hung-yi Lee and Prof. J.-S. Roger Jang. I work on audio-visual learning and speech processing. Before NTU, I obtained a B.S. degree in Computer Science and Information Engineering (CSIE) from National Taiwan University of Science and Technology (Taiwan Tech).

🏠 Personal Website: https://xjchen.tech | 📖 Publications: Google scholar | 📩 Email: [email protected]

Victor Chen's Projects

aclue

Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

awesome-audio-visual-deepfake

awesome-audio-visual-robustness

codec-superb

Audio Codec Speech processing Universal PERformance Benchmark

ctrprediction

deeplearning-500-questions

深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系[email protected] 版权所有，违权必究 Tan 2018.06

dynamic-superb

The unofficial repository of Dynamic-SUPERB.

end-to-end-lipreading

Pytorch code for End-to-End Audiovisual Speech Recognition

ir-programming-hw2

Web Retrieval and Mining 2020 (CSIE 5137) - Programming Homework 2

machine-learning-notes

This contains my machine learning notes in latex form

mandarin-wav2vec2

Pre-trained Wav2vec2.0 for Mandarin

mlpack

mlpack: a scalable C++ machine learning library --

mtdvocalist

Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).

ntu_fintech

NTU_FinTech

ntust-algorithmichw

ntust-complier

ntust-infosecurity

push-pull

Official repository for the paper Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection.

rcurtin

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

singgraph

Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

spot-adv-by-vocoder

victor-leetcode

vocalist

Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices

xjchengit

xjchengit.github.io

1

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.