The av-convtasnet from jusperlee

av-convtasnet's Introduction

Hey 👋🏽, I'm Kai Li!

My name is Kai Li (Chinese name: 李凯). I'm a second-year master student at Department of Computer Science and Technology, Tsinghua University, supervised by Prof. Xiaolin Hu (胡晓林). I am also a member of TSAIL Group directed by Prof. Bo Zhang (张拨) and Prof. Jun zhu (朱军). I am an intern at Tencent AI Lab, mainly doing research on causal speech separation, supervised by Yi Luo (罗艺).

🤗 These works are open source to the best of my ability.

🤗 I am currently doing research on multimodal speech separation, and am interested in other speech tasks (e.g., pre-training models and neuralscience). If you would like to collaborate, please contact me. Many thanks.

🔖 Homepages

: Kai Li : Jusper Lee : cslikai.cn

📅 News

2023.07: 🎲 One paper is accepted by ECAI 2023.
2023.05: 🧩 Two papers are accepted by Interspeech 2023.
2023.05: 🎉 We won the first prize 🥇 of the Cinematic Sound Demixing Track 23 in the Leaderboard A and B.
2023.05: 🎉 We won the first prize 🥇 of the ASC23 and Best Application Award.
2023.04: 🎲 One paper is appeared by Arxiv.
2023.02: 🧩 One paper is accepted by ICASSP 2023.
2023.01: 🧩 One paper is accepted by ICLR 2023.

📰 Selected Publications:

See Google Scholar for a full list of publications.

Speech Separation

An efficient encoder-decoder architecture with top-down attention for speech separation. Kai Li, Runxuan Yang, Xiaolin Hu. ICLR 2023.
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits. Kai Li, Fenghua Xie, Hang Chen, Kexin Yuan, Xiaolin Hu. Arxiv 2022.
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network Xiaolin Hu, Kai Li, Weiyi Zhang, Yi Luo, Jean-Marie Lemercier, Timo Gerkmann. NeurIPS 2021.

Neuroscience

Inferring mechanisms of auditory attentional modulation with deep neural networks. Ting-Yu Kuo, Yuanda Liao, Kai Li, Bo Hong, Xiaolin Hu. Neural Computation 2022.

Cloud Removal

PMAA: A Progressive Multi-scale Attention Autoencoder Model for High-Performance Cloud Removal from Multi-temporal Satellite Imagery. Xuechao Zou, Kai Li, Junliang Xing, Pin Tao#, Yachao Cui. ECAI 2023.

Super Resolution

A Survey of Single Image Super Resolution Reconstruction. Kai Li, Shenghao Yang, Runting Dong, Jianqiang Huang, Xiaoying Wang. IET Image Processing 2020.
Single Image Super-resolution Reconstruction of Enhanced Loss Function with Multi-GPU Training. Jianqiang Huang, Kai Li, Xiaoying Wang. ISPA 2019.

av-convtasnet's People

Contributors

Stargazers

Watchers

av-convtasnet's Issues

about pretrained model

how to use the pretrained model？

pretrained model missing keys

Hi,

I tried to load the pretrained model for VoxCeleb2 and LSR3 but they are missing lots of keys like video.*, feats_conv.*, etc.. Is the pretrained model not for the AV_model module?

Some code problem about training

Great job !
when I use my datasets for training the model, I meet the problem about "training_epoch_end"
the detail is as follow:
File "/home/AVtasnet_linux/Trainer/trainer.py", line 122, in
main(opt)
File "/home/AVtasnet_linux/Trainer/trainer.py", line 112, in main
trainer.fit(system)
File "/root/anaconda3/envs/nichang/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 770, in fit
self._call_and_handle_interrupt(
................
pytorch_lightning.utilities.exceptions.MisconfigurationException: training_epoch_end expects a return of None. HINT: remove the return statement in training_epoch_end.

the training stopped at epoch 0 when it reach to 100%.
could anyone help me?

jusperlee / av-convtasnet Goto Github PK

av-convtasnet's Introduction

Hey 👋🏽, I'm Kai Li!

🔖 Homepages

📅 News

📰 Selected Publications:

av-convtasnet's People

Contributors

Stargazers

Watchers

av-convtasnet's Issues

about pretrained model

pretrained model missing keys

Some code problem about training

Any other link to download dataset?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent