Giter Club home page Giter Club logo

av-convtasnet's Introduction

Hey πŸ‘‹πŸ½, I'm Kai Li!


Β Β Β  Β Β Β 

GIF

My name is Kai Li (Chinese name: ζŽε‡―). I'm a second-year master student at Department of Computer Science and Technology, Tsinghua University, supervised by Prof. Xiaolin Hu (θƒ‘ζ™“ζž—). I am also a member of TSAIL Group directed by Prof. Bo Zhang (张拨) and Prof. Jun zhu (ζœ±ε†›). I am an intern at Tencent AI Lab, mainly doing research on causal speech separation, supervised by Yi Luo (η½—θ‰Ί).

πŸ€— Β  These works are open source to the best of my ability.

πŸ€— Β  I am currently doing research on multimodal speech separation, and am interested in other speech tasks (e.g., pre-training models and neuralscience). If you would like to collaborate, please contact me. Many thanks.

πŸ”– Homepages

: Kai Li Β Β Β  : Jusper Lee Β Β Β  : cslikai.cn

πŸ“… News

  • 2023.07: 🎲 One paper is accepted by ECAI 2023.
  • 2023.05: 🧩 Two papers are accepted by Interspeech 2023.
  • 2023.05: πŸŽ‰ We won the first prize πŸ₯‡ of the Cinematic Sound Demixing Track 23 in the Leaderboard A and B.
  • 2023.05: πŸŽ‰ We won the first prize πŸ₯‡ of the ASC23 and Best Application Award.
  • 2023.04: 🎲 One paper is appeared by Arxiv.
  • 2023.02: 🧩 One paper is accepted by ICASSP 2023.
  • 2023.01: 🧩 One paper is accepted by ICLR 2023.

πŸ“° Selected Publications:

See Google Scholar for a full list of publications.

Speech Separation

Neuroscience

Cloud Removal

Super Resolution

av-convtasnet's People

Contributors

jusperlee avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

av-convtasnet's Issues

pretrained model missing keys

Hi,

I tried to load the pretrained model for VoxCeleb2 and LSR3 but they are missing lots of keys like video.*, feats_conv.*, etc.. Is the pretrained model not for the AV_model module?

Some code problem about training

Great job !
when I use my datasets for training the model, I meet the problem about "training_epoch_end"
the detail is as follow:
File "/home/AVtasnet_linux/Trainer/trainer.py", line 122, in
main(opt)
File "/home/AVtasnet_linux/Trainer/trainer.py", line 112, in main
trainer.fit(system)
File "/root/anaconda3/envs/nichang/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 770, in fit
self._call_and_handle_interrupt(
................
pytorch_lightning.utilities.exceptions.MisconfigurationException: training_epoch_end expects a return of None. HINT: remove the return statement in training_epoch_end.

the training stopped at epoch 0 when it reach to 100%.
could anyone help me?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.