Giter Club home page Giter Club logo

Helin Wang's Projects

adamwr icon adamwr

Implements https://arxiv.org/abs/1711.05101 AdamW optimizer, cosine learning rate scheduler and "Cyclical Learning Rates for Training Neural Networks" https://arxiv.org/abs/1506.01186 for PyTorch framework

asc_triplet icon asc_triplet

triplet loss on Acoustic Scene Classification-PyTorch

at-gcn icon at-gcn

Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network

atresn-net icon atresn-net

Capturing attentive temporal relations in semantic neighborhood for ASC

attention-based_atrous_cnn icon attention-based_atrous_cnn

Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes', by Zhao Ren, Qiuqiang Kong, Jing Han, Mark Plumbley, Björn Schuller.

aty-tts icon aty-tts

Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech

audioset_raw icon audioset_raw

Download and create a tfreader for the audioset dataset

automatic_speech_annotator icon automatic_speech_annotator

Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition

babycry-sound-detection icon babycry-sound-detection

PyTorch implementations of neural network models for Babycry sound detection, including training process and test demo. Based on DCASE2017 Task2: Detection of rare sound events.

bnm icon bnm

code of Towards Discriminability and Diversity: Batch Nuclear-norm Maximization under Label Insufficient Situations (CVPR2020 oral)

cmu-thesis icon cmu-thesis

Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling

consingan icon consingan

Official PyTorch implementation of "Improved Techniques for Training Single-Image GANs"

dcase-2020-task1a-code icon dcase-2020-task1a-code

A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.

dcase2020-task6-pku icon dcase2020-task6-pku

A Pytorch implementation of the DCASE2020 Task6 by PKU team : Automated Audio Captioning With Temporal Attention

duta-vc icon duta-vc

Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.