Giter Club home page Giter Club logo

makabakas's Projects

binaural-source-localization-cnn icon binaural-source-localization-cnn

A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microphone inputs.

danet-tensorflow icon danet-tensorflow

Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"

deep-clustering icon deep-clustering

A tensorflow implementation for Deep clustering: Discriminative embeddings for segmentation and separation

gllim icon gllim

a flexible Matlab toolbox for Gaussian Locally Linear Mapping

humhum icon humhum

a project of digital video and audio processing

mcse icon mcse

Multi-channel speech enhancement system (MVDR beamformer + several postfilters)

messl icon messl

Model-based EM Source Separation and Localization

news_spark icon news_spark

基于Spark2.x新闻网大数据实时分析可视化系统项目

nn-gev icon nn-gev

Neural network supported GEV beamformer

phd-thesis icon phd-thesis

Hagen Wierstorf - Perceptual Assessment of Sound Field Synthesis, PhD thesis, TU Berlin

resample icon resample

audio resample algorithm based weighted-sinc

research icon research

Additional material for my scientific publications

sherpa-onnx icon sherpa-onnx

Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin

speech_feature_extractor icon speech_feature_extractor

Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.

spherical-array-processing icon spherical-array-processing

A collection of MATLAB routines for acoustical array processing on spherical harmonic signals, commonly captured with a spherical microphone array.

voice-identification icon voice-identification

Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.

wavloc icon wavloc

End-to-End binaural sound localization

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.