techthiyanes Goto Github PK
Name: Thiya
Type: User
Bio: Data Scientist
Location: Bengaluru
Name: Thiya
Type: User
Bio: Data Scientist
Location: Bengaluru
Voice Activity Detection based on Deep Learning & TensorFlow
β‘VoltaML is a lightweight library to convert and run your ML/DL deep learning models in high performance inference runtimes like TensorRT, TorchScript, ONNX and TVM.
Lightweight library to accelerate Stable-Diffusion, Dreambooth into fastest inference models with single line of code π₯ π₯
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
In defence of metric learning for speaker recognition
π₯ Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
Official implementation of VQ-Diffusion
Minimalist implementation of VQ-VAE in Pytorch
Implementation of VQ-VAE for audio
Code and Experiments for ACL-IJCNLP 2021 Paper "Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering."
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
JAX implementation of VQGAN
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
VRT: A Video Restoration Transformer
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
Source code for AAAI 2022 paper: Unified Named Entity Recognition as Word-Word Relation Classification
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
A CNN-based optical image ship wake detector.
Watson Openscale sample assets, notebooks and apps.
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
Extension of Wav2Lip repository for processing high-quality videos.
Here We have Transformed the original wav2lip model (which was built in pytorch) from pytorch to tensorflow
Official code for Wav2Seq
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
An live speech recognition using Facebooks wav2vec 2.0 model.
docker for HF wav2vec2-sprint
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.