techthiyanes Goto Github PK

followers: 18.0 following: 210.0 repos: 7.2K gists: 2.0

Name: Thiya

Type: User

Bio: Data Scientist

Location: Bengaluru

Thiya's Projects

voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

voltaml

⚡VoltaML is a lightweight library to convert and run your ML/DL deep learning models in high performance inference runtimes like TensorRT, TorchScript, ONNX and TVM.

voltaml-fast-stable-diffusion

Lightweight library to accelerate Stable-Diffusion, Dreambooth into fastest inference models with single line of code 🔥 🔥

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

voxceleb_trainer

In defence of metric learning for speaker recognition

vpt

🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

vq-diffusion

Official implementation of VQ-Diffusion

vq-vae

Minimalist implementation of VQ-VAE in Pytorch

vq-vae-audio

Implementation of VQ-VAE for audio

vqa-outliers

Code and Experiments for ACL-IJCNLP 2021 Paper "Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering."

vqgan-clip

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

vqgan-jax

JAX implementation of VQGAN

vqmivc

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

vrt

VRT: A Video Restoration Transformer

vtoonify

[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

w2ner

Source code for AAAI 2022 paper: Unified Named Entity Recognition as Word-Word Relation Classification

w2v2-air-traffic

w2v2-speaker

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053

wakenet

A CNN-based optical image ship wake detector.

warp-drive

watson-openscale-samples

Watson Openscale sample assets, notebooks and apps.

wav2keyword

Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.

wav2lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

wav2lip-hq

Extension of Wav2Lip repository for processing high-quality videos.

wav2lip-in-tensorflow

Here We have Transformed the original wav2lip model (which was built in pytorch) from pytorch to tensorflow

wav2seq

Official code for Wav2Seq

wav2vec2-indonesian

wav2vec2-kenlm

Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding

wav2vec2-live

An live speech recognition using Facebooks wav2vec 2.0 model.

wav2vec2-sprint

docker for HF wav2vec2-sprint

techthiyanes Goto Github PK

Thiya's Projects

Recommend Projects

Recommend Topics

Recommend Org