techthiyanes Goto Github PK

followers: 18.0 following: 210.0 repos: 7.2K gists: 2.0

Name: Thiya

Type: User

Bio: Data Scientist

Location: Bengaluru

Thiya's Projects

visual_taste_approximator

Visual Taste Approximator (VTA) is a very simple tool that helps anyone create an automatic replica of themselves that can approximate their own personal visual taste

visualvoice

Audio-Visual Speech Separation with Cross-Modal Consistency

vit-gpt2

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

vit-pytorch-1

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

vit-vqgan

JAX implementation ViT-VQGAN

vitdet

Unofficial implementation of Exploring Plain Vision Transformer Backbones for Object Detection

vitmatte

Boosting Image Matting with Pretrained Plain Vision Transformers

vitmem

Image memorability estimation

vitrina

👀 VITRina: VIsual Token Representations

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

vits-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and any-to-any voice conversion

vits2_pytorch

unofficial vits2-TTS implementation in pytorch

vits_diffusion

vitsinger

Singing Voice Speech modeling test

vizier

Python-based research interface for blackbox and hyperparameter optimization, based on Google's internal Vizier Service.

vizseq

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

vl-t5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

vllm_backend

vmoe

vnext

Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))

vocalist

Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices

vocgan

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

vocode-python

🤖 Build voice-based LLM agents. Modular + open source.

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

vod

Retrieval-augmented LMs, at scale

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

voicefixer

General Speech Restoration

voicefixer_main

General Speech Restoration

techthiyanes Goto Github PK

Thiya's Projects

Recommend Projects

Recommend Topics

Recommend Org