techthiyanes Goto Github PK

followers: 18.0 following: 210.0 repos: 7.2K gists: 2.0

Name: Thiya

Type: User

Bio: Data Scientist

Location: Bengaluru

Thiya's Projects

visual-clustering

Visual Clustering: Clustering Plotted Data by Image Segmentation

visual-question-answering-for-medical-domain

visual-spatial-reasoning

VSR: A probing benchmark for spatial undersranding of vision-language models.

Visualkeras is a Python package to help visualize Keras (either standalone or included in TensorFlow) neural network architectures. It allows easy styling to fit most needs. This module supports layered style architecture generation which is great for CNNs (Convolutional Neural Networks), and a graph style architecture, which works great for most models including plain feed-forward networks.

visualvoice

Audio-Visual Speech Separation with Cross-Modal Consistency

vit-gpt2

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

vit-pytorch-1

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

vit-vqgan

JAX implementation ViT-VQGAN

vitdet

Unofficial implementation of Exploring Plain Vision Transformer Backbones for Object Detection

vitmem

Image memorability estimation

vits-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and any-to-any voice conversion

vits_diffusion

vitsinger

Singing Voice Speech modeling test

vizier

Python-based research interface for blackbox and hyperparameter optimization, based on Google's internal Vizier Service.

vizseq

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

vl-t5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

vmoe

vnext

Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))

vocalist

Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices

vocgan

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

voicefixer

General Speech Restoration

voicefixer_main

General Speech Restoration

voiceme

Repository for the paper: VoiceMe: Personalized voice generation in TTS

voicesplit

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

voltaml

⚡VoltaML is a lightweight library to convert and run your ML/DL deep learning models in high performance inference runtimes like TensorRT, TorchScript, ONNX and TVM.

voltaml-fast-stable-diffusion

Lightweight library to accelerate Stable-Diffusion, Dreambooth into fastest inference models with single line of code 🔥 🔥

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

techthiyanes Goto Github PK

Thiya's Projects

Recommend Projects

Recommend Topics

Recommend Org