techthiyanes Goto Github PK
Name: Thiya
Type: User
Bio: Data Scientist
Location: Bengaluru
Name: Thiya
Type: User
Bio: Data Scientist
Location: Bengaluru
Visual Clustering: Clustering Plotted Data by Image Segmentation
VSR: A probing benchmark for spatial undersranding of vision-language models.
Visualkeras is a Python package to help visualize Keras (either standalone or included in TensorFlow) neural network architectures. It allows easy styling to fit most needs. This module supports layered style architecture generation which is great for CNNs (Convolutional Neural Networks), and a graph style architecture, which works great for most models including plain feed-forward networks.
Audio-Visual Speech Separation with Cross-Modal Consistency
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
JAX implementation ViT-VQGAN
Unofficial implementation of Exploring Plain Vision Transformer Backbones for Object Detection
Image memorability estimation
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and any-to-any voice conversion
Singing Voice Speech modeling test
Python-based research interface for blackbox and hyperparameter optimization, based on Google's internal Vizier Service.
An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)
Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Voice Activity Detection based on Deep Learning & TensorFlow
Unofficial PyTorch implementation of Google AI's VoiceFilter system
General Speech Restoration
General Speech Restoration
Repository for the paper: VoiceMe: Personalized voice generation in TTS
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
⚡VoltaML is a lightweight library to convert and run your ML/DL deep learning models in high performance inference runtimes like TensorRT, TorchScript, ONNX and TVM.
Lightweight library to accelerate Stable-Diffusion, Dreambooth into fastest inference models with single line of code 🔥 🔥
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.