Roberto Mazzotta's Projects
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Retrieval-Augmented Video Generation for Telling a Story
Official implementation of AnimateDiff.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
An experimental open-source attempt to make GPT-4 fully autonomous.
🔊 Text-Prompted Generative Audio Model
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Constructing a personalized ChatGPT bot capable of accepting voice recordings
CoTracker is a model for tracking any point (pixel) on a video.
LearnWeb3-Ethereum dApp and Cryptocurrency
LearnWeb3-Fake-Voice
Specify what you want it to build, the AI asks for clarification, and then builds it.
LearnWeb3-Image-Generator
LearnWeb3-Img-Identifier
ImageBind One Embedding Space to Bind Them All
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
A reference containing Styles and Keywords that you can use with MidJourney AI. There are also pages showing resolution comparison, image weights, and much more!
LearnWeb3-NFT Hardhat
Nx plugin for structuring a monorepo with domains and layers
A pandoc LaTeX template to convert markdown files to PDF or LaTeX.
LearnWeb3-PDF-Chatbot
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
one-click deepfake (face swap)
Superpower plugin for ChatGPT
Config files for my GitHub profile.
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
🆙 Upscayl - Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy.