zhikanggfu Goto Github PK
Type: User
Type: User
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (stay tuned and more will be updated)
A collection of resources and papers on Diffusion Models
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
A curated list of reinforcement learning with human feedback resources (continually updated)
[Arxiv] A Survey on Video Diffusion Models
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
a state-of-the-art-level open visual language model | 多模态预训练模型
Let us control diffusion models!
[Arxiv 2023] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
Code for the paper "Training Diffusion Models with Reinforcement Learning"
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
[ICCV2023 Oral] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
what I learned about fine-tuning stable diffusion
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Generative Models by Stability AI
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
Multiagent Reinforcement Learning Research Project
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
High-Resolution Image Synthesis with Latent Diffusion Models
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
Llama中文社区,最好的中文Llama大模型,完全开源可商用
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.