zhikanggfu Goto Github PK

followers: 0.0 following: 0.0 repos: 49.0 gists: 0.0

Type: User

zhikanggfu's Projects

alphaclip

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

appagent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

auto-ui

Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (stay tuned and more will be updated)

awesome-diffusion-models

A collection of resources and papers on Diffusion Models

awesome-large-multimodal-agents

awesome-papers-autonomous-agent

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

awesome-rlhf

A curated list of reinforcement learning with human feedback resources (continually updated)

awesome-video-diffusion-models

[Arxiv] A Survey on Video Diffusion Models

chatglm-efficient-tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

cogvlm

a state-of-the-art-level open visual language model | 多模态预训练模型

controlvideo

[Arxiv 2023] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

ddpo

Code for the paper "Training Diffusion Models with Reinforcement Learning"

ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

diffusion-policies-for-offline-rl

diffusiondet

[ICCV2023 Oral] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

finetune-sd

what I learned about fine-tuning stable diffusion

flaml

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

freevc

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

hmp2g

Multiagent Reinforcement Learning Research Project

imagereward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

llama-factory

Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)

llama2-chinese

Llama中文社区，最好的中文Llama大模型，完全开源可商用

zhikanggfu Goto Github PK

zhikanggfu's Projects

Recommend Projects

Recommend Topics

Recommend Org