Giter Club home page Giter Club logo

awj2021's Projects

ape icon ape

[ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"

audioldm icon audioldm

AudioLDM: Generate speech, sound effects, music and beyond, with text.

awesome-face-tasks icon awesome-face-tasks

Recording some interesting face tasks, e.g., talking face videos generation, 3D face reconstruction, Face restoration and so on.

clipn icon clipn

ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No

da-fusion icon da-fusion

Effective Data Augmentation With Diffusion Models

ddim icon ddim

Denoising Diffusion Implicit Models

diffae icon diffae

Official implementation of Diffusion Autoencoders

diffmic icon diffmic

[MICCAI 2023] DiffMIC: Dual-Guidance Diffusion Network for Medical Image Classification

diffusion-classifier icon diffusion-classifier

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

froster icon froster

The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"

geneface icon geneface

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

iclr24 icon iclr24

Official code for ICLR 2024 paper, "A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation"

interview icon interview

面试问题,常见的计算机视觉和深度学习的知识点

ip-adapter icon ip-adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

lra-diffusion icon lra-diffusion

This is the source code of LRA-diffusion for learning from noisy labels

mkt icon mkt

Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".

mm-diffusion icon mm-diffusion

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

odise icon odise

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

promix icon promix

PyTorch Code for ProMix: Combating Label Noise via Maximizing Clean Sample Utility

psla icon psla

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

pvt icon pvt

Pyramid Transformer Networks for Our Own dataset.

quilt1m icon quilt1m

[NeurIPS 2023 Oral] Quilt-1M: One Million Image-Text Pairs for Histopathology.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.