tomchen-ctj Goto Github PK

followers: 21.0 following: 97.0 repos: 49.0 gists: 0.0

Name: Tongjia

Type: User

Bio: [email protected]

Blog: https://tomchen-ctj.github.io/

Hi there👋, it's Tongjia, feel free to call me Tom

working on video understanding

Tongjia's Projects

actionclip

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

adapt-image-models

[ICLR'23] AIM: Adapting Image Models for Efficient Video Understanding

annotated_latex_equations

Examples of how to create colorful, annotated equations in Latex using Tikz.

ask-anything

ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

awesome-anything

General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX

awesome-diffusion-models

A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models

awesome-multimodal-large-language-models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

awesome-video-domain-adaptation

A comprehensive collection of awesome research and other items about video domain adaptation

awesome-vision-and-language

A curated list of awesome vision and language resources (still under construction... stay tuned!)

awesome-visual-transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

awesome_prompting_papers_in_computer_vision

A curated list of prompt-based paper in computer vision and vision-language learning.

clip

Contrastive Language-Image Pretraining

contrastivecrop

[CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning

coop

Prompt Learning for Vision-Language Models

cpl

Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"

cv_project

Codes for 2021 HNU EEIT Computer Vision course project

cvpr23-loveu-aqtc

【CVPRW'23】First Place Solution to the CVPR'2023 AQTC Challenge

efficient-video-recognition

fame

Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)

howtocook

程序员在家做饭方法指南。

l2p

Learning to Prompt (L2P) for Continual Learning @ CVPR22 and DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning @ ECCV22

latex-ocr

pix2tex: Using a ViT to convert images of equations into LaTeX code.

lavila

Code release for "Learning Video Representations from Large Language Models"

llama

Inference code for LLaMA models

llava

Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

llm-in-vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

minigpt-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

moviechat

🔥 chat with over 10k frames of video!

mulimgviewer

MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.

tomchen-ctj Goto Github PK

Hi there👋, it's Tongjia, feel free to call me Tom

Tongjia's Projects

Recommend Projects

Recommend Topics

Recommend Org