Giter Club home page Giter Club logo

Hi there👋, it's Tongjia, feel free to call me Tom

working on video understanding

Tongjia's Projects

actionclip icon actionclip

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

ask-anything icon ask-anything

ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

awesome-anything icon awesome-anything

General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX

awesome-diffusion-models icon awesome-diffusion-models

A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models

clip icon clip

Contrastive Language-Image Pretraining

contrastivecrop icon contrastivecrop

[CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning

coop icon coop

Prompt Learning for Vision-Language Models

cpl icon cpl

Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"

cv_project icon cv_project

Codes for 2021 HNU EEIT Computer Vision course project

fame icon fame

Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)

l2p icon l2p

Learning to Prompt (L2P) for Continual Learning @ CVPR22 and DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning @ ECCV22

latex-ocr icon latex-ocr

pix2tex: Using a ViT to convert images of equations into LaTeX code.

lavila icon lavila

Code release for "Learning Video Representations from Large Language Models"

llama icon llama

Inference code for LLaMA models

llava icon llava

Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

llm-in-vision icon llm-in-vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

minigpt-4 icon minigpt-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

mulimgviewer icon mulimgviewer

MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.