Giter Club home page Giter Club logo

To preview/edit localy with docker

docker-compose up

docker-compose.yml file is used to create a container that is reachable under http://localhost:4000. Changes _data/data.yml will be visible after a while.

YANAN WANG's Projects

actiongenome icon actiongenome

A video database bridging human actions and human-object relationships

additional-emotiw-dataset icon additional-emotiw-dataset

Additional datasets for the group-based cohesion and emotion understanding tasks. It contains situation description text for static and dynamic visual data.

bottom-up-attention icon bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

cmu-multimodalsdk icon cmu-multimodalsdk

CMU MultimodalSDK is a machine learning platform for development of advanced multimodal models as well as easily accessing advanced multimodal datasets.

dest_agqa icon dest_agqa

The official implementation of Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling (BMVC 2022 Spotlight).

distillation_methods icon distillation_methods

[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods

driving-with-llms icon driving-with-llms

PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"

examples icon examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

fcm-client icon fcm-client

A simple client for Firebase Cloud Messaging (which replaces Google Cloud Messaging (GCM))

gbnet icon gbnet

Bridging Knowledge Graphs to Generate Scene Graphs, ECCV 2020

graph-rcnn.pytorch icon graph-rcnn.pytorch

Pytorch code for our ECCV 2018 paper "Graph R-CNN for Scene Graph Generation" and other papers

graphclip_vgt icon graphclip_vgt

Video Graph Transformer for Video Question Answering (ECCV'22)

home-robot icon home-robot

Mobile manipulation research tools for roboticists

iccv-2023-papers icon iccv-2023-papers

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

kp-gnn icon kp-gnn

Source code for how powerful are K-hop message passing graph neural networks (Neurips 2022)

linkbert icon linkbert

[ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

low-rank-multimodal-fusion icon low-rank-multimodal-fusion

This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018. This repo will be populated shortly afterwards.

merlot icon merlot

MERLOT: Multimodal Neural Script Knowledge Models

mfn icon mfn

Code for Memory Fusion Network, AAAI 2018

moma icon moma

A dataset for multi-object multi-actor activity parsing

neural-motifs icon neural-motifs

Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.