Giter Club home page Giter Club logo
  • πŸ‘‹ Hi, I’m @haoranD
  • πŸ‘€ I’m interested in deep learning and its wide applications and theories
  • 🌱 I’m mostly focused on deep learning based computer vision and time series.
  • πŸ’žοΈ I’m looking wide collaborations
  • πŸ“« [email protected]

Haoran Duan's Projects

groundingdino icon groundingdino

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

groundinglmm icon groundinglmm

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

gta icon gta

[ICLR2024] GTA: A Geometry-Aware Attention Mechanism for Multiview Transformers

guidance icon guidance

A guidance language for controlling large language models.

hands-on-rl icon hands-on-rl

Free course that takes you from zero to Reinforcement Learning PRO πŸ¦ΈπŸ»β€πŸ¦ΈπŸ½

haorand icon haorand

Config files for my GitHub profile.

har-dl icon har-dl

Pytorch Human Activity Recognition based on wearable sensors

home-robot icon home-robot

Mobile manipulation research tools for roboticists

how-do-vits-work icon how-do-vits-work

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

html4vision icon html4vision

A simple HTML visualization tool for computer vision research :hammer_and_wrench:

human-pose-estimation.pytorch icon human-pose-estimation.pytorch

The project is an official implement of our ECCV2018 paper "Simple Baselines for Human Pose Estimation and Tracking(https://arxiv.org/abs/1804.06208)"

idify icon idify

Make ID photo right in the browser.

ijepa icon ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

im2flow icon im2flow

Im2Flow: Motion Hallucination from Static Images for Action Recognition (CVPR 2018)

image-model icon image-model

Using Adversarial Networks to Generate Faces from Depth Maps

image2paragraph icon image2paragraph

Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

imagebind icon imagebind

ImageBind One Embedding Space to Bind Them All

imu-human-pose-pytorch icon imu-human-pose-pytorch

This is an official Pytorch implementation of "Fusing Wearable IMUs with Multi-View Images for Human Pose Estimation: A Geometric Approach, CVPR 2020".

incognitopilot icon incognitopilot

An AI code interpreter for sensitive data, powered by GPT-4 or Llama 2.

instantid icon instantid

InstantID : Zero-shot Identity-Preserving Generation in Seconds πŸ”₯

instruct2act icon instruct2act

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

interngpt icon interngpt

InternGPT / InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.

internimage icon internimage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.