Giter Club home page Giter Club logo

Hi there, I'm Gen Luo. 👋

  • 🌱 I’m a Ph.D student in Media Analytics and Computing Lab (MAC), Artificial Intelligence Department, School of Informatics, Xiamen University, China.
  • ❤️ My research interests are in vision-and-language learning and efficient training.

luogen1996's github stats

luogen1996's Projects

conv4d icon conv4d

4D Convolution Operator for TensorFlow

datasets icon datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

detectron2 icon detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

detr icon detr

End-to-End Object Detection with Transformers

doodle_generator icon doodle_generator

Multi-process data generators for the kaggle google doodle recognition competition.

ensemble-objdet icon ensemble-objdet

A basic ensemble method for object detection. Given bounding boxes from multiple object detectors, output a single cohesive set of bounding boxes.

examples icon examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

fairseq icon fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

fgd icon fgd

Focal and Global Knowledge Distillation for Detectors (CVPR 2022)

iep-ref icon iep-ref

Inferring and Executing Programs for Visual Reasoning

image_transformer icon image_transformer

Pytorch implementation of the image transformer for unconditional image generation

keras-transformer icon keras-transformer

Keras library for building (Universal) Transformers, facilitating BERT and GPT models

keras-yolo3 icon keras-yolo3

A Keras implementation of YOLOv3 (Tensorflow backend)

lavin icon lavin

[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

llava-hr icon llava-hr

LLaVA-HR: High-Resolution Large Language-Vision Assistant

mattnet icon mattnet

MAttNet: Modular Attention Network for Referring Expression Comprehension

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.