Giter Club home page Giter Club logo

algorithms-notes's Introduction

Hi there, I'm Siteng Huang (黄思腾 in Chinese)! 👋 You can also call me Kyon Huang.

Hi! I am Siteng Huang (黄思腾 in Chinese). I received my Ph.D. degree from Zhejiang University (Hangzhou, China) in June 2024, affiliated with a joint-supervision program with Westlake University (Hangzhou, China) at Machine Intelligence Laboratory (MiLAB) and advised by Prof. Donglin Wang. Before that, I received my B.Eng. Degree from School of Computer Science, Wuhan University (Wuhan, China) in June 2019.

Twitter GitHub GitHub Google Scholar

🔍 Currently, My research has centered on multi-modal large models, especially vision-language models (VLMs), including

  • Generation/AIGC: text-to-image/video (T2I/V) generationADI,SimM, customized & controllable generationADI,SimM, test-time diffusion interventionSimM,VGDiffZero, multi-modal large language models (MLLMs)Cobra, PiTe
  • Understanding: text-video retrieval (TVR)VoP, compositional zero-shot learning (CZSL)Troika, few-shot learning (FSL)AGAM,HTS, visual groundingVGDiffZero,DARA
  • Transfer: parameter-efficient fine-tuning (PEFT/PETL)VoP,DARA,Sparse-Tuning, meta-learningMRN, domain adaptationPDA
  • Embodied AI: vision-language-action models (VLAs)QUAR-VLA, foundation models for robotics

I am always looking for related collaborations, and most of them have produced top-level publications. Feel free to drop me an email if you are interested!

💬 News:

  • [July 1, 2024] Two papers (PiTe and QUAR-VLA) got accepted for ECCV 2024.
  • [June 4, 2024] I successfully defended my dissertation. So many thanks to my Ph.D. committee (Prof. Xiaogang Jin, Prof. Mai Xu, Prof. Changxin Gao, Prof. Fajie Yuan, Prof. Peidong Liu, Prof. Xiaofei Li) and my advisor!
  • [May 5, 2024] Our Cobra was selected for VALSE 2024 Annual Progress Representation. Thanks to all the committee for the approval!
  • [March 29, 2024] Troika got accepted as VALSE 2024 Poster!
  • [March 21, 2024] Cobra, an efficient multi-modal large language model, was released. Project page has been available. The paper has been featured by Hugging Face Daily Papers! Demo has been available!
  • [March 13, 2024] One paper about parameter-efficient tuning for visual grounding got accepted for ICME 2024 (Oral).
  • [February 27, 2024] Awarded as Zhejiang University 2024 Outstanding Graduates!
  • [February 27, 2024] Three papers (ADI, Troika, SimM) as first/co-first author got accepted for CVPR 2024. Congratulations to all collaborators!
  • [December 13, 2023] The paper of VGDiffZero on diffusion model-based zero-shot visual grounding got accepted for ICASSP 2024. Congratulations to all collaborators!
  • [December 9, 2023] One paper on VLM-based unsupervised domain adaptation got accepted for AAAI 2024.
  • [July 24, 2023] 2023 Scholar Metrics was released by Google Scholar. Our paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting" ranked 8th of the CIKM 2019 conference according to the citations, and 26th within five years.
  • [April 2, 2023] The paper of RL-CZSL about reference-limited compositional learning got accepted for ICMR 2023. Congratulations to all collaborators!
  • [February 28, 2023] The paper of VoP about parameter-efficient text-video retrieval got accepted for CVPR 2023. Congratulations to all collaborators!

📫 Contact me by:

  • Email: siteng.huang[at]gmail.com (Please change [at] to @)

algorithms-notes's People

Contributors

bighuang624 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

algorithms-notes's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.