xmu-xiaoma666's Projects
pytorch implementation of deep learning models
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
CV面试中的常见算法
A ✨special ✨ repository to show myself on my homepage.
The official repository for “Image Captioning via Dynamic Path Customization”.
ECCV2022 论文/代码/解读合集,极市团队整理
ECCV2022-Paper-List
ECCV 2022 论文开源项目合集,同时欢迎各位大佬提交issue,分享ECCV 2020开源项目
收集 ECCV 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
深度学习/计算机视觉/多模态/机器学习/人工智能零基础理论/实战教程汇总分享
⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀
This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It will reveals whether the difference of two results is significant. In this code, we complete evaluation code for Spice details(*i.e.*,Object, Relation, Attribute, Color, Count, and Size ).
Leetcode is all you need
Towards Local Visual Modeling for Image Captioning
An official implementation for "Knowing What to Learn: A Metric-Oriented Focal Mechanism for Image Captioning"
Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision
Pytorch-Image-Classification
Pytorch implement ion of RepMLP
Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
helper tools for attention visualization in deep learning
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
A pytorch implementation of “X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation”
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
A pytorch implementation of “ X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance”
🔥🔥🔥YOLOAir:Including YOLOv5, YOLOv7, Transformer, YOLOX, YOLOR and other networks... Support to improve backbone, head, loss, IoU, NMS...The original version was created based on YOLOv5