Topic: vision-transformer Goto Github
Some thing interesting about vision-transformer
Some thing interesting about vision-transformer
vision-transformer,Vision-Centric BEV Perception: A Survey
User: 4dvlab
vision-transformer,Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
User: adithya-s-k
Home Page: https://docs.cognitivelab.in
vision-transformer,Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper
Organization: alibaba-miil
vision-transformer,An all-in-one toolkit for computer vision
Organization: alibaba
vision-transformer,EVA Series: Visual Representation Fantasies from BAAI
Organization: baaivision
vision-transformer,Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
User: baudm
Home Page: https://huggingface.co/spaces/baudm/PARSeq-OCR
vision-transformer,A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
User: chinhsuanwu
Home Page: https://arxiv.org/abs/2110.02178
vision-transformer,An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
User: cmhungsteve
vision-transformer,[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
User: czczup
Home Page: https://arxiv.org/abs/2205.08534
vision-transformer,Extract markdown and images from PDFs, URLs, docs, slides, and more, ready for multimodal LLMs. ⚡
User: emcf
Home Page: https://thepi.pe
vision-transformer,[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Organization: foundationvision
vision-transformer,[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...
Organization: google-research
vision-transformer,Scenic: A Jax Library for Computer Vision Research and Beyond
Organization: google-research
vision-transformer,[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
User: hila-chefer
vision-transformer,Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
Organization: huawei-noah
vision-transformer,[NeurIPS 2021] You Only Look at One Sequence
Organization: hustvl
Home Page: https://arxiv.org/abs/2106.00666
vision-transformer,InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Organization: internlm
vision-transformer,Explainability for Vision Transformers
User: jacobgil
vision-transformer,This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
Organization: jdai-cv
Home Page: https://arxiv.org/pdf/2107.12292.pdf
vision-transformer,SwinIR: Image Restoration Using Swin Transformer (official repository)
User: jingyunliang
Home Page: https://arxiv.org/abs/2108.10257
vision-transformer,VRT: A Video Restoration Transformer (official repository)
User: jingyunliang
Home Page: https://arxiv.org/abs/2201.12288
vision-transformer,A collection of papers about Transformer in the field of medical image analysis.
User: junyuchen245
vision-transformer,Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
User: leaplabthu
Home Page: https://arxiv.org/abs/2309.01430
vision-transformer,pix2tex: Using a ViT to convert images of equations into LaTeX code.
User: lukas-blecher
Home Page: https://lukas-blecher.github.io/LaTeX-OCR/
vision-transformer,Hierarchical Image Pyramid Transformer - CVPR 2022 (Oral)
Organization: mahmoodlab
vision-transformer,[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Organization: mcg-nju
Home Page: https://arxiv.org/abs/2203.12602
vision-transformer,This is a collection of our NAS and Vision Transformer work.
Organization: microsoft
vision-transformer,EfficientViT is a new family of vision models for efficient high-resolution vision.
Organization: mit-han-lab
vision-transformer,[ECCV] Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 3.3M runs https://replicate.com/mv-lab/swin2sr
User: mv-lab
Home Page: https://arxiv.org/abs/2209.11345
vision-transformer,This repository contains demos I made with the Transformers library by HuggingFace.
User: nielsrogge
vision-transformer,[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Organization: nvlabs
Home Page: https://arxiv.org/abs/2306.06189
vision-transformer,Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Organization: nvlabs
Home Page: https://arxiv.org/abs/2407.08083
vision-transformer,Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
Organization: nvlabs
vision-transformer,A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Organization: ofa-sys
vision-transformer,OpenMMLab Detection Toolbox and Benchmark
Organization: open-mmlab
Home Page: https://mmdetection.readthedocs.io
vision-transformer,OpenMMLab Pre-training Toolbox and Benchmark
Organization: open-mmlab
Home Page: https://mmpretrain.readthedocs.io/en/latest/
vision-transformer,[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Organization: opengvlab
vision-transformer,Awesome List of Attention Modules and Plug&Play Modules in Computer Vision
User: pprp
vision-transformer,[NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification
User: raoyongming
Home Page: https://gfnet.ivg-research.xyz/
vision-transformer,SOTA Semantic Segmentation Models in PyTorch
User: sithu31296
vision-transformer,[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
User: sunzey
Home Page: https://aleafy.github.io/alpha-clip
vision-transformer,Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Organization: towhee-io
Home Page: https://towhee.io
vision-transformer,A curated list of foundation models for vision and language tasks
Organization: uncbiag
vision-transformer,A comprehensive list [SAMRS@NeurIPS'23, RVSA@TGRS'22, RSP@TGRS'22] of our research works related to remote sensing, including papers, codes, and citations. Note: The repo for [TGRS'22] "An Empirical Study of Remote Sensing Pretraining" has been moved to: https://github.com/ViTAE-Transformer/RSP
Organization: vitae-transformer
vision-transformer,Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"
Organization: vitae-transformer
vision-transformer,The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
Organization: vitae-transformer
vision-transformer,UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.
User: wanglibo1995
vision-transformer,CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
Organization: westlake-ai
Home Page: https://openmixup.readthedocs.io
vision-transformer,(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
User: xxxnell
Home Page: https://arxiv.org/abs/2202.06709
vision-transformer,ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Organization: yitu-opensource
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.