Tanmay Gupta's Projects
AdReal technology for Desktop
Matlab code for creating an edge map written as a part of my blog ahumaninmachinesworld.blogspot.in
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Bottom-up features extractor implemented in PyTorch.
Implementation of catamull clark subdivision using half edge data structure
Source Code for Android Course Example Applications
Graphical Models and EM for Crowdsourcing
Deep learning with cats (^._.^)
A copy of https://github.com/anuranbaka/OpenDTAM with a main cpp file for both tracking and mapping
A python wrapper for Edge Boxes object proposal generation
Fourier Series expansion of periodic rectangular wave
GIT: A Generative Image-to-text Transformer for Vision and Language
A task-agnostic vision-language architecture as a step towards General Purpose Vision
MATLAB code for Image Blending
Image captioning codebase in pytorch(finetunable cnn in branch "with_finetune";diverse beam search can be found in 'dbs' branch; self-critical training is under my self-critical.pytorch repository.)
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Visualize effects of neural net architectural choices on 2D data
A strong HOI Detection model without Frills!
Photographic Image Synthesis with Cascaded Refinement Networks
Faster R-CNN (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB version
Utility functions and classes for building Artificial Intelligence systems in Python
Fork of ruotianluo/pytorch-faster-rcnn with a simplified script to extract boxes, scores, features etc from any set of images and dump them in a directory
RGBD segmentation
Taming Transformers for High-Resolution Image Synthesis
Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.