Topic: captioning Goto Github
Some thing interesting about captioning
Some thing interesting about captioning
captioning,A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.
User: 42lux
captioning,S2VT (seq2seq) video captioning with bahdanau & luong attention implementation in Tensorflow
User: adrianhsu
Home Page: http://www.cs.utexas.edu/users/ml/papers/venugopalan.iccv15.pdf
captioning,CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022
Organization: aimagelab
captioning,Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023
Organization: aimagelab
captioning,With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning. ICCV 2023
Organization: aimagelab
captioning,
Organization: andrew-ng-s-number-one-fan
captioning,Image caption extension for A1111 Webui 👁️📜🖋️
User: anshler
captioning,A tool to streamline AI image captioning
User: archangelaries
captioning,Tools for the evaluation of audio captioning.
Organization: audio-captioning
captioning,Python code for handling the Clotho dataset.
Organization: audio-captioning
Home Page: https://zenodo.org/record/3490684
captioning,Audio captioning baseline system for DCASE 2020 challenge.
Organization: audio-captioning
Home Page: http://dcase.community/challenge2020/task-automatic-audio-captioning
captioning,Online professional courses that are captioned and/or subtitled
Organization: cd2bit
Home Page: https://airtable.com/shr4C4ccaiyTQDDSg
captioning,An attempt to solve image captioning (in Vietnamese language) regarding ball sports contexts.
User: congphase
captioning,[CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
User: curryyuan
captioning,CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
User: davidhuji
captioning,Using LLMs and pre-trained caption models for super-human performance on image captioning.
User: davidmchan
captioning,Sample app demonstrating adding live captions to Twilio Video rooms
Organization: deepgram-devs
captioning,Sample app to display live captioning to a WebRTC video session with the Deepgram API.
Organization: deepgram-devs
captioning,Fully-Convolutional Point Networks for Large-Scale Point Clouds
User: drethage
captioning,A public repository with key information about the EBU Timed Text (EBU-TT) format.
Organization: ebu
captioning,Toolkit for supporting the EBU-TT Live specification
Organization: ebu
Home Page: http://ebu.github.io/ebu-tt-live-toolkit/
captioning,Captioning code in PyTorch
User: elbayadm
captioning,My notes on some Deep Learning papers
User: elbayadm
captioning,A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Organization: facebookresearch
Home Page: https://mmf.sh/
captioning,A curated list of zero-shot captioning papers
User: feielysia
captioning,A small program that automatically captures live speech and outputs as an onscreen caption via OBS
User: foxiest
captioning,Official python implementation of R3-Transformer
User: hassanhub
captioning,A Tennis dataset and models for event detection & commentary generation
User: haydenfaulkner
captioning,[NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation
User: imkett
Home Page: https://arxiv.org/abs/2306.16649
captioning,An App with Voice Assisted Image Captioning and VQA For Visually Challenged Individuals
User: j0sal
captioning,SimpleSubtitleEditor for Blender
User: jamesruan
captioning,Audio Captioning datasets for PyTorch.
User: labbeti
Home Page: https://aac-datasets.readthedocs.io/
captioning,Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
User: labbeti
Home Page: https://aac-metrics.readthedocs.io/
captioning,Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019
User: ltguo19
captioning,A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering
User: lucidrains
captioning,Medical image captioning using OpenAI's CLIP
User: mauville
captioning,VisText is a benchmark dataset for semantically rich chart captioning.
Organization: mitvis
Home Page: http://vis.csail.mit.edu/pubs/vistext/
captioning,Smart-I is an android application aimed at helping the visually impaired using artificial intelligence and cloud computing.
User: naivehobo
captioning,Python program to generate memes.
User: nikhilkumarsingh
captioning,What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
User: paritoshparmar
Home Page: https://arxiv.org/abs/1904.04346
captioning,Indonesian Image Captioning using Attention-based Semantic Compositional Networks
User: rayandrew
captioning,Some papers about *diverse* image (a few videos) captioning
User: ryanliut
captioning,Automatically describing the content of an image in Persian
Organization: sharif-slpl
captioning,JavaScript bookmarklet for viewing YouTube video transcripts in a popout window.
User: stevennyman
captioning,[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
User: theshadow29
Home Page: https://vidsitu.org/
captioning,A base model for image captions.
User: wangleihitcs
captioning,A Base Tensorflow Project for Medical Report Generation
User: wangleihitcs
captioning,Modifying LAVIS' BLIP2 Q-former with models pretrained on Japanese datasets.
User: zhaopeiduo
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.