Name: Ying Shen
Type: User
Company: University of Illinois Urbana-Champaign
Bio: I am a Ph.D. student interested in Multimodal Machine Learning, Natural Language Processing, Computer Vision, Generative Models, and Deep Learning.
Blog: https://yingshen-ys.github.io/
Ying Shen's Projects
This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation
Easily create a beautiful website using Academic and Hugo
This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-processing: removing duplicated or small regions and holes, under flexible input image size
Magenta: Music and Art Generation with Machine Intelligence
Multi-Genre Natural Language Inference
Official implementation of SEED-LLaMA (ICLR 2024).
EMNLP 2017 (Oral): Tensor Fusion Network for Multimodal Sentiment Analysis Code
An implementation of the TrueSkill rating system for Python
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
[CVPR 2024] Real-Time Open-Vocabulary Object Detection