Light

patrick-tssn Goto Github PK

followers: 56.0 following: 351.0 repos: 23.0 gists: 0.0

Name: Yuxuan Wang

Type: User

Company: Peking University

Bio: No pride and no prejudice

Blog: https://patrick-tssn.github.io

Hey There 🎸

🌱 I am Yuxuan Wang (汪宇轩), a research engineer at BIGAI. I completed my Master's degree at Peking University (PKU) and interned at Johns Hopkins University (JHU). Additionally, I conduct part-time research at the University of California, Santa Cruz.
🔭 I am keen to explore “o” for “omni”.
🤝 I am continually open to all forms of collaborative opportunities.

Yuxuan Wang's Projects

awesome-colorful-llm

Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics, Fundamental Sciences such as Mathematics, and Ominous.

awesome-llm

Awesome-LLM: everything you need to know about Large Language Model

awesome-llms-for-video-understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

cdbert

[ACL2023] Shuo Wen Jie Zi is a new learning paradigm that enhances the semantics understanding ability of the Chinese PLMs with dictionary knowledge and structure of Chinese characters

hallucination

A reading list of hallucination in Generative Models

hw-of-deep-generative-modeling

llama-x

Open Academic Research on Improving LLaMA to SOTA LLM

Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

marl_sg

[EMNLP2022] We propose a new collaborative reasoning method on mutli-modal graphs for multimodal dialogue

mdss

Multi-modal Dialogue Scene & Session Discrimination

mm-bart

multimodal-BART baseline for AVSD

mm-niavh

Pressure Testing Large Video-Language Models (LVLM): Doing multimodal retrieval from LVLM at any video lengths to measure accuracy

nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

nlpcc-2022-shared-task-4

Multimodal Dialogue Understanding and Generation

patrick-tssn

patrick-tssn.github.io

pllava

Official repository for the paper PLLaVA

rekcarc-tsc-uht

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

videohallucer

VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)

vstar

[ACL2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.