Giter Club home page Giter Club logo

Men Tianyi's Projects

abstract-state-seqmodel icon abstract-state-seqmodel

Code for EMNLP 2023 paper "Emergence of Abstract State Representations in Embodied Sequence Modeling"

agent-smith icon agent-smith

[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

autodroid icon autodroid

Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"

babyai icon babyai

BabyAI platform. A testbed for training agents to understand and execute language commands.

cogvlm icon cogvlm

a state-of-the-art-level open visual language model | 多模态预训练模型

gpt_academic icon gpt_academic

为ChatGPT/GLM提供图形交互界面,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm2等本地模型。兼容复旦MOSS, llama, rwkv, newbing, claude, claude2等

gym icon gym

A toolkit for developing and comparing reinforcement learning algorithms.

gym-cooking icon gym-cooking

gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Computational Modeling Prize in High Cognition, and a NeurIPS 2020 CoopAI Workshop Best Paper.

label-words-are-anchors icon label-words-are-anchors

Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

llm-transparency-tool icon llm-transparency-tool

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo

lm-arithmetic icon lm-arithmetic

Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"

othello_world icon othello_world

Emergent world representations: Exploring a sequence model trained on a synthetic task

pyvene icon pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

r-judge icon r-judge

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents

smartplay icon smartplay

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.

synapse icon synapse

Trajectory-as-Exemplar Prompting with Memory for Computer Control

toolbench icon toolbench

ToolBench, an evaluation suite for LLM tool manipulation capabilities.

tora icon tora

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools.

tree-of-thought-llm icon tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

webarena icon webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

webshop icon webshop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.