Implement additional online RL algorithms

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hey <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Implement A2C about trlx HOT 14 CLOSED

carperai commented on May 18, 2024

Implement A2C

from trlx.

Comments (14)

ML-Chen commented on May 18, 2024 1

@geighz Would love to sometime later this week! What's your username in the CarperAI Discord server? I'm mic.

from trlx.

RobertKirk commented on May 18, 2024 1

Note that if you have a sufficiently generic implementation of PPO, you have A2C for free: https://github.com/vwxyzjn/a2c_is_a_special_case_of_ppo

from trlx.

shermansiu commented on May 18, 2024 1

(The discussions for this are currently ongoing in the #trlx channel on the CarperAI Discord.)

from trlx.

ML-Chen commented on May 18, 2024

Hello, I'd like to be assigned to this issue!

from trlx.

geighz commented on May 18, 2024

Hey @ML-Chen , I am wondering how you are doing, would you like to pair up and tackle this issue?

from trlx.

manavgarg commented on May 18, 2024

Hey guys, I am interested in this too. Possible to include me here as well?

from trlx.

ML-Chen commented on May 18, 2024

@manavgarg Sure! Just add me on Discord.

from trlx.

promiseve commented on May 18, 2024

Might be late but also interested

from trlx.

LouisCastricato commented on May 18, 2024

Is this active?

from trlx.

ML-Chen commented on May 18, 2024

Working on this now

from trlx.

shermansiu commented on May 18, 2024

Same, collabing with @ML-Chen.

from trlx.

shermansiu commented on May 18, 2024

(I'm Blitz on Discord)

from trlx.

shermansiu commented on May 18, 2024

Yes, we're aware of that. It's just a matter of making the configurations nice so that it's easily accessible to end-users. There are also other RL algorithms that can be added.

from trlx.

shermansiu commented on May 18, 2024

The pull request: #183

from trlx.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.

Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

TensorFlow

An Open Source Machine Learning Framework for Everyone

Django

The Web framework for perfectionists with deadlines.

Laravel

A PHP framework for web artisans

D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

web

Some thing interesting about web. New door for the world.

server

A server is a program made to process requests and deliver data to clients.

Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

Visualization

Some thing interesting about visualization, use data art

Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.

Microsoft

Open source projects and samples from Microsoft.

Google

Google ❤️ Open Source for everyone.

Alibaba

Alibaba Open Source for everyone

D3

Data-Driven Documents codes.

Tencent

China tencent open source team.

Comments (14)

Related Issues (20)

Recommend Projects

Recommend Topics

Recommend Org