Comments (6)
Let's bring this up today during TRLX weekly :)
from trlx.
@simoninithomas This looks great, as I mentioned in the TRLX weekly I'd love to help out creating quick-start colab notebooks. Do you have any vision in mind, or should I just give it a shot and put up a PR for review?
from trlx.
@thedch I don't have for now a vision in mind. I was thinking on a quickstart with a simple model that we can train and test. If you have some ideas open a new PR so that we can exchange on it 🤗
from trlx.
Is there any support group for trlx.. saw some comments for trlx weekly meeting?
from trlx.
TRLX weekly is Monday at 1pm est in the discord. It's more of an engineering standup meeting than office hours. We've done office hours in the past though, let me know if this is something that interests you :)
from trlx.
Good day Louis,
Yes, please.
How do I sign up for office hours?
from trlx.
Related Issues (20)
- strange design HOT 1
- Use tiny models for the tests
- About the weight of word embedding being nan HOT 1
- Direct Policy Optimization HOT 4
- Add support for safetensors
- sanity check: PPO `log_ratio` should be zero when training is disabled HOT 1
- 8-bit inference
- Sanity check: SFT Model should be frozen (PPO) HOT 2
- support base model + multi adapter for actor, critic, ref and reward model
- Reward model negative numbers meaning HOT 2
- ppo using GLM2-6b as a backbone? HOT 1
- Implement Asynchronous PPO
- Add support for Falcon 7B/40B HOT 1
- Add support for LLaMA2 HOT 1
- Model does not load in the expected dtype HOT 5
- Caught signal 7 (Bus error: nonexistent physical address) HOT 5
- ILQL training batch2 tensor dimensions error HOT 2
- RuntimeError: module must have its parameters and buffers on device HOT 4
- Unable to load the trained model to do the inference HOT 8
- Memory occupy with multi GPUs Training HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from trlx.