Comments (4)
The problem somehow solved itself
from rl4lms.
Could you may be run your program with CUDA_LAUNCH_BLOCKING=1 python scripts/training/train_text_generation.py --config_path scripts/training/task_configs/iwslt2017/t5_ppo.yml
so that we can get a better error reports?
from rl4lms.
How did you create the Python environment?
from rl4lms.
Hey Tatiana, can you send a mail?
I am testing this and perhaps we can become collaborators, here is my email [email protected].
Anyone can feel free to mail as well if they are interested in collaborating, we could be working on similar problems.
from rl4lms.
Related Issues (20)
- Resuming from checkpoint is potentially problematic for IMDB since the splits are resampled HOT 1
- In the paper, what is the detail setting of supervised learning? Is SL has additional supervised data?
- [Question] End-to-end example
- Error with Accelerate integration + NLPO HOT 1
- Bug while loading t5 base model HOT 1
- How can I inference data with the model after PPO training?
- Using GPT-2
- 'GPT2Model' object has no attribute 'first_device'
- model.generate.scores returning two scores
- CPU Support Minor Bug
- is multi-dimensional reward supported?
- Memory issue in metric evals?
- Reproducing existing results on NarrativeQA
- NLPO Code Error and Query About gymnasium vs gym Usage
- Pip install error with gym and torch HOT 2
- Do you have any plans to apply the recently published Reinforced Self-Training (ReST)?
- Is PPO really better than SFT (in general)? under the condition of same amount of data HOT 1
- how to stop env parallel multi-process to debug env.step()?
- Upgrade to torch 2.0 HOT 1
- Question about the classifier used for IntentAccuracyDailyDialog.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rl4lms.