Comments (3)
@comlhj1114 hello! Thanks for reporting this issue with loading pretrained models in DDP mode. It looks like you're correct about the behavior you've observed.
For your Case 1 and Case 2 where the model isn't loading properly, it seems there might be a need for additional handling in our DDP setup to enable loading pretrained weights directly when initializing a model from a YAML file.
The common workaround for now, as you've found, is to load the model directly from the pretrained .pt
file before calling .train()
, as shown in your Case 3. That's a valid approach when deploying to a multi-GPU setting.
Meanwhile, I'll forward this issue to our development team to consider improving the handling of pretrained models in DDP configurations. If you have further insights or would like to contribute to a solution, we encourage you to follow and possibly contribute to this discussion on GitHub. Your feedback is invaluable! 😊
from ultralytics.
@comlhj1114 thanks for the PR, we have a review by @Laughing-q pending on it, and if everything looks good we should have it merged this week :)
from ultralytics.
@glenn-jocher Thank you for your rapid and valuable feedback. I will try to contribute!
from ultralytics.
Related Issues (20)
- Integrate new NN module HOT 4
- Will CoreML Conversion Support be Available for YOLOv10 Custom Models? HOT 1
- zh HOT 4
- non-normalized or out of bounds coordinates HOT 4
- yolov8_obb val appear large error predict boxes HOT 2
- How to train one yolo segment model with 2 class seg label and 1 class detect (box) label? HOT 2
- Load custom data HOT 6
- Segment errors occur during training on linux HOT 2
- Confusion Matrix process_batch function HOT 3
- How can I get FLOPs when I changed the model HOT 2
- Errors during changing the feature extractor HOT 3
- MixUp augmentation problem HOT 3
- Applying YOLOv8 Model on Multiple Streams: How to Implement? HOT 2
- class weights HOT 6
- data.yaml file not recognizing HOT 10
- Export to edgtpu with batch not working HOT 2
- Evaluation metrics implementation VS pycocotools HOT 3
- Do not perform reverse update weights. HOT 3
- Adding Class incremental Learning to YOLOv8 HOT 2
- Exported CoreML Model with Different Results HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ultralytics.