Giter Club home page Giter Club logo

Comments (10)

LittleLittleCloud avatar LittleLittleCloud commented on June 9, 2024

@michaelgsharp my best guess is there's overflow when calculating focal loss?

public override Tensor forward(Tensor classifications, Tensor regressions, Tensor anchors, Tensor annotations)

from machinelearning.

Cilouche avatar Cilouche commented on June 9, 2024

Is there a solution or suggestion? please

from machinelearning.

LittleLittleCloud avatar LittleLittleCloud commented on June 9, 2024

@Cilouche which coco dataset are you using, could you share a link?

from machinelearning.

Cilouche avatar Cilouche commented on June 9, 2024

data : https://drive.google.com/drive/folders/1-dQPRdQ-MRp6mrPhnpng5pcgJsTZMg23 ,

I used this site https://drive.google.com/drive/folders/1-dQPRdQ-MRp6mrPhnpng5pcgJsTZMg23 to convert them to coco format

from machinelearning.

LittleLittleCloud avatar LittleLittleCloud commented on June 9, 2024

@Cilouche which site? The site link seems to be the same with the data you share

from machinelearning.

Cilouche avatar Cilouche commented on June 9, 2024

Yes sorry the site :
https://roboflow.com/convert/pascal-voc-xml-to-coco-json?ref=blog.roboflow.com

I've also noticed that once the database is large, there's a loss Nan
example:
data = 100, epoch=8; all is well except for the precesion is low ~ 0.69

but from data ~= 1200 images, epoch= 5, 8 , 11 ; losses converge rapidly towards Nan

from machinelearning.

LittleLittleCloud avatar LittleLittleCloud commented on June 9, 2024

Update

I got it reproduced on my second training, thanks

Origianl post

Hey @Cilouche some updates here: I can't reproduce the NaN loss error using your dataset in the latest model builder main branch. Maybe it's already been fixed.

We haven't released model builder yet, but you can verify the latest bit in mlnet cli > 16.18.2 by installing mlnet-win-x64 and try object detection there. mlnet cli and model builder shares the same AutoML service so if you didn't see NaN error from mlnet cli, then you probably also won't see NaN error from model builder

steps to verify

mlnet object-detection --dataset /path/to/coco.json --device cuda --epoch 1

from machinelearning.

Cilouche avatar Cilouche commented on June 9, 2024

Any suggestion or solution to bypass this problem, please ?
Thanks

from machinelearning.

LittleLittleCloud avatar LittleLittleCloud commented on June 9, 2024

Try a smaller batch size, Maybe 1?

And GPU training doesn't produce NaN loss, is that also an option for you(training on GPU)

from machinelearning.

Cilouche avatar Cilouche commented on June 9, 2024

It's works on GPU thanks.

from machinelearning.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.