Giter Club home page Giter Club logo

Comments (11)

veikkoeeva avatar veikkoeeva commented on May 18, 2024 1

@Anipik Ah, sorry, I got delayed. I'm currently in www.arctic15.com and rather tied. See the linked PR for further discussion and the last comment. I should be able to get onto it the next weekend, knock on the tree. Naturally if you need to do something in the meantime, feel free to do so. :)

from machinelearning.

Anipik avatar Anipik commented on May 18, 2024

@veikkoeeva can you please paste the error output ?

from machinelearning.

veikkoeeva avatar veikkoeeva commented on May 18, 2024

@Anipik Hmm, indeed... It failed on the assert, but there might be something written to stdout before that (a refactoring idea, might be good to assert the terms directly). I'm not on that code currently, but I'll check in about 14 hours.

from machinelearning.

Anipik avatar Anipik commented on May 18, 2024

@veikkoeeva generally it mentions the line and the file name where the matching of the outputs failed.

from machinelearning.

veikkoeeva avatar veikkoeeva commented on May 18, 2024

@Anipik I took a look and tried to fix a bit, but I could lend a hand a bit. Do you happen to know where the results to LogisticRegression-bin-norm-CV-breast-cancer-rp.txt are written? This eludes me somehow. The result file looks like this

LogisticRegression
AUC	Accuracy	Positive precision	Positive recall	Negative precision	Negative recall	Log-loss	Log-loss reduction	F1 Score	AUPRC	/l2	/ot	/nt	Learner Name	Train Dataset	Test Dataset	Results File	Run Time	Physical Memory	Virtual Memory	Command Line	Settings	
0,9945	0,969373	0,959559	0,952772	0,975316	0,977394	0,134393	85,57494	0,956039	0,988987	0,1	0,001	1	LogisticRegression	%Data%		%Output%	99	0	0	maml.exe CV tr=LogisticRegression{l1=1.0 l2=0.1 ot=1e-3 nt=1} threads=- dout=%Output% data=%Data% seed=1 xf=BinNormalizer{col=Features numBins=5}	/l2:0,1;/ot:0,001;/nt:1	

and I suppose the problem is visible. In the PR I have gone through the places the code uses one-by-one and the results in some other files, such as in LogisticRegression-bin-norm-CV-breast-cancer-out.txt look consistent with the assertion data currently (though the runner still reports it as a failure, I'll check that later).

from machinelearning.

TomFinley avatar TomFinley commented on May 18, 2024

Ah that's interesting @veikkoeeva , thanks for bringing this up.

The rp files are written by the so-called ResultProcessor, the code for which lies in the src/Microsoft.ML.ResultProcessor project... so this LogisticRegression-bin-norm-CV-breast-cancer-rp.txt file, I would expect is the result of running ResultProcessor on top of the LogisticRegression-bin-norm-CV-breast-cancer-out.txt I'd expect to see alongside it.

from machinelearning.

Anipik avatar Anipik commented on May 18, 2024

Yeah the problem is that the decimal separator in your language pack is comma instead of decimal.
and we just match the rp files using string matching

The matching is being done here https://github.com/Anipik/machinelearning/blob/master/test/Microsoft.ML.TestFramework/BaseTestPredictorsMaml.cs#L211

The fix could be https://github.com/Anipik/machinelearning/blob/master/test/Microsoft.ML.TestFramework/BaseTestBaseline.cs#L618
instead of directly using . as a data separator you can dynamically obtain the decimal separator

from machinelearning.

Anipik avatar Anipik commented on May 18, 2024

@danmosemsft can you add a non-english queue for this repo too ?

from machinelearning.

veikkoeeva avatar veikkoeeva commented on May 18, 2024

@TomFinley Thanks, I'll see if get the rest fixed today (it's 19:00 here).

@Anipik Hmm, good to know. As you can see, I've tried to fix all instances that print numbers in non-invariant way. It occurred I could fix the comparison, but then one would have files that aren't easy to diff, say, when asking help here or comparing oneself. If this approach is OK, I think I should add a note to the commit about this.

My locale is fi-FI, by the way.

from machinelearning.

danmoseley avatar danmoseley commented on May 18, 2024

@danmosemsft can you add a non-english queue for this repo too ?

@Anipik feel free to open an issue, and make the addition, if it's analogous to corefx's.

from machinelearning.

Anipik avatar Anipik commented on May 18, 2024

@veikkoeeva is this issue resolved ?

from machinelearning.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.