Example with kknn: <div class="highlight highlight-source-r notranslate position-r

I also found some cases which fail or produce warnings: <div class="highlight high

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Sometimes we reduce variables to only 1 factor level - but not all learners can work with such variables about mlrmbo HOT 22 CLOSED

mlr-org commented on May 29, 2024

Sometimes we reduce variables to only 1 factor level - but not all learners can work with such variables

from mlrmbo.

Comments (22)

berndbischl commented on May 29, 2024

Daniel, doesnt this only concern PREDICTION?
So what you say in 2) makes no real sense? ("learning")
Because in focussearch we never train a model?

And if so, I think I already took care of this in mlr in a general way so it should not happen anymore?

from mlrmbo.

danielhorn commented on May 29, 2024

Yes, 2) does not make real sense. I think, we allready discussed this a while ago.

I will test it again on monday with Karin

from mlrmbo.

berndbischl commented on May 29, 2024

If you test it, whatever the result, please add a uni test!

from mlrmbo.

danielhorn commented on May 29, 2024

Tested and unit test added.

We found both test cases, that work fine and that fail. Look at e0bab5e

from mlrmbo.

danielhorn commented on May 29, 2024

Have a look at these examples. The first one succeeds, the second one produces an error. The only difference is the second numeric param in the second function.

library(mlrMBO)

f1 = function(x)
  ifelse(x$disc1 == "a", 2 * x$num1 - 1, 1 - x$num2)
ps1 = makeParamSet(
  makeNumericParam("num1", lower = -2, upper = 1),
  makeNumericParam("num2", lower = -1, upper = 2),
  makeDiscreteParam("disc1", values = c("a", "b"))
)

f2 = function(x)
  ifelse(x$disc1 == "a", 2 * x$num1 - 1, 1 - x$num1)
ps2 = makeParamSet(
  makeDiscreteParam("disc1", values = c("a", "b")),
  makeNumericParam("num1", lower = 0, upper = 1)
)

ctrl = makeMBOControl(iters = 2, init.design.points = 10, infill.opt.focussearch.points = 100)
lrn = makeLearner("regr.kknn")
mbo(f1, ps1, learner = lrn, control = ctrl)
mbo(f2, ps2, learner = lrn, control = ctrl)

from mlrmbo.

jakobbossek commented on May 29, 2024

What is the status here? At the moment both examples of Daniels previous post work fine.

from mlrmbo.

danielhorn commented on May 29, 2024

I'm not sure if I am missing something at the moment, but have a look at this example:

library(mlrMBO)
par.set = makeParamSet(
  makeNumericVectorParam("x", len = 5, lower = 0, upper = 1),
  makeDiscreteParam("z", values = 1:10)
)
f = function(x) sum(x$x) + as.numeric(x$z)
learner = makeBaggingWrapper(makeLearner("regr.lm"), 2L)
learner = setPredictType(learner, "se")
control =  makeMBOControl( init.design.points = 5L, iters = 2L, save.on.disk.at = numeric(0L))
control = setMBOControlInfill(control, crit = "ei")
res = mbo(f, par.set, learner = learner, control = control)

from mlrmbo.

jakobbossek commented on May 29, 2024

Ok, this one fails.

from mlrmbo.

KarinSchork commented on May 29, 2024

I also found some cases which fail or produce warnings:

library(mlrMBO)

fun = function(x) {
  ifelse(x$disc1 == "a", (x$num1-0.3)^2*(x$num1+2)*(x$num1+4)*(x$num1+0.1), (x$num1+0.2)^2*(x$num1-1.1)^2)
}
ps = makeParamSet(
  makeDiscreteParam("disc1", values = c("a", "b")),
  makeNumericParam("num1", lower = 0, upper = 1)
)

learner1 = makeBaggingWrapper(makeLearner("regr.lm"), bw.iters = 10L)
learner1 = setPredictType(learner1, "se")
learner2 = makeBaggingWrapper(makeLearner("regr.blackboost"), bw.iters = 10L)
learner2 = setPredictType(learner2, "se")
learner3 = makeBaggingWrapper(makeLearner("regr.mob"), bw.iters = 10L)
learner3 = setPredictType(learner3, "se")
learner4 = makeBaggingWrapper(makeLearner("regr.crs"), bw.iters = 10L)
learner4 = setPredictType(learner4, "se")


controlMBO = makeMBOControl(init.design.points = 10, 
  iters = 5, save.on.disk.at = numeric(0L))
setMBOControlInfill(controlMBO, crit = "lcb", opt = "focussearch")

set.seed(2274)
mbo(fun, ps, learner = learner1, control = controlMBO)
set.seed(2274)
mbo(fun, ps, learner = learner2, control = controlMBO)
set.seed(2274)
mbo(fun, ps, learner = learner3, control = controlMBO)
set.seed(2274)
mbo(fun, ps, learner = learner4, control = controlMBO)

from mlrmbo.

berndbischl commented on May 29, 2024

@jakobbossek
Please check if this is all tested and works.

If so we can close

from mlrmbo.

mllg commented on May 29, 2024

learner3 triggered something mob specific and should be fixed now.

from mlrmbo.

jakobbossek commented on May 29, 2024

learner2 caused an error because of no propose.time argument in extras on model fail. Is fixed now.

from mlrmbo.

berndbischl commented on May 29, 2024

The warning we see in learner1 seems to be simply bad luck. In the bagging we select the "disc1" only in rows where it is "b".

We possibly want to stratify on the factors, but this might be hard....

from mlrmbo.

berndbischl commented on May 29, 2024

An easier option here would be to simply learn on the non-constant features. But then we get problems in predict. We can handle this via a preproc wrapper. We simply store what was constant in training (+removed) and remove it also in prediction. Maybe this is best for now.

from mlrmbo.

jakob-r commented on May 29, 2024

We agreed, that we want to check the initial design if all factor level are present. If not an error is thrown. This is an easy fast fix.

from mlrmbo.

jakobbossek commented on May 29, 2024

How does this solve the problem in cases where we use bagging / a bagging wrapper?
Even if all factors levels are covered for all discrete parameters in the initial design the training sets which are generated for bagging might be awkward and contain only a single factor level.

from mlrmbo.

jakobbossek commented on May 29, 2024

ping

from mlrmbo.

berndbischl commented on May 29, 2024

i think somebody needs to summarize the status here.
so we still have examples that fails? how important are they?

i do have a general solution for all of these problems, i think, using the new vtreat package. but we cannot do that now, and should do that in mlr

from mlrmbo.

ja-thomas commented on May 29, 2024

Just rerun the examples


f = function(x) {
  ifelse(x$disc1 == "a", (x$num1-0.3)^2*(x$num1+2)*(x$num1+4)*(x$num1+0.1), (x$num1+0.2)^2*(x$num1-1.1)^2)
}
ps = makeParamSet(
  makeDiscreteParam("disc1", values = c("a", "b")),
  makeNumericParam("num1", lower = 0, upper = 1)
)


fun = makeSingleObjectiveFunction(fn = f, par.set = ps, has.simple.signature = FALSE) 

learner1 = makeBaggingWrapper(makeLearner("regr.lm"), bw.iters = 10L)
learner1 = setPredictType(learner1, "se")
learner2 = makeBaggingWrapper(makeLearner("regr.blackboost"), bw.iters = 10L)
learner2 = setPredictType(learner2, "se")
learner3 = makeBaggingWrapper(makeLearner("regr.mob"), bw.iters = 10L)
learner3 = setPredictType(learner3, "se")
learner4 = makeBaggingWrapper(makeLearner("regr.crs"), bw.iters = 10L)
learner4 = setPredictType(learner4, "se")


controlMBO = makeMBOControl()
controlMBO = setMBOControlTermination(controlMBO, iters = 5)
controlMBO = setMBOControlInfill(controlMBO, crit = "cb", opt = "focussearch")

des = generateDesign(n = 10, ps)

set.seed(2274)
mbo(fun, des, learner = learner1, control = controlMBO) # fails
 > Error in `contrasts<-`(`*tmp*`, value = contr.funs[1 + isOF[nn]]) : 
  contrasts can be applied only to factors with 2 or more levels 

set.seed(2274)
mbo(fun, des, learner = learner2, control = controlMBO) # works
set.seed(2274)
mbo(fun, des, learner = learner3, control = controlMBO) # fails
>  Error in trainLearner.regr.mob(.learner = list(id = "regr.mob", type = "regr",  : 
  Failed to fit party::mob. Some coefficients are estimated as NA 

set.seed(2274)
mbo(fun, des, learner = learner4, control = controlMBO) # warnings
> There were 50 or more warnings (use warnings() to see the first 50)
> warnings()
> Warning messages:
> 1: In krscvNOMAD(xz = xz, y = y, degree.max = degree.max,  ... :
   optimal degree equals search maximum (3): rerun with larger degree.max

1 and 3 fail, 2 runs, 4 gives warnings.

So it depends on the learner and a general fix has to be in mlr, I'm not sure if we can solve it directly in MBO

from mlrmbo.

mllg commented on May 29, 2024

Now all example works and I struggle to reproduce something. This was maybe resolved during code cleanup (I've added some drop=FALSE, replaced sapply with vapply etc.).

If someone has a working example, please post.

from mlrmbo.

ja-thomas commented on May 29, 2024

I would suggest we close here until someone can produce a similar problem again.

ok? @berndbischl @jakobbossek @mllg @jakob-r

Otherwise @berndbischl can/should have a look

from mlrmbo.

mllg commented on May 29, 2024

Agreed.

from mlrmbo.

Sometimes we reduce variables to only 1 factor level - but not all learners can work with such variables about mlrmbo HOT 22 CLOSED

Comments (22)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent