Dependencies: lightgbm==2.2.2 treelite==0.32 LightGBM was tr

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

In our experiment, origin lightgbm predict result is not equal treelite predict result about treelite HOT 12 CLOSED

dmlc commented on July 28, 2024

In our experiment, origin lightgbm predict result is not equal treelite predict result

from treelite.

Comments (12)

henriezhang commented on July 28, 2024 1

@hcho3 well done!

from treelite.

henriezhang commented on July 28, 2024

when set the num_trees=1 the result is similar:
origin model result:

treelite retule:

when set num_trees=2 the result is begin different:
origin model result:

treelite result:

from treelite.

hcho3 commented on July 28, 2024

@henriezhang Is it possible to post your model?

from treelite.

henriezhang commented on July 28, 2024

my model is more than 2G, how post to you?

from treelite.

hcho3 commented on July 28, 2024

@henriezhang Dropbox or Google Drive link will work. I’ll make sure the bug is fixed.

from treelite.

henriezhang commented on July 28, 2024

please give me your email? I send you a simple model have the same problem

from treelite.

hcho3 commented on July 28, 2024

[email protected]

from treelite.

henriezhang commented on July 28, 2024

@hcho3 I have sanded you the model to your email.

from treelite.

hcho3 commented on July 28, 2024

@henriezhang #81 should fix the bug. Thanks so much for reporting!

from treelite.

henuxhj commented on July 28, 2024

On the same experiment parameters（num_tree=500, depth=20, leaves=255 ）, treelite cost more memory than before， It generate more source code than be before. And treelite write source code on disk after generate total source code. It may cause OOM .
Is there any way to solve this problem?
Thanks

from treelite.

hcho3 commented on July 28, 2024

@henuxhj This is because your model has high cardinality categorical features, and Treelite used to truncate high categorical values. So the larger code you get is the correct code (in the sense of giving correct prediction).

Please set parallel_comp option when compiling, to reduce memory consumption. This will break the 500 tree models into smaller pieces.

model.export_lib(toolchain='gcc', libpath='./a.so',
                verbose=True, params={'parallel_comp':500})

from treelite.

henuxhj commented on July 28, 2024

thank you so much， I'll try.

from treelite.

Recommend Projects

In our experiment, origin lightgbm predict result is not equal treelite predict result about treelite HOT 12 CLOSED

Comments (12)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent