Giter Club home page Giter Club logo

decision-tree-id3's People

Contributors

arokem avatar bryandeng avatar fabianp avatar kjacks21 avatar mechcoder avatar nickynicolson avatar onordander avatar svaante avatar tomdlt avatar vighneshbirodkar avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

decision-tree-id3's Issues

Some advice for pruning

HI, I think the 134th line in the function __prune (in tree.py), the condition for pruning should be changed from 'node_error_rate < children_error_rate' to 'node_error_rate<=children_error_rate'. Because node_error_rate==children_error_rate means that increase the complexity of the model won't increase the accuracy of classification. Thus, to prevent from overfittting, the condition should be changed to 'node_error_rate<=children_error_rate'.

are there any more parameters?

Was using this library and I noticed that there aren't parameters in the documentation for Id3Estimator(). Are there any parameters that we can set? Maybe something similar to sklearn's DecisionTreeClassifier() where you can choose max_depth, criterion, etc.

import error sklearn.externals

Hello,

By importing id3 I encounter this error on my setup. Is it possible that importing six from sklearn.externals is outdated?

File "/usr/local/lib/python3.7/site-packages/id3/__init__.py", line 3, in <module> from .export import export_graphviz File "/usr/local/lib/python3.7/site-packages/id3/export.py", line 1, in <module> from sklearn.externals import six ImportError: cannot import name 'six' from 'sklearn.externals' (/usr/local/lib/python3.7/site-packages/sklearn/externals/__init__.py)

thank you

repeating features don't work?

When I run clf = Id3Estimator(is_repeating=True) and clf = Id3Estimator(is_repeating=False) with the toy data in the demos, I get same decision trees which both contain repeating features.

Exception has occurred: ValueError New attribute value not found in train data.

I got problem when split dataset into train and validation set. If validation set contains attribute value that never seen in train set, this exception will occurs. This problem occurs because LabelEncoder execute during training (fit). How to make sure all attribute value from my dataset converted by LabelEncoder ?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.