Giter Club home page Giter Club logo

Comments (6)

juyjuyy avatar juyjuyy commented on September 27, 2024 1

Thank you for your response @guillermo-navas-palencia. After executing the analysis, I have known the properties of binning_table, but I don't get it now. I'm not too good at math, so I don't know the difference between the normalized Jensen-Shannon divergence applied to js and the Jeffrey divergence applied to iv.

  • Can you explain it or suggest some tutorial documentation explaining this difference?
  • And, the main problem here is that I need iv to choose important fields in the dataset, but I cannot calculate, then rank them for my experiment. It raises the question "Can js replace iv in my problem? How can I prove it?"

from optbinning.

guillermo-navas-palencia avatar guillermo-navas-palencia commented on September 27, 2024 1

I use the Jensen Shannon for binary and multiclass. See https://en.wikipedia.org/wiki/Jensen%E2%80%93Shannon_divergence.
image

Yes, you can rank by JS. Both are divergence measures.

from optbinning.

guillermo-navas-palencia avatar guillermo-navas-palencia commented on September 27, 2024

Hi @juyjuyy.

Note that binning quality score for multiclass target is slightly different: https://github.com/guillermo-navas-palencia/optbinning/blob/master/optbinning/binning/metrics.py#L347. It replaces the IV with the normalized Jensen-Shannon divergence. The js property can be retrieved from the multiclass binning table: https://github.com/guillermo-navas-palencia/optbinning/blob/master/optbinning/binning/metrics.py#L347.

from optbinning.

juyjuyy avatar juyjuyy commented on September 27, 2024

Can you suggest some documents related to the problem I have? @guillermo-navas-palencia

from optbinning.

juyjuyy avatar juyjuyy commented on September 27, 2024

Thank you for your response @guillermo-navas-palencia ,

  1. I wonder why both IV and JS value exists in the binning table with binary classification, but the multi-class classification exists only JS. Could you explain the theory for that? Is IV not used for multi-class problem?
  2. I can use the rule of thumb applied for IV to choose features such that IV above 0.1 is the strong predictor. If JS is the value calculated for ranking, how can I know which value I can choose for feature selection?
    Please help me, I really need your reply.

from optbinning.

guillermo-navas-palencia avatar guillermo-navas-palencia commented on September 27, 2024
  1. The IV is a divergence measure only suitable for binary target. The JS divergence generalizes the IV allowing multiple categories (multi-class problem).
  2. The IV, unlike JS, is unbounded. I found, experimentally, that IV is commonly 5-10 times larger than JS for a binary target. Therefore, a value above 0.02 might work, although I suppose that depends on the number of classes. This is a problem I haven't investigated.

from optbinning.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.