Giter Club home page Giter Club logo

graphing_tools's Introduction

Hi, I'm Jack

I'm a data scientist living in Seattle, WA. You can find more about me on linkedin or my website. I'm passionate about helping others learn and collaborate.

graphing_tools's People

Contributors

jackbenn avatar

Stargazers

 avatar  avatar

graphing_tools's Issues

Plot quantile lines on CDF

The CDF would be more useful with quantile lines (horizontal to the left of the curve, vertical under the curve)

Also it should match multihist better in its options.

KDE plot with error ranges

Create a plot that shows the KDE together with some confidence interval for the density at each point, so there's a minimum and maximum curve as well as the base curve. Presumably it would have a greater range in low-density areas, though also the overall distance between the upper and lower bounds would be a good measure of the uncertainty.

Allow use of distributions in qp matrix

The QQ and PP plots handle distributions, but not the CDF plots or the QP matrix.

As part of that, the various plots should change scatter->plot to allow a line between points (appropriate for dist vs dist plots).

Probability plot alignment

The probability plots aren't quite aligned right. In particular, the spacing on matching distributions should probably be at the midpoints of the bins, and the probability plot shouldn't use a strict inequality.

Handle bounded kdes

The histogram should be able to do bounded kdes, either on one side or both. It might have options between reflection and transformation, though there may be a better option. Possible parameters:

kde_bounds: tuple of lower and upper bounds, with None meaning that side is unbonded.
kde_bounding: 'reflect', 'transform'

Option to line up on integers

Add an option to encourage the bins to line up on integers, powers of tens, or (less frequently) small powers or two/five times those. There's a tradeoff between getting the right number of bins and minimize the power of two/five.

Possible multiples of powers of 10:
0: 1
1: 2, 5
2: 2.5, 4
3: 1.25, 8
4: 1.6, 6.25
5: 3.125, 3.2

We then choose the step size that minimizes the sum of
alpha times the number of steps, plus
the square of the log of the ratio between the requested number of bins and the number of bins using that ratio

Not sure of the proper value of alpha yet, that will take experimentation, and maybe it will be adjustable. Likely the default will prevent anything past 1 or 2, and if adjustable it will probably be capped at 5.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.