Giter Club home page Giter Club logo

estimators's People

Contributors

ataymano avatar cheng-tan avatar jackgerrits avatar juip avatar lalo avatar marco-rossi29 avatar olgavrou avatar orenmichaely avatar pmineiro avatar sheetallahabar avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

estimators's Issues

[RLOS2021]Update ccb estimator to use the new interface

  • Add interface for ccb estimator (base.py) - similar to slates, but reward is list of the same length as probabilities

  • Make FirstSlotEstimator implementation:

facade over bandits estimator that is feeding only data from the first slot. (+ tests)

More descriptive variable names

n -> ips_numerator (this can also be called snips_numerator)
d -> snips_denominator
N -> total_events
Ne -> offline_policy_events
c -> max_ips_numerator

p_log -> p_online_policy (probability of chosen action by online policy)
p_pred -> p_offline_policy
r -> reward

[RLOS2021]Design package structure of estimator library

Send out a package structure proposal for estimator library. Two use cases the design needs to address:

  1. for users to consume this library, a clear structure to find the interfaces to use for their problem type (cb, ccb, slates, ca etc.)
  2. for people to add new estimators, a clear structure to know which interfaces need to be implemented.

[RLOS2021]Make the library a proper python package

Since the package is not structured with a setup.py it is hard to install and consume. We would like to be able to install it like so:

python -m pip install git+https://github.com/VowpalWabbit/estimators.git

To Do List

Estimators

  • ips
  • weighted ips (aka snips)
  • mle
  • DR with importance weights transformation (truncate, clipped, mle-like, etc.) - dm, dr, switch estimator, ips, are all special cases of this. SN-versions are also possible

Components

  • Python
  • Testing framework
  • C++
  • C#

Confidence interval

[RLOS2021]Adding tests for estimator lib

Two type of test cases can be add:

  • Tests for correctness
  • Maybe some general tests for all problem types, e.g. verify output is in correct format, confidence interval always converges and not negative and some other sanity checks.

[RLOS2021] Other improvements

  • For intervals store min_bound and max_bound in tuple instead of List
  • For Interval get() method set alpha default value
  • Change inconsistent variable names { (w and p_over_p), (num and den in get method) }
  • Imports in the beginning of the file: change here

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.