dvbuntu / barmpy Goto Github PK

View Code? Open in Web Editor NEW

4.0 3.0 0.0 3.12 MB

Python module for Bayesian Additive Regression Models

Home Page: https://dvbuntu.github.io/barmpy

License: MIT License

Python 100.00%

barmpy's Introduction

BARMPy

Intro

barmpy is the Python implementation of Baeysian Additive Regression Models, a generalization of BART, currently being researched [1]. We hope this library is useful for practictioners, enabling Bayesian architecture search and model ensembling.

Skeleton repo adapted from BartPy.

Check out the Tutorial

Quick Start

barmpy is on PyPi! Install the latest released version with pip install barmpy. barmpy also strives to be compatible with sklearn and easy-to-use. If you have arrays of target data, Y, and input data, X, you can quickly train a model and make predictions using it. barmpy currently supports ensembles of neural networks for both regression and binary classification. See below for simple examples.

from sklearn import datasets, metrics
from barmpy.barn import BARN, BARN_bin
import numpy as np

# Regression problem
db = datasets.load_diabetes()
model = BARN(num_nets=10,
          random_state=0,
          warm_start=True,
          solver='lbfgs',
		  l=1)
model.fit(db.data, db.target)
pred = model.predict(db.data)
print(metrics.r2_score(db.target, pred))

# Classification problem
bc = datasets.load_breast_cancer()
bmodel = BARN_bin(num_nets=10,
          random_state=0,
          warm_start=True,
          solver='lbfgs',
		  l=1)
bmodel.fit(bc.data, bc.target)
pred = bmodel.predict(bc.data)
print(metrics.classification_report(bc.target, np.round(pred)))

References

[1] https://arxiv.org/abs/2404.04425

barmpy's People

Contributors

Stargazers

Watchers

barmpy's Issues

Implement Bayesian Additive Regression Support Vector Machines

BART and BARN exist, but Support Vector Machines (SVMs) are another machine learning method that might be useful to ensemble this way, giving us Bayesian Additive Regression SVMs (BARS).

BARS most likely will define its state space as the hyperparameters for the kernels (i.e. $d$ in the polynomial kernel, $(\langle x_i, x_j\rangle +1)^d$). What's required then is a transition probability, $T(d,d')$ (say, 50-50 on d+-1), a prior probability, $P(d)$ (some kind of discrete probability in this case perhaps), and the evidence likelihood. This last one is tricky, because if we train the SVMs the way we train NNs in BARN, then the learned parameters are not part of the MCMC state space. So we might as well approximate the integral over them with that maximum likelihood est just like in BARN (use the exact same logic, even?).

Practically, we can use SVMs from sklearn. We'll need an extra argument for the kernel, and we can have the kernel choice affect the prior and transition function as needed. So maybe start with a polynomial kernel, then try a gaussian kernel. Slowly add more kernels, generalizing as we go.

After implementing, we should do some extensive analysis of how well BARS does on benchmark data. Is it better than BARN? Maybe it's faster? This could make a great paper.

Implement Posterior Mean Model

BARN models currently only return a single ensemble from the posterior distribution (i.e. a single MCMC replicate). BART, however, allows returning an average over multiple MCMC iterations. Doing such averaging means the final model approximates the expected value of the posterior distribution, not just a single sample from it. This may improve modeling results in some contexts, especially if the variance in the posterior is relatively large (measured by the model sigma estimate).

Practically, there are a few considerations. First, because successive MCMC iterations are correlated, we only want to sample every so many steps (anecdotally, the integrated autocorrelation time is about 7 steps, but that depends on the problem, ensemble size, and other parameters). From a computational perspective, we can save some effort if the same model within the ensemble stays the same (i.e. declines to transition) between two samples in the average. In that case, we can just double weight that model. This requires some additional bookkeeping over just saving every Kth ensemble separately.

The actual output should probably be saved as a new ensemble model (even a barmpy.barn.BARN object itself), just with num_nets*M total networks, where M is the number of samples from the posterior to average over. The final output should also divide by M to ensure it's an average, or we can adjust the weights of the final NN layer to scale similarly (i.e. divide those weights by M instead and sum over the various ensembles).

Develop a contribution guide and code of conduct

To better encourage and manage user-submitted contributions like new methods and custom callbacks, we should add both a contribution guide walking through the process as well as code of conduct to set expectations.

Contribution guide can be a markdown file with a small example, say a custom callback. It should walk a user through the steps of integrating this feature into barmpy. Namely:

Fork/branch with a single new feature implemented and unit test created (if applicable).
Pull request created describing contribution.
Review by one of the barmpy maintainers
Possible revisions.
Merging complete features.

As a practical matter, features added by the primary developers (i.e. Dr. Van Boxel) will likely continue on the main branch directly for now.

The code of conduct can be a short statement, likely as part of the contribution guide, asserting how to engage with the barmpy community. We can pull some examples from https://docs.github.com/en/communities/setting-up-your-project-for-healthy-contributions/adding-a-code-of-conduct-to-your-project, but the short answer will be treating people with respect, understanding that different opinions can exist, and keeping discussion within barmpy focused on the development of this project (i.e. not wider mathematical discussion, however fun that may be).

Port `barmpy` to PyMC

PyMC is a python library that tries to fit bayesian models with Markov Chain Monte Carlo. BARN essentially fits that mold, so it would be instructive and potentially useful to port barmpy to that ecosystem. PyMC is a different approach from sklearn, however, so there may be a bit of learning curve. Some good first steps:

Understand PyMC-BART
Port BARN to PyMC, using PyMC-BART as a starting point
Extract only needed MCMC components from PyMC to be used in BARN, keeping sklearn compatibility.

Implement BART in `barmpy` library

It'd be great to have a Python implementation of BART in barmpy! Note that BARTPy exists, but it hasn't been updated in several years. It would still serve as an excellent starting place.

This issue should also do some refactoring to barmpy.barn so generic routines can be used in both BARN and BART. That will help future features like BAR-Support Vector Machines and the like.

dvbuntu / barmpy Goto Github PK

barmpy's Introduction

BARMPy

Intro

Quick Start

References

barmpy's People

Contributors

Stargazers

Watchers

barmpy's Issues

Implement Bayesian Additive Regression Support Vector Machines

Implement Posterior Mean Model

Develop a contribution guide and code of conduct

Port `barmpy` to PyMC

Implement BART in `barmpy` library

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent