cvxgrp / cocp Goto Github PK

Source code for the examples accompanying the paper "Learning convex optimization control policies."

License: Apache License 2.0

Jupyter Notebook 99.95% Python 0.05%

convex-optimization learning control-systems differentiable-programming

cocp's Introduction

Learning convex optimization control policies

This repository accompanies the paper Learning convex optimization control policies. It contains the source code for the examples therein as IPython notebooks.

Our examples make use the Python package cvxpylayers to differentiate through convex optimization problems.

Abstract

Many control policies used in various applications determine the input or action by solving a convex optimization problem that depends on the current state and some parameters. These types of control policies are tuned by varying the parameters in the optimization problem, such as the linear quadratic regulator weights, to obtain good performance, judged by application-specific metrics. Our paper introduces a method to automate this process, by adjusting the parameters using an approximate gradient of the performance metric with respect to the parameters. Our procedure relies on recently developed methods that can efficiently evaluate the derivative of the solution of a convex optimization problem with respect to its parameters.

Citing

@article{agrawal2019cocp,
    author       = {Agrawal, Akshay and Barratt, Shane and Boyd, Stephen and Stellato, Bartolomeo},
    title        = {Learning Convex Optimization Control Policies},
    journal      = {arXiv},
    archivePrefix = {arXiv},
    eprint = {1912.09529},
    primaryClass = {math.OC},
    year    = {2019},
}

cocp's People

Stargazers

Watchers

Forkers

capri2014 sichitong fagan2888 abhitoronto jasonzhou404 yangwangaaa shengjunzhang maxjkim nirajbasnet slswkr qasimwani cyan-at yangyangfu jd-lara sahil-717

cocp's Issues

potential error in code

hi, thanks for the work on this. I am looking at markowitz_tuning.ipynb for the simulate() function.

def simulate(ht, ut):
    ret = torch.exp(logreturn1p_dist.sample())
    ht = ret * (ht + ut)                  <----- this line
    return ht, ret

from my understanding, ht should be (ret * (ht + ut))/(ret.T @ (ht + ut)) as per formula stated for how holdings evolve in 5.3 of the paper? This formula sums to 1 (as holdings should be), while the line in code doesn't.

Is Continuous ARE supported?

Hi,

I would appreciate getting your help with the LQR example.
Following the LQR example, I notice that for a continuous problem, e.g., cart pole, with linearized dynamics at the fixed point, the LQR solution doesn't work. It returns "inf."

For example,

Consider the following dynamics matrices:
A = np.array([ [0, 0, 1, 0],
[0, -0.71707317, 0, 0],
[0, 0, 0, 1],
[0, 15.77560976, 0, 0]])

B = np.array([ [0],
[0],
[0.97560976],
[-1.46341463]])

The following code doesn't work:
So that you know, I changed the discrete ARE from your notebook to the continuous ARE.

###################################################
P = cp.Variable((n, n), PSD=True)
R0cvxpy = cp.Parameter((m, m), PSD=True)

objective = cp.trace(P@W)
constraints = [cp.bmat([
[R0cvxpy, B.T@P],
[P@B, Q0 + A.T@P + P@A]
]) >> 0, P >> 0]
R0cvxpy.value = R0
result = cp.Problem(cp.Maximize(objective), constraints).solve()
P_lqr = P.value
print(result)
##################################################

Returns: Inf

UserWarning: Solution may be inaccurate. Try another solver, adjusting the solver settings, or solve with verbose=True for more information.

I appreciate any help you can provide.

Best regards,
Solomon

backtest portfolio optimisation - is it possible?

hi all
@SteveDiamond @davidhallac @echu @bodono

im just wondering for the markovitz portfolio optimisation section of this paper.
how would you backtest it. would you retrain the model every time you re-balance.

I tried to follow the code but its a bit hard to tell how to use the weights in a backtest.

Kind regards,
Andrew

cvxgrp / cocp Goto Github PK

cocp's Introduction

Learning convex optimization control policies

Abstract

Citing

cocp's People

Stargazers

Watchers

Forkers

cocp's Issues

potential error in code

Is Continuous ARE supported?

backtest portfolio optimisation - is it possible?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent