zhengyaojiang Goto Github PK

followers: 462.0 following: 30.0 repos: 30.0 gists: 0.0

Name: Zhengyao Jiang

Type: User

Company: University College London

Bio: PhD student at UCL, Interested in Offline Reinforcement Learning (RL), Data-Efficient RL and Neuro-Symbolic Methods for RL.

Twitter: zhengyaojiang

Location: London, UK

Blog: zhengyaojiang.github.io

Zhengyao Jiang's Projects

awesome-decentralized-llm

Collection of LLM resources that can be used to build products you can "own" or to perform reproducible research.

cardboard-unity

Google Cardboard

d4rl

A benchmark for offline reinforcement learning.

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

draw_convnet

dreamerv2

Mastering Atari with Discrete World Models

gradientinduction

Framework of DataLog Neural Program Synthesis

graphbackup

Code release for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions https://arxiv.org/abs/2205.15824

gtg

Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).

inline_asm_snake

latentplan

Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.

mentalvr

The virtual reality controlled by mental command and voice

neural-style

Neural style in TensorFlow! :art:

nlrl

Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)

ntp

End-to-End Differentiable Proving

olps

Online Portfolio Selection toolbox

pdf-to-markdown

Convert PDF files into markdown files

pgportfolio

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

ray

A high-performance distributed execution engine

rl-portfolio-management

Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)