Xiang's Projects
Codes for the AISTATS 2023 paper, A Statistical Analysis of Polyak-Ruppert Averaged Q-learning.
Codes for cdiscount image classification, a Kaggle competition to categorize seven million commodities into up to five thousand classes.
DQN and its variants in deep reinforcement learning. Now it only incorporates vallina DQN, average DQN and median DQN.
On the Convergence of FedAvg on Non-IID Data
Pytorch implementation of generative adversary networks.
Convex optimizers for LASSO, including subgradient, project gradient, proximal gradient, smooth method, lagrangian method and stochastic gradient descent variants.
Codes for LocalPower
Xiang Li's personal homepage
Home of various optimization algorithms.
Library for training machine learning models with privacy for training data
Collection of reinforcement learning algorithms
A pytorch-version implementation of RL algorithms. Now it collects TRPO, ClipPPO, A2C, GAIL and ADCV.
Experiment codes for the paper https://arxiv.org/abs/2404.01245