Giter Club home page Giter Club logo

rmtl's Introduction

Multi-Task Recommendations with Reinforcement Learning

Source code of Multi-Task Recommendations with Reinforcement Learning

Code for RetailRocket Dataset. Please convert data into MDP format.

Model Code

  • layers: stores common network structures

    • critic: critic network
    • esmm: esmm(actor) network, can introduce other MTL models as actor inside slmodels
    • layers: classical Embedding layers and MLP layers
  • slmodels: SL baseline models

  • agents: RL models

  • train: training-related configuration

  • env.py: offline sampling simulation environment

  • RLmain.py: main RL training program

  • SLmain.py: SL training main program

  • dataset

    • rtrl:retrailrocket dataset(Convert to MDP format:)[timestamp,sessionid,itemid,pay,click], [itemid,feature1,feature2,..],6:2:2

How to run it

MTL baselines

python3 SLmain.py --model_name=esmm

RMTL

python3 RLmain.py python3 SLmain.py --model_name=esmm --polish=1

Result:

test: best auc: 0.732444172986328 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 134/134 [00:07<00:00, 19.14it/s] task 0, AUC 0.7273702846096346, Log-loss 0.20675417715656488 task 1, AUC 0.7247954179346048, Log-loss 0.048957254763240504

Citation:

Please cite with the below bibTex if you find it helpful to your research.

@inproceedings{liu2023multi,
  title={Multi-Task Recommendations with Reinforcement Learning},
  author={Liu, Ziru and Tian, Jiejie and Cai, Qingpeng and Zhao, Xiangyu and Gao, Jingtong and Liu, Shuchang and Chen, Dayou and He, Tonghao and Zheng, Dong and Jiang, Peng and others},
  booktitle={Proceedings of the ACM Web Conference 2023},
  pages={1273--1282},
  year={2023}
}

rmtl's People

Contributors

lzr-s avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.