rl-verification-project's Introduction

RL-verification-project

Class project for CSCI 699: Safe-Learning Enabled Autonomous Systems Spring 2024

Project Overview

This project develops and evaluates a reinforcement learning model for managing classroom attendance during an ongoing epidemic. The model aims to balance the dual objectives of minimizing infection risks and maximizing in-person educational activities. The project employs PyTorch for model development and the Z3 theorem prover for verification of the learned policies.

Dependencies

Python 3.8+
PyTorch
tqdm
matplotlib
numpy
z3-solver
pandas

Key Components

Model Definition (Model class): A neural network model defined using PyTorch. It predicts policies based on the current state, which includes community risk and the number of infected individuals.
Infection Dynamics (get_infected_students_apprx_sir function): Simulates the number of new infections based on current policies and epidemiological parameters.
Policy Evaluation (get_label function): Generates labels for training by evaluating the consequences of different allowed attendance levels on infection spread.
Training Loop (train function): Executes the training process over a specified number of epochs, adjusting model weights based on observed rewards and the effectiveness of actions taken under various simulated conditions.
Verification (verify function): Uses the Z3 solver to check if the learned model adheres to desired safety conditions under high-risk scenarios.
Visualization (visualize_model_behavior function): Visualizes the model's behavior over a range of inputs to assess policy consistency and effectiveness.

Setup and Running

Installation

Install the required packages using pip:

pip install torch tqdm matplotlib numpy z3-solver pandas

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.

Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

TensorFlow

An Open Source Machine Learning Framework for Everyone

Django

The Web framework for perfectionists with deadlines.

Laravel

A PHP framework for web artisans

D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

web

Some thing interesting about web. New door for the world.

server

A server is a program made to process requests and deliver data to clients.

Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

Visualization

Some thing interesting about visualization, use data art

Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.

Microsoft

Open source projects and samples from Microsoft.

Google

Google ❤️ Open Source for everyone.

Alibaba

Alibaba Open Source for everyone

D3

Data-Driven Documents codes.

Tencent

China tencent open source team.

eondula / rl-verification-project Goto Github PK