eleurent / phd-thesis Goto Github PK

My PhD thesis. I defended on the 30th of October, 2020! See https://github.com/eleurent/phd-defense/

License: MIT License

TeX 99.71% Python 0.29%

phd-dissertation phd-thesis phd-thesis-latex

phd-thesis's Issues

Chap. 2: Literature Review

Tour d'horizon: comment les chercheurs ont-ils approché ce problème ?
Quels obstacles ont-ils rencontrés, et comment ont-ils essayé d'y répondre ?

Idée: un cadre général, et à chaque étape de modélisation un obstacle pratique: le réel est difficilement réductible à un MDP.

Il faudrait parler des approches contrôle / motion planning bas niveau. A quel endroit ?

Chap. 1 : Introduction

Sections principales:

Intro: Moral philosophy / ethics and the science of decision-making
Nuts and bolts of AD
Context and Scope: Quel est le problème, quels sont nos objectifs ? (décisions moyen terme) Où se situe la difficulté ? (incertitude, interactions) Pour cela, quelles sont nos armes ?
Contributions: comment avons-nous approché le problème ? Quels obstacles avons-nous franchi ?
Outline

Leftbar environments break to next page after the first line.

Instead of wrapping nicely, or breaking before first line.

Remove any mention of the following terms

in this paper
in this work
supplementary material
Q^*, V^*, pi^* (use \star)
i.e., e.g., etc. (use macros)
\ref

Solve all TODOs

Chap. 3: Problem Statement

Reprendre la checklist du Chap 2, en précisant les choix pris dans cette thèse (par opposition au SOTA)

State: positions et vitesses des objets, voies, potentiellement toute information.
Actions: on ne veut pas optimiser le contrôle bas-niveau (e.g. confort), mais des objectifs sémantiques court terme. Structure discrète.
Dynamique: modèle cinématique bicycle. Modèles de comportement pour les autres agents, issus de la simulation du traffic. Ils doivent pouvoir réagir au comportement du véhicule. Contrôleur retour d'état pour l'égo.
Récompense: la plus simple possible: velocity et collision. En particulier, on ne veut pas spécifier tous les aspects du comportement par la reward (e.g. distance de sécurité). Si possible, on ne donne que nos objectifs finaux et on souhaite voir le comportement adéquat émerger (la distance de sécurité est nécessaire et optimale à cause de l'incertitude sur le comportement du véhicule de devant, en particulier en cadre worst case vs en cadre average)
Implémentation: quelques mots sur highway-env, et renvoi en annexe.

Change the color of leftbars

Especially that of theorembar?

eleurent / phd-thesis Goto Github PK

phd-thesis's People

Contributors

Stargazers

Watchers

Forkers

phd-thesis's Issues

Chap. 2: Literature Review

Chap. 1 : Introduction

Leftbar environments break to next page after the first line.

Remove any mention of the following terms

Solve all TODOs

Chap. 3: Problem Statement

Change the color of leftbars

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent