Giter Club home page Giter Club logo

wangdomg's Projects

social_lstm_pedestrian_prediction icon social_lstm_pedestrian_prediction

The aim of the project is to predict the trajectories of pedestrians using lstm neural networks. The project starts from the paper "Social LSTM: Human Trajectory Prediction in Crowded Spaces - Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, Silvio Savarese - Stanford University - The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 961-971", and its official implementation (https://github.com/vvanirudh/social-lstm-tf) and makes some modifications. This is the Multimedia communication course project made during my final year of the bachelor degree under the supervision of professor Nicola Conci and his phd student Niccolò Bisagno. The modifications introduced are two: - To every simulated pedestrian add the input goal; the goal is the final position (in x and y coordinates) of that pedestrian when it disappears from the video. This modification should improve the predicted trajectory of that pedestrian because of the introduction of this new information - The grid created for every pedestrian in the original project to identify nearby pedestrians is replaced with an array containing the position(in x and y coordinates) of the others pedestrians in distance order, from the closest to the farther. This modification should improve the model results beacuse it presents relevant informations in order to the neural network. Then these two modifications were combined in to a single model. Every model has been evaluated in the test videos with different parameters and in conclusion the model with the two modifications (goal and array) combined performed better than any other model. Also the two modify models performed better than the original model. These results can be seen in the report at page 12 and 13. Unfortunately I haven't the time to translate the report in english, because now is in italian, but the result table at page 12 an 13 should be pretty clear. Technical details: - Programming language: Python 2.7 - Neural networks library used: Tensorflow 1.5 - External libraries: CUDA 8.0, CUDNN 6.0 - OS: Linux, Ubuntu 16.04 distrubution License: GPL v3

strnn icon strnn

Predicting the Next Location: A Recurrent Model with Spatial and Temporal Contexts

torchsample icon torchsample

High-Level Training, Data Augmentation, and Utilities for Pytorch

tpu icon tpu

Reference models and tools for Cloud TPUs.

uu_cf icon uu_cf

Python实现的基于User的协同过滤算法

web_kg icon web_kg

爬取百度百科中文页面,抽取三元组信息,构建中文知识图谱

xflow icon xflow

Distributed LR、 FM and MVM model on Parameter Server. FTRL and SGD Optimization Algorithm.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.