Giter Club home page Giter Club logo

Comments (3)

os1a avatar os1a commented on June 25, 2024

Thanks for your interest in our code.

We can share with you most of the processing code which results in obtaining the structure of our processed dataset including both the images and floats files.

I have added two files: extract_frames.py and parse_annotation.py. You need to run them in that order. This will create the structure of the dataset including images and floats files.

Then you need to write your own script which creates the scene.txt file for each scene. The structure of the scene.txt file is already explained in the README:
scene.txt: each line represent one testing sequence and has the following format: tracking_id img_0,img_1,img_2,img_future.

This can be created by iterating over the files (which are generated from the above two scripts) and then writing the scene.txt file in the format described above. Unfortunately our script for creating the scene.txt file operates on a private different strcuture and thus cannot be shared. However, writing your own script should not be too hard and you are welcome to write for clarification.

Best,

from multimodal-future-prediction.

AlexanderRadovic avatar AlexanderRadovic commented on June 25, 2024

Ah this is perfect, thanks so much for the rapid response! Skimming through the files, I think this is exactly what I needed.

Best,
-Alex

from multimodal-future-prediction.

rafalk342 avatar rafalk342 commented on June 25, 2024

@os1a
Can you explain a little more how scene.txt files were created? I understand the format and that first three frames are x, x + 15, x + 30 and the fourth is x + 180, but I don't know:

  • how are tracking_ids chosen and if they matter?
  • what determines the number of lines per file?
  • can it happen that in the fourth frame the object is out of the picture?

I also have one question regarding parse_annotation.py:

  • is it safe to filter occluded images? Can it cause some prediction problems when ground truth is occluded?

from multimodal-future-prediction.

Related Issues (12)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.