Giter Club home page Giter Club logo

Comments (2)

wohler avatar wohler commented on May 16, 2024

I'm not familiar with snakemake or Apache airflow. After reading their introductions, they seem very similar to Jenkins to me: workflows for building things. That's where the similarity ends.

Ziggy is designed as a factory for processing data that’s coming in from a current mission. It handles large volumes of data on thousands of processors and a pipeline may take hours or days to run. Ziggy preserves records, data products, and provides data accountability across a long span of time. In contrast, the Apache airflow documentation said you could run your workflow hourly or daily. Although they can do extremely large analyses, they seem to be aimed at a much more informal setting, such as a laboratory or research group.

Ziggy also predates these tools by a decade, and is certified to process data for NASA missions. I don't see these tools on our groups accredited tool list.

That said, it would be worthwhile to describe similar tools in our README and more thoroughly answer your question. Thank you.

I didn't quite follow the second half of your question. We've released the Kepler pipeline to GitHub (github.com/nasa/kepler-pipeline) for study only as this version can't be compiled or run due to licensing issues. Note that this code also includes the bulk of the K2 work as well.

We are also applying Ziggy to the TESS pipeline as we speak. While the Ziggy version that does this will be pushed to GitHub, the TESS pipeline will not. Once we decide to make the TESS source public, in our experience it will take at least a year to authorize it for deployment to GitHub. So don't hold your breath for that one!

from ziggy.

lgrcia avatar lgrcia commented on May 16, 2024

Thank you for your detailed response. I overlooked the certification requirements that are inherent to NASA and the importance of ensuring data prosperity (something not tackled by the tools I mentioned).

Regarding my second question, I appreciate the clarification that released pipelines cannot be directly compiled. Since I am only interested in specific tasks (such as TPS) I will focus on separate implementations that can be run locally.

from ziggy.

Related Issues (2)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.