Giter Club home page Giter Club logo

Comments (2)

hongzimao avatar hongzimao commented on July 27, 2024

Thanks for your questions.

  1. The scheduler needs to know how many available executors are at disposal when making the scheduling decision. In other words, when the number of available executors are different, the optimal scheduling decision might be different. For example, when there is only 1 executor available, decima will have to prioritize the executor to the most important node. But if there are 50 free executors, decima might schedule executors to two parent stages in a job (i.e., when executors are too few, only running one parent stage can be suboptimal).

  2. We associate the parallelism to DAG mainly because we want to reduce the problem complexity. Our paper section 5.2, parallelism limit section, paragraph "Decima’s action specifies job-level parallelism, as opposed fine-grained stage-level parallelism...." explains this point in more details.

Hope these help!

from decima-sim.

jahidhasanlinix avatar jahidhasanlinix commented on July 27, 2024

Well explained.

I just have a question regarding #1, You said Decima will have to prioritize the executor to the most important node => My question here is, How does it really prioritize the executor to do such a job? For example: I have multiple jobs in the DAG and some are free or no dependency, now how I can able to teach the patterns of the node to communicate with the executor to complete the job and in an optimized way (I guess here ML concept is involved). Is there any special parameter you would like to add so I can have a good understanding of it? Also which part of the code actually allows DAG to do such operation with the GNN, how it's actually triggered here in the GNN part from the DAG job state.

Another thing, so far what I have found and understand, I just want to share and get your opinion, Is it possible to create DAG without RDD in the spark env, I was trying to figure it out but did not get any better explanation here, would you like add some points here.

Again thank you so much and look forward to getting some valuable information from you.

from decima-sim.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.