Giter Club home page Giter Club logo

repair-partitioning's People

Contributors

dgromov avatar yankrasny avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

repair-partitioning's Issues

Need to record fragment frequencies

See printWordFragments for code that can be adopted to store the fragments that were found. Then scan the wordIDs for each fragment and record its frequency.

That ^ seems like it would be O(n^2). Ok, think about this some more.

calcOffsets crashes, maybe stack overflow

We get a seg fault in the middle of calcOffsets, where our current node is at an invalid mem location. I think the tree might be really huge, and calcOffsets recurses twice, so perhaps we have a stack overflow.

Is the algorithm still correct if stringToWordIDs() doesn't run?

Calling the code as a library using the runRepairPartitioning() function circumvents the functionality in main(). The wordID will not be set properly, but I'm not sure if that's really a problem for repair.

  1. Investigate
  2. Adapt main() to work with wordIDs instead of text

Refactor command line params

Add numLevelsDown or maxRecursiveCalls.
Remove old ones you're not using.

Also refactor jinru's code to take your params

Implementing maxNumCalls

In the getBestSubset function, If we hit maxNumCalls then set a variable to true. So if a recursive call seta that variable to true, that also sets yours to true. At that point just return.

The calling code checks that variable and if it's true, reruns the getBestSubset function with different params to try and reduce numCalls

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.