Giter Club home page Giter Club logo

Comments (9)

saini avatar saini commented on July 28, 2024 3

The number of Node folders represents the number of processes that were run in parallel to carry out the clone detection. The numeric argumnet N in the command ‘Python controller.py N’ tells the controller script to cary out clone detection using N processes. For systems where memory is low, N should be 1. Each process will reserve the amount of memory which is specified in the xmx and xms arguments to jvm.

from sourcerercc.

zhuwq585 avatar zhuwq585 commented on July 28, 2024 1

@dyangUCI @saini
Thanks for your quick response.
I'd like to know the results for the three sampled projects. In my computer, there is no "query" file under /NODE_1/output8.0 after executing "python controller.py 1". I'd like to know what's the matter.

Did you find the matter? (If U still remember it...)

from sourcerercc.

dyangUCI avatar dyangUCI commented on July 28, 2024

sorry, I cannot regenerate your error. When I ran the command "python tokenizer.py zipblocks", there will be data under file_blocks_stats. Did you unzip the folder test-env.tgz? Maybe that's the issue?

from sourcerercc.

Kaka727 avatar Kaka727 commented on July 28, 2024

@dyangUCI
Thanks for your response. This time I retry this command and the file "file_blocks_stats" really contains some contents as below.
image

However, the file "file-tokens" is still empty. I'd like to know if this is the case under your environment.
Thanks~

from sourcerercc.

dyangUCI avatar dyangUCI commented on July 28, 2024

Hi, I found the issue in the tokenizer: there's some extra info we once collected for Java functions for some specific experiments and abandoned later on, but the code remains in tokenizer.py, causing index out of range failures, so the results files are not complete. Please pull the git project now and rerun the tokenizer.py, it should be correct now.

from sourcerercc.

dyangUCI avatar dyangUCI commented on July 28, 2024

There will be 56 blocks in the tokens file. The stats file contains both file stats and block stats, 61 lines in total. You can check the results on your end accordingly.

from sourcerercc.

Kaka727 avatar Kaka727 commented on July 28, 2024

Yeah, thanks very much!
this time it really works~

from sourcerercc.

Kaka727 avatar Kaka727 commented on July 28, 2024

But I still have some questions below.
First, in my computer, the results for block-level clones of sampled projects are null. Is it right?
Second, I'd like to know what do Node_1, Node_2, and so on represent for?
Thanks~

from sourcerercc.

Kaka727 avatar Kaka727 commented on July 28, 2024

@dyangUCI @saini
Thanks for your quick response.
I'd like to know the results for the three sampled projects. In my computer, there is no "query" file under /NODE_1/output8.0 after executing "python controller.py 1". I'd like to know what's the matter.

from sourcerercc.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.