Comments (9)
The number of Node folders represents the number of processes that were run in parallel to carry out the clone detection. The numeric argumnet N in the command ‘Python controller.py N’ tells the controller script to cary out clone detection using N processes. For systems where memory is low, N should be 1. Each process will reserve the amount of memory which is specified in the xmx and xms arguments to jvm.
from sourcerercc.
@dyangUCI @saini
Thanks for your quick response.
I'd like to know the results for the three sampled projects. In my computer, there is no "query" file under /NODE_1/output8.0 after executing "python controller.py 1". I'd like to know what's the matter.
Did you find the matter? (If U still remember it...)
from sourcerercc.
sorry, I cannot regenerate your error. When I ran the command "python tokenizer.py zipblocks", there will be data under file_blocks_stats. Did you unzip the folder test-env.tgz? Maybe that's the issue?
from sourcerercc.
@dyangUCI
Thanks for your response. This time I retry this command and the file "file_blocks_stats" really contains some contents as below.
However, the file "file-tokens" is still empty. I'd like to know if this is the case under your environment.
Thanks~
from sourcerercc.
Hi, I found the issue in the tokenizer: there's some extra info we once collected for Java functions for some specific experiments and abandoned later on, but the code remains in tokenizer.py, causing index out of range failures, so the results files are not complete. Please pull the git project now and rerun the tokenizer.py, it should be correct now.
from sourcerercc.
There will be 56 blocks in the tokens file. The stats file contains both file stats and block stats, 61 lines in total. You can check the results on your end accordingly.
from sourcerercc.
Yeah, thanks very much!
this time it really works~
from sourcerercc.
But I still have some questions below.
First, in my computer, the results for block-level clones of sampled projects are null. Is it right?
Second, I'd like to know what do Node_1, Node_2, and so on represent for?
Thanks~
from sourcerercc.
@dyangUCI @saini
Thanks for your quick response.
I'd like to know the results for the three sampled projects. In my computer, there is no "query" file under /NODE_1/output8.0 after executing "python controller.py 1". I'd like to know what's the matter.
from sourcerercc.
Related Issues (20)
- There is something wrong in step2 when detect file clone
- the website your paper give has some tools,but i cant open,it is useless now
- result format of code-clone detection in block-level HOT 1
- 0 and 1 token clones not detected
- Corrupt lines in pair file HOT 1
- How can I specify min line threshold?
- Why controller.py can not run? Could anybody help me? HOT 5
- Changing the value of threshold
- cat: 'clone-detector/NODE_*/output8.0/query_*': No such file or directory HOT 1
- Q How to resolve controller.execute() error : One or more nodes failed during Step Search.
- How to create the clone mapping in C or C++? HOT 2
- Provide the patch which makes the block-level tokenizer.py compatible with current python version HOT 4
- failed in testing tokenizer with tokenizer-sample-input HOT 1
- Failed to run block-level tokenizer HOT 16
- Where is block-level tokenizer and partial index algorithm HOT 1
- collector.py doesn't generate query_*, only report.csv HOT 1
- Is it possible that a type-1 clone won't be reported as a clone by SourcererCC? HOT 2
- Bug Report: Tokenizer failed to handle the final line of listed projects when missing newline character
- How to get the similarity score of a clone pair?
- Help. Is there any tutorial for incremental SCC?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sourcerercc.