Comments (3)
Do you have python-louvain installed?
from fastconsensus.
You are right, this issue occurred when I used the community
from networkx.algorithms
instead of python-louvain
, it might be helpful to comment in the imports like:
import community as cm # python-louvain community
By the way, I realized that issue afterwards, a bit extended your algorithm (mostly I/O formats) and added it to the Clubmark benchmarking framework: https://github.com/eXascaleInfolab/clubmark/blob/master/algorithms/fast_consensus.py
Moreover, we submitted a paper to BigData '19, where your algorithm is evaluated together with almost a dozen others. Overall, it has high accuracy but is much less scalable than Louvain being not always robust to small network pertubations. There main (technical) issue is that convergence of the Fast Consensus Louvain (FCoL) significantly and non-linearly depends on the number of worker processes in the multiprocessing pool
even fixing all the algorithm-related parameters (n_p
, tau
, delta
), which should not happen in theory.
Namely, FCoL converges on amazon
and dblp
(https://snap.stanford.edu/data/#communities) within couple of hours for the default values of tau
, delta
and n_workers
= n_p
= 5, but it can not converge within several days (basically, it hangs) setting n_workers <= 2
for the same n_p
= 5. In practice, the difference there is only in the time when the workers are started. It seems, when all workers are started simultaneously then Louvain yields (approximately) the same results and does not require any consensus-related processing at all (probably, they have a tie breaking strategy depending on a low-resolution timer, system time). But when the workers are started in distinct time and Louvain yields distinct partitions then your algorithm fails to converge (hangs) on the outlined datasets for the default tau
, independently on delta
(I tried all values in the range 0.02
.. 0.1
with the step 0.01
) and setting n_p = 5
. It seems, for n_p = 10
on the single worker the convergence also never happens on that datasets. So, you might need to investigate the outlined issue further and reconsider either the default tau
or something else to ensure the proper convergence.
from fastconsensus.
Thanks a lot for your feedback. I will look into the issues you mentioned for real world datasets and see if it can be improved.
We are additionally doing an extensive analysis of the procedure on real world datasets so we hope to have this fixed soon.
from fastconsensus.
Related Issues (6)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fastconsensus.