Giter Club home page Giter Club logo

bisecting-kmeans-blog's People

Contributors

dennyglee avatar yu-iskw avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar

bisecting-kmeans-blog's Issues

Modified source code to store & return the entire tree structure of bisecting k-means?

Hello, thanks for putting up this repo on Spark implementation of bisecting k-means. Very helpful.

Quick question to follow up on the "What's Next?" section of your intro to bisecting k-means clustering in MLlib 1.6: https://github.com/yu-iskw/bisecting-kmeans-blog/blob/master/blog-article.md

For the first ticket you mentioned [SPARK-11664] "Add methods to get bisecting k-means cluster structure (to get the full cluster tree?)", I checked its JIRA ticket (https://issues.apache.org/jira/browse/SPARK-11664) and the status was marked "resolved". However, in Spark MLlib's latest implementation (2.4.4) as follows, I didn't find this tree structure, or dendrogram to be a built-in output:

PySpark MLlib 2.4.4 official documentation:
https://spark.apache.org/docs/latest/api/python/pyspark.mllib.html#pyspark.mllib.clustering.BisectingKMeans
https://spark.apache.org/docs/latest/api/python/pyspark.mllib.html#pyspark.mllib.clustering.BisectingKMeansModel

Scala MLlib 2.4.4 official documentation:
https://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.mllib.clustering.BisectingKMeans
https://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.mllib.clustering.BisectingKMeansModel

We also looked up into their source code, and it does not seem to have the hierarchical tree structure stored as built-in output?

If the tree structure is not available in Spark MLlib 2.4.4 BisectingKMeans, would you by any chance know if anyone has modified the source code to get the tree structure with the modified code published?

Thanks!

Security Policy violation SECURITY.md

This issue was automatically created by Allstar.

Security Policy Violation
Security policy not enabled.
A SECURITY.md file can give users information about what constitutes a vulnerability and how to report one securely so that information about a bug is not publicly visible. Examples of secure reporting methods include using an issue tracker with private issue support, or encrypted email with a published key.

To fix this, add a SECURITY.md file that explains how to handle vulnerabilities found in your repository. Go to https://github.com/yu-iskw/bisecting-kmeans-blog/security/policy to enable.

For more information, see https://docs.github.com/en/code-security/getting-started/adding-a-security-policy-to-your-repository.


This issue will auto resolve when the policy is in compliance.

Issue created by Allstar. See https://github.com/ossf/allstar/ for more information. For questions specific to the repository, please contact the owner or maintainer.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.