Topic: minhash Goto Github
Some thing interesting about minhash
Some thing interesting about minhash
minhash,Fast MinHash Distances algorithms collection
User: ain0n
minhash,A Clojure library for querying large data-sets on similarity
User: andrewmcloud
minhash,Sketching Algorithms for Clojure (bloom filter, min-hash, hyper-loglog, count-min sketch)
Organization: bigmlcom
minhash,plagiarism detector
User: blackinkgj
minhash,JS implementation of probabilistic data structures: Bloom Filter (and its derived), HyperLogLog, Count-Min Sketch, Top-K and MinHash
User: callidon
Home Page: https://callidon.github.io/bloom-filters/
minhash,There are Python 2.7 codes and learning notes for Spark 2.1.1
User: cheng-lin-li
Home Page: https://cheng-lin-li.github.io/Spark
minhash,Elasticsearch plugin for b-bit minhash algorism
Organization: codelibs
minhash,A Minwise Hashing Method for Addressing Relationship Extraction from Text
User: davidsbatista
minhash,Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
User: davidsvy
minhash,C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
User: dnbaker
minhash,Locality Sensitive Hashing In R
User: dselivanov
minhash,Quickly estimate the similarity between many sets
User: duhaime
Home Page: https://duhaime.github.io/minhash/
minhash,Dynatrace hash library for Java
Organization: dynatrace-oss
minhash,SetSketch: Filling the Gap between MinHash and HyperLogLog
Organization: dynatrace-research
minhash,Easy-to-use Java similarity algorithms for text and numeric-series
User: edduarte
minhash,MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
User: ekzhu
Home Page: https://ekzhu.github.io/datasketch
minhash,Genomic neighbor typing of bacterial pathogens using MinHash :rat:
User: esteinig
minhash,A method to mine beyond-pairwise relationships using Min-Hashing for large-scale pattern discovery
User: gibranfp
minhash,A simple audio fingerprinting system
User: gurushida
minhash,cross-architecture binary comparison database
User: h4sh5
minhash,Software to identify plasmid sequence data from metagenome using logistic regression and Minhash
User: haradama
minhash,Massive Sparse Data Clustering Based on Frequent Items (SIGMOD 2023)
User: huangqiang
minhash,Python library for detecting near duplicate texts in a corpus at scale using Locality Sensitive Hashing, as described in chapter three of Mining Massive Datasets.
User: justinbt1
minhash,Minhash and maxhash library in Python, combining flexibility, expressivity, and performance.
User: lgautier
minhash,Union, intersection, and set cardinality in loglog space
Organization: liveramp
minhash,Poster presented at RECOMB 2017
User: luizirber
minhash,Rust implementation of sourmash core functionality
User: luizirber
minhash,Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
User: mattilyra
minhash,HyperLogLog with intersection
User: mbrg
minhash,An improved method of locality-sensitive hashing for scalable instance matching. In this study, we propose a scalable approach for automatically identifying similar candidate instance pairs in very large datasets utilizing minhash-lsh-algorithm in C#.
User: mehmetaydar
Home Page: https://link.springer.com/article/10.1007/s10115-018-1199-5
minhash,BagMinHash - Minwise Hashing Algorithm for Weighted Sets
User: oertl
minhash,ProbMinHash – A Class of Locality-Sensitive Hash Algorithms for the (Probability) Jaccard Similarity
User: oertl
minhash,TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation
User: oertl
minhash,find similar text files quickly
User: ppw0
minhash,Locality Sensitive Hashing
User: serega
minhash,Quickly search, compare, and analyze genomic and metagenomic data sets.
Organization: sourmash-bio
Home Page: http://sourmash.readthedocs.io/en/latest/
minhash,A database for signatures of public genomic sources
Organization: sourmash-bio
Home Page: https://wort.sourmash.bio
minhash,Weighted MinHash implementation on CUDA (multi-gpu).
Organization: src-d
minhash,Minhash clustering of text documents
User: steven-s
minhash,k-shingling for text to help compare similarity
User: steven-s
minhash,Probabilistic data structures for OCaml
User: travisbrady
minhash,Using Jaccard-Similarity and Minhashing to determine similarity between two text documents
User: tsunny007
minhash,Document store that periodically checks for changes in web documents
Organization: vokter
minhash,MinHash and LSH index written in Rust for Node.js
Organization: wherefortravel
minhash,A resistome profiler for Graphing Resistance Out Of meTagenomes
User: will-rowe
minhash,Detect and visualize text reuse
Organization: yaledhlab
Home Page: https://lab-apps.s3-us-west-2.amazonaws.com/intertext/redesign/index.html
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.