Giter Club home page Giter Club logo

fastlsh's Introduction

FastLSH_LOGO

Build Status

This is a C++ implementation of E2LSH and C2LSH. More detail to come. The build won't pass now because travis don't support hadoop environment. I will try to fix it.

##Attention: The current version requires certain programs installedfor the full functionality. It has been tested on Ubuntu 16.04 with the following programs

  • Hadoop 2.7.3
  • OpenMP 4.5
  • Memcached 1.4.34
  • Redis 3.2
  • OpenMPI 2.02
  • Boost 1.5
  • Spark 2.1.0

If you can not fulfill the requisites, try to use CMakeList_safe.txt in the repo and comment the lines where compile error happens.(i.e. Line 32-34 and 42-43 in src/LSH.cpp as well as 9 and 14 in test/driver.cpp) I will creat a safa brach soon.

##Parameters

  • N the number of vectors in the N dataset
  • Q the number of vectors in the query dataset
  • D the number of dimensions
  • L the number of group hash
  • K the number of lines in each hash group -- the larger the value, the fewer the collisions
  • W the number of bucket width

TODO

  • Access to MPI
  • Access to Redis
  • Access to memcached
  • Distributed computing
  • Parameter generator
  • Rename function names
  • Rafactor, apply design patterns

System Structure

system structure

Authors

Acknowledgments

  • Gan, J., Feng, J., Fang, Q., & Ng, W. (2012, May). Locality-sensitive hashing scheme based on dynamic collision counting. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data (pp. 541-552). ACM.

fastlsh's People

Contributors

peterxuyaohai avatar nokcwong2 avatar joshuawnc avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.