Giter Club home page Giter Club logo

zingg's Introduction

The Problem

Real world data contains multiple records belonging to the same customer. These records can be in single or multiple systems and they have variations across fields which makes it hard to combine them together, especially with growing data volumes. This hurts customer analytics - establishing lifetime value, loyalty programs or marketing channels is impossible when the base data is not linked. No AI algorithm for segmentation can produce right results when there are multiple copies of the same customer lurking in the data. No warehouse can live up to its promise if the dimension tables have duplicates.

# Zingg - Data Silos

Why Zingg

Zingg is an ML based tool for entity resolution. The following features set Zingg apart from other tools and libraries

  • Ability to handle any entity like customer, patient, supplier, product etc
  • Ability to connect to disparate data sources. Local and cloud file systems in any format, enterprise applications and relational, NoSQL and cloud databases and warehouses
  • Ability to scale to large volumes of data. See why this is important and Zingg performance numbers
  • Interactive training data builder using active learning that builds models on frugally small training samples to high accuracy. Shows records and asks user to mark yes, no, cant say on the cli.
  • Ability to define domain specific functions to improve matching
  • Out of the box support for English as well as Chinese, Thai, Japanese, Hindi and other languages

Zingg is useful for

  • Building unified and trusted views of customers and suppliers across multiple systems
  • Large Scale Entity Resolution for AML, KYC and other fraud and compliance scenarios
  • Deduplication and data quality
  • Identity Resolution
  • Integrating data silos during mergers and acquisitions
  • Data enrichment from external sources
  • Establishing customer households

# Zingg - Data Mastering At Scale with ML

Demo

See Zingg in action here

Getting Started

The easiest way to get started with Zingg is through Docker and by running the prebuilt models.

docker pull zingg/zingg:0.3.2
docker run -it zingg/zingg:0.3.2 bash
./scripts/zingg.sh --phase match --conf examples/febrl/config.json

Check the step by step guide for more details.

The Story

What is the backstory behind Zingg?

Documentation

Check detailed Zingg documentation

Community

Be part of the conversation in the Zingg Community Slack

Reporting bugs and contributing

Want to report a bug or request a feature? Let us know on Slack, or open an issue

Want to commit code? Lets talk on Slack

Book Office Hours

If you want to schedule a 30-min call with our team to help you get set up, please book a slot here.

Asking questions

If you have a question or issue while using Zingg, kindly log a question and we will reply very fast :-)

License

Zingg is licensed under AGPL v3.0 - which means you have the freedom to distribute copies of free software (and charge for them if you wish), that you receive source code or can get it if you want it, that you can change the software or use pieces of it in new free programs, and that you know you can do these things.

Need a different license? Write to us.

Acknowledgements

Zingg would have not have been possible without the excellent work below:

zingg's People

Contributors

sonalgoyal avatar navinrathore avatar dependabot[bot] avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.