Giter Club home page Giter Club logo

soda-sql's Introduction

Soda logo

Data testing, monitoring, and profiling for SQL-accessible data.

License: Apache 2.0 Slack Pypi Soda SQL Build soda-sql



Install from the command-line

✔ Access comprehensive documentation

✔ Compatible with Snowflake, Amazon Redshift, BigQuery, and more

Write tests in a YAML file

Run programmatic scans to test data quality


Got 5 minutes? Try the interactive demo!

Example scan YAML file

table_name: breakdowns
metrics:
  - row_count
  - missing_count
  - missing_percentage
...
# Validates that a table has rows
tests:
  - row_count > 0

# Tests that numbers in the column are entered in a valid format as whole numbers
columns:
  incident_number:
    valid_format: number_whole
    tests:
      - invalid_percentage == 0

# Tests that no values in the column are missing
  school_year:
    tests:
      - missing_count == 0

# Tests for duplicates in a column
  bus_no:
    tests:
      - duplicate_count == 0

# Compares row count between datasets
sql_metric: 
  sql: |
    SELECT COUNT(*) as other_row_count
    FROM other_table
  tests:
    - row_count == other_row_count

Play

Install

Collaborate

Contributors ✨

Thanks goes to these wonderful people! (emoji key)


Vijay Kiran

💻

abhishek khare

💻

Jelte Hoekstra

💻 📖

Cor

💻 📖

Milan Aleksić

🚇

Ayoub Fakir

💻

Alex Tonkonozhenko

💻

Todd de Quincey

💻

Antonin Jousson

💻

Jonas

🚇

cwouter

💻

Janet R

📖

Bastien Boutonnet

💻

Tom Baeyens

💻

AlessandroLollo

💻

mmigdiso

💻

ericmuijs

💻

Lieven Govaerts

💻

Milan Lukac

💻

This project follows the all-contributors specification. Contributions of any kind are welcome!

Open Telemetry Tracking

Soda-sql collects statistical usage and performance information via the Open Telemetry framework to help the Soda Core developers team proactively track performance issues and understand how users interact with the tool. The information is strictly limited to usage and performance and does not contain Personal Identifying Information. It will be used for internal purposes only. Soda will keep the data in its raw form for a maximum of 5 years. If some information needs to be kept for longer, it will be done in aggregated form only.

Users can find more information about the tracked information, and opt-out of tracking by consulting the reference section of docs.soda.io

soda-sql's People

Contributors

alessandrolollo avatar allcontributors[bot] avatar antoninj avatar bastienboutonnet avatar cwouter avatar dirkgroenen avatar fakirayoub avatar ilhamikalkan avatar janet-can avatar jczuurmond avatar lgov avatar m1n0 avatar marciogualtieri avatar milanaleksic avatar mmigdiso avatar sintezcs avatar toddy86 avatar tombaeyens avatar tonkonozhenko avatar vijaykiran avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.