Topic: data-quality Goto Github
Some thing interesting about data-quality
Some thing interesting about data-quality
data-quality,pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
Organization: aai-institute
Home Page: https://pydvl.org
data-quality,The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Organization: adidas
Home Page: https://adidas.github.io/lakehouse-engine-docs/
data-quality,An RDF Unit Testing Suite
Organization: aksw
Home Page: http://RDFUnit.aksw.org
data-quality,FeatHub - A stream-batch unified feature store for real-time machine learning
Organization: alibaba
data-quality,Great Expectations Airflow operator
Organization: astronomer
Home Page: http://greatexpectations.io
data-quality,Automated data quality suggestions and analysis with Deequ on AWS Glue
Organization: aws-samples
data-quality,CSV Lint plug-in for Notepad++ for syntax highlighting, csv validation, automatic column and datatype detecting, fixed width datasets, change datetime format, decimal separator, sort data, count unique values, convert to xml, json, sql etc. A plugin for data cleaning and working with messy data files.
User: bdr76
data-quality,Possibly the fastest DataFrame-agnostic quality check library in town.
User: canimus
Home Page: https://canimus.github.io/cuallee/
data-quality,The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Organization: cleanlab
Home Page: https://cleanlab.ai
data-quality,Automatically find issues in image datasets and practice data-centric computer vision.
Organization: cleanlab
Home Page: https://cleanvision.readthedocs.io/
data-quality,A curated, but incomplete, list of data-centric AI resources.
User: daochenzha
data-quality,Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖
Organization: data-centric-ai-community
Home Page: https://datacentricai.community
data-quality,Metrics Observability & Troubleshooting
Organization: data-drift
Home Page: https://www.data-drift.io/
data-quality,Compare tables within or across databases
Organization: datafold
Home Page: https://docs.datafold.com
data-quality,Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.
User: dqops
Home Page: https://dqops.com/docs/
data-quality,The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.
Organization: encord-team
Home Page: https://encord.com/active
data-quality,📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
User: eugeneyan
data-quality,Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
Organization: evidentlyai
Home Page: https://www.evidentlyai.com/evidently-oss
data-quality,The Open Source Feature Store for Machine Learning
Organization: feast-dev
Home Page: https://feast.dev
data-quality,Feathr – A scalable, unified data and AI engineering platform for enterprise
Organization: feathr-ai
Home Page: https://join.slack.com/t/feathrai/shared_invite/zt-1ffva5u6v-voq0Us7bbKAw873cEzHOSg
data-quality,The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Organization: featureform
Home Page: https://www.featureform.com
data-quality,Implementation of Estimating Training Data Influence by Tracing Gradient Descent (NeurIPS 2020)
User: frederick0329
data-quality,Define, govern, and model event data for warehouse-first product analytics.
User: gclunies
data-quality,Learn how to design, develop, deploy and iterate on production-grade ML applications.
User: gokumohandas
Home Page: https://madewithml.com
data-quality,Learn how to design, develop, deploy and iterate on production-grade ML applications.
User: gokumohandas
Home Page: https://madewithml.com
data-quality,Always know what to expect from your data.
Organization: great-expectations
Home Page: https://docs.greatexpectations.io/
data-quality,A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.
Organization: great-expectations
data-quality,数据治理、数据质量检核/监控平台(Django+jQuery+MySQL)
User: hyhyhyhyhyhyh
Home Page: http://data.sghen.cn
data-quality,Code review for data in dbt
Organization: infuseai
Home Page: https://www.piperider.io/
data-quality,Compilation of high-profile real-world examples of failed machine learning projects
User: kennethleungty
data-quality,Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
Organization: kestra-io
Home Page: https://kestra.io
data-quality,Scalable data pre processing and curation toolkit for LLMs
Organization: nvidia
data-quality,A tool to help improve data quality standards in observational data science.
Organization: ohdsi
Home Page: https://ohdsi.github.io/DataQualityDashboard
data-quality,OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Organization: open-metadata
Home Page: https://open-metadata.org
data-quality,📙 Awesome Data Catalogs and Observability Platforms.
Organization: opendatadiscovery
data-quality,First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Organization: opendatadiscovery
Home Page: https://opendatadiscovery.org
data-quality,Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Organization: polyaxon
data-quality,re_data - fix data issues before your users & CEO would discover them 😊
Organization: re-data
Home Page: https://getre.io
data-quality,re_data - fix data issues before your users & CEO would discover them 😊
Organization: re-data
Home Page: https://docs.getre.io/latest/docs/start_here
data-quality,Data quality assessment and metadata reporting for data frames and database tables
Organization: rstudio
Home Page: https://rstudio.github.io/pointblank/
data-quality,NBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile your test suite. Just create an Xml file and let the framework interpret it and play your tests. The framework is designed as an add-on of NUnit but with the possibility to port it easily to other testing frameworks.
User: seddryck
Home Page: http://www.nbi.io
data-quality,:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Organization: sodadata
Home Page: https://go.soda.io/core-docs
data-quality,Swiple enables you to easily observe, understand, validate and improve the quality of your data
Organization: swiple
Home Page: https://swiple.io
data-quality,lakeFS - Data version control for your data lake | Git for data
Organization: treeverse
Home Page: https://docs.lakefs.io
data-quality,:whale: Tool to automate data quality checks on data pipelines
Organization: ubisoft
Home Page: https://ubisoft.github.io/mobydq/
data-quality,The open-source tool for building high-quality datasets and computer vision models
Organization: voxel51
Home Page: https://fiftyone.ai
data-quality,Qualitis is a one-stop data quality management platform that supports quality verification, notification, and management for various datasource. It is used to solve various data quality problems caused by data processing. https://github.com/WeBankFinTech/Qualitis
Organization: webankfintech
data-quality,An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
Organization: whylabs
Home Page: https://whylogs.readthedocs.io/
data-quality,Profile and monitor your ML data pipeline end-to-end
Organization: whylabs
data-quality,1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Organization: ydataai
Home Page: https://docs.profiling.ydata.ai
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.