Ajantha Bhat's Projects
Config files for my GitHub profile.
Mirror of Apache CarbonData
Mirror of Apache CarbonData Site
Apache CarbonData code flow documentation
Nessie Helm Charts Repo
This is a docker compose environment to quickly get up and running with a Spark environment and a local Nessie catalog, and MinIO as a storage backend.
this is downloadings of all educative.io free student subscription courses as pdf from GitHub student pack
GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
A Cluster Computing System for Processing Large-Scale Spatial Data
Gradle plugin that uses Revapi to check whether you have introduced API/ABI breaks in your Java public API
The OBS SDK for Java, which is used for accessing Object Storage Service
Apache Iceberg
CLI tool to bulk migrate the tables from one catalog another without a data copy
Apache Iceberg Documentation Site
A simple integer compression library in Java
ModelArts开发者案例交流互动平台,@ModelArts服务官网:https://www.huaweicloud.com/product/modelarts.html
Nessie provides Git-like capabilities for your Data Lake
Apache Parquet
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Enhancement on top of original IISC picasso tool [https://dsl.cds.iisc.ac.in/projects/PICASSO/]
A common library of MapReduce utilities and code
The official home of the Presto distributed SQL query engine for big data
Optimized data access for AI based on CarbonData files
Revapi is an API analysis and change tracking tool written in Java. Its focus is mainly on Java language itself but it has been specifically designed to not be limited to just Java. API is much more than just java classes - also various configuration files, schemas, etc. can contribute to it and users can become reliant on them.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)