Name: Alessandro Solimando
Type: User
Bio: Apache Hive/Calcite committer, PhD, main interest in data management and distributed systems, both theory and practice, but always open to explore other topics!
Location: France
Blog: www.linkedin.com/in/alessandro-solimando-6263ba75
Alessandro Solimando's Projects
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
Apache Beam is a unified programming model for Batch and Streaming
Mirror of Apache Calcite
Mirror of Apache Calcite - Avatica
Examples and experimentation around Apache Calcite
Dremio - the missing link in modern data
Given a weighted set of 2D points, it computes the Heaviest Increasing Point Subset
Apache Hive
Automated TPC-DS and TPC-H benchmark for Apache Hive LLAP
Example of reading from a Kafka topic via Spark Streaming and writing into Druid via Tranquility library
Utility to deserialize and print KLL data sketches from their binary representation
The database purpose-built for stream processing applications.
A slightly moist lipstick-on-pig clone for Apache Hive
LogMap extension for conservativity principle
Playing around with Map datatype in Spark
Open source platform for the machine learning lifecycle
A simple particle simulator
A generator of Random Data to HDFS, HBase, Hive, Kafka, Kudu, Ozone, SolR in CDP (Cloudera Data Platform)
Mirror of Apache Spark
Streaming Frameworks Examples
Apache Tez
TPC-DS benchmark kit with some modifications/fixes
Analysis of TRAP2017 dataset using Spark
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
XQuery query processing optimization based on XML projection