Name: Fokko Driesprong
Type: User
Company: @databricks
Bio: Open-source software engineer at Tabular. Committer & PMC on Apache {Avro, Airflow, Druid, Iceberg}, Committer on Apache Parquet. Open-source advocate
Location: Netherlands
Blog: https://www.linkedin.com/in/fokkodriesprong/
Fokko Driesprong's Projects
4mc - splittable lz4 and zstd in hadoop/spark/flink
Updates API sample using Actions SDK, Java and Cloud Functions for Firebase
A Slick template for Typesafe Activator
dbt adwords models
Some examples for how to develop airflow dags with testing.
asyncio support for botocore library using aiohttp
Base POM for Airlift
A port of Snappy, LZO, LZ4, and Zstandard to Java
An example dag of Airflow
A template repo for building and releasing Airflow provider packages.
Apache Airflow Website
Skeleton project for Airflow training participants to work on.
Apache Airflow in Docker Compose (for both versions 1.10.* and 2.*)
metrics support for akka-http apps
Example of (micro)service written in Scala & akka-http
Alluxio, data orchestration for analytics and machine learning in the cloud
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Application Insights SDK for Java
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
A utility tool to automate certain tasks with Jupyter notebooks.
Mirror of Apache Avro
Contains Dockerfiles for the Azure Cosmos DB Emulator: https://docs.microsoft.com/azure/documentdb/documentdb-nosql-local-emulator
Apache Spark Connector for Azure Cosmos DB
☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs
☁️ Java client library for Azure Event Hubs
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Apache Spark Connector for Azure Kusto