Name: Abhishek Choudhary
Type: User
Bio:
Data Engineer and Data Science
Working on several Big Data Technologies like Apache Spark, Hadoop, Kafka, Imapala, Apache Beam, Hive and others
Twitter: ubunta
Location: Berlin, Germany
Blog: https://www.linkedin.com/in/iamabhishekchoudhary/
Abhishek Choudhary's Projects
Simple example for reading and writing into Kafka
One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)
Code for kaggle competition at https://www.kaggle.com/c/facebook-recruiting-iv-human-or-bot
Kubernetes cli (kubectl) powered by GPT
CodesMachine Learning Source files
Open source platform for the machine learning lifecycle
Explore the capabilities of the MLX library and leverage the genAI stack on MacOS to interact with any video.
Parse, validate, manipulate, and display dates in javascript.
Monger is an idiomatic Clojure MongoDB driver for a more civilized age: with sane defaults, batteries included, well documented, very fast
edX: Introduction to Big Data with Apache Spark
A python wrapper for the opensubtitles API
This is an experiment to develop an app which could display news based on user preference
This is an Android Based Smile Detection Project , using OpenCV and JavaCV. It works well with android and to make it work you need to install the open cv libraries in your Android Phone.
A Snowflake GPT Demo using SqlAlchemy
Mirror of Apache Spark
Working of CS190.1x, Scalable Machine Learning
CSV data source for Spark SQL and DataFrames
Spark Docker Environment for testing Purpose
Hungarian Method using Apache Spark
Spark barrier Scheduling wrt MPI
A quick dirty code to generate sql code using chatGPT
Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!
Apache Superset is a Data Visualization and Data Exploration Platform
Visualizing our technology choices
Distributed SQL database in Rust, written as a learning project
Setup for running Trino with Hive Metastore on Kubernetes
Code to reproduce the simple sentiment analysis from my presentation
Explore Multiple Vector Databases and chat with documents on Multiple LLM models, private LLM models
Samples for VS Code Python extension