Jerome's Projects
This repo is meant for people using Amazon workspaces. Though the code targets a specific environment, the bot can be tweaked to suit your organization
This repository contains code to programmatically retrieve PGNs from chess.com servers and parse them into a relational format with a move per row. The goal is to build a dashboard on top of the dataset
Cloud Dataproc: Samples and Utils
This tool creates a custom search engine using VertexAI, Langchain and Streamlit. It allows users to input the URL of a website's sitemap XML file, which will serve as the knowledge base. The app then crawls the entire website, refreshes vector embeddings, and uses the information as a knowledge base to answer user queries.
This application extracts autoscaler metrics and dumps them into a CSV file
Config files for my GitHub profile.
A DataStage wrapper script written in bash
A framework to create a standardized feedback mechanism for ETL processes while keeping the developer free from implementation details of the alert system
This repository contains all the source code I've written to setup infrastructure, generate test data, perform benchmarks and other repetetive actions that are required through the course of learning a technology or while working with a customer.
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially supported Google product.
Code samples used on cloud.google.com
A Curated List of Sample Redis Datasets
The Spark Configuration Tool is a Streamlit-based application designed to assist users in optimizing Apache Spark configurations. It allows users to input various parameters related to cluster, node, and executor configurations, providing recommended Spark configurations based on those inputs.