Topic: big-data-processing Goto Github
Some thing interesting about big-data-processing
Some thing interesting about big-data-processing
big-data-processing,The 2022 Big Data Bowl data contains Next Gen Stats player tracking, play, game, player, and PFF scouting data for all 2018-2020 Special Teams play. Here, you'll find a summary of each data set in the 2022 Data Bowl, a list of key variables to join on, and a description of each variable.
User: adnanrahin
big-data-processing,This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessary infrastructure components, including Apache Flink, Elasticsearch, and Postgres
User: airscholar
Home Page: https://youtu.be/deepQRXnniM
big-data-processing,This Git repo showcases my analysis of Sparkify dataset with PySpark on Apache Spark cluster mode and JupyterLab on Docker. The goal was to identify at-risk customers and develop retention strategies. The analysis tested multiple machine learning models and uncovered insights into customer behavior and churn patterns.
User: alessiococchieri
big-data-processing,Welcome, feel free to navigate through my project. Detail information about each project can be found inside specified directory.
User: almersesunan
big-data-processing,Analysis, organization and querying of large genomic datasets using C++, Monsoon and various data structures.
User: anirban166
big-data-processing,GCP_Data_Enginner
User: anjijava16
big-data-processing,MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.
User: ayoub-etoullali
big-data-processing,Data modeling with Cassandra, building Data Warehouse using Redshift and creation of Data Lake using Spark and Airflow
User: bdnf
big-data-processing,Standard Hadoop MapReduce Tasks using Java
User: bennyhwanggggg
big-data-processing,A pipeline that consumes twitter data to extract meaningful insights about a variety of topics using the following technologies: twitter API, Kafka, MongoDB, and Tableau.
User: chandnii7
big-data-processing,Tech blog / notes from my various endeavours and exploits
User: christopherliew
Home Page: https://chrisliew.gitbook.io/chrisliew-and-tech/
big-data-processing,Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
User: drshahizan
big-data-processing,Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clusters on Kubernetes. This is the git repository of Eskimo Community Edition.
Organization: eskimo-sh
Home Page: https://www.eskimo.sh
big-data-processing,Building Data Lake and ETL pipelines using Amazon EMR, S3, and Apache Spark
User: faisal-aldhuwayhi
big-data-processing,This code creates a Kinesis Firehose in AWS to send CloudWatch log data to S3.
User: felipefrizzo
big-data-processing,datasets-toolbox are some scripts usefull to generate, transfom and valid large dataset files, not openable with editor because too large. datasets-toolbox provide also a ping script.
User: franck-mahieu
big-data-processing,Yet Another SPark Framework
User: giucris
big-data-processing,Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
Organization: impresso
Home Page: https://impresso.github.io/impresso-text-acquisition/
big-data-processing,rock-solid pillars for enterprise-grade solutions
User: incredibleprogress
big-data-processing,excel, markdown, csv, sql 数据源批量/单独格式互相转换
User: jameshanzhang
Home Page: https://github.com/JamesHanZhang/table-data-format-transform-app
big-data-processing,SUTD 2021 50.043 Database and Big Data Systems Code Dump
User: jamestiotio
big-data-processing,A movie recommender written in Go that suggests movies considering various factors within a particular dataset, encompassing users, movies, and movie ratings.
User: john-fotis
big-data-processing,"Provides tools for parallel pipeline processing of large data structures
User: jpmorgen
Home Page: https://bigmultipipe.readthedocs.io/en/latest/index.html
big-data-processing,Implementation of algorithms for big data using python, numpy, pandas.
User: kochlisgit
big-data-processing,Analyzing classified ads data from the used motorcycles market. Tasks involve utilizing Redis Bitmaps for analytics on seller actions and MongoDB for analyzing bike listings. Includes data installation, cleaning, and analysis.
User: lefteris-souflas
big-data-processing,Collection of homework (mostly Spark-based) from the course "Big Data Computing" - University of Padua.
User: leonardogemin
big-data-processing,Simple CSV parser for huge volumes of data with the use of the library Pandas for Python for getting specific columns of a CSV file and putting the extracted data into one or more files (each column in a separated file or all of them in the same output) in a short amount of time.
User: levindoneto
Home Page: https://pandas.pydata.org
big-data-processing,Sentiment-Analysis-API
User: louiecai
big-data-processing,Collection of homework (mostly Spark-based) from the course "Big Data Computing" - University of Padua.
User: lucamoroz
big-data-processing,Experiment to record as much data as possible in a given amount of time using a distributed timeseries database.
User: matthewdowns
big-data-processing,Solved tasks of the master's degree courses of speciality "Algorithms and Systems for Big Data Processing".
User: mikhail-kukuyev
big-data-processing,Introduction to Spark Batch processing.
User: mtumilowicz
big-data-processing,Degree diploma project
User: neri-kun
big-data-processing,big data processing and machine learning platform,just like useing sql
Organization: pyajs
big-data-processing,Big Data and AI Engineering bootcamp 2nd capstone project. Using Big Data Tools to predict the probability of university enrollment for Egypt's High School students. :school: :books: :microscope:
User: rghde
big-data-processing,Project using Python, Hive and MapReduce to compare various techniques to find the top K words in a very large file i.e. different techniques to process Big Data.
User: ridakn
big-data-processing,Software basati su metodi di intelligenza artificiale per l'automazione dell'analisi di big data.
User: scratchycode
big-data-processing,A Docker Compose Template to deploy Airflow with sync from a remote repository
User: siddharths067
big-data-processing,Github Repository for a versatile usable Big Data infrastructure (AVUBDI)
Organization: software-competence-center-hagenberg
big-data-processing,A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT run and Non-DLT interactive notebook run.
User: souvik-databricks
Home Page: https://pypi.org/project/dlt-with-debug/
big-data-processing,Flink SQL 实战 -中文博客专栏
User: starplatinumstudio
Home Page: https://blog.csdn.net/qq_35815527/category_9634641.html
big-data-processing,A curated selection of tools, libraries and services that help tame your dataflow to productively build ambitious, data driven & reactive applications on a streaming lakehouse
Organization: tabletop-labs
big-data-processing,
User: talalzone
Home Page: http://talal.zone
big-data-processing,
User: theguywithblacktie
big-data-processing,Study of French hospital production. (2021)
User: vincianedesbois
big-data-processing,BigQuery data pipeline with dbt, Spark, Docker, Airflow, Terraform, GCP
User: vishu-tyagi
big-data-processing,Here I demonstrate the performance difference between the Poisson and the classic bootstrap by estimating the confidence interval for the difference of CTRs of the two user groups
User: vladonmyown
big-data-processing,Reservoir Sampling for Group-By Queries in Flink Platform. Answering effectively Single Aggregate.
User: vvittis
big-data-processing,Crack Detection model using yolov7
User: zaid-24
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.