kvasagiri Goto Github PK
Type: User
Type: User
Apache Accumulo
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.
A Bulk Data Pipeline out of Cassandra
Apache Airflow tutorial
Web UI for PrestoDB.
A collection of commonly asked about data structures and algorithms for technical interviews
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
Example notebooks that show how to apply machine learning, deep learning and reinforcement learning in Amazon SageMaker
Data Challenge Amazon 2017 Interview
Mirror of Apache Ambari
Repository for the Amundsen project
Quickly deploy Hadoop with the help of Ansible and Apache Ambari
Source code for the post, 'Managing AWS Infrastructure as Code using Ansible, CloudFormation, and CodeBuild'
Manage your AWS infrastructure and ECS tasks with two separate ansible playbooks
A Project Based Learning
Amazon Athena, a serverless, interactive query service, is used to easily analyze big data using standard SQL in Amazon S3. Apache Drill, a schema-free, low-latency SQL query engine, enables self-service data exploration on big data. Let us compare data partitioning in Apache Drill & AWS Athena and the distinct features of both.
Code samples for YouTube APIs, including the YouTube Data API, YouTube Analytics API, and YouTube Live Streaming API. The repo contains language-specific directories that contain the samples.
A shared pipeline for building ETLs and batch jobs that we run at the City of LA for Data Science Projects. Built on Apache Airflow & Civis Platform
Cassandra Java Client
In-memory dimensional time series database.
:atom: The hackable text editor
automating AWS with Python using boto3 library
:snowflake: :whale: Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
A curated list of awesome big data frameworks, ressources and other awesomeness.
:memo: An awesome Data Science repository to learn and apply for real world problems.
A curated list of awesome ETL frameworks, libraries and software.
An awesome list of high-quality open datasets in public domains (on-going). By everyone, for everyone!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.