jcool12 Goto Github PK
Type: User
Type: User
This contains a tutorial and a sample projects developed using spark, hive, sqoop and flume technologies.
Data and code behind the stories and interactives at FiveThirtyEight
Data Science at the Command Line
Big Data Projects
Data Science in 30 Minutes #5: Spark
Following along with the Hive tutorial at StrataConf / HadoopWorld
Project Tutorial that serves as an introduction to Data Analysis using Hive and raw data download from IMDB.
Studying data science enables individuals to bring these techniques to bear on their work, their scientific endeavors, and their personal decisions. Critical thinking has long been a hallmark of a rigorous education, but critiques are often most effective when supported by data. A critical analysis of any aspect of the world, may it be business or social science, involves inductive reasoning; conclusions can rarely been proven outright, but only supported by the available evidence. Data science provides the means to make precise, reliable, and quantitative arguments about any set of observations. With unprecedented access to information and computing, critical thinking about any aspect of the world that can be measured would be incomplete without effective inferential techniques.
Code repository for Learning PySpark by Packt
Example lesson using Software Carpentry template.
Student downloads for the Master the Tidyverse Workshop
A collection of templates for use with Apache NiFi.
An R data package containing all out-bound flights from NYC in 2013 + useful metdata
PySpark Code for Hands-on Learners
PySpark-Tutorial provides basic algorithms using PySpark
Learn Spark using Python
Code snippets and tutorials for working with social science data in PySpark
Software Carpentry introduction to Python for novices using inflammation data.
100+ Python challenging programming exercises
Exercise solutions to "R for Data Science"
An example repo using Git Large File Storage (LFS)
Mining Opinions, Exploring Trends and More with Twitter
Two-day introduction to the tidyverse workshop
Code to reproduce the simple sentiment analysis from my presentation
Sentiment analysis on Tweets containing '$TWTR' and the relation to Twitter's stock price.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.