Giter Club home page Giter Club logo

data's Introduction

These data files correspond to the Foundations of Applied Mathematics lab curriculum. Instead of downloading or cloning this repository directly, use the download_data.sh script included in the Student-Materials repository.

$ cd /path/to/folder
$ bash download_data.sh

The download requires git, which you can download at https://git-scm.com/downloads.

Below the data files are listed by volume (in the order given by each table of contents), then alphabetically by lab folder.

Labs by Volume

Python Essentials

Lab Title Folder/Data File Source
Introduction to NumPy NumpyIntro/grid.npy https://projecteuler.net/problem=11
Introduction to Matplotlib MatplotlibIntro/FARS.npy Gathered from https://www.nhtsa.gov/FARS
Exceptions and File Input/Output Exceptions_FileIO/hello_world.txt Written by hand
Exceptions and File Input/Output Exceptions_FileIO/cf_example1.txt Written by hand
Exceptions and File Input/Output Exceptions_FileIO/cf_example2.txt Written by hand
Profiling Profiling/names.txt https://projecteuler.net/problem=22
Profiling Profiling/triangle.txt https://projecteuler.net/problem=18
Profiling Profiling/triangle_large.txt https://projecteuler.net/problem=67
Data Visualization DataVisualization/anscombe.npy https://en.wikipedia.org/wiki/Anscombe's_quartet. Original citation: Anscombe, F. J. (1973). "Graphs in Statistical Analysis". American Statistician. 27 (1): 17โ€“21. JSTOR 2682899
Data Visualization DataVisualization/MLB.npy http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_MLB_HeightsWeights (modified)
Data Visualization DataVisualization/earthquakes.npy Gathered from https://earthquake.usgs.gov/earthquakes/search/
Data Visualization DataVisualization/countries.npy Combined from https://en.wikipedia.org/wiki/List_of_countries_by_GDP_(nominal), http://www.averageheight.co/, and https://en.wikipedia.org/wiki/List_of_countries_and_dependencies_by_population

Data Science Essentials

Lab Title Folder/Data File Source
Regular Expressions RegularExpressions/fake_contacts.txt Generated by http://www.Generatedata.com/
SQL 1: Introduction SQL1/student_info.csv Written by hand
SQL 1: Introduction SQL1/student_grades.csv Written by hand
SQL 1: Introduction SQL1/us_earthquakes.csv
SQL 2 (The Sequel) SQL2/students.db Combined from student_info.csv, student_grades.csv, and the other tables in SQL 1
Web Technologies WebTechnologies/nyc_traffic.json Modified from https://data.cityofnewyork.us/Public-Safety/NYPD-Motor-Vehicle-Collisions/h9gi-nx95, gathered August 2017
Introduction to Beautiful Soup WebScraping1/example.html https://www.example.com
Introduction to Beautiful Soup WebScraping1/san_diego_weather.html
Introduction to Beautiful Soup WebScraping1/large_banks_index.html
Introduction to Beautiful Soup WebScraping1/large_banks_data.html
Pandas 1: Introduction Data Science Essentials Pandas1/crime_data.csv
Pandas 1: Introduction Pandas1/final_accidents2.pickle Data Science Essentials
Pandas 1: Introduction Pandas1/final_drivers.pickle Data Science Essentials
Pandas 2: Plotting Pandas2/final_accidents2.pickle Data Science Essentials
Pandas 2: Plotting Pandas2/final_drivers.pickle Data Science Essentials
Pandas 2: Plotting Pandas2/new_york_crime_clean.csv Data Science Essentials
Pandas 3: Grouping Pandas3/Ohio_1999.csv Data Science Essentials
Pandas 3: Grouping Pandas3/time_usage.txt Data Science Essentials
Pandas 4: Time Series Pandas4/DJIA.csv Data Science Essentials
Pandas 4: Time Series Pandas4/finances.csv Data Science Essentials
Pandas 4: Time Series Pandas4/paychecks.csv Data Science Essentials
Pandas 4: Time Series Pandas4/website_traffic.csv Data Science Essentials
Pandas 5: GeoPandas Pandas5/airports.csv Data Science Essentials

Volume 1

Lab Title Folder/Data File Source
Linear Transformations LinearTransformations/horse.npy Generated
Least Squares and Computing Eigenvalues LeastSquares_Eigenvalues/circle.npy Generated
Least Squares and Computing Eigenvalues LeastSquares_Eigenvalues/ellipse.npy Generated
Least Squares and Computing Eigenvalues LeastSquares_Eigenvalues/housing.npy Gathered from https://www.fhfa.gov/DataTools/Downloads/Pages/House-Price-Index.aspx
Image Segmentation ImageSegmentation/dream.png
Image Segmentation ImageSegmentation/dream_gray.png
The SVD and Image Compression SVD_ImageCompression/hubble.jpg https://www.nasa.gov/multimedia/imagegallery/image_feature_2099.html
The SVD and Image Compression SVD_ImageCompression/hubble_gray.jpg Modification of hubble.jpg
Facial Recognition FacialRecognition/faces94.zip http://cswww.essex.ac.uk/mv/allfaces/faces94.html
Differentiation Differentiation/plane.npy Generated
Conditioning and Stability Conditioning_Stability/stability_data.npy Generated
The PageRank Algorithm PageRank/web_stanford.txt Subset of web-Stanford.txt from http://snap.stanford.edu/data/web-Stanford.html
The PageRank Algorithm PageRank/ncaa2010.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2011.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2012.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2013.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2014.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2015.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2016.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/ncaa2017.csv Scraped from https://www.sports-reference.com
The PageRank Algorithm PageRank/top250movies.txt Subset of movie_data.txt, scraped with imdbpy (https://imdbpy.sourceforge.io/)
The Drazin Inverse DrazinInverse/social_network.csv Adapted from https://en.wikipedia.org/wiki/Zachary%27s_karate_club

Volume 2

Lab Title Folder/Data File Source
Linked Lists LinkedLists/english.txt Generated
Binary Search Trees BinaryTrees/english.txt Generated
Nearest Neighbor Search NearestNeighbor/mnist_subset.npz Subset of the MNIST database from http://yann.lecun.com/exdb/mnist/
Breadth-first Search BreadthFirstSearch/movie_data.txt Scraped with imdbpy (https://imdbpy.sourceforge.io/)
Breadth-first Search BreadthFirstSearch/movie_data_small.txt Subset of movie_data.txt.
Markov Chains MarkovChains/yoda.txt Gathered from http://www.imsdb.com/scripts/Star-Wars-The-Empire-Strikes-Back.html, http://www.imsdb.com/scripts/Star-Wars-Return-of-the-Jedi.html, http://www.imsdb.com/scripts/Star-Wars-The-Phantom-Menace.html, http://www.imsdb.com/scripts/Star-Wars-Attack-of-the-Clones.html, and http://www.imsdb.com/scripts/Star-Wars-Revenge-of-the-Sith.html
The Discrete Fourier Transform FourierTransform/tada.wav https://www.youtube.com/watch?v=bjxf-eQWKoo
The Discrete Fourier Transform FourierTransform/mystery_chord.wav Generated
The Discrete Fourier Transform FourierTransform/CGC.wav Generated
The Discrete Fourier Transform FourierTransform/GCG.wav Generated
The Discrete Fourier Transform FourierTransform/balloon.wav Recorded at BYU
The Discrete Fourier Transform FourierTransform/chopin.wav
The Discrete Fourier Transform FourierTransform/noisy1.wav
The Discrete Fourier Transform FourierTransform/noisy2.wav
The Discrete Fourier Transform FourierTransform/vuvuzela.wav Part of https://www.youtube.com/watch?v=g_0NoBKWCT8
The Discrete Fourier Transform FourierTransform/noisy_face.png Sample from faces94.zip
The Discrete Fourier Transform FourierTransform/license_plate.png
Introduction to Wavelets Wavelets/mandrill.png http://sipi.usc.edu/database/
Introduction to Wavelets Wavelets/woman_darkhair.png
Introduction to Wavelets Wavelets/noisy_darkhair.png
Introduction to Wavelets Wavelets/uncompressed_finger.png
Polynomial Interpolation PolynomialInterpolation/airdata.npy
Gradient Descent Methods GradientMethods/linregression.txt
Gradient Descent Methods GradientMethods/challenger.npy
Simplex Simplex/productMix.npy
CVXOPT CVXOPT_Intro/ForestData.npy
Interior Point 1: Linear Programs InteriorPoint_Linear/simdata.txt
Interior Point 2: Quadratic Programs InteriorPoint_Quadratic/portfolio.txt

Labs by Folder Name

Folder/Data File Lab Title Volume Source
BinaryTrees/english.txt Binary Search Trees Volume 2 Generated
BreadthFirstSearch/movieData.txt Breadt-first Search Volume 2 Scraped with imdbpy (https://imdbpy.sourceforge.io/)
CVXOPT_Intro/ForestData.npy CVXOPT Volume 2
Conditioning_Stability/stability_data.npy Conditioning and Stability Volume 1 Generated
DataVisualization/anscombe.npy Data Visualization Python Essentials https://en.wikipedia.org/wiki/Anscombe's_quartet. Original citation: Anscombe, F. J. (1973). "Graphs in Statistical Analysis". American Statistician. 27 (1): 17โ€“21. JSTOR 2682899
DataVisualization/MLB.npy Data Visualization Python Essentials http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_MLB_HeightsWeights (modified)
DataVisualization/earthquakes.npy Data Visualization Python Essentials Gathered from https://earthquake.usgs.gov/earthquakes/search/
DataVisualization/countries.npy Data Visualization Python Essentials Combined from https://en.wikipedia.org/wiki/List_of_countries_by_GDP_(nominal), http://www.averageheight.co/, and https://en.wikipedia.org/wiki/List_of_countries_and_dependencies_by_population
Differentiation/plane.npy Differentiation Volume 1 Generated
DrazinInverse/social_network.csv The Drazin Inverse Volume 1 Adapted from https://en.wikipedia.org/wiki/Zachary%27s_karate_club
Exceptions_FileIO/hello_world.txt Exceptions and File Input/Output Python Essentials Written by hand
Exceptions_FileIO/cf_example1.txt Exceptions and File Input/Output Python Essentials Written by hand
Exceptions_FileIO/cf_example2.txt Exceptions and File Input/Output Python Essentials Written by hand
FacialRecognition/faces94.zip Facial Recognition Volume 1 http://cswww.essex.ac.uk/mv/allfaces/faces94.html
FourierTransform/tada.wav The Discrete Fourier Transform Volume 2 https://www.youtube.com/watch?v=bjxf-eQWKoo
FourierTransform/mystery_chord.wav The Discrete Fourier Transform Volume 2 Generated
FourierTransform/CGC.wav The Discrete Fourier Transform Volume 2 Generated
FourierTransform/GCG.wav The Discrete Fourier Transform Volume 2 Generated
FourierTransform/balloon.wav The Discrete Fourier Transform Volume 2 Recorded at BYU
FourierTransform/chopin.wav The Discrete Fourier Transform Volume 2
FourierTransform/noisy1.wav The Discrete Fourier Transform Volume 2
FourierTransform/noisy2.wav The Discrete Fourier Transform Volume 2
FourierTransform/vuvuzela.wav The Discrete Fourier Transform Volume 2 Part of https://www.youtube.com/watch?v=g_0NoBKWCT8
FourierTransform/noisy_face.png The Discrete Fourier Transform Volume 2 Sample from faces94.zip
FourierTransform/license_plate.png The Discrete Fourier Transform Volume 2
GradientMethods/linregression.txt Gradient Descent Methods Volume 2
GradientMethods/challenger.npy Gradient Descent Methods Volume 2
ImageSegmentation/dream.png Image Segmentation Volume 1
ImageSegmentation/dream_gray.png Image Segmentation Volume 1
InteriorPoint_Linear/simdata.txt Interior Point 1: Linear Programs Volume 2
InteriorPoint_Quadratic/portfolio.txt Interior Point 2: Quadratic Programs Volume 2
LeastSquares_Eigenvalues/circle.npy Least Squares and Computing Eigenvalues Volume 1 Generated
LeastSquares_Eigenvalues/ellipse.npy Least Squares and Computing Eigenvalues Volume 1 Generated
LeastSquares_Eigenvalues/housing.npy Least Squares and Computing Eigenvalues Volume 1 Gathered from https://www.fhfa.gov/DataTools/Downloads/Pages/House-Price-Index.aspx
LinearTransformations/horse.npy Linear Transformations Volume 1 Generated
LinkedLists/english.txt Linked Lists Volume 2 Generated
MarkovChains/yoda.txt Markov Chains Volume 2 Gathered from http://www.imsdb.com/scripts/Star-Wars-The-Empire-Strikes-Back.html, http://www.imsdb.com/scripts/Star-Wars-Return-of-the-Jedi.html, http://www.imsdb.com/scripts/Star-Wars-The-Phantom-Menace.html, http://www.imsdb.com/scripts/Star-Wars-Attack-of-the-Clones.html, and http://www.imsdb.com/scripts/Star-Wars-Revenge-of-the-Sith.html
MatplotlibIntro/FARS.npy Introduction to Matplotlib Python Essentials Gathered from https://www.nhtsa.gov/FARS
NearestNeighbor/mnist_subset.npz Nearest Neighbor Search Volume 2 Subset of the MNIST database from http://yann.lecun.com/exdb/mnist/
NumpyIntro/grid.npy Introduction to NumPy Python Essentials https://projecteuler.net/problem=11
PageRank/web_stanford.txt The PageRank Algorithm Volume 1 Subset of web-Stanford.txt from http://snap.stanford.edu/data/web-Stanford.html
PageRank/ncaa2010.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2011.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2012.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2013.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2014.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2015.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2016.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/ncaa2017.csv The PageRank Algorithm Volume 1 Scraped from https://www.sports-reference.com
PageRank/top250movies.txt The PageRank Algorithm Volume 1 Subset of movie_data.txt, scraped with imdbpy (https://imdbpy.sourceforge.io/)
Pandas1/crime_data.csv Pandas 1: Introduction Data Science Essentials
Pandas1/final_accidents2.pickle Pandas 1: Introduction Data Science Essentials
Pandas1/final_drivers.pickle Pandas 1: Introduction Data Science Essentials
Pandas2/final_accidents2.pickle Pandas 2: Plotting Data Science Essentials
Pandas2/final_drivers.pickle Pandas 2: Plotting Data Science Essentials
Pandas2/new_york_crime_clean.csv Pandas 2: Plotting Data Science Essentials
Pandas3/Ohio_1999.csv Pandas 3: Grouping Data Science Essentials
Pandas3/time_usage.txt Pandas 3: Grouping Data Science Essentials
Pandas4/DJIA.csv Pandas 4: Time Series Data Science Essentials
Pandas4/finances.csv Pandas 4: Time Series Data Science Essentials
Pandas4/paychecks.csv Pandas 4: Time Series Data Science Essentials
Pandas4/website_traffic.csv Pandas 4: Time Series Data Science Essentials
Pandas5/airports.csv Pandas 5: GeoPandas Data Science Essentials
PolynomialInterpolation/airdata.npy Polynomial Interpolation Volume 2
Profiling/names.txt Profiling Python Essentials https://projecteuler.net/problem=22
Profiling/triangle.txt Profiling Python Essentials https://projecteuler.net/problem=18
Profiling/triangle_large.txt Profiling Python Essentials https://projecteuler.net/problem=67
QuasiNewtonMethods/population.npy Newton and Quasi-Newton Methods Volume 2
RegularExpressions/fake_contacts.txt Regular Expressions Data Science Essentials Generated by http://www.Generatedata.com/
Simplex/productMix.npy Simplex Volume 2
SQL1/student_info.csv SQL 1: Introduction Data Science Essentials Written by hand
SQL1/student_grades.csv SQL 1: Introduction Data Science Essentials Written by hand
SQL1/us_earthquakes.csv SQL 1: Introduction Data Science Essentials
SQL2/students.db SQL 2 (The Sequel) Data Science Essentials Combined from student_info.csv, student_grades.csv, and the other tables in SQL 1
SVD_ImageCompression/hubble.jpg The SVD and Image Compression Volume 1 https://www.nasa.gov/multimedia/imagegallery/image_feature_2099.html
SVD_ImageCompression/hubble_gray.jpg The SVD and Image Compression Volume 1 Modification of hubble.jpg
Wavelets/mandrill.png Introduction to Wavelets Volume 2 http://sipi.usc.edu/database/
Wavelets/woman_darkhair.png Introduction to Wavelets Volume 2
Wavelets/noisy_darkhair.png Introduction to Wavelets Volume 2
Wavelets/uncompressed_finger.png Introduction to Wavelets Volume 2
WebScraping1/example.html Introduction to Beautiful Soup Data Science Essentials https://www.example.com
WebScraping1/san_diego_weather.html Introduction to Beautiful Soup Data Science Essentials
WebScraping1/large_banks_index.html Introduction to Beautiful Soup Data Science Essentials
WebScraping1/large_banks_data.html Introduction to Beautiful Soup Data Science Essentials
WebTechnologies/nyc_traffic.json Web Technologies Data Science Essentials Modified from https://data.cityofnewyork.us/Public-Safety/NYPD-Motor-Vehicle-Collisions/h9gi-nx95, gathered August 2017

data's People

Contributors

rdorff avatar shanemcq18 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.