Giter Club home page Giter Club logo

ds-sigcse's Introduction

Computing Infrastructure and Curriculum Design for Introductory Data Science

TMaterials for the “Computing Infrastructure and Curriculum Design for Introductory Data Science” workshop at SIGCSE 2019 in Minneapolis, MN.

See the GitHub repo here, and the RStudio Cloud space here.

When and where

7:00 PM - 10:00 PM
Wed Feb 27, 2019
Hyatt: Greenway F (2nd floor)

Blurb

Interested in teaching introductory data science? running your course on GitHub, and doing so efficiently? what first exposure to computing with R might look like? what the tidyverse is? If your answer is yes to any of these, this workshop is for you! We will showcase and discuss the pedagogical considerations behind the introductory data science curriculum presented in Data Science in a Box (datasciencebox.org), get hands on practice with tooling, and share a complete set of open source course materials, including teacher facing documentation and student facing learning resources and assessments.

Abstract

The goal of this workshop is to equip educators with concrete information on content and infrastructure for designing and painlessly running a modern data science course. This is a three-part workshop.

  • Part 1 will outline a curriculum for an introductory data science course and discuss pedagogical decisions that go into the choice of topics and concepts, programming language (R) and syntax (primarily tidyverse), emphasis on literate programming for reproducibility (with R Markdown).
  • Part 2 will discuss infrastructure choices around teaching data science with R: RStudio as an integrated development environment, cloud-based access with RStudio Cloud and Server, version control with Git, and collaboration with GitHub.
  • Part 3 will focus on classroom management on GitHub (with ghclass).

Workshop attendees will work through several exercises from the course and get first-hand experience with using the tool-chains and techniques described above. All workshop content, including teacher facing documentation and student facing course materials, will also be available to participants via datasciencebox.org.

Slides

Pre-workshop instructions

In this workshop we will be coding in R via the RStudio IDE. If the conference WiFi cooperates we will do this in the cloud via rstudio.cloud. I will purposefully hold off instructions for this so that you can experience students’ day one experience in the class.

However, the backup option, in case WiFi doesn’t cooperate, is running RStudio locally. Below are the instructions for the required local setup. Note that these instructions are a lot more detailed than what the students in the intro data science course are exposed to.

  • Install R: A recent version of R (>= 3.5.2 “Eggshell Igloo”) is recommended. If you already have R installed, you can check your version from the R Console:
getRversion()
## [1] '3.5.2'
  • Install RStudio: A recent version of RStudio (>= v1.2.1293-1) is recommended.

  • Install packages: For this workshop, you’ll need to install several R packages. To do so, please run the following in the your R console:

workshop_pkgs <- c("tidyverse", "openintro",  "infer", "broom")
install.packages(workshop_pkgs)

ds-sigcse's People

Contributors

mine-cetinkaya-rundel avatar

Stargazers

Karl R. B. Schmitt avatar Nicholas Viau avatar

Watchers

James Cloos avatar  avatar  avatar

ds-sigcse's Issues

Finalize links

and once they're finalized, update

  • welcome deck
  • footer on all decks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.