CSE4/587 Spring 2017

Collecting Data

Objectives for today's lectures

  • Define and understand data science process (Chapter 1)
  • Self-assess where you stand in the DS competencies
  • Learn simple data collection methods
  • Exploratory data analsyis (Chapter 2)
  • Gentle introduction to our data environment--Jupyter

Chapter 1: Data Science Process



What is the skill set needed for a data-computer scientist? Lets look at the various competencies defining DS using this self-assessment.

Data Scale and Exploratory Data Analysis (EDA)

Lets examine some standard approaches to examining data.
We will discuss some noteworthy points discussed in Chapter 2 on statistical inference. Data approaches for SI

Jupyter Notebook

We will demo some of the features of Jupyter, the environment we will use for most of our labs. Please install Jupyter and RStudio using the instructions given in this handout.