There are many ways to organize data. Today we learn one way: the "tidy data" format. It is rather simple, but deceptively powerful.
Mon Sep 19, 2016
There are many ways to organize data. Today we learn one way: the "tidy data" format. It is rather simple, but deceptively powerful.
The third point means we don't mix apples and oranges, keep apples together with apples.
This slide added on Tue 10/4:
This is just a spreadsheet!
But we are very strict about the previous three rules.
The nycflights13
package contains information about all flights that departed from NYC (e.g. EWR, JFK and LGA) in 2013: 336,776 flights in total. To help understand what causes delays, it also includes a number of other useful datasets:
weather
: hourly meterological data for each airportplanes
: construction information about each planeairports
: airport names and locationsairlines
: translation between two letter carrier codes and names