} </style> <div class="fluid-row" id="header"> <h1 class="title toc-ignore">A first glance at the Divvy data</h1> <h4 class="author"><em>Peter Carbonetto</em></h4> </div> <!-- Define defaults shared by all workflowr files. --> <!-- Update knitr chunk options --> <!-- Insert the date the file was last updated --> <p><strong>Last updated:</strong> 2018-01-18</p> <!-- Insert the code version (Git commit SHA1) if Git repository exists and R package git2r is installed --> <p><strong>Code version:</strong> 93a3a86</p> <p>Here, we will take a brief look at the data provided by Divvy.</p> <p>I begin by loading a few packages, as well as some additional functions I wrote for this project.</p> <pre class="r"><code>library(data.table) source("../code/functions.R")</code></pre> <div id="reading-the-data" class="section level2"> <h2>Reading the data</h2> <p>I wrote a function, <code>read.divvy.data</code>, that reads in the trip and station data from the <a href="https://www.divvybikes.com/system-data">Divvy CSV files</a>. This function uses <code>fread</code> from the <code>data.table</code> package to quickly read in the data (it is much faster than <code>read.table</code>). This function also prepares the data, including the departure dates and times, so that they are easier to work with.</p> <pre class="r"><code>divvy <- read.divvy.data() # Reading station data from ../data/Divvy_Stations_2016_Q4.csv. # Reading trip data from ../data/Divvy_Trips_2016_Q1.csv. # Reading trip data from ../data/Divvy_Trips_2016_04.csv. # Reading trip data from ../data/Divvy_Trips_2016_05.csv. # Reading trip data from ../data/Divvy_Trips_2016_06.csv. # Reading trip data from ../data/Divvy_Trips_2016_Q3.csv. # Reading trip data from ../data/Divvy_Trips_2016_Q4.csv. # Preparing Divvy data for analysis in R. # Converting dates and times.</code></pre> </div> <div id="a-first-glance-at-the-divvy-data" class="section level2"> <h2>A first glance at the Divvy data</h2> <p>We have data on 581 Divvy stations across the city.</p> <pre class="r"><code>nrow(divvy$stations) # [1] 581 print(head(divvy$stations),row.names = FALSE) # name latitude longitude dpcapacity online_date # 2112 W Peterson Ave 41.99118 -87.68359 15 5/12/2015 # 63rd St Beach 41.78102 -87.57612 23 4/20/2015 # 900 W Harrison St 41.87468 -87.65002 19 8/6/2013 # Aberdeen St & Jackson Blvd 41.87773 -87.65479 15 6/21/2013 # Aberdeen St & Monroe St 41.88042 -87.65560 19 6/26/2013 # Ada St & Washington Blvd 41.88283 -87.66121 15 10/10/2013</code></pre> <p>We also have information about the >3 million trips taken on Divvy bikes in 2016.</p> <pre class="r"><code>nrow(divvy$trips) # [1] 3595383 print(head(divvy$trips),row.names = FALSE) # trip_id starttime bikeid tripduration from_station_id # 9080551 2016-03-31 23:53:00 155 841 344 # 9080550 2016-03-31 23:46:00 4831 649 128 # 9080549 2016-03-31 23:42:00 4232 210 350 # 9080548 2016-03-31 23:37:00 3464 1045 303 # 9080547 2016-03-31 23:33:00 1750 202 334 # 9080546 2016-03-31 23:31:00 4302 638 67 # from_station_name to_station_id to_station_name # Ravenswood Ave & Lawrence Ave 458 Broadway & Thorndale Ave # Damen Ave & Chicago Ave 213 Leavitt St & North Ave # Ashland Ave & Chicago Ave 210 Ashland Ave & Division St # Broadway & Cornelia Ave 458 Broadway & Thorndale Ave # Lake Shore Dr & Belmont Ave 329 Lake Shore Dr & Diversey Pkwy # Sheffield Ave & Fullerton Ave 304 Broadway & Waveland Ave # usertype gender birthyear start.week start.day start.hour # Subscriber Male 1986 13 Thursday 23 # Subscriber Male 1980 13 Thursday 23 # Subscriber Male 1979 13 Thursday 23 # Subscriber Male 1980 13 Thursday 23 # Subscriber Male 1969 13 Thursday 23 # Subscriber Male 1991 13 Thursday 23</code></pre> <p>Out of all the Divvy stations in Chicago, the one on Navy Pier (near the corner of Streeter and Grand) had the most activity by far.</p> <pre class="r"><code>departures <- table(divvy$trips$from_station_name) as.matrix(head(sort(departures,decreasing = TRUE))) # [,1] # Streeter Dr & Grand Ave 90042 # Lake Shore Dr & Monroe St 51090 # Theater on the Lake 47927 # Clinton St & Washington Blvd 47125 # Lake Shore Dr & North Blvd 45754 # Clinton St & Madison St 41744</code></pre> </div> <div id="divvy-bikes-at-the-university-of-chicago" class="section level2"> <h2>Divvy bikes at the University of Chicago</h2> <p>I would also like to take a close look at the trip data for the main Divvy station on the University of Chicago campus. 