A work-in-progress project to analyze Capital Bikeshare data. My research angle looks at any potential changes in aggregate usage patterns due to COVID-19. This is an independent research project; my views and conclusions here are my own.
Please note that while the code here is available publicly, all system data should be downloaded directly from the capital bikeshare website, as per their license agreement. The full usage data is quite large (~4gb), so please make the appropriate adjustments or time allotments when running the code.
The code is designed to pull the raw .csv files from folders named by year.
In pre-pandemic years, the late spring, summer, and early fall weeks see the highest monthly rides. In 2020, the first few weeks had weekly ride tallys in line with pre-pandemic years, but April of 2020 saw a drastic decrease in the number of rides compared to pre-pandemic years. Monthly rides stayed below pre-pandemic years, but nevertheless recovered after the spring dip in ridership.
Median ride durations appear higher on the weekends throughout the year. But at the start of the pandemic we begin to see a pattern where median ride durations are longer -- on both weekdays and weekends.
The changes in median duration from each station play out between the 10th and 12th weeks of 2020, where the distribution of ride lengths becomes more spread out quite quickly.
In 2020, we also see a more accentuated relationship between membership and metro-going percentages: the higher proportion of users that check out a bike that are members, the more likely the ride patterns from that station are headed to a another station near a metro.
note: main project currently on pause due to development of cycleR, which should hopefully make this project better :)
Technical
- Should the project architecture+code be tweaked to fit the
stplanr
package and workflow? - How to treat docked vs dockless bikes?
Research
- What indicators are most useful for tracking aggregate trends before and across different stages of the pandemic?
- Can the GINI coefficient be implemented to reveal anything, and if so what does it actually mean?
- Can we really think about trends in a pre-, during-, return-to-normal trajectory or is this "restoration" narrative less appropriate than a "change" narrative?
Data come from OpenStreetMap and from CapitalBikeShare (as described above). Upon the request of OSM, here's the attribution: "© OpenStreetMap contributors". The data derived from OpenStreetMap should be maintained under the Open Database License; I intend to do so.