building a bicycle data fusion model for oregon · key takeaways. 1. no single golden data goose 2....
TRANSCRIPT
![Page 1: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/1.jpg)
Building a Bicycle Data Fusion Model for Oregon
Sirisha Kothuri, Ph.D.
3/11/2020
![Page 2: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/2.jpg)
Agenda
• Introduction• Nonmotorized Traffic Monitoring Programs• Emerging Data Sources• Research
2
![Page 3: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/3.jpg)
Why count?
If we don’t count it, it doesn’t count!!
3
![Page 4: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/4.jpg)
Why measure walking and biking?
• Funding & policy decisions• To show change over time• Facility design• Planning (short-term, long-term, regional…)
• Economic impact• Public health• Safety
4
![Page 5: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/5.jpg)
How do we measure?• Surveys
• National• Regional• Local
• GPS• Counts
• Permanent• Short duration
5
![Page 6: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/6.jpg)
What?
People actually bike here?
Yes! 200 per day
Credit: K. Nordback6
![Page 7: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/7.jpg)
What?People actually walk here?
Credit: K. Nordback7
![Page 8: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/8.jpg)
What are counts not good for?
• Studying trip purpose • Demographics
8
Credit: K. Nordback
![Page 9: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/9.jpg)
Traffic Monitoring Programs• Required by FHWA (MAP21):
• all urban and rural principal arterial roadways• all intermodal connector roadways • the strategic defense highway network
• Historically used to allocate federal funds to state DOTs.
• Municipalities • Planning• Signal timing
9Credit: K. Nordback
![Page 10: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/10.jpg)
State Traffic Monitoring• Permanent counters
• Inductive loops
• Short duration counters• Pneumatic tubes
10Pic Credit: MetroCount
![Page 11: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/11.jpg)
Oregon’s Traffic Monitoring Program
11Source: ODOT
![Page 12: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/12.jpg)
Oregon Short Duration Counts
12Credit: K. Nordback
![Page 13: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/13.jpg)
AADT
Credit: K. Nordback13
![Page 14: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/14.jpg)
AADT to Estimate VMT
14
Sum (AADT X Segment Length) over network to compute Vehicle Miles Traveled (VMT)
Source: ODOT
![Page 15: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/15.jpg)
15
Resources
![Page 16: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/16.jpg)
AADBT Estimation
16
• Day-of-week, month-of-year (19 factors)• Day-of-week-of-month• Disaggregate, day-of-year• Seasonal factors• K (design-hour) factors• Statistical modeling approaches (weather, correction
factors)
![Page 17: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/17.jpg)
Minimizing Error
• Using daily factors that are disaggregated by weather conditions;
• Conducting short-duration counts during months with higher traffic volumes (e.g., April–October;) for at least seven days;
• If counts must be limited to 24 h, plan for monitoring Tuesdays through Thursdays, even at sites with high weekend volumes.
• Using monthly rather than seasonal factors;
• Using weather-based regression equations to correct factors;
• Using day-of-year rather than traditional factors; and
• Imputing missing values using methods that account for weather and information from similar, nearby sites.
• Plan for at least four counters per factor group for bicycles and five or more counters for pedestrians.
17
![Page 18: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/18.jpg)
Trends
• Innovation in Counting Technologies
18
![Page 19: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/19.jpg)
Multiple Emerging Data Sources
• Mobile device crowdsensing • Micro-mobility services • Smarter signalized intersection sensors• Ped/bike exposure and near-miss from connected
cars
• How to make sense?? What is “good enough”?
19Slide Credit: S. Turner
![Page 20: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/20.jpg)
Key Takeaways
1. No single golden data goose 2. Swimming in multiple data sources
• Some public, some private• Integrating disparate sources• Reconciling differences and bias • Understand what is being measured and how • Ask vendors questions, independent eval
3. Keep installing and maintaining accurate permanent counters • Foundation for testing or calibrating any crowdsource method
4. Understand privacy issues and be proactivehttps://www.nytimes.com/interactive/2019/12/19/opinion/
location-tracking-cell-phone.html
20Slide Credit: S. Turner
![Page 21: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/21.jpg)
Trends
21Source: Proulx and Pozdnukhov, 2017
• Data Fusion
![Page 22: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/22.jpg)
22
Bike Counts
Automated Manual
+
Third-Party Data
Opt-in / Apps Passive/Background Sharing
+ +
Exploring Data Fusion Techniques to Derive Bicycle Volumes on a Network
![Page 23: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/23.jpg)
Scope• What third-party bicycle volume and route data are
available to augment conventional count data, and how have they been used to date?
• How can we evaluate these emerging data sources for factors such as accuracy, coverage, completeness, and representativeness?
• How can we combine emerging third-party and conventional data sources to estimate bicycle volumes across entire networks?
• What relative marginal value do different data sources and model forms add in predicting bicycle volumes relative to acquisition and processing costs?
23
![Page 24: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/24.jpg)
Third-party Bicycle User Data: Overview
24
Study Location (Country)
User data Demographic(vs. actual)
Count correspondence1
(count data)Peer-reviewed journal articles
Boss et al. (2018) Ottawa (CA) Strava
78% male (68%)Noted age 25-44 over-represented
(11 perm., single month 1 yr apart)1-30% SRr=0.76-0.96
Sample row from full table in Lit Review
22 studies reviewed using 3rd party user data• 11 academic, 11 agency/report/white papers• 13 US, 9 international• Vast majority included Strava Metro data
• Others: Ride Report & other smartphone apps• Not all focused on evaluation, but most reported basic
validation stats
![Page 25: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/25.jpg)
• User data demographics generally limited to age, gender, and type of cycling activity (e.g., commuting vs. recreational riding)
• Information on key equity-related variables such as income and race/ethnicity lacking
• Even where user data available, lack of comparison data on the composition of cyclists in a city or region
• Strava (from reviewed US/Canadian studies): 75-84% male, 25-44 over-represented; 31% 35-44 (single study, Atlanta, 71% recreational (single study, Atlanta)
• National cyclist comparisons: 51%-60% male, 18-21% 35-44
Third-party Bicycle User Data: Demographics
25
![Page 26: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/26.jpg)
• Sampling rate is often reported as: third party count / full count
• Most often calculated from automated counters, but sometimes from annualized short-duration counts
• Strava sampling rates typically reported as • < 10% of all cyclists• Most commonly ~1-5%• Maximum 30%, typically along busy trails or at urban
chokepoints (e.g. bridges)• Variable spatially, even within region/city
Third-party Bicycle User Data: Sampling rate
26
![Page 27: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/27.jpg)
• Usually reported as either a bi-variate, linear correlation (r) or as an R^2 from a simple multiple regression function
• Overall correlation across all sites typically high (r=0.6-0.9)• Wide variation by site (some sites have even shown negative
correlation with third-party data)• Sampling rate does not necessarily predict correlation; e.g.,
in Austin Ride Report demonstrated comparable correlation results despite a much lower sampling rate
• Specific correlation not always well-described (e.g. across sites or over time at same site), and simple correlation sensitive to outliers such as high volume count locations, so high r’s should be interpreted with caution
Third-party Bicycle User Data: Correlation with Counts
27
![Page 28: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/28.jpg)
Anticipated Outcomes• A review of existing bicycle volume estimation methods
and a catalog of potential third-party data sources• Evaluation of emerging third-party bicycle data sources
as inputs into network-wide volume estimation• Demonstration of the development, application, and
validation of models that combine multiple data sources• Comparison of the relative value of different data
sources and modeling techniques in bicycle volume estimation
• Documentation and packaging of methods developed including making scripts available on GitHub
28
![Page 29: Building a Bicycle Data Fusion Model for Oregon · Key Takeaways. 1. No single golden data goose 2. Swimming in multiple data sources • Some public, some private • Integrating](https://reader034.vdocument.in/reader034/viewer/2022042222/5ec82b9ddda961587573b820/html5/thumbnails/29.jpg)
Contact Information
Sirisha Kothuri, Ph.D.Portland State UniversityEmail: [email protected]: 503.725.4208
29
Thank You!