TY - JOUR
T1 - CIAM
T2 - A data-driven approach for classifying long-term engagement of public transport riders at multiple temporal scales
AU - Cardell-Oliver, Rachel
AU - Olaru, Doina
PY - 2022/11
Y1 - 2022/11
N2 - Many human activities, including daily travel, show a mix of stable, intermittent and changing patterns in demand by individuals over time. However, the lack of continuous, long-term, passenger-linked data for public transport (PT) journeys means that we do not know how passenger ridership evolves in real-world networks. This paper proposes the CIAM model for classifying long-term passenger engagement with PT. CIAM is a data-driven model combining year-on-year churn (C), monthly intensity (I), annual (A) and multi-year (M) engagement. Parameter search algorithms are used to ensure that the learned features are distinctive and robust. We evaluated CIAM using a 5-year dataset from a PT network with over 300 million journeys. CIAM identified distinct patterns of long-term ridership at multiple time scales. Although the total number of annual journeys was relatively stable over the five years, we found long-term differences between passenger subgroups. Churn of passengers was a major factor in ridership with only 55% of passengers retained from year to year. Patterns of annual engagement are often intermittent, so short-term snapshots of a few weeks are typically not good indicators for longer term engagement. Only 27% of high-frequency, full-fare riders still have the same level of engagement four years later, compared with 55% who continue high-frequency engagement after only one year.
AB - Many human activities, including daily travel, show a mix of stable, intermittent and changing patterns in demand by individuals over time. However, the lack of continuous, long-term, passenger-linked data for public transport (PT) journeys means that we do not know how passenger ridership evolves in real-world networks. This paper proposes the CIAM model for classifying long-term passenger engagement with PT. CIAM is a data-driven model combining year-on-year churn (C), monthly intensity (I), annual (A) and multi-year (M) engagement. Parameter search algorithms are used to ensure that the learned features are distinctive and robust. We evaluated CIAM using a 5-year dataset from a PT network with over 300 million journeys. CIAM identified distinct patterns of long-term ridership at multiple time scales. Although the total number of annual journeys was relatively stable over the five years, we found long-term differences between passenger subgroups. Churn of passengers was a major factor in ridership with only 55% of passengers retained from year to year. Patterns of annual engagement are often intermittent, so short-term snapshots of a few weeks are typically not good indicators for longer term engagement. Only 27% of high-frequency, full-fare riders still have the same level of engagement four years later, compared with 55% who continue high-frequency engagement after only one year.
KW - Annual engagement
KW - CIAM
KW - Customer segmentation
KW - Monthly intensity
KW - Multi-year engagement
KW - Public transport
KW - Smart card data
KW - Urban computing
KW - Year-on-year churn
UR - http://www.scopus.com/inward/record.url?scp=85139333814&partnerID=8YFLogxK
U2 - 10.1016/j.tra.2022.09.002
DO - 10.1016/j.tra.2022.09.002
M3 - Article
AN - SCOPUS:85139333814
SN - 0965-8564
VL - 165
SP - 321
EP - 336
JO - Transportation Research Part A: Policy and Practice
JF - Transportation Research Part A: Policy and Practice
ER -