Measuring Divergence and Convergence in Sequence Analysis: A Spell-Based Extension of Longest Common Prefixes
Yuqi Liang and
Jan Meyerhoff-Liang
No 3pyhr_v1, SocArXiv from Center for Open Science
Abstract:
Measuring sequence dissimilarity is central to sequence analysis, yet empirical applications remain dominated by optimal matching (OM), a widely used but contested approach. Longest common prefix (LCP) measures offer an alternative by capturing shared beginnings and endings, making them useful for studying divergence and convergence, important concepts in social science. However, existing LCP variants are position-wise: small shifts in transition timing can break the common prefix between trajectories that share the same spell sequence, making them appear to diverge early. This study develops a spell-based perspective on understanding sequence comparison. We introduce OMspellUnitFree (a modified spell-based OM) and propose LCPspell, a spell-based extension of LCP. Using simulations and an empirical demonstration, we show that whole-trajectory measures such as OM family capture broad trajectory profiles, whereas LCP-type measures identify where similarity breaks down or reappears. The paper provides practical guidance for choosing distance measures according to research questions and data characteristics.
Date: 2026-05-05
References: Add references at CitEc
Citations:
Downloads: (external link)
https://osf.io/download/69f9c6d86245f64c90df8acf/
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:osf:socarx:3pyhr_v1
DOI: 10.31219/osf.io/3pyhr_v1
Access Statistics for this paper
More papers in SocArXiv from Center for Open Science
Bibliographic data for series maintained by OSF ().