Improvement on the association strength: implementing a probabilistic measure based on combinations without repetition
Mathieu Steijn
No 2043, Papers in Evolutionary Economic Geography (PEEG) from Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography
Abstract:
The use of co-occurrence data is common in various domains. Co-occurrence data often needs to be normalised to correct for the size-e↵ect. To this end, van Eck and Waltman (2009) recommend a probabilistic measure known as the association strength. However, this formula is based on combinations with repetition, even though in most uses self-co-occurrences are non-existent or irrelevant. A more accurate measure based on combinations without repetition is introduced here and compared to the original formula in mathematical derivations, simulations, and patent data, which shows that the original formula overestimates the relation between a pair and that some pairs are disproportionally more overestimated than others. The new measure is available in the EconGeo package for R by Balland (2016).
Keywords: co-occurrence; network analysis; similarity measure; probabilistic measures (search for similar items in EconPapers)
Date: 2020-09, Revised 2020-09
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://econ.geo.uu.nl/peeg/peeg2043.pdf Version September 2020 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:egu:wpaper:2043
Access Statistics for this paper
More papers in Papers in Evolutionary Economic Geography (PEEG) from Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography Contact information at EDIRC.
Bibliographic data for series maintained by ( this e-mail address is bad, please contact ).