Fitting Chinese syllable-to-character mapping spectrum by the beta rank function
Wentian Li
Physica A: Statistical Mechanics and its Applications, 2012, vol. 391, issue 4, 1515-1518
Abstract:
We define the syllable-to-character mapping spectrum in Chinese as the normalized number of characters per syllable ranked from high to low. This spectrum provides a statistical characterization of the relationship between spoken and written Chinese. We have shown that two functions, the logarithmic function and the beta rank function, fit the syllable-to-character mapping spectrum well. The beta rank function is even better than the logarithmic function judged by two measures of data-fitting performance: the sum of square errors, and Akaike information criterion. We comment on why the beta rank function is a good fitting function for many range-limited ranking data, whereas for range-open data it may be out-performed by other functions, such as a power-law function in the case of Zipf’s law.
Keywords: Zipf’s law; Beta rank function; Akaike information criterion; Chinese language (search for similar items in EconPapers)
Date: 2012
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0378437111006522
Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:phsmap:v:391:y:2012:i:4:p:1515-1518
DOI: 10.1016/j.physa.2011.08.024
Access Statistics for this article
Physica A: Statistical Mechanics and its Applications is currently edited by K. A. Dawson, J. O. Indekeu, H.E. Stanley and C. Tsallis
More articles in Physica A: Statistical Mechanics and its Applications from Elsevier
Bibliographic data for series maintained by Catherine Liu ().