A Language and Its Dimensions: Intrinsic Dimensions of Language Fractal Structures
Vasilii A. Gromov,
Nikita S. Borodin,
Asel S. Yerbolova and
Hiroki Sayama
Complexity, 2024, vol. 2024, 1-24
Abstract:
The present paper introduces a novel object of study, a language fractal structure; we hypothesize that a set of embeddings of all n-grams of a natural language constitutes a representative sample of this fractal set. (We use the term Hailonakea to refer to the sum total of all language fractal structures, over all n). The paper estimates intrinsic (genuine) dimensions of language fractal structures for the Russian and English languages. To this end, we employ methods based on (1) topological data analysis and (2) a minimum spanning tree of a data graph for a cloud of points considered (Steele theorem). For both languages, for all n, the intrinsic dimensions appear to be noninteger values (typical for fractal sets), close to 9 for both of the Russian and English language.
Date: 2024
References: Add references at CitEc
Citations:
Downloads: (external link)
http://downloads.hindawi.com/journals/complexity/2024/8863360.pdf (application/pdf)
http://downloads.hindawi.com/journals/complexity/2024/8863360.xml (application/xml)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hin:complx:8863360
DOI: 10.1155/2024/8863360
Access Statistics for this article
More articles in Complexity from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().