Quantitative linguistic study of DNA sequences
S.P. Li,
Ka-Lok Ng and
M.C. Chung
Physica A: Statistical Mechanics and its Applications, 2003, vol. 321, issue 1, 189-192
Abstract:
A new family of compound Poisson distribution functions from quantitative linguistics is used to study the linguistic features of DNA sequences that go beyond the Zipf's law. The relative frequency distribution of n-tuples and the compositional segmentation study can be fit reasonably well using this new family of distribution functions. On the other hand, the absolute values of the relative frequency come out naturally from the linguistic model without ambiguity. It is suggesting that DNA sequences have features that resemble natural language and it may be modeled by linguistic methodology.
Keywords: DNA segmentation; Statistical linguistic; Compound Poisson distribution; Jensen–Shannon divergence measure (search for similar items in EconPapers)
Date: 2003
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0378437102017879
Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:phsmap:v:321:y:2003:i:1:p:189-192
DOI: 10.1016/S0378-4371(02)01787-9
Access Statistics for this article
Physica A: Statistical Mechanics and its Applications is currently edited by K. A. Dawson, J. O. Indekeu, H.E. Stanley and C. Tsallis
More articles in Physica A: Statistical Mechanics and its Applications from Elsevier
Bibliographic data for series maintained by Catherine Liu ().