Identifying the “Ghost City” of domain topics in a keyword semantic space combining citations
Kai Hu (),
Kunlun Qi,
Siluo Yang,
Shengyu Shen,
Xiaoqiang Cheng,
Huayi Wu (),
Jie Zheng,
Stephen McClure and
Tianxing Yu
Additional contact information
Kai Hu: Wuhan University
Kunlun Qi: China University of Geosciences (Wuhan)
Siluo Yang: Wuhan University
Shengyu Shen: Yangtze River Scientific Research Institute
Xiaoqiang Cheng: Hubei University
Huayi Wu: Wuhan University
Jie Zheng: Wuhan University
Stephen McClure: Wuhan University
Tianxing Yu: Wuhan University
Scientometrics, 2018, vol. 114, issue 3, No 19, 1157 pages
Abstract:
Abstract As an increasing number of scientific literature dataset are open access, more attention has gravitated to keyword analysis in many scientific fields. Traditional keyword analyses include the frequency based and the network based methods, both providing efficient mining techniques for identifying the representative keywords. The semantic meanings behind the keywords are important for understanding the research content. However, traditional keyword analysis methods pay scant attention to semantic meanings; the network based or frequency based methods as traditionally used, present limited semantic associations among the keywords. Moreover, the ways in which the semantic meanings behind the keywords are associated to the citations are not clear. Thus, we use the Google Word2Vec model to build word vectors and reduce them to a two-dimensional plane in a Voronoi diagram using the t-SNE algorithm, to link meanings with citations. The distance between semantic meanings of keywords in two-dimensional plane are similar to distances in geographical space, thus we introduce a geographic metaphor, “Ghost City” to describe the relationship between semantics and citations for hot topics that have recently become not so hot. Along with “Ghost City” zones, “Always Hot”, “Newly Emerging Hot”, and “Always Silent” areas are classified and mapped, describing the spatial heterogeneity and homogeneity of the semantic distribution of keywords cited in a domain database. Using a collection of “geographical natural hazard” literature datasets, we demonstrate that the proposed method and classification scheme can efficiently provide a unique viewpoint for interpreting the interaction between semantics and the citations, as “Ghost City”, “Always Hot”, “Newly Emerging Hot”, and “Always Silent” areas.
Keywords: Ghost City; Semantic space; Keyword analysis; Word2Vec; t-SNE; Spatial analysis (search for similar items in EconPapers)
Date: 2018
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11192-017-2604-7 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:scient:v:114:y:2018:i:3:d:10.1007_s11192-017-2604-7
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11192
DOI: 10.1007/s11192-017-2604-7
Access Statistics for this article
Scientometrics is currently edited by Wolfgang Glänzel
More articles in Scientometrics from Springer, Akadémiai Kiadó
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().