Analysis of Enterprise Social Media Intelligence Acquisition Based on Data Crawler Technology
Yu Lehe () and
Gui Zhengxiu
Additional contact information
Yu Lehe: School of Economics, Huazhong University of Science and Technology, Wuhan 430074, China
Gui Zhengxiu: WuChang Housing Security and Management Bureau, Wuhan 430000, China
Entrepreneurship Research Journal, 2021, vol. 11, issue 2, 3-23
Abstract:
There are generally hundreds of millions of nodes in social media, and they are connected to a huge social network through attention and fan relationships. The news is spread through this huge social network. This paper studies the acquisition technology of social media topic data and enterprise data. The topic positioning technology based on Sina meta search and topic related keywords is introduced, and the crawling efficiency of topic crawlers is analyzed. Aiming at the factors of diverse and variable webpage structure on the Internet, this paper proposes a new Web information extraction algorithm by studying the general laws existing in the webpage structure, combining DOM (Document Object Model) tree and DBSCAN (Density-Based Spatial Clustering of Applications with Noise) algorithm. Several links in the algorithm are introduced in detail, including Web page processing, DOM tree construction, segmented text content acquisition, and web content extraction based on the DBSCAN algorithm. The simulation results show that the intelligence culture, intelligence system, technology platform and intelligence organization ecological collaboration strategy under the extraction of DOM tree and DBSCAN information can improve the level of intelligence participation of all employees. There is a significant positive correlation between the level of participation and the level of the intelligence environment of all employees. According to the research results, the DOM tree and DBSCAN information proposed in this paper can extract the enterprise’s employee intelligence and the effective implementation of relevant collaborative strategies, which can provide guidance for the effective implementation of the employee intelligence.
Keywords: corporate social media intelligence; data crawler; DBSCAN; DOM tree; theme crawler (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1515/erj-2020-0267 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:erjour:v:11:y:2021:i:2:p:3-23:n:1
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/erj/html
DOI: 10.1515/erj-2020-0267
Access Statistics for this article
Entrepreneurship Research Journal is currently edited by Chandra S. Mishra and Ramona K. Zachary
More articles in Entrepreneurship Research Journal from De Gruyter
Bibliographic data for series maintained by Peter Golla ().