A RESEARCH ON RETRIEVING AND PARSING OF MULTIPLE WEB PAGES FOR STORING THEM IN LARGE DATABASES
Cristian Bucur and
Bogdan George Tudorica
Revista Economica, 2012, vol. Supplement, issue 5, 23-30
Abstract:
This paper intends to present one of the studies we jointly done during the research for our Ph.D. theses. Cristian Bucur`s thesis aim is to study how the knowledge stored in web pages from various sources can be retrieved and classified. Bogdan Tudorica`s thesis aim is to study the ways to manage large quantities of data for various purposes (especially through use of new technologies, such as NoSQL databases. As such, the application we are describing in this paper is a mixed one, containing both web page crawling and parsing and data storage in a commonly used NoSQL database.
Date: 2012
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://economice.ulbsibiu.ro/revista.economica/arc ... nte/Volume5-2012.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:blg:reveco:v:supplement:y:2012:i:5:p:23-30
Access Statistics for this article
More articles in Revista Economica from Lucian Blaga University of Sibiu, Faculty of Economic Sciences Lucian Blaga University of Sibiu, Faculty of Economic Sciences Dumbravii Avenue, No.17, postal code 550324, Sibiu, Romania. Contact information at EDIRC.
Bibliographic data for series maintained by Eduard Alexandru Stoica ().