On an Algorithm for Identifying Sessions from Web Logs
Claudia Elena Dinuca () and
Dumitru Ciobanu ()
Additional contact information
Claudia Elena Dinuca: University of Craiova, Romania
Dumitru Ciobanu: University of Craiova, Romania
Acta Universitatis Danubius. OEconomica, 2011, issue 4(4), 10-15
Abstract:
The quality of decisions is based on the quality of processed data. So it is important that at the beginning of the data mining process to provide correct and quality data. The preprocessing data is a necessity for avoiding the failure of the data analysis. The idea that the data mining process can be done without human supervision has proved to be wrong. Even so, the humans are trying to automate as much as possible the process. From here are resulting many algorithms and techniques that are implemented using various programming language. In this work is presented an algorithm for identifying the sessions from a web logs file. It uses a value of 30 minutes to mark the end of a session and start another. We compute the average time for visiting the pages and using this we show that the presented algorithm produces errors in identifying sessions. We consider that the correct way to identify the session is to take into account the average time for visiting the pages.
Keywords: clickstream analysis; preprocessing data; sessions’ identification (search for similar items in EconPapers)
Date: 2011
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://journals.univ-danubius.ro/index.php/oeconomica/article/view/942/923 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:dug:actaec:y:2011:i:4:p:10-15
Access Statistics for this article
More articles in Acta Universitatis Danubius. OEconomica from Danubius University of Galati Contact information at EDIRC.
Bibliographic data for series maintained by Daniela Robu ().