|
Application specific web log pre-processingAbstract: Web Usage Mining discovers interesting patterns in accesses to various Web pages within the Web space associated with a particular server. The Web Usage Mining architecture divides the process into two main parts- the first part includes pre-processing, transaction identification, and data integration components. The second part includes the largely domain independent application of generic data mining and pattern matching. Nearly 80% of mining efforts often spend to improve the quality of data. All application removes error log, css file log etc., but pre-processing also depends on application. This paper presents customized web log pre-processing which reduces size of logs to be mined
|