%0 Journal Article %T An On-line Density-based Clustering Algorithm for Spatial Data Stream
一种基于密度的空间数据流在线聚类算法 %A YU Yan-Wei %A WANG Qin %A KUANG Jun %A HE Jie %A
于彦伟 %A 王沁 %A 邝俊 %A 何杰 %J 自动化学报 %D 2012 %I %X We propose an efficient online density-based clustering algorithm (On-line density-based clustering algorithm for spatial data stream, OLDStream), which is designed for online discovering clusters in spatial data stream. In OLDStream, only the new spatial point and its adjunct points which satisfy core point are processed in clustering update. And the overall clusters results can be accessed instantaneously. The developed algorithm has exhibited many advantages such as its high scalability to online process incremental large-scale spatial data, its capability to discover overall clusters with arbitrary shape instantaneously, its insensitivity to the input sequence of data stream, and its capability to detect all isolated points. An experimental evaluation of the effectiveness, efficiency and scalability of our algorithm was performed by using real data and large synthetic data from Matlab and Thomas Brinkhoff's network-based generator. Experimental results vividly demonstrated that our algorithm can fast and efficiently cluster new points based on the previous points. The statistics of the results showed that only 4% of the points take the worst case running time, and the average running time is about 0.033 ms for each point process. %K Spatial data mining %K clustering data stream %K density-based clustering %K on-line algorithm %K handling noise
空间数据挖掘 %K 聚类数据流 %K 基于密度的聚类 %K 在线算法 %K 噪声处理 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=E76622685B64B2AA896A7F777B64EB3A&aid=B39D47F72CB13A763CC0452CE070845A&yid=99E9153A83D4CB11&vid=16D8618C6164A3ED&iid=B31275AF3241DB2D&sid=14B92F3C984CB5EA&eid=A326383D2F3B3AB0&journal_id=0254-4156&journal_name=自动化学报&referenced_num=0&reference_num=21