ABSTRACT
This paper presents the BBS - Biased Box Sampling algorithm, a technique that combines dimensionality reduction with biased sampling, which aims at keeping the skewed clustering from the original data.
- M. Ester, H.-P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on KDD-96, pages 226--231. AAAI Press, 1996.Google Scholar
- E. P. M. d. Sousa, C. Traina Jr., A. J. M. Traina, and C. Faloutsos. How to use fractal dimension to find correlations between attributes. In First Workshop on Fractals and Self-similarity in Data Mining: Issues and Approaches (in conjunction with 8th ACM SIGKDD), pages 26--30, Edmonton, Alberta, Canada, 2002. ACM Press.Google Scholar
Index Terms
- Biased box sampling - a density-biased sampling for clustering
Recommendations
Density biased sampling: an improved method for data mining and clustering
SIGMOD '00: Proceedings of the 2000 ACM SIGMOD international conference on Management of dataData mining in large data sets often requires a sampling or summarization step to form an in-core representation of the data that can be processed more efficiently. Uniform random sampling is frequently used in practice and also frequently criticized ...
Density biased sampling: an improved method for data mining and clustering
Data mining in large data sets often requires a sampling or summarization step to form an in-core representation of the data that can be processed more efficiently. Uniform random sampling is frequently used in practice and also frequently criticized ...
Indexed-based density biased sampling for clustering applications
Density biased sampling (DBS) has been proposed to address the limitations of Uniform sampling, by producing the desired probability distribution in the sample. The ease of producing a random sample depends on the available mechanism for accessing the ...
Comments