skip to main content
10.1145/2640087.2644161acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbigdatascienceConference Proceedingsconference-collections
research-article

Clustering Experiments on Big Transaction Data for Market Segmentation

Published:04 August 2014Publication History

ABSTRACT

This paper addresses the Volume dimension of Big Data. It presents a preliminary work on finding segments of retailers from a large amount of Electronic Funds Transfer at Point Of Sale (EFTPOS) transaction data. To the best of our knowledge, this is the first time a work on Big EFTPOS Data problem has been reported. A data reduction technique using the RFM (Recency, Frequency, Monetary) analysis as applied to a large data set is presented. Ways to optimise clustering techniques used to segment the big data set through data partitioning and parallelization are explained. Preliminary analysis on the segments of the retailers output from the clustering experiments demonstrates that further drilling down into the retailer segments to find more insights into their business behaviours is warranted.

References

  1. W. R. Smith, "Product differentiation and market segmentation as alternative marketing strategies," The Journal of Marketing, vol. 21, no. 1, pp. 3--8, 1956.Google ScholarGoogle ScholarCross RefCross Ref
  2. C. Doyle, A dictionary of marketing. Oxford University Press, 2011.Google ScholarGoogle Scholar
  3. M. J. Brusco, J. D. Cradit, and A. Tashchian, "Multicriterion clusterwise regression for joint segmentation settings: An application to customer value," Journal of Marketing Research, pp. 225--234, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  4. D. Chen, S. L. Sain, and K. Guo, "Data mining for the online retail industry: A case study of RFM model-based customer segmentation using data mining," Journal of Database Marketing & Customer Strategy Management, vol. 19, no. 3, pp. 197--208, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  5. G. T. Ho, W. Ip, C. Lee, and W. Mou, "Customer grouping for better resources allocation using GA based clustering technique," Expert Systems with Applications, vol. 39, no. 2, pp. 1979--1987, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Namvar, M. R. Gholamian, and S. KhakAbi, "A two phase clustering method for intelligent customer segmentation," in Intelligent Systems, Modelling and Simulation (ISMS), 2010 International Conference on, 2010, pp. 215--219. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. C. Hung and C.-F. Tsai, "Market segmentation based on hierarchical self-organizing map for markets of multimedia on demand," Expert Systems with Applications, vol. 34, no. 1, pp. 780--787, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Alam, G. Dobbie, P. Riddle, and M. A. Naeem, "Particle swarm optimization based hierarchical agglomerative clustering," in Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010 IEEE/WIC/ACM International Conference on, vol. 2, 2010, pp. 64--68. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. M. Hughes, Strategic database marketing. McGraw-Hill, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. N.-C. Hsieh, "An integrated data mining and behavioral scoring model for analyzing bank customers," Expert Systems with Applications, vol. 27, no. 4, pp. 623--633, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  11. H.-C. Chang and H.-P. Tsai, "Group RFM analysis as a novel framework to discover better customer consumption behavior," Expert Systems with Applications, vol. 38, no. 12, pp. 14499--14513, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. L. Olson, Q. Cao, C. Gu, and D. Lee, "Comparison of customer response models," Service Business, vol. 3, no. 2, pp. 117--130, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  13. G. Lefait and T. Kechadi, "Customer Segmentation Architecture Based on Clustering Techniques," in Digital Society, 2010. ICDS'10. Fourth International Conference on, 2010, pp. 243--248. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y.-S. Chen, C.-H. Cheng, C.-J. Lai, C.-Y. Hsu, and H.-J. Syu, "Identifying patients in target customer segments using a two-stage clustering-classification approach: A hospital-based assessment," Computers in Biology and Medicine, vol. 42, no. 2, pp. 213--221, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. B. Baesens, S. Viaene, D. Van den Poel, J. Vanthienen, and G. Dedene, "Bayesian neural network learning for repeat purchase modelling in direct marketing," European Journal of Operational Research, vol. 138, no. 1, pp. 191--211, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  16. M. Bizhani and M. J. Tarokh, "Behavioral rules of bank's point-of-sale for segments description and scoring prediction," Int. J. Industrial Eng. Comput, vol. 2, pp. 337--350, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  17. A. Singh, G. Rumantir, and A. South, "Market Segmentation of EFTPOS Retailers", in the Proceedings of the Australasian Data Mining Conference (AusDM 2014), Brisbane, 27--28 November 2014. Conferences in Research and Practice in Information Technology, Vol. 158. (in press).Google ScholarGoogle Scholar
  18. J. Wu, J. Chen, H. Xiong, and M. Xie, "External validation measures for K-means clustering: A data distribution perspective," Expert Systems with Applications, vol. 36, no. 3, pp. 6050--6061, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. C. F. Olson, "Parallel algorithms for hierarchical clustering," Parallel computing, vol. 21, no. 8, pp. 1313--1325, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Clustering Experiments on Big Transaction Data for Market Segmentation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        BigDataScience '14: Proceedings of the 2014 International Conference on Big Data Science and Computing
        August 2014
        162 pages
        ISBN:9781450328913
        DOI:10.1145/2640087

        Copyright © 2014 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 4 August 2014

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader