Abstract
This article presents an analysis of five days of workload data from a large Web-based shopping system. The multitier environment of this Web-based shopping system includes Web servers, application servers, database servers, and an assortment of load-balancing and firewall appliances. We characterize user requests and sessions and determine their impact on system performance scalability. The purpose of our study is to assess scalability and support capacity planning exercises for the multitier system. We find that horizontal scalability is not always an adequate mechanism for supporting increased workloads and that personalization and robots can have a significant impact on system scalability.
- ABDELZAHER,T.AND BHATTI, N. 1999. Web server QoS management by adaptive content delivery. Tech. Rep. HPL-1999-161. Hewlett-Packard Laboratories, Palo Alto, CA.Google Scholar
- ALMEIDA, V., RIEDI, R., MENASC~, D., MEIRA, W., RIBEIRO, F., AND FONSECA, R. 2001. Characterizing and modeling robot workload on e-business sites.Google Scholar
- ARLITT,M.AND JIN, T. 2000. Workload characterization of the 1998 World Cup Web site. IEEE Network 14, 3 (May-June), 30-37. Google ScholarDigital Library
- ARLITT,M.AND WILLIAMSON, C. 1997. Internet Web servers: workload characterization and performance implications. IEEE/ACM Trans. Netw. 5, 5, 631-645. Google ScholarDigital Library
- BARFORD, P., BESTAVROS, A., BRADLEY, A., AND CROVELLA, M. 1999. Changes in Web client access patterns. World Wide Web J. 2, 15-28. Google ScholarDigital Library
- BRESLAU, L., CAO, P., FAN, L., PHILLIPS, G., AND SHENKER, S. 1999. Web caching and Zipf-like distributions: Evidence and implications. In Proceedings of the IEEE INFOCOM Conference (New York, NY, Mar.). IEEE Computer Society Press, Los Alamitos, CA.Google ScholarCross Ref
- CAO,P.AND LIU, C. 1998. Maintaining strong cache consistency in the World Wide Web. IEEE Trans. Comput. 47, 4, 445-457. Google ScholarDigital Library
- CHALLENGER, J., IYENGAR, A., AND DANTZIG, P. 1999. A scalable system for consistently caching dynamic Web data. In Proceedings of the IEEE INFOCOM Conference (New York, NY, Mar.). IEEE Computer Society Press, Los Alamitos, CA.Google ScholarCross Ref
- CUNHA, C., BESTAVROS, A., AND CROVELLA, M. 1995. Characteristics of WWW client-based traces. Tech. Rep. TR-95-010. Boston University, Boston, MA. Google Scholar
- DILLEY, J., ARLITT, M., PERRET, S., AND JIN, T. 1999. The distributed object consistency protocol: Version 1.0. Tech. Rep. HPL-1999-109. Hewlett-Packard Laboratories, Palo Alto, CA.Google Scholar
- HARTIGAN, J. 1975. Clustering Algorithms. John Wiley and Sons, Inc., New York, NY. Google ScholarDigital Library
- JAIN, R. 1991. The Art of Computer Systems Performance Analysis: Techniques for Experimen-tal Design, Measurement, Simulation, and Modeling. John Wiley and Sons, Inc., New York, NY.Google Scholar
- KAUFMAN,L.AND ROUSSEEUW, P. J. 1990. Finding Groups in Data. John Wiley and Sons, Inc., New York, NY.Google Scholar
- KOSTER, M. 1994. A standard for robot exclusion. Tech. Rep.Google Scholar
- LEE,J.AND PODLASECK, M. 2000. Visualization and analysis of clickstream data of online stores for understanding Web merchandising. Int. J. Data Mining Knowl. Discovery. Google ScholarDigital Library
- MENASC~,D.AND ALMEIDA, V. 2000. Scaling for E-Business. Prentice-Hall, Inc., Englewood Cliffs, NJ. Google ScholarDigital Library
- MENASC~, D., ALMEIDA, V., FONSECA, R., AND MENDES, M. 1999. A methodology for workload characterization of e-commerce sites. In Proceedings of the ACM Conference on Electronic Commerce (Denver, CO, Nov.). ACM Press, New York, NY. Google ScholarDigital Library
- MENASC~, D., ALMEIDA, V., RIEDI, R., RIBEIRO, F., FONSECA, R., AND MERIA, W. 2000. In search of invariants for e-business workloads. In Proceedings of the ACM Conference on Electronic Commerce (Minneapolis, MN, Oct.). ACM Press, New York, NY. Google ScholarDigital Library
- MOGUL, J., DOUGLIS, F., FELDMANN, A., AND KRISHNAMURTHY, B. 1997. Potential benefits of delta encoding and data compression for HTTP. SIGCOMM Comput. Commun. Rev. 27,4, 181-194. Google ScholarDigital Library
- PADMANABHAN,V.AND QUI, L. 2000. The content and access dynamics of a busy Web site: Findings and implications. In Proceedings of the ACM SIGCOMM Conference (Stockholm, Sweden, Aug.). ACM Press, New York, NY, 111-123. Google ScholarDigital Library
- REICHHELD,F.AND SASSER, W. 1990. Zero defections: Quality comes to services. Harvard Bus. Rev. (Sept.-Oct.).Google Scholar
- VANDERMEER, D., DUTTA, K., DATTA, A., RAMAMRITHAM, K., AND NAVATHE, S. 2000. Enabling scalable online personalization on the Web. In Proceedings of the ACM Conference on Electronic Commerce (Minneapolis, MN, Oct.). ACM Press, New York, NY. Google ScholarDigital Library
- WANG, J. 1999. A survey of Web caching schemes for the Internet. SIGCOMM Comput. Commun. Rev. 29, 5 (Oct.), 36-46. Google ScholarDigital Library
- YIN, J., ALVISI, L., DAHLIN, M., AND LIN, C. 1999. Hierarchical cache consistency in WAN. In Proceedings of the Second USENIX Symposium on Internet Technologies and Systems (Boulder, CO, Oct.). USENIX Assoc., Berkeley, CA, 13-24. Google ScholarDigital Library
- YU, H., BRESLAU, L., AND SCHENKER, S. 1999. A scalable Web cache consistency architecture. In Proceedings of the ACM SIGCOMM Conference (Cambridge, MA, Sept.). ACM Press, New York, NY, 163-174. Google ScholarDigital Library
Index Terms
- Characterizing the scalability of a large web-based shopping system
Recommendations
Feature-based recommendations for one-to-one marketing
Most recommendation systems face challenges from products that change with time, such as popular or seasonal products, since traditional market basket analysis or collaborative filtering analysis are unable to recommend new products to customers due to ...
A Study on the Factors Affecting the User Intention of Omnichannel Shopping Based on Information Technology
ICEBA 2019: Proceedings of the 2019 5th International Conference on E-Business and ApplicationsTechnical advance and its spread brought diversification of distribution channels. It is not unusual that retailers deploy omnichannel shopping environment. Retailing companies are responding to markets with new strategies which are caused by appearance ...
Thin-client Web access patterns: Measurements from a cache-busting proxy
This paper describes a new technique for measuring Web client request patterns and analyzes a large client trace collected using the new method. In this approach, a modified proxy intercepts requests and serves all responses to clients marked ...
Comments