Abstract The explosion of WWW traffic necessitates an accurate picture of WWW use, and in particular requires a good understanding of client requests for WWW documents. To address this need, we have collected traces of actual executions of NCSA Mosaic, reflecting over half a million user requests for WWW documents. In this paper we present a descriptive statistical summary of the traces we collected, which identifies a number of trends and reference patterns in WWW use. In particular, we show that many characteristics of WWW use can be modelled using power-law distributions, including the distribution of document sizes, the popularity of documents as a function of size, the distribution of user requests for documents, and the number of references to documents as a function of their overall rank in popularity (Zipf''s law). In addition, we show how the power-law distributions derived from our traces can be used to guide system designers interested in caching WWW documents. --- Our client-based traces are available via FTP from http://www.cs.bu.edu/techreports/1995-010-www-client-traces.tar.gz http://www.cs.bu.edu/techreports/1995-010-www-client-traces.a.tar.gz
Cited By
- Novakovic S, Daglis A, Ustiugov D, Bugnion E, Falsafi B and Grot B (2019). Mitigating Load Imbalance in Distributed Data Serving with Rack-Scale Memory Pooling, ACM Transactions on Computer Systems, 36:2, (1-37), Online publication date: 31-May-2018.
- Beckmann N, Chen H and Cidon A LHD Proceedings of the 15th USENIX Conference on Networked Systems Design and Implementation, (389-403)
- Liu M, Zhang M, Chen K, Qian X, Wu Y, Zheng W and Ren J (2018). DudeTx, ACM Transactions on Storage, 14:1, (1-28), Online publication date: 4-Apr-2018.
- Gellert A (2017). Web access mining through dynamic decision trees with markovian features, Journal of Web Engineering, 16:5-6, (524-536), Online publication date: 1-Sep-2017.
- Lin Y and Shen H (2017). EAFR, IEEE Transactions on Parallel and Distributed Systems, 28:4, (1017-1030), Online publication date: 1-Apr-2017.
- Novakovic S, Daglis A, Bugnion E, Falsafi B and Grot B The Case for RackOut Proceedings of the Seventh ACM Symposium on Cloud Computing, (182-195)
- Gellert A and Florea A (2016). Web prefetching through efficient prediction by partial matching, World Wide Web, 19:5, (921-932), Online publication date: 1-Sep-2016.
- Liu Y and Williamson C Workload Study of a Media-Rich Educational Web Site Proceedings of the 25th International Conference Companion on World Wide Web, (495-500)
- Simkin M and Roychowdhury V (2015). Why does attention to web articles fall with Time?, Journal of the Association for Information Science and Technology, 66:9, (1847-1856), Online publication date: 1-Sep-2015.
- Gellert A and Florea A (2014). Web page prediction enhanced with confidence mechanism, Journal of Web Engineering, 13:5-6, (507-524), Online publication date: 1-Nov-2014.
- Papastavrou S, Chrysanthis P and Samaras G (2014). Performance vs. freshness in web database applications, World Wide Web, 17:5, (969-995), Online publication date: 1-Sep-2014.
- Al-Arnaout Z, Fu Q and Frean M Exploiting graph partitioning for hierarchical replica placement in WMNs Proceedings of the 16th ACM international conference on Modeling, analysis & simulation of wireless and mobile systems, (5-14)
- Abad C, Yuan M, Cai C, Lu Y, Roberts N and Campbell R (2013). Generating request streams on Big Data using clustered renewal processes, Performance Evaluation, 70:10, (704-719), Online publication date: 1-Oct-2013.
- Papastavrou S, Chrysanthis P and Samaras G Exploring content dependencies to better balance performance and freshness in web database applications Proceedings of the 13th international conference on Web Information Systems Engineering, (512-525)
- Wan M, Jönsson A, Wang C, Li L and Yang Y (2012). Web user clustering and Web prefetching using Random Indexing with weight functions, Knowledge and Information Systems, 33:1, (89-115), Online publication date: 1-Oct-2012.
- Zhang Y and Årvidsson A (2012). Understanding the characteristics of cellular data traffic, ACM SIGCOMM Computer Communication Review, 42:4, (461-466), Online publication date: 24-Sep-2012.
- Zhang Y and Årvidsson A Understanding the characteristics of cellular data traffic Proceedings of the 2012 ACM SIGCOMM workshop on Cellular networks: operations, challenges, and future design, (13-18)
- Atikoglu B, Xu Y, Frachtenberg E, Jiang S and Paleczny M Workload analysis of a large-scale key-value store Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems, (53-64)
- Atikoglu B, Xu Y, Frachtenberg E, Jiang S and Paleczny M (2012). Workload analysis of a large-scale key-value store, ACM SIGMETRICS Performance Evaluation Review, 40:1, (53-64), Online publication date: 7-Jun-2012.
- Al-Arnaout Z, Fu Q and Frean M A content replication scheme for wireless mesh networks Proceedings of the 22nd international workshop on Network and Operating System Support for Digital Audio and Video, (39-44)
- Lymberopoulos D, Riva O, Strauss K, Mittal A and Ntoulas A (2012). PocketWeb, ACM SIGPLAN Notices, 47:4, (1-12), Online publication date: 1-Jun-2012.
- Lymberopoulos D, Riva O, Strauss K, Mittal A and Ntoulas A (2012). PocketWeb, ACM SIGARCH Computer Architecture News, 40:1, (1-12), Online publication date: 18-Apr-2012.
- Lymberopoulos D, Riva O, Strauss K, Mittal A and Ntoulas A PocketWeb Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems, (1-12)
- Kastaniotis G, Maragos E, Douligeris C and Despotis D (2012). Using data envelopment analysis to evaluate the efficiency of web caching object replacement strategies, Journal of Network and Computer Applications, 35:2, (803-817), Online publication date: 1-Mar-2012.
- Gill P, Arlitt M, Carlsson N, Mahanti A and Williamson C (2011). Characterizing Organizational Use of Web-Based Services, ACM Transactions on the Web, 5:4, (1-23), Online publication date: 1-Oct-2011.
- Wan M, Jönsson A, Wang C, Li L and Yang Y A random indexing approach for web user clustering and web prefetching Proceedings of the 15th international conference on New Frontiers in Applied Data Mining, (40-52)
- Xie T and Sun Y (2011). Understanding the relationship between energy conservation and reliability in parallel disk arrays, Journal of Parallel and Distributed Computing, 71:2, (198-210), Online publication date: 1-Feb-2011.
- Liao G, Bhuyan L, Wu W, Yu H and King S A new TCB cache to efficiently manage TCP sessions for web servers Proceedings of the 6th ACM/IEEE Symposium on Architectures for Networking and Communications Systems, (1-10)
- Otoo E, Rotem D and Tsao S Dynamic data reorganization for energy savings in disk storage systems Proceedings of the 22nd international conference on Scientific and statistical database management, (322-341)
- Lee M, Kompella R and Singh S AjaxTracker Proceedings of the 2010 USENIX conference on Web application development, (2-2)
- Reda A, Noble B and Haile Y Distributing private data in challenged network environments Proceedings of the 19th international conference on World wide web, (801-810)
- Xie T and Sun Y (2009). A file assignment strategy independent of workload characteristic assumptions, ACM Transactions on Storage, 5:3, (1-24), Online publication date: 1-Nov-2009.
- Beauvisage T (2009). The dynamics of personal territories on the web, ACM SIGWEB Newsletter, 2009:Autumn, (1-10), Online publication date: 1-Sep-2009.
- Beauvisage T The dynamics of personal territories on the web Proceedings of the 20th ACM conference on Hypertext and hypermedia, (25-34)
- Freire E, Ziviani A and Salles R (2009). On Metrics to Distinguish Skype Flows from HTTP Traffic, Journal of Network and Systems Management, 17:1-2, (53-72), Online publication date: 1-Jun-2009.
- Iamnitchi A, Doraimani S and Garzoglio G (2009). Workload characterization in a high-energy data grid and impact on resource management, Cluster Computing, 12:2, (153-173), Online publication date: 1-Jun-2009.
- Zeng Z and Veeravalli B (2009). A novel distributed architecture of large-scale multimedia storage system using autonomous object-based storage devices, Journal of Parallel and Distributed Computing, 69:4, (349-359), Online publication date: 1-Apr-2009.
- Jónsson K HttpTools Proceedings of the 2nd International Conference on Simulation Tools and Techniques, (1-8)
- Kaya C, Zhang G, Tan Y and Mookerjee V (2009). An admission-control technique for delay reduction in proxy caching, Decision Support Systems, 46:2, (594-603), Online publication date: 1-Jan-2009.
- Kotera I, Egawa R, Takizawa H and Kobayashi H Modeling of cache access behavior based on Zipf's law Proceedings of the 9th workshop on MEmory performance: DEaling with Applications, systems and architecture, (9-15)
- Otoom M and Paul J Holistic design and caching in mobile computing Proceedings of the 6th IEEE/ACM/IFIP international conference on Hardware/Software codesign and system synthesis, (115-120)
- Sieminski A (2008). Usefulness of local buffer data for WWW objects prefetching, International Journal of Intelligent Information and Database Systems, 2:3, (320-335), Online publication date: 1-Sep-2008.
- Siemiński A Mining Local Buffer Data Proceedings of the 2008 conference on New Trends in Multimedia and Network Information Systems, (81-94)
- Doraimani S and Iamnitchi A File grouping for scientific data management Proceedings of the 17th international symposium on High performance distributed computing, (153-164)
- Liu Q and Solis-Oba R Web prefetching with high accuracy and low memory cost Proceedings of the WSEAS International Conference on Applied Computing Conference, (114-119)
- Weinreich H, Obendorf H, Herder E and Mayer M (2008). Not quite the average, ACM Transactions on the Web, 2:1, (1-31), Online publication date: 1-Feb-2008.
- Patil J and Pawar B Trace driven simulation of GDSF# and existing caching algorithms for web proxy servers Proceedings of the 9th WSEAS International Conference on Data Networks, Communications, Computers, (378-384)
- Gill P, Arlitt M, Li Z and Mahanti A Youtube traffic characterization Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, (15-28)
- Kim J, Choi G and Das C (2007). An SSL Back-End Forwarding Scheme in Cluster-Based Web Servers, IEEE Transactions on Parallel and Distributed Systems, 18:7, (946-957), Online publication date: 1-Jul-2007.
- Aweya J, Ouellette M and Montuno D (2007). A simple mechanism for stabilizing network queues in TCP/IP networks, International Journal of Network Management, 17:4, (275-286), Online publication date: 1-Jul-2007.
- Keogh-Brown M and Bogacka B (2007). WWW traffic measure and its properties, Intelligent Data Analysis, 11:2, (137-154), Online publication date: 1-Apr-2007.
- Quintero A, Li D and Castro H (2007). A location routing protocol based on smart antennas for ad hoc networks, Journal of Network and Computer Applications, 30:2, (614-636), Online publication date: 1-Apr-2007.
- Busson A, Kofman D and Rougier J (2007). A new service overlays dimensioning approach based on stochastic geometry, Performance Evaluation, 64:1, (76-92), Online publication date: 1-Jan-2007.
- Branch P and Armitage G Extrapolating server to client IP traffic from empirical measurements of first person shooter games Proceedings of 5th ACM SIGCOMM workshop on Network and system support for games, (24-es)
- Kules B, Kustanowitz J and Shneiderman B Categorizing web search results into meaningful and stable categories using fast-feature techniques Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, (210-219)
- Du B, Demmer M and Brewer E Analysis of WWW traffic in Cambodia and Ghana Proceedings of the 15th international conference on World Wide Web, (771-780)
- Weinreich H, Obendorf H, Herder E and Mayer M Off the beaten tracks Proceedings of the 15th international conference on World Wide Web, (133-142)
- Shi W and Mao Y (2006). Performance evaluation of peer-to-peer Web caching systems, Journal of Systems and Software, 79:5, (714-726), Online publication date: 1-May-2006.
- Li K, Shen H, Tajima K and Huang L (2006). An effective cache replacement algorithm in transcoding-enabled proxies, The Journal of Supercomputing, 35:2, (165-184), Online publication date: 1-Feb-2006.
- Wu B and Kshemkalyani A (2006). Objective-Optimal Algorithms for Long-Term Web Prefetching, IEEE Transactions on Computers, 55:1, (2-17), Online publication date: 1-Jan-2006.
- de Lara E, Chopra Y, Kumar R, Vaghela N, Wallach D and Zwaenepoel W (2005). Iterative Adaptation for Mobile Clients Using Existing APIs, IEEE Transactions on Parallel and Distributed Systems, 16:10, (966-981), Online publication date: 1-Oct-2005.
- Li K, Tajima K and Shen H Cache Replacement for Transcoding Proxy Caching Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence, (500-507)
- Santhanakrishnan G, Amer A and Chrysanthis P Towards universal mobile caching Proceedings of the 4th ACM international workshop on Data engineering for wireless and mobile access, (73-80)
- Beauvisage T and Assadi H From user-centric web traffic data to usage data Special interest tracks and posters of the 14th international conference on World Wide Web, (1086-1087)
- Teng W, Chang C and Chen M (2005). Integrating Web Caching and Web Prefetching in Client-Side Proxies, IEEE Transactions on Parallel and Distributed Systems, 16:5, (444-455), Online publication date: 1-May-2005.
- Yin L, Cao G and Cai Y (2005). A generalized target-driven cache replacement policy for mobile environments, Journal of Parallel and Distributed Computing, 65:5, (583-594), Online publication date: 1-May-2005.
- Downey A (2005). Lognormal and Pareto distributions in the Internet, Computer Communications, 28:7, (790-801), Online publication date: 1-May-2005.
- Song H and Cao G (2005). Cache-miss-initiated prefetch in mobile environments, Computer Communications, 28:7, (741-753), Online publication date: 1-May-2005.
- Park J and Chong K An implementation of the client-based distributed web caching system Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development, (759-770)
- Marques Neto H, Almeida J, Rocha L, Meira W, Guerra P and Almeida V (2004). A characterization of broadband user behavior and their e-business activities, ACM SIGMETRICS Performance Evaluation Review, 32:3, (3-13), Online publication date: 1-Dec-2004.
- Sripanidkulchai K, Maggs B and Zhang H An analysis of live streaming workloads on the internet Proceedings of the 4th ACM SIGCOMM conference on Internet measurement, (41-54)
- Marques H, Rocha L, Guerra P, Almeida J, Meira W and Almeida V Characterizing broadband user behavior Proceedings of the 2004 ACM workshop on Next-generation residential broadband challenges, (11-18)
- Datta A, Dutta K, Thomas H, Vandermeer D and Ramamritham K (2004). Proxy-based acceleration of dynamically generated content on the world wide web, ACM Transactions on Database Systems, 29:2, (403-443), Online publication date: 1-Jun-2004.
- Liu J, Zhang S and Yang J (2004). Characterizing Web Usage Regularities with Information Foraging Agents, IEEE Transactions on Knowledge and Data Engineering, 16:5, (566-584), Online publication date: 1-May-2004.
- Shu W and Wu M (2004). Resource Requirements of Closed-Loop Video Delivery Services, IEEE MultiMedia, 11:2, (24-37), Online publication date: 1-Apr-2004.
- Santhanakrishnan G, Amer A, Chrysanthis P and Li D GD-GhOST Proceedings of the 2004 ACM symposium on Applied computing, (1141-1145)
- González-Arévalo B (2004). Performance of a Leaky Bucket System with Long-Range Dependent Input Traffic, Queueing Systems: Theory and Applications, 46:3/4, (439-459), Online publication date: 1-Mar-2004.
- Selvakumar S, Kumar Sahoo S and Venkatasubramani V (2004). Delay sensitive least frequently used algorithm for replacement in web caches, Computer Communications, 27:3, (322-326), Online publication date: 1-Feb-2004.
- Manivel V, Ahamad M and Venkateswaran H Attack resistant cache replacement for survivable services Proceedings of the 2003 ACM workshop on Survivable and self-regenerative systems: in association with 10th ACM Conference on Computer and Communications Security, (64-71)
- Wong A, Ip M and Wu R (2003). A novel dynamic cache size adjustment approach for better data retrieval performance over the internet, Computer Communications, 26:14, (1709-1720), Online publication date: 1-Sep-2003.
- Hadjiefthymiades S and Merakos L (2003). Proxies + path prediction, Mobile Networks and Applications, 8:4, (389-399), Online publication date: 1-Aug-2003.
- Anastasi G, Conti M, Gregori E and Passarella A (2003). Performance comparison of power-saving strategies for mobile web access, Performance Evaluation, 53:3-4, (273-294), Online publication date: 1-Aug-2003.
- Chang C and Chen M (2003). On Exploring Aggregate Effect for Efficient Cache Replacement in Transcoding Proxies, IEEE Transactions on Parallel and Distributed Systems, 14:6, (611-624), Online publication date: 1-Jun-2003.
- Pandey S, Ramamritham K and Chakrabarti S Monitoring the dynamic web to respond to continuous queries Proceedings of the 12th international conference on World Wide Web, (659-668)
- Prabhakaran B, Tu Y and Wu Y (2003). Experiences with an object-level scalable web framework, Journal of Network and Computer Applications, 26:2, (163-196), Online publication date: 1-Apr-2003.
- Floyd S and Kohler E (2003). Internet research needs better models, ACM SIGCOMM Computer Communication Review, 33:1, (29-34), Online publication date: 1-Jan-2003.
- Yang Q, Huang J and Ng M (2003). A data cube model for prediction-based web prefetching, Journal of Intelligent Information Systems, 20:1, (11-30), Online publication date: 1-Jan-2003.
- Chang S (2002). Caching Strategy and Service Policy Optimization in a Cache-Satellite Distribution Service, Telecommunications Systems, 21:2-4, (261-281), Online publication date: 1-Dec-2002.
- Veloso E, Almeida V, Meira W, Bestavros A and Jin S A hierarchical characterization of a live streaming media workload Proceedings of the 2nd ACM SIGCOMM Workshop on Internet measurment, (117-130)
- Iheagwara C and Blyth A (2002). The impact of security layering on end-to-end latency and system performance in switched and distributed e-business environments, Computer Networks: The International Journal of Computer and Telecommunications Networking, 39:6, (827-840), Online publication date: 21-Aug-2002.
- Psounis K and Prabhakar B (2002). Efficient randomized web-cache replacement schemes using samples from past eviction times, IEEE/ACM Transactions on Networking, 10:4, (441-455), Online publication date: 1-Aug-2002.
- Davison B Predicting web actions from HTML content Proceedings of the thirteenth ACM conference on Hypertext and hypermedia, (159-168)
- Adya A, Bahl P and Qiu L Characterizing Alert and Browse Services of Mobile Clients Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference, (343-356)
- Datta A, Dutta K, Thomas H, VanderMeer D, Suresha and Ramamritham K Proxy-based acceleration of dynamically generated content on the world wide web Proceedings of the 2002 ACM SIGMOD international conference on Management of data, (97-108)
- Wu M, Ma S and Shu W Scheduled video delivery for scalable on-demand service Proceedings of the 12th international workshop on Network and operating systems support for digital audio and video, (167-175)
- Kelly T and Mogul J Aliasing on the world wide web Proceedings of the 11th international conference on World Wide Web, (281-292)
- Busari M and Williamson C (2002). ProWGen, Computer Networks: The International Journal of Computer and Telecommunications Networking, 38:6, (779-794), Online publication date: 22-Apr-2002.
- Venkataramani A, Yalagandula P, Kokku R, Sharif S and Dahlin M (2002). The potential costs and benefits of long-term prefetching for content distribution, Computer Communications, 25:4, (367-375), Online publication date: 1-Mar-2002.
- Kelly T (2002). Thin-client Web access patterns, Computer Communications, 25:4, (357-366), Online publication date: 1-Mar-2002.
- Doyle R, Chase J, Gadde S and Vahdat A (2002). The Trickle-Down Effect, Computer Communications, 25:4, (345-356), Online publication date: 1-Mar-2002.
- Cothey V (2002). A longitudinal study of World Wide Web users' information-searching behavior, Journal of the American Society for Information Science and Technology, 53:2, (67-78), Online publication date: 15-Jan-2002.
- BUDZIK J, BRADSHAW S, FU X and HAMMOND K (2002). Supporting on-line resource discovery in the context of ongoing tasks with proactive software assistants, International Journal of Human-Computer Studies, 56:1, (47-74), Online publication date: 1-Jan-2002.
- Jin S and Bestavros A (2001). GISMO, ACM SIGMETRICS Performance Evaluation Review, 29:3, (2-10), Online publication date: 1-Dec-2001.
- Mao Y, Chen K, Wang D and Zheng W Cluster-based online monitoring system of web traffic Proceedings of the 3rd international workshop on Web information and data management, (47-53)
- Berfield A, Simons B, Chrysanthis P and Pruhs K Better client OFF time prediction to improve performance in web information systems Proceedings of the 3rd international workshop on Web information and data management, (39-46)
- Adya A, Bahl P and Qiu L Analyzing the browse patterns of mobile clients Proceedings of the 1st ACM SIGCOMM Workshop on Internet measurement, (189-194)
- Xiao J and Zhang Y Clustering of Web Users Using Session-Based Similarity Measures Proceedings of the 2001 International Conference on Computer Networks and Mobile Computing (ICCNMC'01)
- Bi Z, Faloutsos C and Korn F The "DGX" distribution for mining massive, skewed data Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, (17-26)
- Datta A, Dutta K, VanderMeer D, Ramamritham K and Navathe S (2001). An architecture to support scalable online personalization on the Web, The VLDB Journal — The International Journal on Very Large Data Bases, 10:1, (104-117), Online publication date: 1-Aug-2001.
- Staehle D, Leibnitz K and Tsipotis K QoS of internet access with GPRS Proceedings of the 4th ACM international workshop on Modeling, analysis and simulation of wireless and mobile systems, (57-64)
- Carpenter T, Carter R, Cochinwala M and Eiger M (2001). Client-Server Caching with Expiration Timestamps, Distributed and Parallel Databases, 10:1, (5-22), Online publication date: 1-Jul-2001.
- Smith F, Campos F, Jeffay K and Ott D (2001). What TCP/IP protocol headers can tell us about the web, ACM SIGMETRICS Performance Evaluation Review, 29:1, (245-256), Online publication date: 1-Jun-2001.
- Smith F, Campos F, Jeffay K and Ott D What TCP/IP protocol headers can tell us about the web Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, (245-256)
- Schroeder M, Boro L and McCann J Efficiency improvements for interactions of web-agents Proceedings of the fifth international conference on Autonomous agents, (112-113)
- Hadjiefthymiades S and Merakos L Using proxy cache relocation to accelerate Web browsing in wireless/mobile communications Proceedings of the 10th international conference on World Wide Web, (26-35)
- Paul S and Fei Z (2001). Distributed caching with centralized control, Computer Communications, 24:2, (256-268), Online publication date: 1-Feb-2001.
- Jin S and Bestavros A (2001). GreedyDual* Web caching algorithm, Computer Communications, 24:2, (174-183), Online publication date: 1-Feb-2001.
- Almeida J, Krueger J, Eager D and Vernon M Analysis of educational media server workloads Proceedings of the 11th international workshop on Network and operating systems support for digital audio and video, (21-30)
- Menascé D (2000). Web Performance Modeling Issues, International Journal of High Performance Computing Applications, 14:4, (292-303), Online publication date: 1-Nov-2000.
- VanderMeer D, Dutta K, Datta A, Ramamritham K and Navanthe S Enabling scalable online personalization on the Web Proceedings of the 2nd ACM conference on Electronic commerce, (185-196)
- Padmanabhan V and Qiu L (2000). The content and access dynamics of a busy Web site, ACM SIGCOMM Computer Communication Review, 30:4, (111-123), Online publication date: 1-Oct-2000.
- Padmanabhan V and Qiu L The content and access dynamics of a busy Web site Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, (111-123)
- de Lara E, Wallach D and Zwaenepoel W Opportunities for bandwidth adaptation in microsoft office documents Proceedings of the 4th conference on USENIX Windows Systems Symposium - Volume 4, (9-9)
- Tuah N, Kumar M and Venkatesh S Performance modelling of speculative prefetching for compound requests in low bandwidth networks Proceedings of the 3rd ACM international workshop on Wireless mobile multimedia, (83-92)
- Wolman A, Voelker M, Sharma N, Cardwell N, Karlin A and Levy H (1999). On the scale and performance of cooperative Web proxy caching, ACM SIGOPS Operating Systems Review, 33:5, (16-31), Online publication date: 12-Dec-1999.
- Wolman A, Voelker M, Sharma N, Cardwell N, Karlin A and Levy H On the scale and performance of cooperative Web proxy caching Proceedings of the seventeenth ACM symposium on Operating systems principles, (16-31)
- Wolman A, Voelker G, Sharma N, Cardwell N, Brown M, Landray T, Pinnel D, Karlin A and Levy H Organization-based analysis of web-object sharing and caching Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2, (3-3)
- Hadjiefthymiades S and Merakos L (1999). ESW4, ACM SIGCOMM Computer Communication Review, 29:5, (24-35), Online publication date: 5-Oct-1999.
- Saul L and Jordan M (1999). Mixed Memory Markov Models, Machine Language, 37:1, (75-87), Online publication date: 1-Oct-1999.
- Yin J, Alvisi L, Dahlin M and Lin C (1999). Volume Leases for Consistency in Large-Scale Systems, IEEE Transactions on Knowledge and Data Engineering, 11:4, (563-576), Online publication date: 1-Jul-1999.
- Shim J, Scheuermann P and Vingralek R (1999). Proxy Cache Algorithms, IEEE Transactions on Knowledge and Data Engineering, 11:4, (549-562), Online publication date: 1-Jul-1999.
- Maltzahn C, Richardson K and Grunwald D Reducing the disk I/O of web proxy server caches Proceedings of the annual conference on USENIX Annual Technical Conference, (17-17)
- Banga G and Druschel P (1999). Measuring the capacity of a Web server under realistic loads, World Wide Web, 2:1-2, (69-83), Online publication date: 15-Jan-1999.
- Barford P, Bestavros A, Bradley A and Crovella M (1999). Changes in Web client access patterns, World Wide Web, 2:1-2, (15-28), Online publication date: 15-Jan-1999.
- Aggarwal C, Wolf J and Yu P (1999). Caching on the World Wide Web, IEEE Transactions on Knowledge and Data Engineering, 11:1, (94-107), Online publication date: 1-Jan-1999.
- Lukose R and Huberman B Surfing as a real option Proceedings of the first international conference on Information and computation economies, (45-51)
- Barford P and Crovella M (1998). Generating representative Web workloads for network and server performance evaluation, ACM SIGMETRICS Performance Evaluation Review, 26:1, (151-160), Online publication date: 1-Jun-1998.
- Barford P and Crovella M Generating representative Web workloads for network and server performance evaluation Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, (151-160)
- Colajanni M, Yu P and Dias D (1998). Analysis of Task Assignment Policies in Scalable Distributed Web-Server Systems, IEEE Transactions on Parallel and Distributed Systems, 9:6, (585-600), Online publication date: 1-Jun-1998.
- Loon T and Bharghavan V Alleviating the latency and bandwidth problems in WWW browsing Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems, (20-20)
- Gribble S and Brewer E System design issues for internet middleware services Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems, (19-19)
- Banga G and Druschel P Measuring the capacity of a web server Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems, (6-6)
- Shi Y, Watson E and Chen Y Model-driven simulation of World-Wide-Web cache policies Proceedings of the 29th conference on Winter simulation, (1045-1052)
- Paxson V and Floyd S Why we don't know how to simulate the Internet Proceedings of the 29th conference on Winter simulation, (1037-1044)
- Cunha C and Jaccoud C Determining WWW User's Next Access and Its Application to Pre-fetching Proceedings of the 2nd IEEE Symposium on Computers and Communications (ISCC '97)
- Mah B An Empirical Model of HTTP Network Traffic Proceedings of the INFOCOM '97. Sixteenth Annual Joint Conference of the IEEE Computer and Communications Societies. Driving the Information Revolution
- Bestavros A (1997). WWW Traffic Reduction and Load Balancing through Server-Based Caching, IEEE Parallel & Distributed Technology: Systems & Technology, 5:1, (56-67), Online publication date: 1-Jan-1997.
- Aggarwal C and Yu P On disk caching of Web objects in proxy servers Proceedings of the sixth international conference on Information and knowledge management, (238-245)
- Crovella M and Bestavros A Self-similarity in World Wide Web traffic Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, (160-169)
- Arlitt M and Williamson C Web server workload characterization Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, (126-137)
- Crovella M and Bestavros A (1996). Self-similarity in World Wide Web traffic, ACM SIGMETRICS Performance Evaluation Review, 24:1, (160-169), Online publication date: 15-May-1996.
- Arlitt M and Williamson C (1996). Web server workload characterization, ACM SIGMETRICS Performance Evaluation Review, 24:1, (126-137), Online publication date: 15-May-1996.
- Bestavros A Using speculation to reduce server load and service time on the WWW Proceedings of the fourth international conference on Information and knowledge management, (403-410)
Recommendations
DNS-based Internet client clustering and characterization
WWC '01: Proceedings of the Workload Characterization, 2001. WWC-4. 2001 IEEE International WorkshopThis paper proposes a novel protocol which uses the Internet domain name system (DNS) to partition Web clients into disjoint sets, each of which is associated with a single DNS server. We define an L-DNS cluster to be a grouping of Web clients that use ...
Thin-client Web access patterns: Measurements from a cache-busting proxy
This paper describes a new technique for measuring Web client request patterns and analyzes a large client trace collected using the new method. In this approach, a modified proxy intercepts requests and serves all responses to clients marked ...