In Search of: Reliable Usage Data on the World Wide Web

Show full item record

Please use this identifier to cite or link to this item:

Title: In Search of: Reliable Usage Data on the World Wide Web
Author: Pitkow, James Edward
Abstract: The WWW is currently the hottest testbed for future interactive digital systems. While much is understood technically about how the WWW functions, substantially less is known about how this technology is used collectively and on an individual basis. This disparity of knowledge exists largely as a direct consequence of the decentralized nature of Web. Since each user of the Web is not uniquely identifiable across the system and the system employs various levels of caching, measurement of actual usage is problematic. This paper establishes terminology to frame the problem of reliably determining usage of WWW resources while reviewing current practice and their shortcomings. A review of the various metrics and analyses that can be performed to determine usage is then presented. This is followed by a discussion of the strengths and weaknesses of the hit-metering proposal [Mogul and Leach 1997] currently in consideration by the HTTP working group. Lastly, new proposals, based upon server-side sampling are introduced and assessed against the other proposal. It is argued that server-side sampling provides more reliable and useful usage data while requiring no change to the current HTTP protocol and enhancing user privacy.
Type: Technical Report
Date: 1997
Relation: GVU Technical Report;GIT-GVU-97-13
Publisher: Georgia Institute of Technology
Subject: World Wide Web
Statistical analysis
Path analysis
Log file analysis

All materials in SMARTech are protected under U.S. Copyright Law and all rights are reserved, unless otherwise specifically indicated on or in the materials.

Files in this item

Files Size Format View
97-13.pdf 91.51Kb PDF View/ Open

This item appears in the following Collection(s)

Show full item record