Searching for Heavy Tails in Web Robot Traffic

Derek Doran, Swapna S. Gokhale

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents a study on whether the heavy-tailed trends reported in Web traffic are present in the traffic generated by Web robots. The study is motivated by three factors: (i) a significant volume of Web server traffic can now be attributed to Web robots, (ii) the Web is continuing to evolve into a semantic and service-oriented environment where Web robots will play a central role, and (iii) there are fundamental differences in the way robots and humans visit a site and search for information and these differences may lead to contrasts in the statistical patterns of the robots' requests compared to humans. We analyze Web robot traffic from a two-year access log from a Web server in the academic domain and study whether the response sizes, request inter-arrival times, and inter-session times exhibit heavy-tailed properties. In a multi-faceted analysis of the data we find that the response sizes and request inter-arrival times of robot requests do not exhibit heavy-tailed characteristics, contrasting the trends in these metrics in human traffic. However, we find that inter-session times of robots follow heavy-tailed characteristics similar to that of humans.

Original languageAmerican English
Title of host publication2010 Seventh International Conference on the Quantitative Evaluation of Systems
PublisherIEEE
Pages282-291
Number of pages10
ISBN (Print)978-1-4244-8082-1
DOIs
StatePublished - 2010
Event7th International Conference on the Quantitative Evaluation of Systems, QEST 2010 - Williamsburg, VA, United States
Duration: Sep 15 2010Sep 18 2010

Conference

Conference7th International Conference on the Quantitative Evaluation of Systems, QEST 2010
Country/TerritoryUnited States
CityWilliamsburg, VA
Period9/15/109/18/10

ASJC Scopus Subject Areas

  • Control and Systems Engineering

Keywords

  • Heavy-tailed distributions
  • Web robot
  • Web traffic

Disciplines

  • Computer Sciences
  • Engineering

Cite this