A Comparison of Web Robot and Human Requests

Derek Doran, Kevin Morillo, Swapna S. Gokhale

Research output: Contribution to conferencePaperpeer-review

Abstract

Sophisticated Web robots sport a wide variety of functionality and visiting characteristics, constituting a significant percentage of the requests serviced by a Web server. Unlike human clients that retrieve information off a site by navigating links and ignoring irrelevant information, Web robots may collect many different types of resources, and employ varying navigation strategies to find the knowledge on the site they desire. Thus, the resource request patterns of their visits are unpredictable and cannot be inferred based on our knowledge of human request patterns. In this paper, we perform an analysis on the types of resources requested by Web robots using recent Web logs from an academic Web server. We study the distribution of response sizes and response codes, the types of resources requested, and popularity of resources for requests from Web robots. Throughout, we contrast our findings against human resource request patterns. We find reasons to suggest that robots severely handicaps the ability of Web server caches to operate with high performance.
Original languageEnglish
Pages1374-1380
Number of pages7
DOIs
StatePublished - Aug 2013
Event2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining - Niagara Falls, Canada
Duration: Aug 25 2013Aug 28 2013
https://dl.acm.org/doi/proceedings/10.1145/2492517

Conference

Conference2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Abbreviated titleASONAM 2013
Country/TerritoryCanada
CityNiagara Falls
Period8/25/138/28/13
OtherThis is the first time the ASONAM conference has been organized in North America. I am so delighted to watch the conference moving up so fast with consistent and hopefully sustainable success. Achieving acceptance rate of 13% this year is a new record which brings a challenge for the new organizers. I am sure they are aware of the mission and will be able to maintain the same quality for the coming years. We need to show the success as permanent for ASONAM to continue its mission as the leading venue in the area of social networks analysis and mining. The number of quality submissions is rapidly increasing every year demonstrating the visibility of the conference and making it harder to select the papers to accommodate in the program. The quality of workshops co-located with ASONAM has been considerably improved this year. In addition, I am happy to witness the success of the two symposiums which have been integrated into the organization to have more specialized coverage of two key areas related to network based modeling and analysis. The symposium on the Foundations of Open Source Intelligence and Security Informatics (FOSINT-SI) is serving mostly researchers and practitioners interested in terror and criminal data analysis. On the other hand, the symposium on Network Enabled Health Informatics, Biomedicine and Bioinformatics (HI-BI-BI) covers the network applications in the health domain from the wet-lab to the clinic.
Internet address

ASJC Scopus Subject Areas

  • Computer Networks and Communications
  • Information Systems

Keywords

  • Human-centered computing
  • Human Computer Interaction
  • Interaction paradigms
  • Web-based interaction
  • Information systems
  • Information retrieval
  • Documented representation
  • information systems applications
  • data mining

Disciplines

  • Artificial Intelligence and Robotics
  • Databases and Information Systems

Cite this