Detecting Web Robots Using Resource Request Patterns

Derek Doran, Swapna S. Gokhale

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A significant proportion of Web traffic is now attributed to Web robots, and this proportion is likely to grow over time. These robots may threaten the security, privacy, functionality, and performance of a Web server due to their unregulated crawling behavior. Therefore, to assess their impact, it must be possible to accurately detect Web robot requests. Contemporary detection approaches, however, may cease to be effective as the behavior of both robots and humans evolves. In this paper, we present a novel detection approach that is based on the contrasts in the resource request patterns of robots and humans. The proposed scheme, which relies on an invariant behavioral difference between humans and robots, builds on the lessons from contemporary approaches. We demonstrate that the proposed approach can accurately detect Web robots and argue that it is expected to remain effective even as they continue their rapid evolution.

Original languageAmerican English
Title of host publication2012 11th International Conference on Machine Learning and Applications
PublisherIEEE
Pages7-12
Number of pages6
ISBN (Electronic)978-0-7695-4913-2
ISBN (Print)978-1-4673-4651-1
DOIs
StatePublished - Jan 10 2013
Event11th IEEE International Conference on Machine Learning and Applications, ICMLA 2012 - Boca Raton, FL, United States
Duration: Dec 12 2012Dec 15 2012

Conference

Conference11th IEEE International Conference on Machine Learning and Applications, ICMLA 2012
Country/TerritoryUnited States
CityBoca Raton, FL
Period12/12/1212/15/12

ASJC Scopus Subject Areas

  • Human-Computer Interaction
  • Education

Keywords

  • Detection
  • User Classification
  • Web crawler
  • Web Log Analysis
  • Web Mining
  • Web robot

Disciplines

  • Computer Sciences
  • Engineering

Cite this