Can We Learn What People Are Doing From Raw DNS Queries?

Jianfeng Li, Xiaobo Ma, Li Guodong, Xiapu Luo, Junjie Zhang, Wei Li, Xiaohong Guan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Domain Name System (DNS) is one of the pillars of today's Internet. Due to its appealing properties such as low data volume, wide-ranging applications and encryption free, DNS traffic has been extensively utilized for network monitoring. Most existing studies of DNS traffic, however, focus on domain name reputation. Little attention has been paid to understanding and profiling what people are doing from DNS traffic, a fundamental problem in the areas including Internet demographics and network behavior analysis. Consequently, simple questions like 'How to determine whether a DNS query for www.google.com means searching or any other behaviors?' cannot be answered by existing studies. In this paper, we take the first step to identify user activities from raw DNS queries. We advance a multiscale hierarchical framework to tackle two practical challenges, i.e., behavior ambiguity and behavior polymorphism. Under this framework, a series of novel methods, such as pattern upward mapping and multi-scale random forest classifier, are proposed to characterize and identify user activities of interest. Evaluation using both synthetic and real-world DNS traces demonstrates the effectiveness of our method.

Original languageAmerican English
Title of host publicationIEEE INFOCOM 2018 - IEEE Conference on Computer Communications
PublisherIEEE
Pages2240-2248
Number of pages9
ISBN (Electronic)978-1-5386-4128-6
ISBN (Print)978-1-5386-4129-3
DOIs
StatePublished - Oct 11 2018
Event2018 IEEE Conference on Computer Communications, INFOCOM 2018 - Honolulu, United States
Duration: Apr 15 2018Apr 19 2018

Conference

Conference2018 IEEE Conference on Computer Communications, INFOCOM 2018
Country/TerritoryUnited States
CityHonolulu
Period4/15/184/19/18

ASJC Scopus Subject Areas

  • General Computer Science
  • Electrical and Electronic Engineering

Keywords

  • Conferences
  • Monitoring
  • Forestry
  • Google
  • Training
  • Internet
  • Encryption

Disciplines

  • Computer Sciences
  • Engineering

Cite this