Repository logo
Communities & Collections
Research Outputs
Fundings & Projects
People
Statistics
User Manual
Have you forgotten your password?
  1. Home
  2. Faculty of Computer Science and Engineering
  3. Faculty of Computer Science and Engineering: Journal Articles
  4. Malware distributions and graph structure of the Web
Details

Malware distributions and graph structure of the Web

Journal
arXiv preprint arXiv:1707.06071
Date Issued
2017-07-19
Author(s)
Šćepanović, Sanja
Ruohonen, Jukka
Ayala-Gómez, Frederick
Aura, Tuomas
Hyrynsalmi, Sami
Abstract
Knowledge about the graph structure of the Web is important for understanding this complex socio-technical system and for devising proper policies supporting its future development. Knowledge about the differences between clean and malicious parts of the Web is important for understanding potential treats to its users and for devising protection mechanisms. In this study, we conduct data science methods on a large crawl of surface and deep Web pages with
the aim to increase such knowledge. To accomplish this, we answer the following questions. Which theoretical distributions explain important local characteristics and network properties of websites? How are these characteristics and properties different between clean and malicious (malware-affected) websites? What is the prediction power of local characteristics and network properties to classify malware websites? To the best of our knowledge, this is the
first large-scale study describing the differences in global properties between malicious and clean parts of the Web. In other words, our work is building on and bridging the gap between Web science that tackles large-scale graph representations and Web cyber security that is concerned with malicious activities on the Web. The results presented herein can also help antivirus vendors in devising approaches to improve their detection algorithms.
File(s)
Loading...
Thumbnail Image
Name

1707.06071.pdf

Size

3.49 MB

Format

Adobe PDF

Checksum

(MD5):f1a195b52bafbf38381b07874456c5df

⠀

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Accessibility settings
  • Privacy policy
  • End User Agreement
  • Send Feedback
Repository logo COAR Notify