Repository logo
Communities & Collections
Research Outputs
Fundings & Projects
People
Statistics
User Manual
Have you forgotten your password?
  1. Home
  2. Faculty of Computer Science and Engineering
  3. Faculty of Computer Science and Engineering: Conference papers
  4. Web genre classification via hierarchical multi-label classification
Details

Web genre classification via hierarchical multi-label classification

Date Issued
2015-10-14
Author(s)
Madjarov, Gjorgji
Vidulin, Vedrana
Kocev, Dragi
Abstract
The increase of the number of web pages prompts for
improvement of the search engines. One such improvement can be by
specifying the desired web genre of the result web pages. This opens
the need for web genre prediction based on the information on the web
page. Typically, this task is addressed as multi-class classification, with
some recent studies advocating the use of multi-label classification. In
this paper, we propose to exploit the web genres labels by constructing a hierarchy of web genres and then use methods for hierarchical
multi-label classification to boost the predictive performance. We use
two methods for hierarchy construction: expert-based and data-driven.
The evaluation on a benchmark dataset (20-Genre collection corpus)
reveals that using a hierarchy of web genres significantly improves the
predictive performance of the classifiers and that the data-driven hierarchy yields similar performance as the expert-driven with the added value
that it was obtained automatically and fast.
Subjects

Web genre classificat...

File(s)
Loading...
Thumbnail Image
Name

978-3-319-24834-9_2.pdf

Size

610.62 KB

Format

Adobe PDF

Checksum

(MD5):78cc5251302bd18a2b1c9aff6d5df833

⠀

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Accessibility settings
  • Privacy policy
  • End User Agreement
  • Send Feedback
Repository logo COAR Notify