Learning Structural Classification Rules for Web-page Categorization. Stuckenschmidt, H., Hartmann, J., & Van Harmelen, F. 2002.
abstract   bibtex   
Content-related metadata plays an important role in the effort of developing intelligent web applications. One of the most established form of providing content-related metadata is the assignment of web-pages to content categories. We describe the Spectacle system for classifying individual web pages on the basis of their syntactic structure. This classification requires the spe-cification of classification rules associating common pa-ge structures with predefined classes. In this paper, we propose an approach for the automatic acquisition of these classification rules using techniques from inducti-ve logic programming and describe experiments in ap-plying the approach to an existing web-based informa-tion system.
@misc{1ed1d738b5ab4c4b9192563f10370569,
  title    = "Learning Structural Classification Rules for Web-page Categorization",
  abstract = "Content-related metadata plays an important role in the effort of developing intelligent web applications. One of the most established form of providing content-related metadata is the assignment of web-pages to content categories. We describe the Spectacle system for classifying individual web pages on the basis of their syntactic structure. This classification requires the spe-cification of classification rules associating common pa-ge structures with predefined classes. In this paper, we propose an approach for the automatic acquisition of these classification rules using techniques from inducti-ve logic programming and describe experiments in ap-plying the approach to an existing web-based informa-tion system.",
  author   = "Heiner Stuckenschmidt and Jens Hartmann and {Van Harmelen}, Frank",
  year     = "2002",
}

Downloads: 0