Number and Type:

181130 VU WS 2006/07

Lecturer: Robert Baumgartner
Keywords: Information Extraction, Approaches and methods for Wrapper Generation, Web Querying, Integration, XML.
  • Information Extraction: Setting, History, IE vs. IR
  • Structured Data Extraction and Wrapping
  • XML Transformation and Query Languages (in particular XPath and XSLT, very short look on XQuery)
  • Web Wrapper Languages
  • Wrapper Generation Approaches
  • Inductive Wrapper Generation: Machine Learning on Strings/Trees, Tree Edit Distances
  • Automatic Data Extraction / Web Data Mining
  • Supervised Wrapper Generation
  • Deep Web Navigation Approaches
  • Data Extraction from PDF documents
  • Mediation and Integration Approaches
  • Web Data Cleaning
  • Lixto Visual Wrapper and Transformation Server
Fields of Study: This VU is a compulsory course or compulsory elective in some bachelor and master studies, and is furthermore part of the re-designed KfK Semantic Web, and is part of the European Master Programs Computational Logic.


Robert Baumgartner, last modified on 8/2/2007