A project on web information extraction Lixto is a fully visual interactive system for the generation of wrappers based on our developed algorithms and our declarative language for the definition of HTML/XML wrappers. Lixto is portable (implemented with Java), offers a capacious interactive visual interface, allows for expressive and flexible data extraction and uses intuitive hierarchical extraction, as well as string extraction techniques. Lixto translates relevant parts of web-pages into XML. It can be used to create an XML Companion for a HTML web page with changing content, containing the continually updated XML translation of the relevant information. Lixto wrappers can be embedded into personalisable information pipes.
Patent pending. |
|
This page currently under construction.
More screenshots and an interactive demo will be offered later.
baumgart@dbai.tuwien.ac.at
Last modified 12.6.2002 |