GAMES logo Nodes: Vienna
Aachen Bordeaux Edinburgh Paris Rice Uppsala Warsaw Vienna

Task 8: Automata and query languages for semistructured data

Objectives

Main goal: A new approach to data on the web based on a new notion of automata, similar to tree automata, will be developed. On the practical side, automata concepts will be combined with database methods for analysing and improving existing XML query languages and automatic information extractors. We will also use verification techniques based on automata and logics for validating and comparing Document Type Definitions (DTDs). aspect and the practical impact of the network.

Background literature

[1] G. Gottlob and C. Koch, Monadic datalog and the expressive power of languages for Web Information Extraction, in Proc. of PODS, 2002.
[ BibTeX ]
[2] R. Baumgartner, S. Flesca, and G. Gottlob, Visual web information extraction with Lixto, in Proc. of VLDB, 2001.
[ BibTeX ]
[3] G. Gottlob and C. Koch, Monadic queries over tree-structured data, in Proc. of LICS, 2002.
[ BibTeX ]
[4] G. Gottlob, N. Leone, and F. Scarcello, Hypertree decompositions: A survey, in Proc. of MFCS, 2001.
[ BibTeX ]