(Enter summary)
Abstract: We present new techniques for supervised wrapper generation
and automated web information extraction, and a system
called Lixto implementing these techniques [6]. Our system
can generate wrappers which translate relevant pieces of
HTML pages into XML. Lixto, of which a working prototype
has been implemented, assists the user to semi-automatically
create wrapper programs by providing a fully visual and interactive
user interface. In this convenient user-interface
very expressive extraction... (Update)
Cited by: More
COMPASS: A Concept-based Web Search Engine - For Html Xml (2004)
(Correct)
Learning Node Selecting Tree Transducer - From Completely Annotated
(Correct)
Intl. Conf. RIVF'04 - February Hanoi Vietnam (2004)
(Correct)
Active bibliography (related documents): More All
2.6: Visual Web Information Extraction with Lixto - Baumgartner, Flesca, Gottlob (2001)
(Correct)
1.1: Declarative Information Extraction, Web Crawling, and.. - Baumgartner, Flesca.. (2001)
(Correct)
0.4: Visual Programming of Web Data Aggregation Applications - Robert Baumgartner Technische (2003)
(Correct)
Similar documents based on text: More All
1.0: Monadic Datalog and the Expressive Power of Languages for Web.. - Gottlob, Koch (2002)
(Correct)
0.6: The Lixto Data Extraction Project -- Back and Forth - Between Theory And (2004)
(Correct)
0.4: An Approach for Synergically Carrying out Intensional.. - Pontier, Ursino, Zumpano (2002)
(Correct)
Related documents from co-citation: More All
16: XWRAP: An XML-enabled wrapper construction system for web information sources
- Liu, Pu et al. - 2000
12: Roadrunner: Towards automatic data extraction from large web sites
- Crescenzi, Mecca et al. - 2001
10: Building Intelligent Web Applications Using Lightweight Wrappers
- Sahuguet, Azavant - 2000
BibTeX entry: (Update)
R. Baumgartner, S. Flesca, G. Gottlob. Visual Web Information Extraction with Lixto, Proc. of VLDB, 2001 http://citeseer.ist.psu.edu/baumgartner01visual.html More
@inproceedings{ baumgartner01visual,
author = "Robert Baumgartner and Sergio Flesca and Georg Gottlob",
title = "Visual Web Information Extraction with Lixto",
booktitle = "The {VLDB} Journal",
pages = "119-128",
year = "2001",
url = "citeseer.ist.psu.edu/baumgartner01visual.html" }
Citations (may not include all citations):
212
Wrapper induction for information extraction
- Kushmerick, Weld et al. - 1997
188
Data on the Web - From Relations to Semistructured Data and .. (context) - Abiteboul, Buneman et al. - 2000
152
A query language for XML
- Florescu, Deutsch et al. - 1999
144
Extracting semistructured information from the web
- Hammer, Garcia-Molina et al. - 1997
58
A hierarchical approach to wrapper induction
- Muslea, Minton et al. - 1999
50
XQuery: A query language for XML (context) - Chamberlin, Eds - 2001
47
Cut and paste
- Atzeni, Mecca - 1997
43
Generating finite-state transducers for semistructured data ..
- Hsu, Dung - 1998
33
Visual web information extraction with Lixto
- Baumgartner, Flesca et al. - 2001
25
JEDI: Extracting and synthesizing information from the web
- Huck, Fankhauser et al. - 1998
24
NoDoSE - a tool for semi-automatically extracting semi-struc.. (context) - Adelberg - 1998
19
Building light weight wrapper legacy web data source using W..
- Azavant, weight et al. - 1999
16
World Wide Web Journal (context) - Kushmerick - 2000
16
Intelligent internet systems
- Levy, Weld - 2000
13
mediating and restructuring information from the web (context) - May, Himmeroder et al. - 1999
11
Anti-unification based learning of T-wrappers for informatio..
- Thomas - 1999
7
Natural language processing with ThoughtTreasure
- Mueller - 1998
5
Content integration for e-business
- Stonebraker, Hellerstein - 2001
5
Supervised wrapper generation with Lixto
- Baumgartner, Flesca et al. - 2001
4
XML-GL: a graphical query language for querying and restruct.. (context) - Ceri, Comai et al. - 1999
3
XWrap: An extensible wrapper construction system for interne.. (context) - Liu, Pu et al. - 2000
3
web crawling and recursive wrapping with Lixto (context) - Baumgartner, Flesca et al. - 2001
3
aspects of resilient data extract (context) - Davulcu, Yang et al. - 2000
2
Distributed WWW programming using (context) - Cabeza, Hermenegildo - 2001
http://www.starlab.vub.ac.be/research/
http://www.savarese.org/oro/
http://www.jdom.org/
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://semannot2001.aifb.uni-karlsruhe.de/schedule_new.html): More
Large Scale Acquisition and Maintenance From the Web Without.. - Leonard, Glaser (2001)
(Correct)
Towards Annotation using DAML+OIL - Sean Bechhofer Carole (2001)
(Correct)
Incremental Development of Domain-Specific Document Retrieval.. - Kim, Compton (2001)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC