Visual Web Information Extraction with Lixto (2001)  (Make Corrections)  (33 citations)
Robert Baumgartner, Sergio Flesca, Georg Gottlob
The VLDB Journal

  Home/Search   Context   Related

 
View or download:
semannot2001.aifb.unikar...6_lixto.pdf
Cached:  PS.gz  PS  PDF  Image  Update  Help

From:  semannot2001.aifb....schedule_new (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: We present new techniques for supervised wrapper generation and automated web information extraction, and a system called Lixto implementing these techniques [6]. Our system can generate wrappers which translate relevant pieces of HTML pages into XML. Lixto, of which a working prototype has been implemented, assists the user to semi-automatically create wrapper programs by providing a fully visual and interactive user interface. In this convenient user-interface very expressive extraction... (Update)

Cited by:   More
COMPASS: A Concept-based Web Search Engine - For Html Xml (2004)   (Correct)
Learning Node Selecting Tree Transducer - From Completely Annotated   (Correct)
Intl. Conf. RIVF'04 - February Hanoi Vietnam (2004)   (Correct)

Active bibliography (related documents):   More   All
2.6:   Visual Web Information Extraction with Lixto - Baumgartner, Flesca, Gottlob (2001)   (Correct)
1.1:   Declarative Information Extraction, Web Crawling, and.. - Baumgartner, Flesca.. (2001)   (Correct)
0.4:   Visual Programming of Web Data Aggregation Applications - Robert Baumgartner Technische (2003)   (Correct)

Similar documents based on text:   More   All
1.0:   Monadic Datalog and the Expressive Power of Languages for Web.. - Gottlob, Koch (2002)   (Correct)
0.6:   The Lixto Data Extraction Project -- Back and Forth - Between Theory And (2004)   (Correct)
0.4:   An Approach for Synergically Carrying out Intensional.. - Pontier, Ursino, Zumpano (2002)   (Correct)

Related documents from co-citation:   More   All
16:   XWRAP: An XML-enabled wrapper construction system for web information sources - Liu, Pu et al. - 2000
12:   Roadrunner: Towards automatic data extraction from large web sites - Crescenzi, Mecca et al. - 2001
10:   Building Intelligent Web Applications Using Lightweight Wrappers - Sahuguet, Azavant - 2000

BibTeX entry:   (Update)

R. Baumgartner, S. Flesca, G. Gottlob. Visual Web Information Extraction with Lixto, Proc. of VLDB, 2001 http://citeseer.ist.psu.edu/baumgartner01visual.html   More

@inproceedings{ baumgartner01visual,
    author = "Robert Baumgartner and Sergio Flesca and Georg Gottlob",
    title = "Visual Web Information Extraction with Lixto",
    booktitle = "The {VLDB} Journal",
    pages = "119-128",
    year = "2001",
    url = "citeseer.ist.psu.edu/baumgartner01visual.html" }
Citations (may not include all citations):
212   Wrapper induction for information extraction - Kushmerick, Weld et al. - 1997
188   Data on the Web - From Relations to Semistructured Data and .. (context) - Abiteboul, Buneman et al. - 2000
152   A query language for XML - Florescu, Deutsch et al. - 1999
144   Extracting semistructured information from the web - Hammer, Garcia-Molina et al. - 1997
58   A hierarchical approach to wrapper induction - Muslea, Minton et al. - 1999
50   XQuery: A query language for XML (context) - Chamberlin, Eds - 2001
47   Cut and paste - Atzeni, Mecca - 1997
43   Generating finite-state transducers for semistructured data .. - Hsu, Dung - 1998
33   Visual web information extraction with Lixto - Baumgartner, Flesca et al. - 2001
25   JEDI: Extracting and synthesizing information from the web - Huck, Fankhauser et al. - 1998
24   NoDoSE - a tool for semi-automatically extracting semi-struc.. (context) - Adelberg - 1998
19   Building light weight wrapper legacy web data source using W.. - Azavant, weight et al. - 1999
16   World Wide Web Journal (context) - Kushmerick - 2000
16   Intelligent internet systems - Levy, Weld - 2000
13   mediating and restructuring information from the web (context) - May, Himmeroder et al. - 1999
11   Anti-unification based learning of T-wrappers for informatio.. - Thomas - 1999
7   Natural language processing with ThoughtTreasure - Mueller - 1998
5   Content integration for e-business - Stonebraker, Hellerstein - 2001
5   Supervised wrapper generation with Lixto - Baumgartner, Flesca et al. - 2001
4   XML-GL: a graphical query language for querying and restruct.. (context) - Ceri, Comai et al. - 1999
3   XWrap: An extensible wrapper construction system for interne.. (context) - Liu, Pu et al. - 2000
3   web crawling and recursive wrapping with Lixto (context) - Baumgartner, Flesca et al. - 2001
3   aspects of resilient data extract (context) - Davulcu, Yang et al. - 2000
2   Distributed WWW programming using (context) - Cabeza, Hermenegildo - 2001
http://www.starlab.vub.ac.be/research/
http://www.savarese.org/oro/
http://www.jdom.org/



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://semannot2001.aifb.uni-karlsruhe.de/schedule_new.html):   More
Large Scale Acquisition and Maintenance From the Web Without.. - Leonard, Glaser (2001)   (Correct)
Towards Annotation using DAML+OIL - Sean Bechhofer Carole (2001)   (Correct)
Incremental Development of Domain-Specific Document Retrieval.. - Kim, Compton (2001)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC