Information Integration from the Web
The Web provides access to large data sources which are not explicitly
organized as databases. Instead, the information is presented as
semistructured data. In contrast to integrating classical
distributed databases, integration of such data raises several new
problems such as schema discovery, wrapping and reorganizing the data
sources and coping which changes in autonomous sources.
Information integration has been investigated by members of the
group since 1997 at
Freiburg
University using the the FLORID system for
extraction and integration of semistructured data from the
Web.
The Experiences with F-Logic and FLORID have been incorporated
into the LoPiX (Logic Programming in XML)
project, dealing with integration of XML data (2000 - 2003).
The work on information integration in XML is now continued with the
LinXIS project.
|