D6.2, Methods for data acquisition and transformation

Short Description:

The purpose of this document is to describe methods for homogeneous access to data residing in information sources to be integrated, and to select and adapt available research results for their realization from the fields of information integration and agent technology. The data in the sources are given in the raw data formats specified in Project Report D6.1, and have to be transformed into an appropriate format for internal integration use. In the course of this, the INFOMIX Source Data Format (ISDF), which provides a uniform logical format of the source data to the user, has to be taken into account, as well as an internal integration data format, which is the one used by the internal integration algorithms. Furthermore, the usage of methods and techniques for information extraction from implicit representation in this framework will be respected.