System TARTAR

(Transforming ARbitrary TAbles into fRames)

 

Description: Efficient automatic information handling has become increasingly important in information society. Most information on the Web is presented in the form of semi-structured or unstructured documents, encoded as a mixture of loosely structured natural language text and template units. The lack of metadata, which would precisely annotate the structure and semantics of documents, and ambiguity of natural language makes automatic computer processing very complex. The Semantic Web aims to overcome this bottleneck.

The contribution of this software package presents a novel method for automatic generation of knowledge models such as ontologies from arbitrary tabular structures found on the Web [1,2]. The method is based on a grounded cognitive table model introduced by Hurst [3]. The methodology is stepwise instantiated in four steps. In the first step, a table is transformed into regular matrix form. In the following two steps the table is handled from a structural and functional, and in the last step from a semantic point of view. The outcome of the method is threefold: a knowledge frame, an ontology, and a knowledge base, all encoded in an F-Logic representation language. A system is implemented in Java language, and consists of approximately 15.000 lines of code. In order to fully exploit (i.e. for query answering) the outcomes produced by the system, one can use an inference engine that supports F-Logic, like OntoBroker [4].

More information about the system and accompanying aspects can be found at the following resources:

[1] A. Pivk, P. Cimiano, Y. Sure. From Tables to Frames. Technical Report, Institute AIFB, University of Karlsruhe, July 2004. [PDF]

[2] A. Pivk, Y. Sure, P. Cimiano, M. Gams, V. Rajkovič, and R. Studer. Transforming Arbitrary Tables into F-Logic Frames with TARTAR . Data & Knowledge Engineering : submitted, 2005. [PDF]

[3] M. Hurst. Layout and Language: Beyond Simple Text for Information Interaction - Modelling the Table. In Proceedings of the 2nd International Conference on Multimodal Interfaces, Hong Kong, 1999.

[4] S. Decker, M. Erdmann, D. Fensel, and R. Studer. OntoBroker: Ontology Based Access to Distributed and Semi-Structured Information. In Database Semantics: Semantic Issues in Multimedia Systems , R. Meersman, S. Stevens, and Z. Tari (editors), Kluwer, pp. 351-369, 1999.

 

Download TARTAR:

 

Required packages and installations: