The Semantic Puzzle

Tassilo Pellegrini

Thoughts on KOS (Part1): Getting to grips with “semantic” interoperability

Enabling and managing interoperability at the data and the service level is one of the strategic key issues in networked knowledge organization systems (KOSs) and a growing issue in effective data management. But why do we need “semantic” interoperability and how can we achieve it?

Interoperability vs. Integration

The concept of (data) interoperability can best be understood in contrast to (data) integration. While integration refers to a process, where formerly distinct data sources and their representation models are being merged into one newly consolidated data source, the concept of interoperability is defined by a structural separation of knowledge sources and their representation models, but that allows connectivity and interactivity between these sources by deliberately defined overlaps in the representation model. Under circumstances of interoperability data sources are being designed to provide interfaces for connectivity to share and integrate data on top of a common data model, while leaving the original principles of data and knowledge representation intact. Thus, interoperability is an efficient means to improve and ease integration of data and knowledge sources.

Three levels of interoperability

When designing interoperable KOSs it is important to distinguish between structural, syntactic and semantic interoperability (Galinski 2006):

  • Structural interoperability is achieved by representing metadata using a shared data model like the Dublin Core Abstraction Model or RDF (Resource Description Framework).
  • Syntactic interoperability if achieved by serializing data in a shared mark-up language like XML, Turtle or N3Notation3 language, of internet Media Type text/n3. Compact and readable alternative to RDF's XML syntax. (, or N3 as it is more commonly known, is a shorthand non-XML serialization of Resource Description Framework models, designed with ....
  • Semantic interoperability is achieved by using a shared terminology or controlled vocabularyControlled vocabularies provide a way to organize knowledge for subsequent retrieval. They are used in subject indexing schemes, subject headings, thesauri and taxonomies. Controlled vocabulary schemes mandate the use of predefined, authorised terms that have been preselected by the designer of ... to label and classify metadata terms and relations.

Given the fact that metadata standards carry a lot of intrinsic legacy, it is sometimes very difficult to achieve interoperability at all three levels mentioned above. Metadata formats and models are historically grown, they are most of the time a result of community decision processes, often highly formalized for specific functional purposes and most of the time deliberately rigid and difficult to change. Hence it is important to have a clear understanding and documentation of the application profile of a metadata format as a precondition for enabling interoperability at all three levels mentioned above. Semantic Web standards do a really good job in this respect!!

In the next post, we will take a look at various KOSs and how they differ with respect to expressivity, scope and target group.

This entry was posted in Linked Data & Open Data, Vocabularies & Languages and tagged , , by Tassilo Pellegrini. Bookmark the permalink.
Tassilo Pellegrini

About Tassilo Pellegrini

From Wikipedia, the free encyclopedia: Prof. (FH) Dr. Tassilo Pellegrini (born 1974) studied International Trade, Communication Science and Political Science at the University of Salzburg and University of Málaga. Since end of 2007 he is running the New Media Division at the University of Applied Sciences in St. Pölten. He obtained his master degree in 1999 from the University of Salzburg on the topic of telecommunications policy in the European Union, which was followed by a PhD in 2010 on the topic of bounded policy-learning in the European Union with a focus on intellectual property policies. His current research encompasses economic effects of internet regulation with respect to market structure and basic civil rights. He is member of the International Network for Information Ethics (INIE), the African Network of Information Ethics (ANIE) and the Deutsche Gesellschaft für Publizistik und Kommunikationswissenschaft (DGPUK). Beside his specialisation in policy research and media economics Tassilo Pellegrini has worked on semantic technologies and the Semantic Web. He is co-founder and Head of Division Research and Development of the Semantic Web Company in Vienna, co-editor of the first German textbook on Semantic Web and Conference Chair of the annual I-SEMANTICS conference series founded in 2005.