The Semantic Puzzle

Jana Herwig

Integrating Information Extraction into the KiWi-System: a proposal from Brno

Semantic technology isn’t about technology: It’s noble concern is to make the life and work of people easier. Yesterday, Marek Schmidt and Petr Knoth, both working on PhD project within Natural Language Processing (NLP) at Brno University, introduced their vision of how Information ExtractionInformation extraction (IE) is a type of information retrieval whose goal is to automatically extract structured information from unstructured machine-readable documents, generally human language texts by means of natural language processing (NLP). Due to the difficulty of the problem, current ... could be integrated into the KiWiEU-funded project combining the wiki philosophy with methods of the Semantic Web, aiming to develop a new approach to knowledge management. (http://www.kiwi-project.eu/index.php/about)-System.

First off: What is Information Extraction? In natural language processingNatural Language processing (NLP) is a field of computer science and linguistics concerned with the interactions between computers and human (natural) languages. In theory, natural-language processing is a very attractive method of human-computer interaction. Natural-language understanding is ..., “information extraction (IE) is a type of information retrieval whose goal is to automatically extract structured information, i.e. categorized and contextually and semantically well-defined data from a certain domain, from unstructured machine-readable documents” (Wikipedia). Marek and Petr’s vision for using IE in KiWi is to support the user in the creation of semantic annotations.

Annotations and Ontology

The image above illustrates their vision: If a user, for instance, enters the text “Hello, I am the best expert in Java around the Sun” into the content editor, structured information is extracted, analyzed on the fly and returned as suggestions. Through the application of reasoning on existing annotations and on further information that is available on the system – e.g. relevant domain ontologies, but also information about the user himself – the system will be able to infer new statements: E.g. the system will be able to infer that Bill Rodgers has Java programming skills, even though this information has never been explicitly stated in the knowledge base.

One thought on “Integrating Information Extraction into the KiWi-System: a proposal from Brno

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>