Thomas Schandl

I-Semantics: The Review in a Car – 2011 Edition

Continuing the tradition of last year’s review in a car, the Semantic Web Company’s participants of the I-KNOW / I-SEMANTICS talked about their impressions of the conference while on their way back to Vienna.

Image based on work by Paolo Mañalac

Image based on work by Paolo Mañalac

 

Thomas Schandl: An especially nice thing about this conference is that it’s co-location attracts people from two separate communities: Knowledge Management and Semantic Web. This serves as a natural facilitator for looking beyond the boundaries of one’s own domain and getting more than a glimpse of what’s currently happening in related fields.

That being said one of the most interesting talks I attended was a talk from KM expert Prof. Martin Eppler and his take on “Sketching at Work“, which introduced loads of sketching methods which can help to solve problems, inspire creativity and support communication.

From the Semantic Web side I enjoyed the innovative approach taken by Hasso Plattner Insitute‘s DBpedia powered quiz game Risq!. It is a Jeopardy-like Facebook game, that (besides being fun) sheds insights as to which facts are especially important to characterize a Linked Data resource. E.g. when the system wants you to guess a specific “female politician” would it help you more to know that she is part of the category yago:LivingPeople or would you rather get the hint that she is dbpedia:Chancellor_of_Germany?
By analyzing the logs of the played games, the researchers can find out which triples have more discriminative power than others.

Through the many personal encounters I also got a lot of input on which new features would be especially interesting for future versions of PoolParty and what we should concentrate on in the LOD interlinking project LASSO that Bernhard Schandl (Gnowsis / Refinder), Stefan Wunder (Neurovation) and me presented at the I-Praxis track.

Andreas Blumauer: Again, this year was absolutely worth coming to Graz also from a business perspective. For me it was the 10th time going to Graz. When I went to the second edition of I-KNOW in 2001 I remember that nearly nobody has ever heard of “semantics”. When I-SEMANTICS came to Graz the first time, this was in 2007, it was still unclear for most visitors how semantic technologies could contribute to a more efficient enterprise knowledge management. Nowadays, 10 years later, there is another question most prominent:

Which kind of semantic technology is solving my problem?

Being most of the time at our exhibition booth I enjoyed talking to visitors who had very concrete plans & ideas about how to use linked data, text mining or knowledge models for their business. The time when we had to explain what the “semantic web” is all about is over.

Christian Dirschl´s (Wolters Kluwer) keynote on Friday was exactly reflecting this fact: It´s good to see how big players have started to integrate the idea of linked data into their processes already. The days when we had to explain the difference between RDF and XML seem to be over. Or at least almost.

Florian Kondert: It was a vibrant atmosphere for me, since I didn’t make it to participate to just one track, but talking to interesting and interested persons at the booth without one break, instead.

From the participant’s perspective the conference as a networking platform was a huge success – and it definitely didn’t stop at dusk! It is worth pointing out the diverse needs and ideas on semantic use cases, that allow us to learn more with every discussion. The bottom line is that semantic solutions are badly needed for many organisations – and they start to realize, that there are no working alternatives at the moment.

On the other hand it is crucial to show up with real life examples, not just with prototypes that might work tentatively! As providers for semantic solutions we face decision makers on the highest level and they demand high level remedies – so, no time to take a break!

Tassilo Pellegrini: As the conference chair I really had an intense, but all in all very positive time at the conference. Interesting people, inspiring talks and a really good time at the socializing events (greetings to Leo Sauermann & Co. – I enjoyed the drinks!). For a general conference overview read my post from a few days ago.

But there is more to such a diverse conference as just talking about semantics. As some of you might know, beside my interest in Semantic Web, I have been involved in some policy consulting lately concerning the topic of net neutrality. At the conference I took the opportunity to talk to some telecommunications-savvy people and had some really great conversations (Harald … I really enjoyed our discussion!). But to my surprise I had to find out that – especially among the engineering guys – there seems to be very little awareness about the pressing social, cultural and economic consequences that an abandoning of net neutrality will have on the Internet as we know it today. For those readers who are into semantic web but not into the net neutrality discourse I want to reduce it to a very simple formula: without net neutrality you can say goodbye to linked open data. And this should really make us think and act!!

Enhanced by Zemanta
Thomas Schandl

PoolParty 3.0 and its all new Linked Data framework

The new major release of PoolParty boasts with new Linked Data capabilities that further unlock the potential that the Semantic Web can bring to improve your metadata management, to enhance your data with external knowledge and to ease data integration efforts within your organization and with your partners.

In PoolParty 3.0 we created a Linked Data interlinking editor, making it easier than ever to add your own lookup and interlinking services (even for non-RDF sources) and made the Linked Data publishing front-end fully customizable in design, layout and regards to which parts of your content will be displayed.

But let’s start at the beginning:

Step 1 – Hook into the Linked Data Cloud!

In the era of the rapidly growing Linked Data Cloud your knowledge models don’t need to stay isolated from the outside world anymore. Simply use PoolParty’s new and improved lookup service to find matching resources from the Linked Open Data Cloud (e.g. from DBpedia).

Imagine having different data models that all refer to the same product categories and world regions. Once you have them represented in PoolParty you can use its lookup service to find matching resources from the Linked Data Cloud. In this way you will get globally used identifiers for your product categories and regions, usually in the form of a URI like http://dbpedia.org/resource/Berlin. This eases your internal data integration efforts, and it can aid the data exchange with partners or customers and enables hassle-free distributed management of knowledge models.

Image 1: Lookup of concept ‘Austria’ and selection of properties and values to be imported

 

With PoolParty 3.0 we increased the number of included lookup services: DBpedia, Geonames, Wordnet, Umbel, Yago, Freebase, Sindice, dmoz and LCSH – BBC Wildlife, Enis and Gemet are available on request.

Step 2 – Pull in Semantic Data!

There is a vast amount of Linked Data out there just waiting to be leveraged for thesaurus creation and extension. To meet that end we had a close look at our interlinking module and decided to enhance it a way that it becomes more of a Linked Data editor.

Once you have a base thesaurus in PoolParty and hooked a couple of your concepts into the cloud as described above, you can proceed to pull in the good stuff that comes with the Linked Data resources you have found.

Image 2: Imported Linked Data for concept ‘London’

 

As you can see in the image above, you can extend your local thesaurus with labels, definitions and all kinds of other information like e.g. in the case of countries their population, GDP, spoken languages, famous people born there, newspaper articles related to the political situation, and so on.

Now PoolParty 3.0 takes this approach a couple of steps further. You can not only specify which of your local concepts corresponds to which Linked Data resource and grab all semantic information that comes with this resource, but now you are able to selectively pick out the data items you are interested in and even transform the predicates they originally came with. Just switch them to whatever custom properties you created or want to re-use from any ontology (see an example in Image 1).

In this way you can easily enrich your own knowledge models with external information – which in turn can be utilized for better content recommendation, easier data integration and improved search services.

Step 3 – Publish your Linked Data in Style

Previous PoolParty versions already offered the possibility to instantly publish your thesauri, taxonomies or vocabularies and display their concepts as HTML while additionally providing machine-readable RDF versions for them. This means that anyone using PoolParty intuitive GUI can become a W3C standards compliant Linked Data publisher without having to know anything about Semantic Web technicalities.
Of course you don’t need to publish all your valuable models, just choose the parts that safely can be shared with the public and keep everything else behind your firewall, available only to you and trusted partners!

In this new release of PoolParty the design of all pages on the Linked Data front-end is now under your full control. You can use your own style sheets and create views on your data with velocity templates. It is even possible to develop project- and thesaurus-specific templates and layouts, so they can have an individual look and display different predicates and their values.

Take a look at PoolParty´s standard linked data frontend!

The following images show a PoolParty default Linked Data page and a custom-made Linked Data page of a PoolParty concept that has some DBpedia info imported.

Image 3: PoolParty default Linked Data page

PoolParty Linked Data page of ScOT thesaurus courtesy of Educational Services Australia
Image 4: Custom Linked Data page of ScOT thesaurus (courtesy of Educational Services Australia)

 

Step 4 – Unlock new Linked Data Sources

With PoolParty 3.0 you are in no way limited to DBpedia, Freebase, Geonames and the other lookup services that PoolParty provides out of the box: you can add your own non-Semantic Web data sources to the mix, thereby enabling you to boldly go where no Linked Data tool has gone before.

Maybe you have a product thesaurus and want to specify which products are related to patents that can be found with Google Patents?
Or you want to interlink concepts from a company taxonomy with related articles from the Guardian’s search service or any other newspaper that provides a search API?

All those sources are not available as RDF, so how can you re-use them easily as data sources for Linked Data style interlinking? For such cases PoolParty introduces the Unified Lookup API, which makes it easy to turn almost any third party Web API into a source for interlinking your concepts with third party resources as described above.

This makes it possible to interlink your concepts with many kinds of data out there, be it New York Times articles, UN data, synonym services, abbreviations, press releases, juridical information – or any web API important for your knowledge domain.

That being said, if you have suggestions for additional lookup services that you think are interesting, let us know!

To gain a first hand impression of the new PoolParty just apply for a demo account!

Thomas Schandl

Interview on Enhancing Semantic Web applications with Linguistic Information

John McCrae (Uni Bielefeld), Elena Montiel-Ponsoda (Universidad Politécnica de Madrid) and Tobias Wunner (DERI Galway) will hold a tutorial at the ESWC 2011 with the title “Enriching the Semantic Web with Linguistic Information“. We had a chance to talk to them beforehand:

Can you please tell us about the aims and purpose of your tutorial and the importance of incorporating linguistic information in the Semantic Web?

With the continuing growth of linked data and semantic technologies the incorporation of linguistic descriptions into Semantic Web resources has become a challenging issue. The integration of linguistic information especially on a multilingual level could greatly benefit Natural Language Processing (NLP) applications. Furthermore, the continuing growth of ontologies for semantic modeling and the use of terminological resources to add human language descriptions has raised the issue of how to add linguistic information to ontologies and linked data vocabularies and to represent models of lexical and terminological information in a way which is compatible with Semantic Web standards. Prominent examples here are, for instance, multilingual language tags in RDF Schema or SKOS’s success in bringing terminological information to the Semantic Web.

In the Tutorial we would like to discuss trends and novel models such as Lemon – the lexicon model for ontologies – to show possible future directions. The tutorial is targeted at researchers and practitioners interested in learning how to enrich ontologies with linguistic information in one or several natural languages and NLP tool developers interested in understanding how Semantic Web resources can be leveraged fro NLP. There will be two hands-on sessions in this tutorial.

Why did you choose to use PoolParty thesaurus management system in your tutorial?

To create terminology models on the web there are only few tools available which are often very technical and not straightforward to use for non-experts. We found that PoolParty in contrast to other SKOS editors has an attractive and usable interface. In addition the web based interface was preferable, as it did not require the participants to download software, the immediate publishing of linked data is more compatible with linked data principles and the tool has similarities to our own tools for working with lemon.

Thank you for this interview!

Thomas Schandl

Which kind of controlled vocabularies matter?

Looking at intermediate results of the Controlled Vocabularies Survey an interesting finding concerns the question which types of knowledge models are currently best fit for actual use in applications.

So far 143 people whose organization already make use of controlled vocabularies answered the question “Which kind of controlled vocabulary do you use or plan to use in your applications?”.
The results so far show that lightweight models like taxonomies and thesauri are somewhat preferred over ontologies:

Taxonomies are the favorite, as 73.6% of participants use or plan to use them, followed by thesauri (62%) and ontologies (61.2%), while simple glossaries lag considerably behind with a usage of 31.4%.

This survey will close in about a week, so please take this chance to make your opinions on this topic count! You can find the questions here, it will take 5-10 minutes to answer them.

All participants will gain access to a report with the results within the following month. The most interesting results will be made public on this blog.