Thomas Thurner

The review in a car

Imagine the following: A car full of Semantic Web Experts is on it’s way back from Graz. They hand around an iPhone to record some first impressions about the just ended 6th International Conference on Semantic Systems, I-SEMANTICS. So, the car was a Volvo, occupied by Thomas Schandl, Helmut Nagy, Tassilo Pellegrini and Andreas Blumauer.

Andreas Blumauer: “I think this year’s I-Semantics was a big step forward. I had the impression that a lot of industry representatives are looking again for serious solutions there, after they have had already “burned their fingers” with the first-generation semantics. The now presented 2nd generation is much more about running applications and less unproven concepts.”

Tassilo Pellegini: “This is what I also noticed this year. People build now on a solid common knowledge on the topic and are much more aware of the possibilities of the existing technologies and methods. And as this conference also where visited by a quite international crowd, a very homogeneous discussion incorporating a lot of the international trends was possible. So the developed sight on the topic was quite clear. In this respect, the keynote of Peter A. Gloor was a notable and impressive look into the very next future. It seems that the powerful technique of Cool Farming will be on our agenda in the next years, when we talk about prognosis tools, sentiment analysis, aggregated expert’s data, etc.”

Andreas Blumauer: “In terms of a look into the very next trends, also the Keynote of Rafael Sidi was impressive to me, as he draw a real amazing picture how his company Elsevier is on the way to transform their whole business model into a new paradigm. And this gives a glue that LOD has now arrived in real industry environments.”

Tom Schandl: “I think this real-live-aspect of the Semantic Web was one of the unspoken focal points of the conference. In this respect Richard Cyganiak had a brilliant talk about how corporate data integration can benefit from RDF-Solutions, because a RDF based data concept can be developed step-by-step in contradiction to a “conservative corporate data integration” which always goes with a general redesign of the whole data-structure of a company. Richard calls this “pay as you go” – and I think this is what the industry looks for.”

Helmut Nagy: “This is also my impression, standing a lot on our booth. The industry looks for very concrete semantic solutions – and some of them are already there and ready to use. So – to carry some house advertising – our PoolParty demozone was very well recognised and commented. And this is not only because I served Tropical Banana Cocktail there.”

So the talk went on, in the car, at the blogosphere in the Semantic Web Community.

Tassilo Pellegrini

NYT, Wolters Kluwer Germany and Semantic Universe sponsor Triplification Challenge 2010

3.000 € prize money for the  most promising Linked Data applications


The New York Times, Wolters Kluwer Germany and Semantic Universe sponsor the Triplification Challenge 2010 taking place at the I-SEMANTICS Conference from 1 – 3 September 2010 in Graz / Austria. Together they have provided 3.000 Euro in prize money which will be given to the  most promising application demonstrations and approaches built upon Linked Data.

The Challenge is organized by an international consortium consisting of Juan Sequeda (University of Texas at Austin), Bernhard Schandl (University of Vienna), Sören Auer (University of Leipzig), Ivan Herman (World Wide Web Consortium), Tassilo Pellegrini (Semantic Web Company Vienna) and patroned by Sir Tim Berners-Lee.

Participants can choose between an open track and a special NYT track.

The open track sponsored by Wolters Kluwer Germany and Semantic Universe encourages submissions that fit into one or more of the following categories:

  • novel data sets that are published as part of the Web of Data, according to Linked Data principles, and demonstrating potential benefit of use within applications;
  • novel generic mechanisms, approaches, and technologies that convert certain types and formats of information into triples, interlink them to other data sets, and expose them as Linked Data;
  • applications showcasing the benefits of Linked Data to end-users such as for information syndication, specialized search, browsing, or augmentation of content.

The NYT track invites submissions that make use of the Linked Data published at data.nytimes.com and one or more government datasets that relate to politics. Any dataset qualifies that is produced by any government in the world that would be of interest to a constituent of that government. These can range from the demographics of election districts to campaign finance to corporate spending on political messaging.

The submission deadline for both tracks is 18 May 2010.

All submissions will be reviewed by an international program committee from industry and academia electing two winners in the open track and one winner in the NYT track.

For detailed information on the Triplification Challenge visit

http://i-semantics.tugraz.at/triplification-challenge or

http://triplify.org/Challenge/2010

Cordial thanks to our sponsors:

Thomas Schandl

Using Triplify to expose the semantics of a site

Recently the SWC took a thorough look at Triplify, a tool for mapping the contents of a relational DB to RDF, in the course of which we could convince ourselves of Triplify’s ease of use and its potent capabilities.
We take this opportunity to given an account of the philosophy behind Triplify, how it is used and also had the chance to interview the creator Sören Auer.

Triplify Logo

A common objection from critics of the semantic web is that regular users or webmasters won’t go to the trouble of marking up their content or whole web sites with RDF.
While it is obvious that nobody is going to decorate their web pages with hand-carved RDF triples, it is also apparent that a lot of the current web’s pages are generated by transforming information from relational databases to HTML pages, which are perfectly suited for human consumption, but which suffer from a big loss of machine-readable semantics.

As the information in the relational databases is highly structured and contains rich semantics, it is only natural to also use the already existing structured data to generate RDF representations of the same information.

Triplify is all about this approach of bootstrapping data for the semantic web. It does this for web applications which are built on PHP and MySQL.
Triplify consists of a lightweight PHP script and a configuration file. The latter is used to do the mapping of the columns of an application’s relational database to appropriate RDF classes and properties.

In many cases a site administrator who wants to export her site’s content as RDF, only has to save Triplify with a premade configuration file for her site’s application into the right folder, as for many popular applications like WordPress, Joomla! or phpBB all the work has already been done.
Once installed, Triplify can be used to generate a dump of the site’s complete RDF graph, or to generate Linked Data, as each of the site’s main concepts’ RDF graph is provided under its own URL, e.g. the semantic description of a user with the ID 123 can be accessed under http://yoursite.com/triplify/user/123.

If no configuration for an application exits, it is fairly easy to create one by yourself.
All one has to do is to look at the app’s database schema, find appropriate classes and properties from well known ontologies and create MySQL queries that grab the data from the relational database and map them to RDF classes or properties.
An example for a query that takes the data from a table describing the user of a CMS:
"SELECT id, name AS 'foaf:name', url AS 'foaf:homepage', short_description AS 'dc:abstract' FROM user_table",

Triplify’s creator Sören Auer kindly gave us the opportunity for an interview:

Triplify is very easy to configure for web developers. For which scenarios would you recommend to use Triplify, and in which situations other approaches of semantifying your data might be more suitable?

As you already mentioned Triplify was primarily developed for Web applications developed in PHP. These usually have a relatively small and simple set of tables. Triplify creates complete RDF exports, Linked Data or JSON, but does not include SPARQL endpoint functionality. When SPARQL is required you are better off with D2R Server or Virtuosos RDF views.

Triplify creates semantic representations of the data in relational databases. Do you think there would also be benefit in the inverse approach i. e. creating an application that parses triples and writes it to a relational DB according to a mapping file?

In certain scenarios this might make sense, but for the most cases I think the database schema has to be developed separately. Database schemata contain more storage and retrieval oriented information, such as for example about data indexing. Vocabularies and ontologies on the other hand represent information on a conceptually higher level and are more flexible with regard to evolution of the information structures than databases.

Are there plans for further development of Triplify?

Sure. We want to add SPARQL support and possibly port Triplify to other scripting languages such as Ruby and Python.

Thank you Sören, we will stay tuned about the news from your great application and look forward to the Triplification Challenge 2009!

Andreas Blumauer

Open now: LOD Triplification Challenge 2009

The yearly organized Linking Open Data Triplification Challenge (as part of this year´s I-Semantics conference, 2 – 4 September 2009, Graz/Austria) awards prizes to the most promising triplifications of existing Web applications, Websites and data sets.

The challenge (Patron: Sir Tim Berners-Lee) is open to anyone interested in applying Semantic Web and Linked Data technologies. We envision submissions such as following:

  • Applications of Linked Data tools and techniques such as for example Triplify, Virtuoso or D2RQ on custom Web applications and data sets exposing a large quantity and variety of content.
  • Implementations of exporters and mappers from existing content repository formats (such as mbox mailing list archives, Bib Te X, XML-Schemes etc.) into RDF and Linked Data.
  • Adoptions / configurations of Triplify for standard Web applications, such as for example Wikis, Weblogs, Webshops, Forums, Web-Gallery, ERP/CRM systems and Web-calendar software. You can find popular Web applications for example at Source Forge.
  • Portings of the Triplify script into other Web application programming languages such as Python, Ruby, Perl, ASP. The Triplify script is very small (<300 lines of code) however, the port should be as compatible as possible with the current reference implementation but integrate well with the environment given by the programming language.
  • Applications showcasing the benefits of Linked Data to end-users such as for information syndication, specialized search, browsing or augmentation of content.

Submissions should consist of a two page description in JUCS format of the application, accompanied by (a link to) the software source code and a link to an online demo. The descriptions should be submitted electronically via email to Michael Hausenblas with the subject Triplification Challenge Submission by May 30th, 2009.