<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Semantic Puzzle&#187; Sören Auer</title>
	<atom:link href="http://blog.semantic-web.at/tag/soren-auer/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.semantic-web.at</link>
	<description>Open World Assumptions</description>
	<lastBuildDate>Thu, 02 Feb 2012 14:26:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>The hype, the hope and the LOD2: Sören Auer engaged in the next generation LOD</title>
		<link>http://blog.semantic-web.at/2011/04/20/the-hype-the-hope-and-the-lod2-soren-auer-engaded-in-the-next-generation-lod/</link>
		<comments>http://blog.semantic-web.at/2011/04/20/the-hype-the-hope-and-the-lod2-soren-auer-engaded-in-the-next-generation-lod/#comments</comments>
		<pubDate>Wed, 20 Apr 2011 07:23:29 +0000</pubDate>
		<dc:creator>Thomas Thurner</dc:creator>
				<category><![CDATA[Linked Data & Open Data]]></category>
		<category><![CDATA[Tools & Software]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[lod2]]></category>
		<category><![CDATA[Sören Auer]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=2090</guid>
		<description><![CDATA[The paneuropean Project LOD2 is one of the biggest projects dealing with linked data. Scientists, programmers and software architects in various european countries are working on the next generation of linked open data. In a series of interviews i&#8217;m presenting &#8230; <a href="http://blog.semantic-web.at/2011/04/20/the-hype-the-hope-and-the-lod2-soren-auer-engaded-in-the-next-generation-lod/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><em>The paneuropean Project LOD2 is one of the biggest projects dealing with linked data. Scientists, programmers and software architects in various european countries are working on the next generation of linked open data. In a series of interviews i&#8217;m presenting people working on and with LOD2. As a start, i had the change to talk to <a href="http://www.informatik.uni-leipzig.de/~auer/">Sören Auer</a>, head of the <a href="http://lod2.eu/">LOD2 project</a>.</em></p>
<p><strong>Thomas Thurner: </strong>Over the recent years the LOD movement gained tremendous momentum. As one of the key players in this area how do you perceive this development? Hype or hope?</p>
<p><strong><img title="Sören Auer" src="http://lod2.eu/BlogPost/wp-content/uploads/2011/04/Bild-1-198x300.png" alt="" width="111" height="169" align="left" />Sören Auer: </strong>From my point of view the momentum LOD gained is deserved. We should strive for a Web, which is more decentralized, democratic, participatory, transparent and inclusive. <a class="zem_slink" title="Linked Data" rel="homepage" href="http://linkeddata.org/">Linked Open Data</a> is from my point a key technological building block on this road. However, a lot of work is ahead of us. LOD has to find its way directly into mainstream technology such as CMSes, Search Engines, Web Applications, Mash-Ups and we have to show users and stakeholders the direct added-value of this technology.</p>
<p><strong>Thomas Thurner: </strong>What is the current state of the LOD cloud from a technological point of view? Where do you see room for improvement?</p>
<p><strong>Sören Auer: </strong>Currently, the technological state of LOD seems to be comparable to the early days of the Web. We are still able to draw maps/clouds of the LOD datasets and data links are still sparse and difficult to maintain. This reminds me a lot of the early days of the Web, where we also had problems with broken links (the infamous 404). Later, after content management systems and Web applications automatized the link generation and maintenance this improved a lot and I hope we are on the same road with LOD technologies finding its way into more and more Web systems.</p>
<p><strong>Thomas Thurner: </strong>How is the LOD2 project addressing theses issues? What are the project&#8217;s key objectives?</p>
<p><strong>Sören Auer: </strong>LOD2 is addressing in three ways: First, we develop new research approaches highly relevant for LOD, for example, for <a class="zem_slink" title="Linked Data" rel="wikipedia" href="http://en.wikipedia.org/wiki/Linked_Data">Linked Data</a> management, automatic data linking as well as Linked Data enrichment andquality improvement. Second, we implement and integrate these approaches into specialized tools (e.g. SILK, <a class="zem_slink" title="OntoWiki" rel="wikipedia" href="http://en.wikipedia.org/wiki/OntoWiki">OntoWiki</a>, Virtuoso and DL-Learner) forming together the integrated LOD2 stack. The LOD2 stack can be used by data publishers for the whole life-cycle of Linked Data management ranging from extraction over linking, authoring, enrichment to exploration &amp; search.</p>
<p><strong>Thomas Thurner: </strong>What do you think are the most important factors to bring LOD to the masses?</p>
<p><strong><img title="Sören Auer" src="http://lod2.eu/BlogPost/wp-content/uploads/2011/04/Bild-2-217x300.png" alt="" hspace="5" width="114" height="158" align="right" />Sören Auer: </strong>From my point of view the key factor here is that we manage to integrate the large number of tools and approaches for supporting the Linked Datalife-cycle stages in a synergistic way, where each aspect adds value and triggers a number of other improvements. For example, the establishing of a new data link has a direct effect on search &amp; exploration of Linked Data. We have to directly show these kind of benefits to users so they receive and instant gratification for contributions to the Web of Data. Semantic Wikis, such as <a class="zem_slink" title="Semantic MediaWiki" rel="wikipedia" href="http://en.wikipedia.org/wiki/Semantic_MediaWiki">Semantic MediaWiki</a> and OntoWiki, are already nicely working in this direction. An application with an enormous potential to bring LOD to the masses would be the creation of a distributed, social semantic network. With <a class="zem_slink" title="OpenID Foundation" rel="homepage" href="http://openid.net">OpenId</a>, WebId, FOAF, Semantic Pingback most of the building blocks are available, but the final step integrating these into an easy-to-use social networking application still has to be done.</p>
<p><strong>Thomas Thurner: </strong>Compared to other semantic web approaches linked data principles seem to be rather easy to understand. On the other hand some argue that the &#8220;linked data cloud&#8221; is a big heap of data which cannot be used for professional purposes. What is your point of view?</p>
<p><strong>Sören Auer: </strong>Of course the currently available data is not useful for all potential usage scenarios. However, already now Linked Data can be used for many interesting applications: For example, we just completed the development of a prototype for a large search engine, where users searching are assisted with comprehensive background information obtained from the Linked Data Web. For this use case, information available as Linked Data is already very valuable and useful. The criticism of LOD being a &#8220;heap of data&#8221; also reminds me a lot of the early days of the Web, where people raised similar criticisms for the Web being a medium of un-professionalism. Later it turned out that, of course there is a lot of amateurism, but as Wikipedia impressively demonstrates the working together of many amateurs with the right tools can in the end outperform few professionals.</p>
<p><strong>Thomas Thurner: </strong>Linked Data could also become a new paradigm for light-weight enterprise data integration. What are the biggest obstacles today for linked data to being accepted by the business community?</p>
<p><strong>Sören Auer: </strong>Using Linked Data for data integration in large enterprises has an enormous potential. Just last week I was invited for a workshop with the IT department of one of the top car makers and the people responsible there for data integration were extremely excited about the opportunities of Linked Data in the large heterogeneous enterprise with more than 3000 different backend systems. Linked Data technologies can easily fill the gap between unstructured Intranet search and expensive &amp; complicated Service-oriented Architectures. Compared to SOA, Linked Data is a pay-as-you-go strategy, where data integration can be performed incementally and in sync with the requirements  and evolution of the data structures in the enterprise. In order to realize this vision, we need to continue the maturation of enterprise Linked Data tools &#8211; the availability of <a href="http://poolparty.punkt.at">PoolParty</a>, Sindice Enterprise Edition, <a href="http://virtuoso.openlinksw.com/">Virtuoso</a>, <a href="http://www.topquadrant.com/products/TB_Composer.html">TopBraid</a> are already important steps in that direction.</p>
<p><strong>Thomas Thurner: </strong>Automatic mechanisms to curate linked data and to make alignments between datasets possible play a crucial role for the next phase of linked data economics. Which technologies will play a central role? What will be the most critical point &#8211; do you see a &#8220;wisdom of the crowd&#8221; playing a role in this game?</p>
<p><strong><img title="Sören Auer" src="http://lod2.eu/BlogPost/wp-content/uploads/2011/04/3452_tmpphpVThg6k.jpg" alt="" hspace="5" width="110" height="133" align="left" />Sören Auer: </strong>Definitely! Tapping the wisdom of the crowd for mapping &amp; linking has a huge potential, which is currently unused. We started working in that direction with <a href="http://dbpedia-live.openlinksw.com/live/"><span class="zem_slink">DBpedia</span> Live</a> and the <a href="http://mappings.dbpedia.org/">DBpedia mapping Wiki</a>. In order, to make it really easy for people to contribute we have to dramatically lower the barrier to contributing to the alignment process. In LOD2 we also plan to enable users to create mapping and links between dataset by simply giving examples of correct links and evaluating some automatically generated ones.</p>
<p><strong>Thomas Thurner: </strong>At the moment governments all around the world start to publish open data, more and more stakeholders start to understand the benefit of open linked data. On the other hand enterprises haven´t even started with this topic. What could be the dynamics which will trigger projects in industry sectors like financial industries which will make use of open data principles?</p>
<p><strong>Sören Auer: </strong>Making statistical and financial information available in structured form and as Linked Data could have a enormous impact in this regard. With the DataCube vocabulary effort a first step in this direction was made, but it would be nice if this vocabulary would get an official stamp of a standardization organization such as W3C. Since the benefit of publishing statistical and financial data in structured form, e.g. as Linked Data, is visible most when done by many, this could be also facilitated by government regulations and industry best-practices.</p>
<p><strong>About INFAI</strong></p>
<p>The <a href="http://infai.org/">Institute for Applied Computer Science</a> (InfAI) at Universität Leipzig hosts research groups in service sciences, knowledge engineering and management as well as natural language processing. The approximately 20 researchers of the <a href="http://aksw.org/">Agile Knowledge Engineering and Semantic Web</a> (AKSW) research group at InfAI headed by Dr. Sören Auer are establishing theoretical results and scalable implementations for the field. Particular emphasis is given to areas such as ontology creation and<br />
manipulation, knowledge extraction, ontology learning and information &amp; data integration on the Semantic Data Web. The implemented tools and services (such as DBpedia, OntoWiki, DL-Learner and LinkedGeoData) developed by the group enjoy considerable popularity.</p>
<p><strong>About Sören Auer</strong></p>
<p>Dr. Sören Auer leads the research group Agile Knowledge Engineering and  Semantic Web (AKSW) at Universität Leipzig. His research interests  include semantic data web technologies, knowledge representation,  engineering &amp; management, usability, agile methodologies as well as  databases and information systems. He aims to combine strong theoretical  results with high-impact practical applications. Sören is author of  over 50 peer-reviewed scientific publications resulting in a Hirsch  index of 15. Sören is leading the large-scale integrated EU-FP7-ICT  research project &#8220;LOD2 &#8211; Creating Knowledge out of Interlinked Data&#8221;.  Sören is founder (respectively co-founder) of several high-impact  research and community projects such as the Wikipedia semantification  project DBpedia or the social Semantic Web toolkit OntoWiki. He is  co-organiser of several workshops, programme chair of I-Semantics 2008,  OKCON 2010, ESWC 2010 and ICWE 2011, area editor of the Semantic Web  Journal, serves as an expert for industry, the European Commission, the  W3C and is member of the advisory board of the Open Knowledge  Foundation.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2011/04/20/the-hype-the-hope-and-the-lod2-soren-auer-engaded-in-the-next-generation-lod/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Using Triplify to expose the semantics of a site</title>
		<link>http://blog.semantic-web.at/2009/04/20/using-triplify-to-expose-the-semantics-of-a-site/</link>
		<comments>http://blog.semantic-web.at/2009/04/20/using-triplify-to-expose-the-semantics-of-a-site/#comments</comments>
		<pubDate>Mon, 20 Apr 2009 17:32:58 +0000</pubDate>
		<dc:creator>Thomas Schandl</dc:creator>
				<category><![CDATA[Calls & Competitions]]></category>
		<category><![CDATA[Linked Data & Open Data]]></category>
		<category><![CDATA[mysql]]></category>
		<category><![CDATA[php]]></category>
		<category><![CDATA[Sören Auer]]></category>
		<category><![CDATA[SPARQL]]></category>
		<category><![CDATA[triplification]]></category>
		<category><![CDATA[Triplify]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=858</guid>
		<description><![CDATA[Recently the SWC took a thorough look at Triplify, a tool for mapping the contents of a relational DB to RDF, in the course of which we could convince ourselves of Triplify&#8217;s ease of use and its potent capabilities. We &#8230; <a href="http://blog.semantic-web.at/2009/04/20/using-triplify-to-expose-the-semantics-of-a-site/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Recently the SWC took a thorough look at <strong><a href="http://triplify.org/Overview">Triplify</a></strong>, a tool for mapping the contents of a relational DB to RDF, in the course of which we could convince ourselves of Triplify&#8217;s ease of use and its potent capabilities.<br />
We take this opportunity to given an account of the philosophy behind Triplify, how it is used and also had the chance to <strong>interview the creator Sören Auer</strong>.</p>
<p><img src="http://blog.semantic-web.at/wp-content/uploads/2009/04/logo-triplify.png" alt="Triplify Logo" /></p>
<p>A common objection from critics of the semantic web is that regular users or webmasters won&#8217;t go to the trouble of marking up their content or whole web sites with RDF.<br />
While it is obvious that nobody is going to decorate their web pages with hand-carved RDF triples, it is also apparent that a lot of the current web&#8217;s pages are generated by <strong>transforming information from relational databases to HTML pages</strong>, which are perfectly suited for human consumption, but which suffer from a big loss of machine-readable semantics.</p>
<p>As the information in the relational databases is highly structured and contains rich semantics, it is only natural to <strong>also use the already existing structured data to generate RDF representations</strong> of the same information.</p>
<p>Triplify is all about this approach of <strong>bootstrapping data for the semantic web</strong>. It does this for web applications which are built on PHP and MySQL.<br />
Triplify consists of a lightweight PHP script and a configuration file. The latter is used to do the mapping of the columns of an application’s relational database to appropriate RDF classes and properties.</p>
<p>In many cases a site administrator who wants to export her site&#8217;s content as RDF, only has to save Triplify with a premade configuration file for her site&#8217;s application into the right folder, as for <a href="http://triplify.org/Documentation">many popular applications</a> like WordPress, Joomla! or phpBB all the work has already been done.<br />
Once installed, Triplify can be used to generate a dump of the site&#8217;s complete RDF graph, or to generate <strong><a href="http://linkeddata.org/">Linked Data</a></strong>, as each of the site&#8217;s main concepts&#8217; RDF graph is provided under its own URL, e.g. the semantic description of a user with the ID 123 can be accessed under http://yoursite.com/triplify/user/123.</p>
<p>If no configuration for an application exits, it is fairly <strong>easy to create one by yourself</strong>.<br />
All one has to do is to look at the app&#8217;s database schema, find appropriate classes and properties from well known ontologies and create MySQL queries that grab the data from the relational database and map them to RDF classes or properties.<br />
An example for a query that takes the data from a table describing the user of a CMS:<br />
<code>"SELECT id, name AS 'foaf:name', url AS 'foaf:homepage', short_description AS 'dc:abstract' FROM user_table",</code></p>
<p>Triplify&#8217;s creator Sören Auer kindly gave us the opportunity for an <strong>interview</strong>:</p>
<p><strong>Triplify is very easy to configure for web developers. For which scenarios would you recommend to use Triplify, and in which situations other approaches of semantifying your data might be more suitable?</strong></p>
<p>As you already mentioned Triplify was primarily developed for Web applications developed in PHP. These usually have a relatively small and simple set of tables. Triplify creates complete RDF exports, Linked Data or JSON, but does not include SPARQL endpoint functionality. When SPARQL is required you are better off with D2R Server or Virtuosos RDF views.</p>
<p><strong>Triplify creates semantic representations of the data in relational databases. Do you think there would also be benefit in the inverse approach i. e. creating an application that parses triples and writes it to a relational DB according to a mapping file?</strong></p>
<p>In certain scenarios this might make sense, but for the most cases I think the database schema has to be developed separately. Database schemata contain more storage and retrieval oriented information, such as for example about data indexing. Vocabularies and ontologies on the other hand represent information on a conceptually higher level and are more flexible with regard to evolution of the information structures than databases.</p>
<p><strong>Are there plans for further development of Triplify?</strong></p>
<p>Sure. We want to add SPARQL support and possibly port Triplify to other scripting languages such as Ruby and Python.</p>
<p><strong>Thank you Sören, we will stay tuned about the news from your great application and look forward to the <a href="http://blog.aksw.org/2009/triplification-challenge-2009/" target="_self">Triplification Challenge 2009</a>!</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2009/04/20/using-triplify-to-expose-the-semantics-of-a-site/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>OntoWiki Workshop</title>
		<link>http://blog.semantic-web.at/2008/12/09/ontowiki-workshop/</link>
		<comments>http://blog.semantic-web.at/2008/12/09/ontowiki-workshop/#comments</comments>
		<pubDate>Tue, 09 Dec 2008 17:24:46 +0000</pubDate>
		<dc:creator>Thomas Schandl</dc:creator>
				<category><![CDATA[Software Development]]></category>
		<category><![CDATA[Tools & Software]]></category>
		<category><![CDATA[conX]]></category>
		<category><![CDATA[OntoWiki]]></category>
		<category><![CDATA[Sebastian Dietzold]]></category>
		<category><![CDATA[Semantic MediaWiki]]></category>
		<category><![CDATA[Sören Auer]]></category>
		<category><![CDATA[University of Leipzig]]></category>
		<category><![CDATA[zend framework]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=400</guid>
		<description><![CDATA[Days 3 and 4 of the OntoWiki KickOff Meeting in Leipzig were comprised of semantic technologies and OntoWiki development workshops. Just like the overall organization of the project meeting was very good, so Sebastian Dietzold, Sebastian Hellmann, Michael Martin and &#8230; <a href="http://blog.semantic-web.at/2008/12/09/ontowiki-workshop/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Days 3 and 4 of the <a href="http://blog.semantic-web.at/2008/12/03/ontowiki-kick-off-in-leipzig/"><span><span>OntoWiki</span> <span>KickOff</span> Meeting</span></a><span> in Leipzig were comprised of semantic technologies and <span>OntoWiki</span> development workshops.</span></p>
<p>Just like the overall organization of the project meeting was very good, so <em><span>Sebastian <span>Dietzold</span></span></em>, <em><span>Sebastian <span>Hellmann</span></span></em>, <em>Michael Martin</em> and <em><span><span>Jörg</span> <span>Unbehauen</span></span></em> did a real good job at putting the ideas behind key concepts of the semantic web across in several <strong><span>introductory <span>SemWeb</span> presentations</span></strong>. Their talks about various technologies from the <a href="http://en.wikipedia.org/wiki/Semantic_Web_Stack">semantic web stack</a><span> like <span>URIs</span>, RDF and its serialisations, RDFS, SPARQL and some related tools were well suited to bring people who are relatively new to the semantic web up to speed. Links to the presentation slides can be found at the </span><a href="http://aksw.org/Events/2009/OntoWikiKickOff">project page</a> in the coming days.</p>
<p>Later <em><span>Jens <span>Lehmann</span></span></em> outlined the new things <strong>OWL 2</strong> brings, e. g. <a href="http://www.w3.org/TR/owl2-profiles/">profiles</a><span>, which are subsets of OWL 2 and which provide different degrees of <span>expressivity</span> and reasoning efficiency.</span></p>
<p>The last day started with <em><span><span>Sören</span> <span>Auer&#8217;s</span></span></em> presentation of their <strong>semantic wiki</strong> <a href="http://openresearch.org/wiki/Main_Page"><span><span>OpenResearch</span></span></a><span>, a site where information on conferences, journals and scientists is pooled. <span>OpenResearch</span> is built with Semantic <span>MediaWiki</span> (SMW), just like our </span><a href="http://social.semantic-web.at/index.php/Main_Page">Social Semantic Web</a> wiki.</p>
<p><span>While SMW is a very useful tool as it lowers the entry barriers for using semantic <span>wikis</span>, <span>Sören</span> also pointed out  that in comparison </span><strong><span><span>OntoWiki</span> provides some important features</span></strong> that SMW doesn&#8217;t have:</p>
<ul>
<li>SMW doesn&#8217;t use <strong>SPARQL </strong><span>for its queries, but a less powerful custom query language, whereas <span>OntoWiki</span> has full SPARQL support.</span></li>
<li><strong><span><span>OntoWiki&#8217;s</span> UI</span></strong><span> has many widgets <span>that</span> support the user when entering data or new properties on a page (e. g. there is an <span>autocomplete</span> feature for suggesting properties)</span></li>
<li>With SMW <strong><span>changes to the <span>wiki&#8217;s</span> semantic structure</span></strong><span> often entail manual changes to many, many pages. With <span>OntoWiki</span> it is easy to e.g. change <span>poperties</span> at any time.</span></li>
</ul>
<p><span>For the new version of <span>OntoWiki</span> <span>Sören</span> and his team use the </span><strong><span><span>Zend</span> framework</span></strong> and develop the <strong><a href="http://aksw.org/Events/2009/OntoWikiKickOff/files?get=erfurt_api.pdf"><span><span>Erfurt</span> API</span></a> to store and access RDF data</strong><span>. The <span>Erfurt</span> API supports SPARQL, <span>versioning</span>, caching and RDF based <span>authentification</span>/access control. It </span><strong>abstracts different stores</strong><span> using the adapter pattern, so it can be used with Virtuoso and any other store which has an interface provided by <span>Zend</span>_Db (MySQL, Oracle, <span>PostgreSQL</span>, etc.) plus they are working on an interface for <span>Redland</span>. Find the slides for </span><em><span><span>Philipp</span> <span>Frischmuth&#8217;s</span></span></em><span> <span>Erfurt</span> API presentation </span><a href="http://aksw.org/Events/2009/OntoWikiKickOff/files?get=erfurt_api.pdf">here</a>, the API documentation <a href="http://docs.ontowiki.net/erfurt/doc/">here</a> and <em><span>Norman <span>Heino&#8217;s</span></span></em><span> <span>Zend</span> &amp; <span>OntoWiki</span> Application Framework presentation </span><a href="http://aksw.org/files/ontowikiextensiondevelopment.pdf">here</a>.</p>
<p><em><span>Julian <span>Jöris</span></span></em> demonstrated how <a href="http://seleniumhq.org/">Selenium</a> is used for <strong>acceptance testing</strong>. This is a very promising testing framework for web applications, where one can e.g. record interactions with different browsers and automatically run them as tests. Selenium has a <a href="https://addons.mozilla.org/de/firefox/addon/1157"><span><span>Firefox</span> extension</span></a> to record macros and is integrated with <a href="http://www.phpunit.de/"><span><span>PHPUnit</span></span></a>.</p>
<p>Finally we had a very good discussion about our <strong><span><span>conX</span>-<span>OntoWiki</span> integration use case</span></strong> and application ideas, so we left Leipzig with a pleasant anticipation of the coming co-operation in the project.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2008/12/09/ontowiki-workshop/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Semantic Web becomes mainstream, again.</title>
		<link>http://blog.semantic-web.at/2008/12/05/the-semantic-web-becomes-mainstream-again/</link>
		<comments>http://blog.semantic-web.at/2008/12/05/the-semantic-web-becomes-mainstream-again/#comments</comments>
		<pubDate>Fri, 05 Dec 2008 11:09:08 +0000</pubDate>
		<dc:creator>Andreas Blumauer</dc:creator>
				<category><![CDATA[Enterprise 2.0]]></category>
		<category><![CDATA[Literature & Publications]]></category>
		<category><![CDATA[Open Calais]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[Paul Miller]]></category>
		<category><![CDATA[Sören Auer]]></category>
		<category><![CDATA[t3n]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=366</guid>
		<description><![CDATA[The roll-out of semantic web technologies seems to enter the next stage. And it will be a quiet (r)evolution like the open source movement was. Two examples: Next year´s JAX in Mainz/Germany will have its first Semantic Web track. Organisers &#8230; <a href="http://blog.semantic-web.at/2008/12/05/the-semantic-web-becomes-mainstream-again/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>The roll-out of semantic web technologies seems to enter the next stage. And it will be a quiet (r)evolution like the open source movement was. Two examples: Next year´s JAX in Mainz/Germany will have its first <a href="http://it-republik.de/jaxenter/jax/specialdays/" target="_blank">Semantic Web track</a>. Organisers say that &#8220;the Semantic Web is going to conquer the business market soon&#8221; &#8211; we will see if it will be that martial.</p>
<p><a href="http://blog.semantic-web.at/wp-content/uploads/2008/12/t3n1.jpg"><img class="alignleft size-full wp-image-372" title="t3n1" src="http://blog.semantic-web.at/wp-content/uploads/2008/12/t3n1.jpg" alt="" width="255" height="257" /></a>Another example: One of the biggest Open Source Magazines in Germany, <a href="http://t3n.yeebase.com/" target="_blank">t3n</a>, has recently published its new magazine with many stories around the Semantic Web. Editor in chief, Jan Christe says: &#8220;We have constantly stumbled upon semantic web related stuff  when we scanned the news, so we decided to set a focus on this topic.&#8221;</p>
<p>The Semantic Web is tangible now &#8211; Christe says: &#8220;Applications like <a href="http://opencalais.com/" target="_blank">OpenCalais</a>, <a href="http://www.zemanta.com/" target="_blank">Zemanta</a> or <a href="http://tagaroo.opencalais.com/" target="_blank">Tagaroo</a> show the end-users what´s really in for them.&#8221; And it is also nice to see, that the semantic web won´t be reduced down to &#8220;search&#8221; anymore: <a href="http://t3n.yeebase.com/magazin/ausgaben/t3n-pageflip" target="_blank">t3n´s new issue</a> has also interesting articles about Linked Data, for instance Sören Auer´s &#8220;How to develop Semantic Web Applications&#8221;.</p>
<p>So, as a conclusion: <a href="http://blogs.zdnet.com/semantic-web/?p=237" target="_blank">Paul Miller´s waiting</a> for the &#8220;Semantic Web in Business&#8221; (a great blog post!) has an end. It won´t be found in heavy books, rather in the open source community and sometimes in light-weight magazines.</p>
<p>Yes, we can!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2008/12/05/the-semantic-web-becomes-mainstream-again/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DBpedia, UMBEL &amp; the Future Web&#8217;s Ecology &#8211; interview with Mike Bergman &amp; Sören Auer</title>
		<link>http://blog.semantic-web.at/2008/11/10/umbel-dbpedia-futureweb-ecology-interview/</link>
		<comments>http://blog.semantic-web.at/2008/11/10/umbel-dbpedia-futureweb-ecology-interview/#comments</comments>
		<pubDate>Mon, 10 Nov 2008 11:09:01 +0000</pubDate>
		<dc:creator>Andreas Blumauer</dc:creator>
				<category><![CDATA[Linked Data & Open Data]]></category>
		<category><![CDATA[Mashups & Web services]]></category>
		<category><![CDATA[Ontology Engineering]]></category>
		<category><![CDATA[dbpedia]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Mike Bergman]]></category>
		<category><![CDATA[OpenLink Software]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Sören Auer]]></category>
		<category><![CDATA[UMBEL]]></category>
		<category><![CDATA[Wikipedia]]></category>
		<category><![CDATA[Zitgist]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=339</guid>
		<description><![CDATA[The Linked Open Data infrastructure is in a tremendous process of maturing &#8211; the recent release of UMBEL&#8217;s webservice AND the incorporation of UMBEL classes in DBpedia are yet another confirmation of this exciting process. Knowing and having met DBpedia &#8230; <a href="http://blog.semantic-web.at/2008/11/10/umbel-dbpedia-futureweb-ecology-interview/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.semantic-web.at/file_upload/3452_tmpphpVThg6k.jpg" alt="Sören Auer" align="right" border="0" width="150" height="180">The Linked Open Data infrastructure is in a tremendous process of maturing &#8211; the recent release of <a href="http://umbel.zitgist.org">UMBEL&#8217;s webservice</a>   AND the incorporation of <a href="http://lists.w3.org/Archives/Public/public-lod/2008Sep/0071.html">UMBEL classes in DBpedia</a>  are yet another confirmation of this exciting process. Knowing and having met DBpedia co-initiator, Triplify main developer and head of the AKSW research group <a href="http://www.informatik.uni-leipzig.de/%7Eauer/">Sören Auer</a> and UMBEL editor and Zitgist CEO <a href="http://www.mkbergman.com/?page_id=4%20">Mike Bergman</a> in various contexts, I felt it was time to talk to and pick the brains of both these key players in a dialog situation. The (first) result is the interview you can find below. As not everyone can expected to be familiar with both projects, here is some backgrond to get you started (you can also <a href="#interview-dbpedia-umbel">go directly to the interview</a>):</p>
<div align="right"><small>Sören Auer (image above), Mike Bergman (image below)</small></div>
<p><a href="http://dbpedia.org">DBpedia</a> has become the largest RDF repository for encyclopaedic knowledge, extracting structured information from Wikipedia and making it available on the Web of Data. <a href="http://umbel.org/">UMBEL</a>, on the other hand, provides an OpenCYC-based, light-weight ontology structure for relating Web content and data to a standard set of subject concepts, with a number of 20,000 concepts currently reached. In the Linked Data Cloud, DBpedia and UMBEL map and cross-reference each other.</p>
<p><img src="http://www.semantic-web.at/file_upload/3452_tmpphpLUz7Hq.jpg" alt="Mike Bergman" align="right" border="0" width="150" height="180">In practice this means that UMBEL provides classes to describe the concepts to which “things” are members. For instance, named entities from Wikipedia such as “<a href="http://umbel.org/umbel/ne/wikipedia/John_F._Kennedy">John F. Kennedy</a>”  are mapped with subject concepts such as <a href="http://umbel.org/umbel/sc/Leader">Leader</a>, <a href="http://umbel.org/umbel/sc/Person">Person</a>, <a href="http://umbel.org/umbel/sc/Administrator">Administrator</a> and <a href="http://umbel.org/umbel/sc/Graduate">Graduate</a>, with broader and equivalent classes in CYC and FOAF and broader subject concepts within UMBEL. A link is set to Wikipedia, as well as a ‘same as’ reference to DBpedia.  A class structure enables faceted browsing and extraction, inferencing, and navigation and discovery for all datasets linked to that structure.</p>
<p>DBpedia, in turn, returns <a href="http://dbpedia.org/resource/John_F._Kennedy">properties of &#8216;John J. Kennedy&#8217;</a> (e.g. abstracts in available Wikipedia languages, demographic information such as birth date and place, alma mater, predecessors and successors), and ‘same as’ references, e.g., to the <a href="http://rdf.freebase.com/ns/guid.9202a8c04000641f8000000000c1c424">JFK entry in Freebase</a>  (who recently <a href="http://blog.semantic-web.at/2008/10/30/the-day-after-freebase-went-rdf">released their RDF service</a>) and the aforementioned page in UMBEL. Furthermore, DBpedia maps the URI with available RDF types, for instance <a href="http://xmlns.com/foaf/spec/#term_Person">foaf:person</a>  or <a href="http://dbpedia.org/class/yago/AssassinatedAmericanPoliticians">yago:AssassinatedAmericanPoliticians</a> and, once again, with UMBEL’s subject concepts Person, Administrator, Graduate and Leader.</p>
<p>Due to its reliance on Wikipedia, DBpedia does a great job at covering a bandwidth of knowledge as broad as the spectrum of the interest of people participating in Wikipedia; it’s within the area of named entities, i.e. entities such as persons, organizations, locations, which have a proper name, but are not necessarily and specifically part of a particular, acknowledged domain or discipline. UMBEL, on the other hand, has as its most apparent advantage its reliance on <a href="http://www.opencyc.org">OpenCyc</a>  and with that the strong inferencing and logic capabilities of the <a href="http://www.cyc.com/cyc/technology/whatiscyc">CYC knowledge-base</a>  which are thus also brought to the Web of Data. DBpedia is a community project started by the <a href="http://aksw.org/">University of Leipzig</a>, <a href="http://www.fu-berlin.de/en/">Free University Berlin</a> and <a href="http://www.openlinksw.com/">OpenLink Software</a>, while the open and free UMBEL is developed and hosted by <a href="LLC%20http://www.zitgist.com">Zitgist</a> with support from, again, OpenLink Software.</p>
<p>Now, and in particular with the recent release of <a href="http://umbel.zitgist.com/">Zitgist’s web service endpoints</a> and with the <a href="http://lists.w3.org/Archives/Public/public-lod/2008Sep/0071.html">incorporation of UMBEL classes in DBpedia</a>, questions arises as to the relationship of the two projects, and regarding the role of OpenLink Software in the further process. To draw a distinction:</p>
<p><strong><a name="interview-dbpedia-umbel"></a>One could say that DBpedia’s goal is to lower the barrier for web developers and end-users in the actual use of the semantic web, while UMBEL aims at bringing &#8220;order to the chaos&#8221; that is inherent to user-generated, collective knowledge. </p>
<p>Would you agree with this description – and is it a contradiction at all or the kind of dynamic the Semantic Web community has been waiting for?</strong></p>
<p><strong>Mike Bergman</strong>: Yes, I would agree with this description, though we have tried many others.  For example, <a href="http://umbel.org/intro.html">in various writings in the past</a>, we have described UMBEL as a roadmap, or middleware, or a backbone, or a concept ontology, or an &#8216;infocline&#8217;, or a meta layer for metadata, and others.  Today, what I tend to use, particularly in reference to DBpedia, is the TBox-ABox distinction  in computer science and description logics.  UMBEL is more of a class or structural and concept relationships schema &#8212; a <a href="http://en.wikipedia.org/wiki/TBox">TBox</a> &#8212; while DBpedia is more of an an instance and entity layer with attributes &#8212; an <a href="http://en.wikipedia.org/wiki/ABox">ABox</a>.  I think they are pretty complementary&#8230;<br />
<span id="more-339"></span><br />
<strong>Sören Auer</strong>: I very much agree with Mike, but would like to add that Wikipedia authors do not have in mind to create a coherent and consistent knowledge base when working on Wikipedia. I think the more we demonstrate the benefits of the semantic representations in DBpedia to the Wikipedia community, these people will start to organize and rearrange content to enable the use of Wikipedia as a knowledge base. Right now, Wikipedia authors just have not yet been confronted with the problem of synonymous infobox properties or the uncleanliness of the category system, for example. I think with a few small and non-invasive changes to Wikipedia, much of the current chaos can be already resolved.</p>
<p><strong>Mike Bergman</strong>: I agree, too, with Sören&#8217;s adder:  I think it is difficult for Wikipedia authors to be consistent or coherent across the entire Wikipedia knowledge base.  I think, then, the real question is where does that coherence or structural consistency come from?  I think the nature of that task is quite different than creating or editing instance articles.</p>
<p>As for the dynamics and drivers of the community, the role of DBpedia for practical, linked data can not be overstressed.  It was the first, remains the biggest, and has brought much visibility and awareness to linked data.  I think I was one of the first <a href="http://www.mkbergman.com/?p=354">to give DBpedia some press</a>  shortly after its release nearly two years ago, which is still one of my most popular blog posts.  I&#8217;d like to think that UMBEL is now timely and well-positioned to help provide a complementary resource of concept classes, but that is not proven.  DBpedia is. </p>
<p><strong>DBpedia relies on user-generated content, UMBEL, with CYC, is expert-driven. How will a system that combines these divergent approaches continue to grow?</strong></p>
<p><strong>Mike Bergman</strong>: DBpedia is in the enviable position of being able to leverage two things:  1) the phenomenal success and growth of its source content, Wikipedia; and 2) the growing sophistication of information and structure extraction techniques built around that phenomenon.  In fact, my most recent <a href="http://www.mkbergman.com/?page_id=461">SWEETpedia listing</a> of research projects based on Wikipedia now exceeds 170 or something projects and we discover and learn about more daily.  We now see major research efforts in Germany, Austria, Japan, New Zealand, the US and England (among those I know) aggressively mining and learning from Wikipedia.  I&#8217;m sure DBpedia will learn more, but others will increasingly contribute as well.  This is unprecedented and very exciting. Wikipedia may have as strong a heritage in contributing to research and language and structure understanding as it does as a reference encyclopedia. </p>
<p>But, that is largely instance and attribute data, the ABox to use my earlier terminology.  For structural and conceptual relationships &#8212; the coherent way to organize and relate things in the world; that is, the TBox &#8212; I think there is much subtlety and thinking required.  I&#8217;ve used the phrase before that creating structural schema is &#8220;not like flinging hash&#8221;.  Efforts such as Cyc, with nearly 1,000 person years of consistent testing and effort behind it, or perhaps others such as SUMO or what is coming out of the biology community with OBO (<a href="http://www.obofoundry.org">Open Biomedical Ontologies</a>), offer better coherency and the ability to interoperate across diverse datasets and domains.  Perhaps Wikipedia and its data extraction offshoots may someday get to this point &#8212; and I truly hope so &#8212; but are not anywhere near that today in my opinion.</p>
<p><strong>Sören Auer</strong>: I&#8217;m actually not so concerned about the lack of structure and coherency as Mike. If we look at the current mostly textual Web and the search engines making it accessible to us humans, there is almost no data, few structure and even less coherency, but search engines still manage to provide an enormous added-value. If we add more data to the Webm much more sophisticated browsing, searching and data integration interfaces can be built. Structure and coherency will then emerge automatically, once people see how their content is indexed and can be easily found (or not). </p>
<p>The same happened by the way with the traditional Web 1.0 &#8211; in the beginning nobody used HTML&#8217;s meta-data tags. Once search engines started to interpret those for ranking results, meta-tags shifted to the center of attention of every Web content manager. Applied to the Semantic Data Web: once search engines understand <a href="http://xmlns.com/foaf/spec/#term_Person">foaf:person</a>,  everybody will use this concept for describing people.</p>
<p><strong>A little experiment – <a href="http://en.wikipedia.org/wiki/Web_3.0">Web 3.0</a>,  <a href="http://en.wikipedia.org/wiki/Semantic_Web">Semantic Web</a>,  Web of Data, <a href="http://en.wikipedia.org/wiki/Linked_Data">Linked Data</a>: Can you think of an ontology that is able to connect these terms and reveal the concepts behind them?</strong></p>
<p><strong>Mike Bergman</strong>: Well, I&#8217;d like to think UMBEL is that ontology (smile).  That is certainly our intent, though truthfully we are still working out structural details and have not added all of this nice SemWeb terminology.  But it is coming shortly (smile).</p>
<p><strong>Sören Auer</strong>: A precise definition of these terms in the mathematical sense does not (and probably will never) exist, so articles such as those in Wikipedia (or many other publications) about the terms are from my point of view completely sufficient to reveal the concepts behind them to us. Of course it&#8217;s nice to have pretty and world-wide unique identifiers (such as provided by UMBEL) to annotate articles about these terms.</p>
<p><strong>Mike Bergman</strong>: Well, UMBEL is about linking to concepts, though we welcome anyone thinking our identifiers are pretty (smile).  One key aspect we will see moving forward is how we can translate those concepts into one of the 250 languages now used by Wikipedia while retaining existing structure.  That is a real exciting prospect.</p>
<p><strong>What are the concepts you would personally want to employ to explain the over-arching idea of these terms to a newbie?</strong></p>
<p><strong>Mike Bergman</strong>: Structured data on the Web is becoming like newly visible stars as nighttime darkens in the desert.  Structured data are points of light in a global information space.  We need fixed reference points in that sky to find specific stars.  We need linkages to extract meanings and constellations from them.  So, we both need to expose those stars &#8212; as linked data &#8212; and to provide fixed references to find them again and connections to draw meaning from them.  Objects, references and connections all work in concert to expose the wonder.</p>
<p><strong>Sören Auer</strong>: Its difficult for me to add something after Mike&#8217;s truly poetic description of pretty technical terms (smile). I see Linked Data or the Web of Data as the next milestone on the road of realizing the vision of the Semantic Web. In this regard, I&#8217;m Marxist (smile) and think <a href="http://www2.cddc.vt.edu/marxists/archive/marx/works/1867/letters/67_06_22.htm#n5">Marx&#8217;s Law of transformation of Quantity into Quality</a> applies: once we have a sufficient quantity of data out there on the Web, a new quality will emerge. Unfortunately, we are still far away from reaching a critical mass, since the Semantic or Data Web as we recently found out  (cf.: <a href="http://www.informatik.uni-leipzig.de/%7Eauer/publication/triplify.pdf">Triplify – Lightweight Linked Data Publication from Relational Databases</a>, PDF, 332 KB) is effectively shrinking if compared with the growth of the traditional Web.</p>
<p><strong><a href="http://www.openlinksw.com/blog/%7Ekidehen/">Kingsley Idehen </a> from OpenLink Software was the <a href="http://lists.w3.org/Archives/Public/public-lod/2008Sep/0071.html">first to announce on the W3C mailing list</a> that DBpedia &amp; UMBEL are now “fully connected.” Is Kingsley the bridge-builder between the two projects?</strong></p>
<p><strong>Mike Bergman</strong>: Without question.  Kingsley has backed both efforts in a big way with people and resources.  His company actually did the first RDF linkage between the two projects. What is more remarkable, however, is that DBpedia and UMBEL are but a small slice of the things Kingsley has been backing. He has been a leader in middleware, scalable clusters and cloud computing, RDFizing all data forms, converting relational legacy data to linked RDF data, and providing demos and teaching to newbies on mailing lists. I&#8217;m glad you asked this question because Kingsley is a real catalyst and visionary.</p>
<p><strong>Sören Auer</strong>: I agree, Kingsley is a mover and shaker in many areas of technology and innovation and in particular the Data Web. However, we should not forget his marvelous team at OpenLink with the database mastermind <a href="http://www.openlinksw.com/weblog/oerling/">Orri Erling</a>,  <a href="http://www.linkedin.com/companies/openlink-software">Hugh Williams</a>,  <a href="http://www.linkedin.com/in/ivanmikhailov">Ivan Mikhailov</a>,  <a href="http://www.twine.com/user/ghard">Yrjäna Rankka</a>  and all the others.</p>
<p><strong>In the same vein: What are roles that are vital for the LOD-engineering process? Are there also &#8220;gardeners&#8221; or &#8220;<a href="http://en.wikipedia.org/wiki/Wikipedia:Bureaucrats">bureaucrats</a>&#8220;, as Wikipedians would put it?</strong></p>
<p><strong>Mike Bergman</strong>: I think the LOD (linking open data) and DBpedia mailing lists have been very effective, and there continues to be good community and organization around those efforts.  We know that Wikipedia is not a free for all, with a kind of self-policing plus type of governance.  I think that works well at the instance level.</p>
<p>However, structure decisions at the level of conceptual schema such as UMBEL or OpenCyc or even the mapping of classes between ontologies or datasets requires more skill and care.  Others may not agree, but I think the schema aspects essential to UMBEL&#8217;s purpose &#8212; while definitely needing to be open and participatory at the suggestion or input level &#8212; possibly require roles more like &#8220;priest&#8221; or &#8220;professional&#8221; or &#8220;authority&#8221; at the actual roll-out level.  Without quality, structure is nothing, and all of this is just an elaborate toy.</p>
<p><strong>Sören Auer</strong>: Again my philosophy here differs a little from Mike&#8217;s: I&#8217;m pretty skeptic there will be one ontology or organization scheme for the Semantic Web. Rather, I think structure and homogeneity will be achieved on a peer-to-peer basis first and a community consensus will emerge later.</p>
<p><strong>Mike Bergman</strong>: Yeah, this is an excellent point and I&#8217;m glad Sören raised it.  While it is true we have put much effort into creating a lightweight structure of concept classes for linking disparate datasets, we too do not see &#8220;one ontology to rule them all.&#8221;  I suspect there will often be none, and then many times other frameworks chosen. It really depends on the use case and purpose. UMBEL&#8217;s specific purpose is to provide a coherent framework for serious knowledge engineers looking to federate data.  After that, other frameworks with a different purposes may then need to do the heavy lifting of actual data interoperability.</p>
<p><strong>With its recent RDF service release, <a href="http://www.freebase.com">Freebase</a>  has risen to the level of a major SemWeb knowledge base, too. Where do you see its role in the future SemWeb ecology, also in relation to your own projects?</strong></p>
<p><strong>Sören Auer</strong>: To be honest, I was not very convinced of Freebase from the very beginning although their technology in particular the user interfaces are impressive. From my point of view the Freebase approach was too centralized and proprietary. A better strategy would have been to develop an open (source) technology, which people can deploy on their own Websites combined with server side crawling and search facilities. The first part of this equation is by the way exactly what we are aiming for with our <a href="http://ontowiki.net/">OntoWiki</a>  and <a href="http://triplify.org">Triplify</a>  projects. However, if Freebase now moves towards more openness and interoperability, this can be only applauded.</p>
<p><strong>Mike Bergman</strong>:  I think early on that Freebase was needing to get its own feet set and did not do much with regard to open standards or external interoperability.  These are good signs we are now seeing.  However, I think the revenue model around user-supplied data remains highly suspect.  Heck, Wikipedia with all of its tremendous success is daily soliciting contributions from users.  It&#8217;s hard to get traction without being open and free, and its hard to make money when you are open and free even with traction.  But these new announcements now make it much easier for us to use Freebase should our customers request it.</p>
<p><em>About DBpedia:</em></p>
<p>DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. Wikipedia is the by far largest publicly available encyclopedia on the Web. Wikipedia editions are available in over 250 languages with the English one accounting for more than 2.49 million articles. Unfortunately, Wikipedia&#8217;s search capabilities are limited to full-text search, which allows very limited access to this valuable knowledge-base. Semantic Web technologies enable expressive queries against structured and interlinked information on the Web. DBpedia allows you to make sophisticated queries against Wikipedia, and to link other data sets on the Web to Wikipedia data. The DBpedia data set currently provides information about more than 2.49 million “things”, including at least 108,000 persons, 392,000 places, 57,000 music albums, and 36,000 films. Altogether, the DBpedia data set consists of 218 million pieces of information (RDF triples).</p>
<p><a href="http://dbpedia.org/">dbpedia.org</a> (general website)</p>
<p><a href="http://wiki.dbpedia.org/">wiki.dbpedia.org/OnlineAccess</a> (DBpedia Wiki &#8211; Online Access)</p>
<p><em>About UMBEL:</em></p>
<p>UMBEL (Upper Mapping and Binding Exchange Layer) is a lightweight ontology structure for relating Web content and data to a standard set of 20,000 subject concepts. Its purpose is to provide a fixed set of reference points in a global knowledge space. These subject concepts have defined relationships between them, and can act as binding or attachment points for any Web content or data. UMBEL is like a map of an interstate highway system, a set of roadsigns to help find related content and a way of getting from one big place to another. Once in the right vicinity, other maps (or ontologies) — more akin to detailed street maps — are then necessary to get to specific locations or street addresses. By definition, these more fine-grained maps are beyond UMBEL&#8217;s scope. But UMBEL can help provide the context for placing such detailed maps in relation to one another and in relation to the Big Picture of what related content is about.</p>
<p><a href="http://www.umbel.org">umbel.org</a> (project website)</p>
<p><a href="http://umbel.zitgist.com">umbel.zitgist.com</a> (UMBEL webservice &#8211; sandbox)</p>
<p><small>The interview was led by Andreas Blumauer, <a href="http://semantic-web.at/">SWC</a></small></p>
<div style="margin-top: 10px; height: 15px;" class="zemanta-pixie"><a class="zemanta-pixie-a" href="http://reblog.zemanta.com/zemified/920fa430-b077-4dbc-9e94-5fc23481dd87/" title="Zemified by Zemanta"><img style="border: medium none ; float: right;" class="zemanta-pixie-img" src="http://img.zemanta.com/reblog_e.png?x-id=920fa430-b077-4dbc-9e94-5fc23481dd87" alt="Reblog this post [with Zemanta]"></a></div>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2008/11/10/umbel-dbpedia-futureweb-ecology-interview/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Bringing (Legacy) Data to the Web [WOD-PD]</title>
		<link>http://blog.semantic-web.at/2008/10/22/bringing-legacy-data-to-the-web-wod-pd/</link>
		<comments>http://blog.semantic-web.at/2008/10/22/bringing-legacy-data-to-the-web-wod-pd/#comments</comments>
		<pubDate>Wed, 22 Oct 2008 15:44:37 +0000</pubDate>
		<dc:creator>Jana Herwig</dc:creator>
				<category><![CDATA[Conferences & Events]]></category>
		<category><![CDATA[Linked Data & Open Data]]></category>
		<category><![CDATA[AKSW]]></category>
		<category><![CDATA[Billion Triples Challenge]]></category>
		<category><![CDATA[OpenLink Software]]></category>
		<category><![CDATA[OpenLink Virtuoso]]></category>
		<category><![CDATA[Orri Erling]]></category>
		<category><![CDATA[Relational to RDF Mapping]]></category>
		<category><![CDATA[Semantic Proxy]]></category>
		<category><![CDATA[Sören Auer]]></category>
		<category><![CDATA[Triplify]]></category>
		<category><![CDATA[University of Leipzig]]></category>
		<category><![CDATA[Virtuoso Universal Server]]></category>
		<category><![CDATA[WOD-PD]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=318</guid>
		<description><![CDATA[The third session at WOD-PD was dedicated to &#8220;Bringing (Legacy) Data on the Web&#8220;, and led by Sören Auer (University of Leipzig, Germany) and Orri Erling (OpenLink Software) . Sören Auer described the difference between the Web 1.0, 2.0 and &#8230; <a href="http://blog.semantic-web.at/2008/10/22/bringing-legacy-data-to-the-web-wod-pd/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://webofdata.info/sessions/#session3">third session at WOD-PD</a> was dedicated to &#8220;<a href="http://webofdata.info/sessions/#session3">Bringing (Legacy) Data on the Web</a>&#8220;, and led by <a href="http://aksw.org/SoerenAuer">Sören Auer</a> (University of Leipzig, Germany) and <a href="http://www.openlinksw.com/weblog/oerling/">Orri Erling</a> (OpenLink Software) .</p>
<p><a href="http://blog.semantic-web.at/wp-content/uploads/2008/10/_talksoerenauer.jpg"><img src="http://blog.semantic-web.at/wp-content/uploads/2008/10/_talksoerenauer.jpg" alt="Sören Auer giving a talk" title="Sören Auer giving a talk" align="right" height="246" width="239"></a>Sören Auer described the difference between the Web 1.0, 2.0 and 3.0 as follows: On the Web 1.0, you had many websites that provided unstructured, mainly textual content. On the Web 2.0, you have a few large websites that are specialised on specific content types. And, finally, on the Web 3.0, there are many websites which contain, and are able to semantically syndicate, arbitrarily structured content.</p>
<p>So why would we need another web? What you cannot do with the current web is finding answers to seemingly complex, yet in reality pretty mundane question such as: Where in Leipzig do I find an apartment that is close to bilingual, German-French child care facilities? Are there any ERP service providers which have offices in Vienna and Berlin? Who are the researchers in South-East Asia currently working on database related topics?</p>
<p>Sören further discussed three of the present means of bringing relation data to the web: <a href="http://triplify.org">Triplify</a> (a web application plugin that exposes data from relational databases in RDF), <a href="http://www4.wiwiss.fu-berlin.de/bizer/d2rq/">D2RQ</a> (a declarative language to describe mappings between relational database schemata and OWL/RDFS ontologies, developed at Free University Berlin), and <a href="http://virtuoso.openlinksw.com/">Virtuoso Universal Server</a> (a middleware and database engine hybrid delivering for instance data integration for SQL, RDF, XML, Web Services). With respect to Triplify, Sören &#8211; who is Triplify&#8217;s founder and main developer at <a href="http://aksw.org/">AKSW</a> Uni Leipzig &#8211; showed and discussed the configuration for WordPress 2.1., <a href="http://triplify.org/Configuration/WordPress?v=49d">which can be found here</a> (<a href="http://triplify.org/Configuration?v=s2s">click here for more configurations</a>, e.g. for Joomla, OpenConf and Drupal). The next aim for Triplify is to become an integral part in enduser web app distibutions.</p>
<p>And important question raised by Sören was: How do next generation search engines know that something has changed on the web of data? He suggested three approaches:</p>
<ol>
<li>Always try to crawl everything (this may sound silly &#8211; but that&#8217;s actually what is happening on the current web)</li>
<li>Ping a central update notification service &#8211; e.g. <a href="http://www.pingthesemanticweb.com/">PingTheSemanticWeb.com</a> &#8211; which works as a showcase, but will probably not scale if the data web gets really deployed.</li>
<li>Each linked data endpoint publishes an update log &#8211; e.g. with Triplify, as a special folder inside the Triplify namespace, e.g. http://example.com/Triplify/update</li>
</ol>
<p>Also discussed by Sören and worth checking out is Reuters&#8217; <a href="http://www.semanticproxy.com/demo.html">Semantic proxy &#8211; the demo</a> went live in late September.</p>
<p>Orri Erling, as the lead developer of the Virtuoso Team, addressed the issue of mapping relational databases to RDF with OpenLink Virtuoso. In his talk, he addressed the pros and cons of RDF data warehouse:</p>
<blockquote><p>
Pros</p>
<ul>
<li>Even query performance across all data</li>
<li>Possibility of forward-chaining inference</li>
<li>Some SPARQL features may be better supported, e.g. Unspecified predicates</li>
</ul>
<p>Cons</p>
<ul>
<li>Keeping data up-to-date</li>
<li>Complex set up, needs dedicated servers: you don&#8217;t build them on a whim</li>
</ul>
</blockquote>
<p><img src="http://blog.semantic-web.at/wp-content/uploads/2008/10/_talkorrierling.jpg" alt="Orri Erling giving a talk" title="Talk Orri Erling" align="right" width="350">What Virtuoso delivers is mapping of SPARQL to SQL against any existing schema (whether stored in Virtuoso or elsewhere); a physical quad-store (quad as in quadruple; not as in <a href="http://en.wikipedia.org/wiki/All-terrain_vehicle">quad-bike</a> <img src='http://blog.semantic-web.at/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> ; and Federated/local Relational Data Base Management Systems (RDBMS).</p>
<p>A more detailed discussion of the <a href="http://www.openlinksw.com/dataspace/oerling/weblog/Orri%20Erling%27s%20Blog/1434">requirements for Relational-to-RDF Mapping is available on Orri&#8217;s blog</a>, where he discusses it in the light of his own experience. A power point presentation of a previous talk he gave to the <a href="http://www.w3.org/2005/Incubator/rdb2rdf/">W3C RDB2RDF Incubator Group</a> can be downloaded here: <a href="http://virtuoso.openlinksw.com/wiki/main/Main/VirtPresentations/Relational2RDF.ppt">Mapping Relational Databases to RDF with OpenLink Virtuoso</a> (PPT, 115KB). His summary of the group discussions around the same topic,<a href="http://esw.w3.org/topic/Rdb2RdfXG/ReqForMappingByOErling"> Requirements for Relational to RDF Mapping, can be found here</a>.</p>
<p>Orri also showed the Virtuoso billion triples demo which,  <a href="http://www.openlinksw.com/dataspace/oerling/weblog/Orri%20Erling%27s%20Blog/1445">according to the corresponding blogpost</a>, &#8220;is being worked on at the time of submission and may be shown online by appointment.&#8221; The demo was a submission to the <a href="http://challenge.semanticweb.org/">Billion Triples Challenge</a>.</p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://blog.semantic-web.at/?p=69">Linking Open Data Triplification Challenge at I-Semantics 2008</a></li>
<li class="zemanta-article-ul-li"><a href="http://vanirsystems.com/danielsblog/2008/08/04/virtuoso-universal-server/">Virtuoso Universal Server</a></li>
</ul>
<div style="margin-top: 10px; height: 15px;" class="zemanta-pixie"><a class="zemanta-pixie-a" href="http://reblog.zemanta.com/zemified/36c33cc9-edd2-4e3e-ae01-79846b3ec02b/" title="Zemified by Zemanta"><img style="border: medium none ; float: right;" class="zemanta-pixie-img" src="http://img.zemanta.com/reblog_e.png?x-id=36c33cc9-edd2-4e3e-ae01-79846b3ec02b" alt="Reblog this post [with Zemanta]"></a></div>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2008/10/22/bringing-legacy-data-to-the-web-wod-pd/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

