<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Semantic Puzzle&#187; Google</title>
	<atom:link href="http://blog.semantic-web.at/tag/google/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.semantic-web.at</link>
	<description>Open World Assumptions</description>
	<lastBuildDate>Thu, 02 Feb 2012 14:26:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Transforming spreadsheets into SKOS with Google Refine</title>
		<link>http://blog.semantic-web.at/2011/02/17/transforming-spreadsheets-into-skos-with-google-refine/</link>
		<comments>http://blog.semantic-web.at/2011/02/17/transforming-spreadsheets-into-skos-with-google-refine/#comments</comments>
		<pubDate>Thu, 17 Feb 2011 14:08:41 +0000</pubDate>
		<dc:creator>Thomas Schandl</dc:creator>
				<category><![CDATA[Tools & Software]]></category>
		<category><![CDATA[Vocabularies & Languages]]></category>
		<category><![CDATA[gics]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[SKOS]]></category>
		<category><![CDATA[taxonomy]]></category>
		<category><![CDATA[Thesaurus]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=1928</guid>
		<description><![CDATA[Looking for high quality enterprise vocabularies we recently turned our attention to the Global Industry Classification Standard (GICS), which is an industry taxonomy designed to categorize any private company. It was developed by Morgan Stanley Capital International and Standard &#038; &#8230; <a href="http://blog.semantic-web.at/2011/02/17/transforming-spreadsheets-into-skos-with-google-refine/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Looking for high quality enterprise vocabularies we recently turned our attention to the <a href="http://www.mscibarra.com/products/indices/gics/">Global Industry Classification Standard</a> (GICS), which is an industry taxonomy designed to categorize any private company. It was developed by <a href="http://www.msci.com/">Morgan Stanley Capital International</a> and <a href="http://www.standardandpoors.com/home/en/us/">Standard &#038; Poor&#8217;s</a> and is mainly used by the global financial community to aid in the investment research process. </p>
<p>It is <a href="http://www.mscibarra.com/products/indices/gics/gics_structure.html">available for download</a> as .xls spreadsheet files in several languages. Of course it would be much better to have this valuable taxonomy in a standard and machine-readable format. The Simple Knowledge Organization System <a href="http://www.w3.org/TR/skos-reference/">SKOS</a> is a perfect fit for a taxonomy like GICS. But how to turn a spreadsheet into SKOS with minimal manual effort? </p>
<p>I chose to try <a href="http://code.google.com/p/google-refine/">Google Refine</a> for this task, as recently a promising <a href="http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/">RDF extension</a> had been released by <a href="http://www.deri.ie/">DERI</a>&#8216;s <a href="http://www.deri.ie/about/team/member/fadi_maali/">Fadi Maali</a> and <a href="http://richard.cyganiak.de/">Richard Cyganiak</a>.</p>
<p>Google Refine is &#8220;a power tool for working with messy data, cleaning it up, transforming it from one format into another, extending it with web services, and linking it to databases&#8221;. Previously it was known as Freebase Gridworks which is now further developed by Google since its <a href="http://blog.semantic-web.at/2010/07/17/what-if-the-biggest-web-company-bought-one-of-the-central-semantic-web-players/">acquisition of Metaweb</a>. </p>
<div id="attachment_1946" class="wp-caption aligncenter" style="width: 660px"><a href="http://blog.semantic-web.at/wp-content/uploads/2011/02/google_refine.jpg"><img src="http://blog.semantic-web.at/wp-content/uploads/2011/02/google_refine_650.jpg" alt="Refine" title="google_refine_650" width="650" height="189" class="size-full wp-image-1946" /></a><p class="wp-caption-text">Google Refine UI</p></div>
<p>Refine is a very useful tool to filter and consequently transform rows, colums and cells according to customizable patterns.</p>
<p>After applying all necessary transformations to the spreadsheet one can edit the &#8220;RDF Skeleton&#8221;, where the columns can be mapped to literals, RDF properties and RDF classes (which can be imported from their namespaces). </p>
<div id="attachment_1943" class="wp-caption aligncenter" style="width: 660px"><a href="http://blog.semantic-web.at/wp-content/uploads/2011/02/refine_rdf_sekeleton.jpg"><img src="http://blog.semantic-web.at/wp-content/uploads/2011/02/refine_rdf_sekeleton650.jpg" alt="RDF Sekeleton" title="refine_rdf_sekeleton650" width="650" height="521" class="size-full wp-image-1943" /></a><p class="wp-caption-text">Editing the RDF Sekeleton</p></div>
<p>Once you got your valid SKOS model ready you can export it in RDF/XML or Turtle format. Then you may want to load it into an ontology editor like <a href="http://protege.stanford.edu/">Protégé</a> or a thesaurus management tool like <a href="http://poolparty.punkt.at/">PoolParty</a> in order to build upon it or connect it to other knowledge models. With PoolParty the GICS taxonomy can also be utilized to tag and categorize documents, provide semantic search and facetted navigation and it can be published as Linked Data without further effort. </p>
<div id="attachment_1968" class="wp-caption aligncenter" style="width: 610px"><a href="http://blog.semantic-web.at/wp-content/uploads/2011/02/GICS_PP.jpg"><img src="http://blog.semantic-web.at/wp-content/uploads/2011/02/GICS_PP_600.jpg" alt="GICS in PoolParty screenshot" title="GICS_PP_600" width="600" height="387" class="size-full wp-image-1968" /></a><p class="wp-caption-text">GICS loaded in PoolParty</p></div>
<p>Working with Refine and its RDF extension was easy and fun. It&#8217;s even possible to isolate and save the transformation steps done with Refine, so one can re-apply them on similar structured spreadsheets. This came in very handy as GICS is published in nine languages and as many separate, identically structured spreadsheets.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2011/02/17/transforming-spreadsheets-into-skos-with-google-refine/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>What if the biggest web company bought one of the central semantic web players?</title>
		<link>http://blog.semantic-web.at/2010/07/17/what-if-the-biggest-web-company-bought-one-of-the-central-semantic-web-players/</link>
		<comments>http://blog.semantic-web.at/2010/07/17/what-if-the-biggest-web-company-bought-one-of-the-central-semantic-web-players/#comments</comments>
		<pubDate>Sat, 17 Jul 2010 10:47:38 +0000</pubDate>
		<dc:creator>Andreas Blumauer</dc:creator>
				<category><![CDATA[Companies & Institutions]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Freebase]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Metaweb Technologies]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=1656</guid>
		<description><![CDATA[Well, exactly this happened yesterday: Google bought Metaweb &#8211; provider of Freebase. Freebase is an important hub in the linked data cloud providing 12 million entities with uniform resource identifiers most of them linked to other semantic web datasets like &#8230; <a href="http://blog.semantic-web.at/2010/07/17/what-if-the-biggest-web-company-bought-one-of-the-central-semantic-web-players/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Well, exactly this happened yesterday: <a href="http://googleblog.blogspot.com/2010/07/deeper-understanding-with-metaweb.html">Google bought Metaweb</a> &#8211; provider of <a href="http://www.freebase.com/">Freebase</a>. Freebase is an important hub in the linked data cloud providing 12 million entities with uniform resource identifiers most of them linked to other semantic web datasets like <a href="http://dbpedia.org">DBpedia</a> or <a href="http://data.nytimes.com/" target="_blank">New York Times</a>. For example: <a href="http://www.freebase.com/view/en/google" target="_blank">Google´s page on Freebase</a> offers a rich source for <a href="http://rdf.freebase.com/rdf/en.google" target="_blank">machine-readable facts</a> around this company.</p>
<p><em>What does this mean to the Semantic Web Community which has  been working on a smarter web in the last decade?</em><br />
Well, a lot&#8230; First of all, it´s good to hear that Google will continue to develop Freebase as a free and open database to everyone, saying &#8220;&#8230; we would be delighted if other web companies use and contribute to the data.&#8221;</p>
<p>Until yesterday still a lot of companies were not fully convinced if the Semantic Web will play a central role in the further development of the Internet. Now the game has changed. The entity-driven approach to develop web applications has just started now:</p>
<p><object width="500" height="306"><param name="movie" value="http://www.youtube.com/v/TJfrNo3Z-DU&amp;hl=en_US&amp;fs=1?rel=0&amp;hd=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/TJfrNo3Z-DU&amp;hl=en_US&amp;fs=1?rel=0&amp;hd=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="500" height="306"></embed></object></p>
<p>We will keep on reporting and discussing how Google will influence the development of the Semantic Web &#8211; and if I had a wish for free: Please add RDF(a) to the Freebase widgets!</p>
<div class="fb-widget" id="fbtb-8700f09960b8433d88673b1d2d83be08" style="border:0; outline:0; padding:0; margin:0; position:relative;" itemscope="" itemid="http://www.freebase.com/id/en/semantic_web" itemtype="http://www.freebase.com/id/business/industry">
<form class="fb-widget-placeholder" style="border:0; outline:0; padding:0; margin:0;">
<input name="src" value="http://www.freebase.com/widget/topic?track=topicblocks_embedthis&amp;mode=content&amp;id=%2Fen%2Fsemantic_web" type="hidden" />
<input name="width" value="413" type="hidden" />
<input name="height" value="285" type="hidden" /> <span style="line-height:1; border:0; outline:0; padding:0; margin:0; display:inline-block; padding:5px; background:#eee; border-radius:5px; -moz-border-radius:5px; -webkit-border-radius:5px;">
<div style="text-align:left; vertical-align:baseline; line-height:1; border:0; outline:0; margin:0 0 5px 5px;"> <a style="text-align:left; vertical-align:baseline; font-family:'Helvetica Neue', Arial, sans-serif; font-size:13px; font-weight:bold; line-height:1.6; text-decoration:none; color:#17b; border:0; outline:0; padding:0; margin:0;" href="http://www.freebase.com/view/en/semantic_web" target="_blank" > Semantic Web </a> </div>
<div style="vertical-align:top; border:1px solid #ddd; outline:0; padding:0; margin:0; position: relative; width:400px; height:220px; overflow:auto; background-color:#fff"> <img src="http://img.freebase.com/api/trans/image_thumb/en/semantic_web?pad=1&amp;errorid=%2Ffreebase%2Fno_image_png&amp;maxheight=150&amp;mode=fillcropmid&amp;maxwidth=150" title="Semantic Web" style="border:0; outline:0; padding: 0; margin: 28px auto; display: block;"> </div>
<p> </span> </form>
<p> <script src="http://freebaselibs.com/static/widgets/2/widget.js" type="text/javascript" defer=""></script> </div>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2010/07/17/what-if-the-biggest-web-company-bought-one-of-the-central-semantic-web-players/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Metaweb´s Jamie Taylor: &#8220;Freebase provides a large and user extensible vocabulary for RDF/RDFa&#8221;</title>
		<link>http://blog.semantic-web.at/2009/05/18/metawebs-jamie/</link>
		<comments>http://blog.semantic-web.at/2009/05/18/metawebs-jamie/#comments</comments>
		<pubDate>Mon, 18 May 2009 12:41:20 +0000</pubDate>
		<dc:creator>Andreas Blumauer</dc:creator>
				<category><![CDATA[Linked Data & Open Data]]></category>
		<category><![CDATA[Semantic Web Applications]]></category>
		<category><![CDATA[Tools & Software]]></category>
		<category><![CDATA[dbpedia]]></category>
		<category><![CDATA[Freebase]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[MQL]]></category>
		<category><![CDATA[RDFa]]></category>
		<category><![CDATA[Wikipedia]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=978</guid>
		<description><![CDATA[Andreas Blumauer from Semantic Web Company (SWC) talked with Jamie Taylor, Minister of Information at Metaweb Technologies Inc. about Freebase &#38; Linked Data and Google´s announcement to use RDFa. SWC: At ISWC 2008 Freebase became &#8220;officially&#8221; part of the LOD &#8230; <a href="http://blog.semantic-web.at/2009/05/18/metawebs-jamie/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_984" class="wp-caption alignright" style="width: 210px"><img class="size-full wp-image-984" title="jamie_taylor" src="http://blog.semantic-web.at/wp-content/uploads/2009/05/jamie_taylor.jpg" alt="Jamie Taylor, Metaweb" width="200" height="261" /><p class="wp-caption-text">Jamie Taylor, Metaweb</p></div>
<p>Andreas Blumauer from Semantic Web Company (SWC) talked with <a href="http://www.freebase.com/view/en/jamie_taylor" target="_blank">Jamie Taylor</a>, Minister of Information at <a class="zem_slink" title="Metaweb" rel="homepage" href="http://www.metaweb.com">Metaweb Technologies Inc.</a> about Freebase &amp; Linked Data and Google´s announcement to use <a class="zem_slink" title="RDFa" rel="wikipedia" href="http://en.wikipedia.org/wiki/RDFa">RDFa</a>.</p>
<p><strong>SWC</strong>: <em>At ISWC 2008 <a class="zem_slink" title="freebase" rel="homepage" href="http://www.freebase.com/">Freebase</a> became &#8220;officially&#8221; <a href="http://blog.semantic-web.at/2008/10/30/the-day-after-freebase-went-rdf/" target="_self">part of the LOD Cloud</a>. What exactly has changed since that time?</em></p>
<p><strong>Jamie</strong>: Since Freebase is a community writable semantic database, the addition of the RDF interface allows anyone to publish data into the LOD cloud. LOD Applications can access any Freebase Topic through the RDF interface by constructing a URI from the Freebase identifier.  But perhaps more importantly, because entities in Freebase can be annotated with multiple identifiers, Freebase Topics can be retrieved by constructed URIs using the identifiers used by other systems and data sets.<br />
For instance, the movie Blade Runner can be referred to as <span id="OBJ_PREFIX_DWT1594" class="Object"><span id="OBJ_PREFIX_DWT1595" class="Object"><a href="http://rdf.freebase.com/ns/en.blade_runner" target="_blank">http://rdf.freebase.com/ns/en.blade_runner</a></span></span>, but it can also be referenced as <span id="OBJ_PREFIX_DWT1596" class="Object"><span id="OBJ_PREFIX_DWT1597" class="Object"><a href="http://rdf.freebase.com/ns/authority.netflix.movie.70053131" target="_blank">http://rdf.freebase.com/ns/authority.netflix.movie.70053131</a></span></span> using the Netflix identifier, <span id="OBJ_PREFIX_DWT1598" class="Object"><span id="OBJ_PREFIX_DWT1599" class="Object"><a href="http://rdf.freebase.com/ns/authority.imdb.title.tt0083658" target="_blank">http://rdf.freebase.com/ns/authority.imdb.title.tt0083658</a></span></span> using the IMDB identifier, or as <span id="OBJ_PREFIX_DWT1600" class="Object"><span id="OBJ_PREFIX_DWT1601" class="Object"><a href="http://rdf.freebase.com/ns/wikipedia.en.Dangerous_Days" target="_blank">http://rdf.freebase.com/ns/wikipedia.en.Dangerous_Days</a></span></span> using a Wikipedia wikiword (which in this case is a Wikipedia redirect to the wikiword Blade_Runner).<br />
Freebase also provides a user maintained mapping of how these identifiers can be used to address resources in other LOD systems. The sameas.freebase.com schema can tell an LOD user that the Freebase Blade Runner Topic can also be found in <a href="http://dbpedia.org/" target="_blank">DBpedia</a> using Wikipedia identifiers or how musical artists can be found at the <a href="http://www.bbc.co.uk/music/" target="_blank">BBC</a> using <a href="http://musicbrainz.org/" target="_blank">Musicbrainz</a> identifiers.  In fact, the Freebase RDF interface uses the sameas.freebase.com schema to create the owl:sameAs links in the RDF output allowing the user community to expand the interconnections between Freebase and the LOD Cloud.<br />
<a href="http://linkeddata.org/data-sets" target="_blank">Linked Data providers</a> are also using the strong identifiers in Freebase to identify entities such as companies and locations in their own data sets.  When they find an entity that is not represented in Freebase, they simply add the entity to Freebase and use the newly minted Freebase identifier.  This permits anyone using their data to understand how their entities relates to any of the more than 5 million things interconnected within Freebase.</p>
<p>The RDF interface can also be used to reference the Freebase type system, giving LOD data set providers vocabularies across a wide range of subject areas.  And because anyone can expand Freebase&#8217;s data model, data providers can use our schema development tools to build and extend these vocabularies to suite their needs.<br />
Freebase was not designed for ephemeral or fast changing data, like weather conditions or stock ticks.  But this type of information is well suited for publication as Linked Data.  Freebase entities representing a location or company can be annotated with references to LOD services that provide these types of volatile data.  Similarly, Linked Data provides a great way to disseminate very fined grained information that might be associated with a scientific study or financial report.  Linked Data provides a seemless transition from Freebase, where a user (or application) can run a query with constraints that run across a wide range of types to find entities of interest along with the LOD services that provide access to temporal or high resolution data not available in Freebase.<br />
We recently demonstrated MQL Extensions which allows the <a href="http://www.freebase.com/view/en/mql" target="_blank">Metaweb Query Language</a> to use data from other systems as a part of the query constraint and result set.  While MQL Extensions are user extensible and work with a wide array of systems,  this capability makes the connection between Freebase and the LOD Cloud even more transparent.<br />
For example, because US companies that are registered with the SEC are annotated CIK code in Freebase and the sameas.freebase.com schema indicates that the CIK annotation can be used to create a URI that is dereferencable at rdfabout.com, it is possible to write a MQL query that asks who is on the board of financial services companies that trade on NASDAQ and are  headquartered in California (and using another MQL Extension, you can ask for their stock price as well!)</p>
<p><strong>SWC</strong>: <em>Many organisations are very interested in Linking Open Data now but they are still not sure if they can benefit from publishing data on the web &#8211; what´s your experience <span id="OBJ_PREFIX_DWT1877" class="Object"><span id="OBJ_PREFIX_DWT1878" class="Object">so</span></span> far?</em></p>
<p><strong>Jamie</strong>: Linked Open Data provides a simple, standard way for organizations to distribute structured data.  For most organizations, providing access to data is another important outlet to announce the availability of higher value services.  For organizations involved in building or selling physical goods, the bits representing what they provide are not the goods themselves, but a way of attracting potential customers.  Making catalogs and specification sheets available in electronic form, <span id="OBJ_PREFIX_DWT1879" class="Object"><span id="OBJ_PREFIX_DWT1880" class="Object">so</span></span> other applications can connect buyers to their physical goods is simply an effective marketing system.  Even for firms involved in electronic services, providing access to open structured data is generally a lead-in to value added services.  For instance, if I ran a service collecting hard-to-find information about manufacturing relationships between medium sized businesses, I would publish open company profiles covering things like market size, industry, location for the medium-sized businesses I tracked, <span id="OBJ_PREFIX_DWT1881" class="Object"><span id="OBJ_PREFIX_DWT1882" class="Object">so</span></span> potential users the premium data would know I had the coverage they were looking for.</p>
<p><strong>SWC</strong>: <em>Just recently Google has announced to use RDFa to enhance their search results. What do you think?</em></p>
<p><strong>Jamie</strong>: We are excited about <a href="http://googlewebmastercentral.blogspot.com/2009/05/introducing-rich-snippets.html" target="_blank">Google&#8217;s announcement</a>. Yahoo&#8217;s use of RDFa for Search Monkey and Google&#8217;s announcement gives RDFa users tangible benefits. The Search Monkey team was very quick to realize that because users can create data models in Freebase, and because the elements of those models all have strong RDF identifiers, Freebase provides a large and user extensible vocabulary for RDF/RDFa (see the <a href="http://developer.yahoo.com/searchmonkey/smguide/profile_vocab.html" target="_blank">list of vocabularies</a>). When a user wants to create a Search Monkey application that works with their film review site, they need not invent a new vocabulary (that will probably be used only once),  they can use the Freebase Film Domain vocabulary which supports over 63,000 instances in Freebase alone.<br />
Similarly, with over 5 Million well described Topics in Freebase and over 14,000,000 Named Objects (Topics, images, musical tracks and documents) when a user wants to unambiguously identify a subject or object in RDF/RDFa, Freebase has an extremely large collection of identifiers to draw from.  These cover people, places, companies, movies, music, books and wide variety of other subjects.  If Freebase doesn&#8217;t have the entity the user is looking for, they can of course add it themselves and make use of the identifier immediately. I think this is why Google used some Freebase identifiers in their examples. We hope that with Yahoo and Google&#8217;s support for RDFa the web will become a strongly annotated source of data which can support a wide range of user applications.</p>
<p><strong>SWC</strong>: <em>Thank you, Jamie!</em></p>
<h6 class="zemanta-related-title" style="font-size: 1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://gen5.info/q/2009/02/25/putting-freebase-in-a-star-schema/">Putting Freebase in a Star Schema</a> (gen5.info)</li>
<li class="zemanta-article-ul-li"><a href="http://ivan-herman.name/2009/05/13/rdfa-google/"> RDFa, Google </a> (ivan-herman.name)</li>
</ul>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/46c45d34-6d03-4b52-948a-638c993e5aad/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=46c45d34-6d03-4b52-948a-638c993e5aad" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2009/05/18/metawebs-jamie/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Linked Data is not owl:sameAs Semantic Web</title>
		<link>http://blog.semantic-web.at/2009/03/30/linked-data-is-not-owlsameas-semantic-web/</link>
		<comments>http://blog.semantic-web.at/2009/03/30/linked-data-is-not-owlsameas-semantic-web/#comments</comments>
		<pubDate>Mon, 30 Mar 2009 08:21:59 +0000</pubDate>
		<dc:creator>Andreas Blumauer</dc:creator>
				<category><![CDATA[Linked Data & Open Data]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[cloudlet]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[linking open data]]></category>
		<category><![CDATA[OpenLink Software]]></category>
		<category><![CDATA[Tagcloud]]></category>
		<category><![CDATA[Talis]]></category>
		<category><![CDATA[Tim Berners-Lee]]></category>
		<category><![CDATA[wonder wheel]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=759</guid>
		<description><![CDATA[While some people work heavily on the extension of the semantic web infrastructure, like Talis Connected Commons or OpenLink´s Amazon EC2 Instantiation others have started to bring the semantic web closer to the developers and therefore to a much broader &#8230; <a href="http://blog.semantic-web.at/2009/03/30/linked-data-is-not-owlsameas-semantic-web/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-762" title="twitter_cloudlet" src="http://blog.semantic-web.at/wp-content/uploads/2009/03/twitter_cloudlet.jpg" alt="twitter_cloudlet" width="251" height="244" />While some people work heavily on the extension of the semantic web infrastructure, like <a href="http://blogs.talis.com/n2/cc" target="_blank">Talis Connected Commons</a> or <a href="http://virtuoso.openlinksw.com/wiki/main/Main/VirtInstallationEC2" target="_blank">OpenLink´s Amazon EC2 Instantiation</a> others have started to bring the semantic web closer to the developers and therefore to a much broader audience: They offer search facilities or Linked Data Navigators like <a href="http://lod.openlinksw.com/" target="_blank">OpenLink´s Entity Finder</a> or <a href="http://visinav.deri.org/" target="_blank">DERI´s VisiNav</a>.</p>
<p>Those kind of applications should not be confused with &#8220;semantic web&#8221; end-user-applications like <a href="http://searchengineland.com/google-wonder-wheel-17093" target="_blank"> Google´s Wonderwheel</a> or <a href="http://www.crunchbase.com/company/intspei" target="_blank">INTSPEI´s</a> <a href="http://www.intspei.com/Products/SearchCloudlet.aspx" target="_blank">Cloudlet</a>: To add some semantics to existing user-interfaces can be helpful and obviously users are ready for such experiments, but of course this is NOT the innovation which the semantic web will bring but it is a very important step to be taken in parallel with the <a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData" target="_blank">linked data initiative</a>.</p>
<p>Let´s take a look at Cloudlet: This tool is an easy-to-use <a href="http://www.getcloudlet.com" target="_blank">free Firefox extension</a> that adds context-sensitive tag clouds to the most popular search engines and helps people more efficiently navigate through their search results. The previous version of Search Cloudlet worked with Google and Yahoo; the new version also works with Twitter. It adds Tag Clouds, Author Clouds, Recipient Clouds and Hashtag Clouds to Twitter search, Twitter user profiles and home pages. See <a href="http://www.getcloudlet.com/swm.php?page=reviews" target="_blank">some reviews</a> on this popular tool.</p>
<p>Cloudlet is a child of the Web. INTSPEI has learned all lessons from Web 2.0 especially how to promote ideas using the blogosphere and how to identify market trends as early as possible, and it generates some added value for the users which is obvious. Sure, it doesn´t make use of linked data yet, but as a typical representative of the fast growing &#8220;semantic search evolution&#8221; it reminds me on <a href="http://domino.research.ibm.com/comm/research_people.nsf/pages/welty.index.html" target="_blank">Chris Welty</a>´s famous insight: &#8220;In the <em>Semantic Web</em>, it is not the <em>Semantic</em> which is new, it is the <em>Web</em> which is new.&#8221;</p>
<p>Web 1.0 was the WWW without tons of network effects. Web 2.0 changed that a lot.</p>
<p>Linked Data is not the Semantic Web, it´s the basement for it. From a software developer´s and an IT archictect´s perspective it might seem as those two concepts were the same. But this community represents a very small percentage of all web-users.</p>
<p>So where is the User´s Web in the Linked Data architecture? If you´re looking at <a href="http://www.w3.org/DesignIssues/LinkedData.html" target="_blank">TimBL´s Linked Data principles</a> one can clearly see that this is a &#8220;Web&#8221; for developers.</p>
<p>But things evolve. And some Web companies will jump on the bandwagon and will, for instance, improve their tagclouds, their semantic search, their recommender systems (Twine?) or their similarity search a lot by making use of linked data.</p>
<p>Like semantic search becomes mainstream (or call it &#8220;semantic search 2.0&#8243;) right now, then (in about three years, I guess) linked data will become part of a lot of mainstream applications. Linked data will generate tons of new network effects, maybe even new business models, it won´t be avant-garde anymore. It will be part of the Semantic <em>Web</em>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2009/03/30/linked-data-is-not-owlsameas-semantic-web/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>the next google</title>
		<link>http://blog.semantic-web.at/2009/03/25/the-next-google/</link>
		<comments>http://blog.semantic-web.at/2009/03/25/the-next-google/#comments</comments>
		<pubDate>Wed, 25 Mar 2009 15:05:35 +0000</pubDate>
		<dc:creator>Thomas Thurner</dc:creator>
				<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Software Development]]></category>
		<category><![CDATA[Tools & Software]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Search Engine]]></category>
		<category><![CDATA[Web search engine]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=751</guid>
		<description><![CDATA[Image via Wikipedia Maybe you have noticed it already; today in the morning something new appeared at Google&#8217;s search engine interface: A bunch of corresponding search-suggestions based on your search query. Google spoke about this enhancement: Starting today, we&#8217;re deploying &#8230; <a href="http://blog.semantic-web.at/2009/03/25/the-next-google/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="margin: 1em; display: block;">
<div>
<dl class="wp-caption alignright" style="width: 212px;">
<dt class="wp-caption-dt"><a href="http://en.wikipedia.org/wiki/Image:Google1998.png"><img title="Google in 1998" src="http://upload.wikimedia.org/wikipedia/en/thumb/b/b7/Google1998.png/202px-Google1998.png" alt="Google in 1998" width="202" height="112" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution" style="font-size: 0.8em;">Image via <a href="http://en.wikipedia.org/wiki/Image:Google1998.png">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>Maybe you have noticed it already; today in the morning something new appeared at Google&#8217;s search engine interface: A bunch of corresponding search-suggestions based on your search query. <span><span class="IL_SPAN"><a title="NASDAQ: GOOG" rel="stockexchange" href="http://news.techwhack.com/10098-google-search-enhancements">Google</a></span> spoke about this enhancement: </span></p>
<blockquote><p>Starting today, we&#8217;re deploying a new technology that can better understand associations and concepts related to your search, and one of its first applications lets us offer you even more useful related searches (the terms found at the bottom, and sometimes at the top, of the search results page).</p></blockquote>
<p>I tried it. So, if you type in &#8220;time travel&#8221; you also get search proposals like &#8220;theory of relativity time travel&#8221; or &#8220;wormhole time travel&#8221;. Google annouced, that the service is available in various languages. The direct test with German is a little disillusioning: Searching for &#8220;zeit reise&#8221; (which is the same concept as above, in german) leads to alternative searches like &#8220;reisen 50er jahren&#8221; (travel 50ies) and &#8220;reisen im mittelalter&#8221; (travel in the medieval).</p>
<p>Even if this semantic-like extension of the  basis search function still needs some tuning, the point  is getting clearer: Also Google is doing developments to get more meaningful results into their search algorithms. And parts of the semantic methodology are finding their way into mainstream services like <a class="zem_slink" title="Web search engine" rel="wikipedia" href="http://en.wikipedia.org/wiki/Web_search_engine">search engines</a> &#8211; as we have seen with <a href="www.wolframalpha.com">Wolfram Alpha</a> some days ago. So keep your eyes open &#8211; maybe next morning you&#8217;ll find another piece of the semantic puzzle embedded into one of your favorite web-apps.</p>
<div class="zemanta-pixie" style="margin-top: 10px; height: 15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/d3fd0ad0-cf57-41fb-a32c-3d04848dafc5/"><img class="zemanta-pixie-img" style="border: medium none; float: right;" src="http://img.zemanta.com/reblog_e.png?x-id=d3fd0ad0-cf57-41fb-a32c-3d04848dafc5" alt="Reblog this post [with Zemanta]" /></a><span class="zem-script more-related"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2009/03/25/the-next-google/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google and the Semantic Web: About Quad Stores and URIs</title>
		<link>http://blog.semantic-web.at/2009/03/20/google-and-the-semantic-web-about-quad-stores-and-uris/</link>
		<comments>http://blog.semantic-web.at/2009/03/20/google-and-the-semantic-web-about-quad-stores-and-uris/#comments</comments>
		<pubDate>Fri, 20 Mar 2009 09:26:19 +0000</pubDate>
		<dc:creator>Andreas Blumauer</dc:creator>
				<category><![CDATA[Internet & Media]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Vocabularies & Languages]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[quad store]]></category>
		<category><![CDATA[URI]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=729</guid>
		<description><![CDATA[Just recently Google launched another interesting service called &#8220;In Quotes&#8221;. It delivers quotes from stories linked to from Google News and users can compare opinions of e.g. politicians in a very comfortable way. If  a closer look is taken at &#8230; <a href="http://blog.semantic-web.at/2009/03/20/google-and-the-semantic-web-about-quad-stores-and-uris/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Just recently Google launched another interesting service called <a href="http://labs.google.com/inquotes/" target="_blank">&#8220;In Quotes&#8221;</a>. It delivers quotes from stories linked to from Google News and users can compare opinions of e.g. politicians in a very comfortable way.</p>
<p>If  a closer look is taken at the system, one can see that any person whose quotes are listed has got a URI: Barack Obama has got the <a href="http://news.google.com/news?qsid=tPjE5CDNzMicmM" target="_blank">uniform &#8220;qsid&#8221; tPjE5CDNzMicmM</a>.</p>
<p>It seems like &#8220;qsid&#8221; stands for &#8220;Quad Store ID&#8221; which would perfectly support such a URI based system.</p>
<p>Does Google slowly approximate to the Semantic Web?</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2009/03/20/google-and-the-semantic-web-about-quad-stores-and-uris/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Why Wolfram Alpha won´t replace Google</title>
		<link>http://blog.semantic-web.at/2009/03/12/why-wolfram-alpha-won%c2%b4t-replace-google/</link>
		<comments>http://blog.semantic-web.at/2009/03/12/why-wolfram-alpha-won%c2%b4t-replace-google/#comments</comments>
		<pubDate>Thu, 12 Mar 2009 10:30:43 +0000</pubDate>
		<dc:creator>Andreas Blumauer</dc:creator>
				<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[qa]]></category>
		<category><![CDATA[question answering]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[start]]></category>
		<category><![CDATA[wolfram alpha]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=677</guid>
		<description><![CDATA[If Nova Spivack and Doug Lenat are positive with what they have seen from Wolfram Alpha, I am also close of being convinced that the internet community won´t be dissapointed by Alpha´s first release. Just remember, which hype was caused &#8230; <a href="http://blog.semantic-web.at/2009/03/12/why-wolfram-alpha-won%c2%b4t-replace-google/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>If <a href="http://www.twine.com/item/122mz8lz9-4c/wolfram-alpha-is-coming-and-it-could-be-as-important-as-google" target="_blank">Nova Spivack</a> <em>and</em> <a href="http://www.semanticuniverse.com/blogs-i-was-positively-impressed-wolfram-alpha.html" target="_blank">Doug Lenat</a> are positive with what they have seen from <a href="http://blogsearch.google.com/blogsearch?hl=en&amp;ie=UTF-8&amp;q=%22wolfram+alpha%22&amp;btnG=Search+Blogs" target="_blank">Wolfram Alpha</a>, I am also close of being convinced that the internet community won´t be dissapointed by <a href="http://blog.wolfram.com/2009/03/05/wolframalpha-is-coming/" target="_blank">Alpha´s first release</a>. Just remember, which hype was caused by <a href="http://www.google.com/trends?q=cuil" target="_blank">Cuil´s PR-strategy</a> of spreading news about their first release throughout the blogosphere, and scarcely anybody would talk about this engine anymore.</p>
<p>After all what I have read about Wolfram Alpha, one thing obviously can be stated: Wolfram Alpha will be a perfect addition to traditional <em>search </em>engines like Google, but will never replace it. For example: In the first paragraph of this blog I have used Google Services like &#8220;Google Blog Search&#8221; or &#8220;Google Trends&#8221; to prove some of my statements (in a broader sense: to give answers to those, who want to know, why this is my opinion). Such services Alpha won´t deliver, but it will do other things much better than Google. Doug Lenat:</p>
<blockquote><p>At one extreme is, say, Google, which responds to almost anything like a faithful puppy bringing in the morning newspaper without understanding much of anything it&#8217;s fetching (recognizing words in what it returns, often leading to amusing or hair-raising inappropriate &#8220;ads&#8221; being displayed, and leading to tons of false positives and false negatives).  At the other extreme is, say, Cyc, which only can answer a small fraction of user queries, but can answer ones that require common sense (not just common sense queries like &#8220;Do surgeons often operate on themselves?&#8221;, but ones where the logical application of such knowledge is required to correctly disambiguate and parse the user&#8217;s query containing pronouns, elisions, ambiguous words, ellipsis, and so on) and where every piece of the query and every piece of the answer is as deeply understood as, say, arithmetic.  Wolfram Alpha is somewhere around the geometric mean of those two extremes.</p></blockquote>
<p>Search engines or question answering machines (QA) which understand the meaning of the query and/or of the result are not completely new and some of them are really useful like good old <a href="http://ablvienna.wordpress.com/2008/01/" target="_blank">START</a>.</p>
<p><strong>But the point is: In many cases of information demand people can´t <em>express </em>the right question.</strong></p>
<p>Why didn´t START become the default browser if it can <em>even </em>answer questions? I think the USP of Alpha will be, that it can give the right answer to more questions than any other QA machine before. But still, the real &#8220;search engine revolution&#8221; won´t happen, until engines will be able to help users to formulate the proper questions and will help to interprate the right results. Therefore we need to rethink some search paradigms from scratch.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2009/03/12/why-wolfram-alpha-won%c2%b4t-replace-google/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Pimp your Google</title>
		<link>http://blog.semantic-web.at/2009/02/04/pimp-your-google/</link>
		<comments>http://blog.semantic-web.at/2009/02/04/pimp-your-google/#comments</comments>
		<pubDate>Wed, 04 Feb 2009 16:51:07 +0000</pubDate>
		<dc:creator>Andreas Blumauer</dc:creator>
				<category><![CDATA[Mashups & Web services]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Tagcloud]]></category>
		<category><![CDATA[Wikipedia]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=547</guid>
		<description><![CDATA[Sure, that´s not the end of the flagpole &#8211; but &#8220;a little semantics goes a long way&#8221; (Jim Hendler): With two Firefox add-ons, you can pimp your Google and you will get (1) a better overview over the search results, &#8230; <a href="http://blog.semantic-web.at/2009/02/04/pimp-your-google/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Sure, that´s not the end of the flagpole &#8211; but &#8220;a little semantics goes a long way&#8221; (Jim Hendler): With two Firefox add-ons, you can pimp your Google and you will get (1) a better overview over the search results, (2) kind of a moderated search and (3) information from Wikipedia along with the results.</p>
<p>Install <a href="http://www.getcloudlet.com/" target="_blank">Cloudlet</a> and <a href="https://addons.mozilla.org/de/firefox/addon/2517" target="_blank">Googlepedia</a> (Don´t forget to donate!) and you will see something like this:</p>
<p><img class="alignnone size-full wp-image-548" title="pimp_your_google" src="http://blog.semantic-web.at/wp-content/uploads/2009/02/pimp_your_google.jpg" alt="pimp_your_google" width="509" height="253" /></p>
<p>Sure, both &#8220;mashups&#8221; are not based on RDF, and the &#8220;TagCloud&#8221; is not as accurate as we wished, but let us be patient again. At least this picture makes end-users yearning for a bit more semantics (which goes a long way&#8230;) on top of the usual lists of search results.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2009/02/04/pimp-your-google/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yahoo vs Google &#8211; Technology vs Advertising</title>
		<link>http://blog.semantic-web.at/2008/11/19/yahoo-vs-google-technology-vs-advertising/</link>
		<comments>http://blog.semantic-web.at/2008/11/19/yahoo-vs-google-technology-vs-advertising/#comments</comments>
		<pubDate>Wed, 19 Nov 2008 09:28:03 +0000</pubDate>
		<dc:creator>Jana Herwig</dc:creator>
				<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[advertising]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Search-Based Keyword Tool]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[Yahoo!]]></category>
		<category><![CDATA[Yahoo! BOSS]]></category>
		<category><![CDATA[Yahoo! Search BOSS]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=344</guid>
		<description><![CDATA[Just stumbled upon this observation in a blog post by Daniel Tunkelang where he compares Yahoo&#8217;s and Google&#8217;s latest key word tools, and chuckled. The occasion was Yahoo&#8217;s release of a new BOSS features called Key Terms, and Google&#8217;s announcement &#8230; <a href="http://blog.semantic-web.at/2008/11/19/yahoo-vs-google-technology-vs-advertising/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Just stumbled upon this observation in a <a href="http://thenoisychannel.com/2008/11/18/the-difference-between-google-and-yahoo/">blog post</a> by <a href="http://thenoisychannel.com">Daniel Tunkelang</a> where he compares Yahoo&#8217;s and Google&#8217;s latest key word tools, and chuckled. The occasion was <a href="http://www.ysearchblog.com/archives/000654.html">Yahoo&#8217;s release</a> of a new BOSS features called Key Terms, and Google&#8217;s announcement <a href="http://searchengineland.com/googles-new-search-based-keyword-tool-15525.php"> of the release of a new tool</a> that tells you which keyterms you&#8217;re missing (i.e. should potentially buy): </p>
<blockquote><p>I imagine that the technology behind both tools isn&#8217;t all that different&#8211;or at least doesn&#8217;t have to be. But, while Yahoo makes friends in the technology community (especially among researchers), Google makes friends in the advertising community&#8211;and makes itself oodles of money.
</p></blockquote>
<p>Nice analogy, Daniel!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2008/11/19/yahoo-vs-google-technology-vs-advertising/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Extending Google: First Look at SemantiFind</title>
		<link>http://blog.semantic-web.at/2008/09/23/extending-google-first-look-at-semantifind/</link>
		<comments>http://blog.semantic-web.at/2008/09/23/extending-google-first-look-at-semantifind/#comments</comments>
		<pubDate>Tue, 23 Sep 2008 13:30:02 +0000</pubDate>
		<dc:creator>Jana Herwig</dc:creator>
				<category><![CDATA[Collective Intelligence]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Tools & Software]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Java Coffee]]></category>
		<category><![CDATA[Labels]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Search Engine]]></category>
		<category><![CDATA[SemantiFind]]></category>
		<category><![CDATA[Tagging]]></category>
		<category><![CDATA[Web search engine]]></category>

		<guid isPermaLink="false">http://blog.semantic-web.at/?p=276</guid>
		<description><![CDATA[Just stumbled upon SemantiFind via T3N, and then upon the review on ReadWriteWeb from last week Thursday. What&#8217;s it about? Semantifind is an IE and FF browser plug-in that extends Google&#8217;s search functionalities, most notably through a typeahead functionality that &#8230; <a href="http://blog.semantic-web.at/2008/09/23/extending-google-first-look-at-semantifind/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Just stumbled upon <a href="http://www.semantifind.com/">SemantiFind</a> via <a href="http://t3n.yeebase.com/aktuell/news/newspost/die-t3n-links-der-woche-vom-19092008/1974/">T3N</a>, and then upon the <a href="http://www.readwriteweb.com/archives/semantics_google_semantifind.php">review on ReadWriteWeb</a> from last week Thursday. </p>
<p align="center"><a href="http://www.semantifind.com"><img src="http://blog.semantic-web.at/wp-content/uploads/2008/09/logo_semantifind_250x65.png" alt="" title="Semantifind" height="65" width="250"></a></p>
<p>What&#8217;s it about? Semantifind is an IE and FF browser plug-in that extends Google&#8217;s search functionalities, most notably through a <a href="http://www.readwriteweb.com/archives/semantics_google_semantifind.php">typeahead</a> functionality that allows you to refine your search results before hitting &#8216;enter&#8217;. <a href="http://www.readwriteweb.com/archives/semantics_google_semantifind.php">ReadWriteWeb</a> wasn&#8217;t too impressed though:</p>
<blockquote><p>Unfortunately, SemantiFind is one of those tools that&#8217;s good in theory, but not so good in practice. When performing some test searches, results were not as precise as they should have been. For example, in the above-mentioned search for &#8220;Georgia,&#8221; a search for the U.S. state returned Google results for the country as well.</p></blockquote>
<p>Ambiguities due to <a href="http://en.wikipedia.org/wiki/Homonym">homonyms</a> such as <a href="http://en.wikipedia.org/wiki/Georgia_%28country%29">Georgia</a> <a href="http://en.wikipedia.org/wiki/Georgia_%28disambiguation%29">vs</a> <a href="http://en.wikipedia.org/wiki/Georgia_%28U.S._state%29">Georgia</a>, or <a href="http://en.wikipedia.org/wiki/Java_%28Sun%29">Java</a> <a href="http://en.wikipedia.org/wiki/Java_%28disambiguation%29">vs</a> <a href="http://en.wikipedia.org/wiki/Java">Java</a> are among the faves of people who are trying to pitch a semantic tool to you &#8211; but I really wonder whether the effects of homonyms aren&#8217;t highly overrated? How often do people really search for these, and in particular search for these without context, i.e. further search terms such as in &#8216;Georgia Tech&#8217;, &#8216;Georgia war&#8217;, &#8216;Java Coffee&#8217; or &#8216;Java bugs&#8217;?</p>
<p>I must say I was quite impressed by the choice of search terms offered, and if you (like me) are easy prey for the <a href="http://en.wikipedia.org/wiki/Serendipity">serendipity effect</a>, then SemantiFind can please and distract you endlessly. Here is a preview of what appears if you enter &#8216;serendipity&#8217; &#8211; please note the preview of possible descriptions and definitions which you get on the Google homepage with the plugin <a href="http://blog.semantic-web.at/wp-content/uploads/2008/09/scr_semifind_typeahead.gif">(click &gt; big)</a>:</p>
<p><a href="http://blog.semantic-web.at/wp-content/uploads/2008/09/scr_semifind_typeahead.gif"><img src="http://blog.semantic-web.at/wp-content/uploads/2008/09/scr_semifind_typeahead-300x290.gif" alt="" title="SemiFind Screenshot" width="500"></a></p>
<p>Once you pick a term it turns into a kind of button (just slightly annoying: you cannot edit a term after it&#8217;s turned into a button, but would have to delete the whole thing and type again if you want to change your search query): </p>
<p><img src="http://blog.semantic-web.at/wp-content/uploads/2008/09/scr_semifind_button.gif" alt="" title="Semifind Button" class="alignnone size-full wp-image-278" height="85" width="500"></p>
<p>And then, what happens? On the search results page, you see results filtered by SemantiFind&#8217;s user-generated, user-approved labels on top of the other search results &#8211; which irritated me at first as it comes across as a search engine within the search engine. Admittedly: I&#8217;d rather sift through 13 results than through 10,900,00 search results (even though I never make it to the end of Google&#8217;s search list anyway; does anybody?) &#8211; but does the article about trees doing their best work with thermostats at 70Â° really deserve the second rank in SemantiFind&#8217;s list of recommended search results?</p>
<p><a href="http://blog.semantic-web.at/wp-content/uploads/2008/09/scr_semifind_results.gif"><img src="http://blog.semantic-web.at/wp-content/uploads/2008/09/scr_semifind_results-300x218.gif" alt="" title="SemantiFind Results" height="218" width="500"></a> </p>
<p>So while I agree with RWW that this &#8220;just goes to show why search engines that rely on people to filter the results might not work. Human error shouldn&#8217;t be a factor in web searches&#8221;, I am still quite fond of the suggestions and definition previews. I would probably use SemantiFind regularly if they allowed me to configure the plugin in such a way that I&#8217;d get the suggestions on the input page, but not the recommended results on the results page.</p>
<p><strong>What&#8217;s the source of these results anway?</strong> SemantiFind&#8217;s recommended results seem to rely entirely on input generated by users &#8211; to add input, you need to install their toolbar and start adding labels to websites; if a website has been labeled before, you can confirm or reject existing labels. What&#8217;s nice: a label recommender (only presumably the same one that&#8217;s used for search queries) reduces ambiguity. What&#8217;s curious: You can also browse the pages you have already labeled in what they call your &#8220;catalogue&#8221; &#8211; which makes the service even more reminiscent of a bookmarking service, and which makes me wonder whether one shouldn&#8217;t possibly link this with a del.icio.us/Mr.Wong/Bibsonomy/Faviki account (<a href="http://faviki.com/">Faviki</a> would probably be the best, considering their tag recommendations are based on DBpedia, and considering that <a href="http://faviki.wordpress.com/2008/09/19/almost-1-million-new-tags-added/">Faviki just added 1 million new tags</a> and now holds more than 5 million tags across all languages)</p>
<p>Questions that remain: I&#8217;d really like to know how they maintain their list of suggested labels &#8211; ambiguity, typos, plurals forms, i.e. the usual folksonomy issues must be a big challenge. Also, I&#8217;d like to know where they get their definitions in the preview from &#8211; from Google? Or are these user-generated as well?  There must, after all, be some use for the &#8220;<a href="http://www.semantifind.com/requestDefinition.jsp">request a new definition</a>&#8221; form?</p>
<p>Too bad they don&#8217;t have a blog to which one could send a track back, and there is nothing much on their <a href="http://www.semantifind.com/about/">company page</a> either.</p>
<div style="margin-top: 10px; height: 15px;" class="zemanta-pixie"><a class="zemanta-pixie-a" href="http://reblog.zemanta.com/zemified/97519edf-da9d-4c37-9af3-57c77964e9fb/" title="Zemified by Zemanta"><img style="border: medium none ; float: right;" class="zemanta-pixie-img" src="http://img.zemanta.com/reblog_e.png?x-id=97519edf-da9d-4c37-9af3-57c77964e9fb" alt="Reblog this post [with Zemanta]"></a></div>
]]></content:encoded>
			<wfw:commentRss>http://blog.semantic-web.at/2008/09/23/extending-google-first-look-at-semantifind/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>

