The Semantic Puzzle

Tassilo Pellegrini

Is OpenCalais becoming a Search Engine?

Open Calais Logo

From the very beginning I was wondering, what Reuters is going to do with all that data generated by OpenCalais. So I took a moment and browsed through the Privacy Statement (formerly their Terms Of Use), stepping over an enlightning paragraph:

We may build a search capability in the future. This capability would allow users to search the metadata repository and receive back a list of entries that match that search criteria. Unless you have authorized it via an APIAn application programming interface (API) is an interface implemented by a software program to enable interaction with other software, similar to the way a user interface facilitates interaction between humans and computers. APIs are implemented by applications, libraries and operating systems ... parameter, this list would not include the original metadata contained in the document but would expose the URL and description of the original document if you have provided it to us. If you do not want your content included in the search functionality you should indicate so in the appropriate area of the API. If you want to maximize the exposure of your content on the web you should not opt out of inclusion in the search functionality.

Hypothetical in wording this paragraph states it very clear: engagement in the search market is definitely an option. But they even go one step further.

We may build a syndication capability in the future. This capability would allow us to generate feeds of content that match certain selection criteria based on the metadata. As with search, unless you have authorized it via an API parameter, these feeds will not expose the original metadata contained in the document but would expose the URL and description of the original document if you have provided it to us. If you do not want your content included in the syndication functionality you should indicate so in the appropriate area of the API. If you want to maximize the exposure of your content on the web you should not opt out of inclusion in the syndication functionality.

This sounds to me like content reselling business. In this regard it might be interesting to take a look at the latest developments from IPTC: a policy standard called ACAP, which stands for Automated Content Access Protocol. Its designed to express access policies for robots on content items. Coupling ACAP with (hypothetical) search capabilities of OpenCalais could result in a major commercial distribution engine especially for traditional media content owners. Especially with the following marketing capabilities in mind:

We may build other products in the future based on statistical or other analysis of the metadata, such as trend analysis, emerging topics or others. In no case will these products expose the original document’s metadata.

Finally a business model for the Semantic Web? Whatever … smart guys, great service!

This entry was posted in Mashups & Web services, Privacy & Information Ethics, Search Engines and tagged , , , , by Tassilo Pellegrini. Bookmark the permalink.
Tassilo Pellegrini

About Tassilo Pellegrini

From Wikipedia, the free encyclopedia: Prof. (FH) Dr. Tassilo Pellegrini (born 1974) studied International Trade, Communication Science and Political Science at the University of Salzburg and University of Málaga. Since end of 2007 he is running the New Media Division at the University of Applied Sciences in St. Pölten. He obtained his master degree in 1999 from the University of Salzburg on the topic of telecommunications policy in the European Union, which was followed by a PhD in 2010 on the topic of bounded policy-learning in the European Union with a focus on intellectual property policies. His current research encompasses economic effects of internet regulation with respect to market structure and basic civil rights. He is member of the International Network for Information Ethics (INIE), the African Network of Information Ethics (ANIE) and the Deutsche Gesellschaft für Publizistik und Kommunikationswissenschaft (DGPUK). Beside his specialisation in policy research and media economics Tassilo Pellegrini has worked on semantic technologies and the Semantic Web. He is co-founder and Head of Division Research and Development of the Semantic Web Company in Vienna, co-editor of the first German textbook on Semantic Web and Conference Chair of the annual I-SEMANTICS conference series founded in 2005.

6 thoughts on “Is OpenCalais becoming a Search Engine?

  1. Thanks for your interest in ACAP. Please note, however, that ACAP is the initiative of the European Publishers Council (EPC); the World Association of Newspapers (WAN); and the International Publishers Association (IPA). Please don’t hesitate to get in touch for any further information. http://www.the-acap.org

  2. Pingback: Ο Σημασιολογικός Ιστός εισβάλλει στη Wordpress, στο Yahoo, στο Digg, στο πρακτορείο Reuters!

  3. As a matter of fact, the terms you are quoting are gone, even the url is not valid anymore (just found general terms at http://opencalais.com/terms).

    So either a) you made that up or b) you lift the lid on something here … or c) they just happen to change their site structure in the meantime. I don’t know why but I think b) is right. Right?

  4. Yes, I realized that OpenCalais has updated it’s Terms of Service. They shifted the original references to their Privacy Statement (http://opencalais.com/privacy). Anyway the quoted text reveals a lot about the SemTech Business. The straightforward question simply is: What are they gonna do with all the metadata? The entry above is just one scenario based on what OpenCalais wrote themselves. But it’s plausible … isn’t it …

  5. Pingback: How Semantic is Linked Data? OpenCalais Launches Semantic Proxy « GrowthTimes

Leave a Reply

Your email address will not be published. Required fields are marked *

*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>