Andreas Blumauer

Open Intranet

The following blog post was used by Andreas Blumauer as a basis for a talk at TEDxVienna on Monday, November 29, 2010:



Open Data, Open Government, Open Source, Open Innovation – “Open” everywhere. Today I want to talk about another “Open something”: The “Open Intranet”. This might sound a bit radical but it will also help to reflect a little bit on the term “open” in general.

“Open Intranet” – isn´t this a contradiction by definition? What is understood by “Intranet”? It means a network of computers and users “within” some organisational boundaries. But boundaries don´t necessarily have to be closed as nature teaches us: Organisms aren´t closed systems. A watch would be an example for a closed system but living organisms tend to be open – to survive. Of course they aren´t totally open, in systems theory we are talking about systems which are structurally coupled with their medium when we refer to this special kind of openness. As an example, an immune system, having learned to recognise a class of virus it will remain sensitive to that and similar viruses in future. In contrast to this, imagine a fly walking over a painting of Rembrandt: Since the fly isn´t structurally coupled to the cultural space of human aesthetics it is not “open” to the beauty of Rembrandt´s work.

When we think of today´s intranets, we can see that they tend to be isolated from the world wide web, they don´t seem to perceive the internet as their medium. From a user perspective, those two systems aren´t connected to each other. Typically, when working on the intranet we jump from time to time to be in the “internet mode” and start to Google something, we copy it, jump back and paste it into the intranet. It´s the user who is the only part of the whole system connecting the internet with the intranet. Isn´t this exhausting for us?

And now I start with the good news: Intranets all over the world start to open up, slowly – but they do. It seems like the “pressure” from “the outside” just became too huge. In the first instance it seems that it´s not the data and the information which will “break” in, it´s rather the “cool functions” which web apps offer and which we (as digital natives) would like to have in our intranets too. We want:

  • better search,
  • more possibilities to interact with information,
  • integrated views instead of jumping around,
  • and we want more possibilities to self-serve our extensive hunger for more and well structured information.

On the information level intranets are still rather conservative: Typical pieces of information already “injected” from the web into an average intranet would be:

  • weather forecasts,
  • stock exchange rates,
  • time zones and
  • jokes.

How could companies use the web to inspire their employees (without opening up totally), how could the web “inject” the right amount of information into an intranet to make an enterprise portal as vivid as the web is being perceived by today´s typical end-user. How could this tremendous amount of data and knowledge on the web be “structurally” coupled with intranet repositories and workflows? What are the advantages a company could gain from publishing (at least some) data on the web?

Let me give you a few examples for intranet apps which have started to consume other information than jokes from the web:

  • Enterprise Mashups: Combine CRM systems with social networks like LinkedIn
  • Open innovation: Let´s bring the knowledge of consumers and producers together and improve certain products and services. As an example, just recently after BP´s oil spill more than 40.000 people came up with ideas on how to clean up  the oil, more than two dozen were deployed to help clean up the oil
  • Content Augmentation: Enrich content which is being edited, let´s say in an enterprise wiki, automatically with some background knowledge from Wikipedia or with news from a news company

Finally I will also give you two examples for use cases where companies expose and publish internal data on the web (without violating privacy) and benefit from it.

  • Wisdom of the crowd: The Canadian gold mining group Goldcorp made 400 megabytes of geological survey data available to the public over the Internet. They offered over $500,000 to anyone who could analyze the data and suggest places where gold could be found. The company claims that the contest produced 110 targets, 8 million ounces of gold, worth more than $3 billion.
  • Prize economics: Netflix, a movie rental service in the US has published data for a contest to improve their recommender engine. One team out of 50.000 contestants after nearly 3 years has improved the existing recommender engine by more than 10% and won 1 Million dollar

To end with a conclusion: What Tim Berners-Lee has demanded in one of his famous TED talks was “raw data now!”. It has started to become reality. Just think of all the “Open Government Data Initiatives” around the globe which were initiated since then. Now companies with a “Web DNA” have started to understand the value of open data and to contribute their “5 cents” to the global “open data cloud”. I think this will not only be of value for many companies but also will increase tremendously the chances to resolve some global problems in the near future.

Tassilo Pellegrini

Jordan S. Hatcher: “Why we can’t use the same open licensing approach for databases as we do for content and software.”

jordanJordan S. Hatcher is, among other things, a lawyer, academic, and entrepreneur working on Intellectual Property and Internet law issues in the UK and worldwide. He is heavily involved in the Open Data Commons initiative. Last month he gave me an interview on IPR issues associated with data licensing. His brief answer to the question why data needs a seperate licensing framework:

The answer to me is that database and data are different.  They’re different legally and different practically in what consumers and producers of open data want to do with it.  They’re also different in what the future looks like in terms of things like linked data.

Read the details in the full interview.

Reblog this post [with Zemanta]
Thomas Schandl

Attending TopQuadrant’s SemWeb Technology Training

There’s a lot to know about semantic standards, languages, technologies and their application, so last week I attended TopQuadrant’s first European training from Oct 5th to 9th in Amsterdam.

We kicked off with Eddy Vanderlinden elaborating on the lessons he learned from 30 years of work in the financial sector. He outlined how improvements could be achieved by using data models relying on semantic web standards. You can read about his ideas in this essay.

TQ’s chief scientist Dean Allemang then continued with his talk “Enabling Creativity at the Edge”. “The edge” refers to the boundary between an information system and the real world, where the end users of a system work. As business needs change faster and faster, the people working at the edge need to be able to adapt the company’s applications on their own and shape them to their everyday needs.

Dean Allemang

Dean Allemang

Nowadays end user often achieve this kind of creativity on the edge by using self-made spreadsheets. The problem with that is their lack of interoperability. These data from different spreadsheets, databases, reports, etc. are often connected through business processes that rely on repetitive and error prone human processing, like copying things from a spreadsheet to a database, creating a report and pasting its result into another system, and so on.

The result is a complex system with many heterogenous parts and an organisation that cannot possibly know what it knows.

As a solution Dean proposed to “think outside the table” and go beyond the relational database way of orgranising data. This of course can be achieved by integrating the data using semantic technologies. TopQuadrant’s software offers possibilities to do just that, and makes it possible to create highly customizable dashboards and applications that all rely on the same data.

During the following days we learned about the ins and out of using semantic standards and languages and tried out TopBraid tools in several hands-on excercises. The TopBraid Suite is a very powerful, commercial toolkit. It includes TopBraid Composer, Live and Ensemble. Composer is a semantic web modeling and application developement tool, that uses the Eclipse framework. TopBraid Live is a server for semantic applications built with TopBraid Ensemble. Ensemble is a graphical application assembly toolkit, that enables end users to create custom apps that run in a browser and use RDF data and data models – thereby allowing for the above mentioned “creativity at the edge”.

I am very impressed with the capabilities of these tools, they enable the user to realize manifold possibilities that come with using semantic web standards – and that without programming. You can see some of these tools in action and learn about applying semantic standards in a series of webcasts from Semantic Universe. For the latter topic you might also attend one of our webinars.

On the last day Dean coverd several case studies, like connecting ontologies to legacy data sources (using e.g. D2RQ inside Composer), applying semantic technologies to the customer service management of a larger retailer or using ontologies in Federal Enterprise Architecture.

All in all I am very happy to have attended TopQuadrant’s training and hope they will establish a successful series of trainings in Europe just as they did in the US.

Jana Herwig

A. Shakya: “From hyperlinking to data linking”

[This article was written by guest author Aman Shakya and originally submitted as an entry in our LinkedData Vision Competition]

The current gigantic network of web documents could be realized by enabling any user to publish any document and link to other documents. If we want to see the network of Linked Open Data explode on a similar scale, we need to enable general users to publish “data” directly on the web and link to other “data”. We need to move the paradigm of web page publishing and hyperlinking towards data publishing and data linking. We should enable people to post structured data about anything rather than just unstructured text. We need the active participation and contribution of the billions of worldwide internet users. Recently, the web has seen enormous user participation with the rise of easy-to-use social software. We should exploit this trend of social web applications, however, for enabling people to create, share and link “data” on the global Linked Data Web.

With this vision, I am working on a social Semantic Web application called StYLiD (an acronym for Structure Your own Linked Data), now available at www.stylid.org. A basic demo video is also available at www.stylid.org/quickstart.php#video

It enables people to share a wide variety of structured data with the freedom to define their own structured concepts on the fly. Continue reading