Semantic Web Company

The Semantic Puzzle

Open World Assumptions

subscribe RSS

Archive for June, 2010

Report on developments at the European Semantic Technology Market

June 25, 2010 By: Thomas Thurner Category: Corporate Semantic Web, Enterprise 2.0, Literature & Publications No Comments →

The present state of development, future trends and expected market scenarios for Semantic Technologies are shown in the just published “Demand driven Mapping Report”. The report is part of the EU-funded project Value It, which is about bringing together the various stakeholders within the sector: Industry, Research and Government. VALUE-IT preliminary findings show that the STE potential market in Europe will size up to €1.44B for 2014. Scanning furthermore the executive summary of the report, some findings attract attention:

The survey results also show considerable variation by sector, both of policy and technology implementation. With respect to technologies, ICT companies are also the most willing to consider semantic approaches. The ICT sector has an unusually high interest in all ST components, with 20% or more being willing to consider all of them, and over half of IT respondents looking at Web 2.0 (social computing). [...]  The use of tagging technologies – which overall is the least mature approach in the survey – is most advanced in Life Sciences. The Life Sciences, Media & Entertainment, and ICT sectors all have a reasonably strong interest in Natural Language Processing (roughly 25% on average). Ontologies and RDF/OWL are the technologies least often considered, though the interest in these Semantic Technologies is not insignificant. Taxonomies are slightly more popular, perhaps indicating that companies are taking the first step to prepare for a more semantic approach to IT solutions. The ICT, Energy & Utilities, and Media & Entertainment sectors all have a reasonably strong interest in using taxonomies.

The 190 pages report gives an actual overview of the status quo on European Semantic Technology Market and is now available for download: Final demand driven mapping Report

Sphere: Related Content

Vienna 01.07.2010 – Panel discussion on the Future Internet

June 22, 2010 By: Thomas Thurner Category: Conferences & Events No Comments →

Within the last year the SWC’s team run the project called “ZukunftsWeb” (Future Internet). After ten month of in-deep discussion, expert panels, webinars and the becoming of a book on the topic, it’s time to celebrate the past efforts and have also a look into the future. So this is why we want invite friendly to our evening event on july the first. So if you are in vienna that day, join us – we promise a inspiring evening, with nice people and wise talks.

Venue: Filmmuseum Wien
Date/time: 01.07.2010 / 6pm

More about this event in german and english.

RSVP to
eMail FacebookYahoo Upcomingxing

Sphere: Related Content

Stella Dextre Clarke & Alan Gilchrist about the “Future of Knowledge Organization on the Web”

June 21, 2010 By: Andreas Blumauer Category: Linked Data & Open Data, Tools & Software, Vocabularies & Languages 1 Comment →

Semantic Web Company (SWC) had the pleasure and the opportunity to talk with two internationally recognised experts in the fields of information management and knowledge organization: Alan Gilchrist and Stella Dextre Clarke. SWC asked some questions about the “Future of Knowledge Organization on the Web & Linked Data” on the occasion of an event of the same name organised by ISKO UK which will take place on September 14, 2010 in London.

1. Alan, you are one of the leading experts in the field of thesaurus construction. Organising knowledge in a (worldwide) Semantic Web is a rather young discipline compared to your domain. What do you think can the Semantic Web community learn from “traditional” thesaurus management and vice versa?

You put inverted commas round the word traditional, but it might be more appropriate to put them round the word thesaurus! So long as words are used in information retrieval and in information sharing, different forms of structured vocabularies will be required, and many of the fundamental principles of thesaurus construction are still valid for their construction. Of course, the “traditional” thesaurus has mutated since the days when it was used only for controlled indexing and retrieval; and now, with the many enrichments possible it can be viewed as an ontology (in one of the definitions of this word). What remains a difficulty is to create a generalisable typology of associative relationships, though this is, of course, possible in relatively closed systems. In short, structured vocabularies with broadly thesaurus formats will be a necessary component in the web stack.

2. Stella, as a consultant you are specialized in the design and implementation of knowledge structures for information retrieval applications. In the last few months we have seen that SKOS can serve as a significant building block to link “traditional” thesaurus management to knowledge structures from the semantic web. Can you see that this development is market-driven, is there a significant growth of demand for solutions built around SKOS?

This question sounds surprisingly sceptical about the growth of SKOS. I guess the dizzying speed of phenomena like Facebook and Twitter has fuelled expectations of tools springing up overnight like mushrooms, fully formed and ready to eat. But actually it takes time, not just for the tools to be fashioned, but for the potential market to develop an understanding of what they can do and what will happen next when they are used.

Applications for SKOS are springing up all the time, as fast as people can grow the skills and vision to deploy them. At the moment the market, or shall we say the power-base, seems to be with the academic sector and allied not-for-profit organisations. This will spread progressively through the public to the private sector, as enterprises find ways of adapting their business models. The main hurdles to overcome could be intellectual property rights and the need for compilers of databases to keep earning their living.

3. Alan, constructing thesauri for the semantic web also means that one has to make the “open world assumption”. In which sense does this change the way to manage thesauri, keep them growing and assure quality? Can you see new, upcoming methodologies to do that?

Everything changes with the “open world assumption”! Following on from my answer to the previous question, it seems clear that one manifestation of the thesaurus will be found in those systems that support interoperability, such as federated searching or metadata registries. Even with simple thesaurus management software, it is possible to construct a “master vocabulary” or “word bank” to support different applications within an enterprise; thereby promoting interoperability. More sophisticated software is already available (though not very widely); more will be needed and, doubtless, will be created.

A more formal answer to both questions will be found in a new standard – ISO 25964, currently being prepared on the basis of BS 8723. The two fundamental features of these two standards are (1) the thesaurus as a theoretical and practical basis for the construction of structured vocabularies for information retieval and (2) the growing and vital need for interoperability between systems and the intelligent mapping of the vocabularies used by those systems.

4. Stella, just recently at ESWC 2010, Sean Bechhofer was asked during his keynote why there are so few SKOS tools on the market. What do you think are the reasons for this? Are there still shortcomings of the SKOS specification compared to other existing thesaurus standards? (see also: http://www.eswc2010.org/program-menu/keynote-speakers/155-sean-bechhofer & http://www.slideshare.net/seanb/skos-past-present-and-future )

Regarding the speed of development, see my reply above. As to shortcomings, did you note in one of Bechhofer’s slides: “Standardisation is necessarily a compromise: Everyone equally unhappy = success!” The SKOS development team took a conscious decision to keep the schema sufficiently simple that it could be applicable to as many different types of KOS as possible.  On the downside, this means SKOS is unsatisfactory for conveying sophisticated features of some thesauri and classification schemes. But by keeping the entry barrier low, more widespread use has been encouraged.

By way of illustration, compare SKOS with the data model and XML schema of BS 8723. This schema is comparatively specialized, with the aim of enabling exchange of any thesaurus carrying any or all of the features recommended in the standard. And incidentally, this data model and schema will have some further capabilities added when published in the forthcoming standard ISO 25964. SKOS does not provide for a number of features in these standards (such as compound equivalence). But the schemas in BS 8723 and ISO 25964 are designed for thesaurus developers to share their work, rather than for easy publication on the Web, and will never have so many users or associated tools as SKOS.

So I believe that SKOS has done well to accept compromises that encourage generalisation although they might not suit some specialists. That said, I do regret one of its weaknesses in the context of mapping. Compound equivalence mappings (that is to say, where Concept A in one vocabulary maps to a combination of Concepts  B and C in another) are very commonly needed when extending a search across multiple databases, and the SKOS mapping properties do not currently allow for them. Perhaps there will be some provision in future?

5. Stella, Alan, in September ISKO UK will organise an event on “The Future of Knowledge Organisation on the Web”. “Linked Data” seems to be a promising approach to organise knowledge in large scale environments.
Could you imagine that SKOS as a small subset of semantic web specifications will play a central role in this environment since it is quite intuitively comprehensible by virtually any knowledge worker or do you rather think SKOS is too simple (or too complex)? (see also: http://poolparty.punkt.at/using-skos-as-an-interface-to-the-linked-data-cloud )

Stella: Of course SKOS will have a central role (whether or not every knowledge worker finds it as intuitive as you suppose). “Linked Data” will find even wider applicability. ISKO-UK (the organiser of the meeting in London on 14 September) has a mission not just to spread the word about both these technologies, but to build bridges between the several communities who must share their expertise and data to build more exciting applications. We’re expecting an audience of over 100 at this low-cost event.

Alan: Yes, of course, just as all the tools in the web stack will be necessary if semantic web technologies are to be effective. But it is obvious that we are dealing with complexities of a higher order than ever before. Any structured vocabulary is an “artificial language” which, while acknowledging many aspects of theoretical linguistics is forced to be pragmatic in its construction. Consequently, it would not be surprising if SKOS is seen to be “catching up”, and this became apparent in the work of BS 8723 when thesaurus models using UML were being constructed. There remains much work to be done on all fronts.

Stella Dextre Clarke is an independent consultant specializing in the design and implementation of thesauri and other knowledge organization structures. She currently leads ISO NP 25964, the project to update and revise the international standards for thesauri. Previously she was the Convenor of the Working Group which developed BS 8723. In 2006 she won the Tony Kent Strix Award for outstanding achievement in information retrieval, in recognition for her development work on IPSV (Integrated Public Sector Vocabulary), as well as on the vocabulary standards. She is a Fellow of the Chartered Institute of Library and Information Professionals.

Alan Gilchrist has been a consultant for many years in the fields of information management and information architecture, specialising in the vocabulary aspects of information retrieval. He is co-author, with Jean Aitchison and David Bawden of Thesaurus Construction and Use, now in its fourth edition. In 1979 he founded and edited the Journal of Information Science, and is now Editor Emeritus. He has an Honorary Degree (D. Litt.) from the University of Brighton and is an Honorary Fellow of the Chartered Institute of Librarians and Information Professionals.

Sphere: Related Content

Kingsley Idehen: “By declaring its context, Linked Data can be made more easily reusable by others”

June 16, 2010 By: Andreas Blumauer Category: Corporate Semantic Web, Enterprise 2.0, Linked Data & Open Data, Tools & Software No Comments →

Semantic Web Company talked with Kingsley Idehen who is CEO of OpenLink Software and probably one of the most profound experts on data integration issues about “Linked Data”.

The interview covers questions like:

  • How can Linked Data help to make companies more productive?
  • Do you think that the Linked Data Initiative can build upon a stable architecture or will it face more and more problems the bigger the “cloud” will grow?
  • What´s the ultimate argument for an Enterprise Architect to use languages like SPARQL at least in addition to SQL?
  • How will a “Real Time Semantic Web” change the whole game?
  • How will the “Semantic Web” be called in 10 years? Will there still be a “Semantic Web”?

Read the full version of the interview here.

Sphere: Related Content

Lyndon Nixon: “With the hundreds of TV channels available, content selection becomes a significant challenge for users.”

June 07, 2010 By: Tassilo Pellegrini Category: Conferences & Events, Internet & Media No Comments →

Lyndon Nixon

From June 9 – 11, 2010 the EuroITV Conference discusses latest advances and research of media technology, HCI, media studies, and the content creation community. Tassilo Pellegrini talked to Lyndon Nixon, STI International, about the future role of semantic technologies in the television industry and how a Social Semantic Web might influence the traditional television experience.


At this year’s EuroITV conference you will hold a workshop on the EU project NoTube. Can you give us a brief insight what this project is about?

NoTube is all about the future of television! We are seeing a significant shift in viewing patterns driven by the Web, which breaks the linear programming model and makes TV or video on demand a reality, whether it is being provided directly by the broadcasters or via a third party like Hulu or YouTube. The Web-based model taken up by viewers using their PC is being transferred back to the TV set in the lounge by IPTV applications running on Set Top Boxes or Internet TVs which come with Web access built into them. The strong interaction between the desires of users and technology has had its impact on the Web and as the gap between the Web and TV experience grows, we aim to translate features of the Web to TV, such as the personalised and community aspects. The NoTube European project puts the TV user back in the driver’s seat by generating user profiles from data the user creates on the Social Web, and in this way facilitating a personalised TV experience without an intrusive user profiling process.

What promises does the Social Semantic Web hold with respect to innovate the television experience? What is the vision?

With the hundreds of channels available via modern TV providers, content selection and dealing with the vast amount of TV-related information become significant challenges for users. TV metadata is created and distributed by a small group of people, as a result of the closed-source information exchange protocols that are the standard for providing electronic programme guide (EPG) data to users. Yet people often have several clusters of personal data on the Web, such as their profiles on social networks, or ratings of videos on YouTube and IMDB.

Analogously, there are many isolated clusters of broadcast data on the Web, such as broadcast data on EPGs and background information on Wikipedia. Within the NoTube vision context, we speculate that the conjunction of all these bits and pieces of data provide accurate information on someone’s interests, which is suitable for generating relevant recommendations on TV broadcasts. We see progress on opening up this data with open standards and APIs such as Google’s OpenSocial, Facebook’s OpenGraph, DBPedia, the BBC ontologies and FOAF. Further, we assume that Semantic Web technologies provide important building blocks for realizing this vision, as they enable the global identification mechanism of URIs and the means to define relations between data anywhere on the Web. By integrating these different pockets of data, we can provide TV viewers with personalised recommendations for their viewing.

What economic effects on the value chain do you expect from semantically empowered television? Will there be new revenue opportunities with respect to advertising or Pay TV models?

Our primary focus is on open source and open standards, so for example we are extending the open source MythTV media centre to develop first scenarios of personalised EPGs. However, down the road there are clearly commercialisation opportunities.

Another scenario in the project looks at personalised advertising, which is clearly somewhere where there are revenue opportunities. However, we take user privacy very seriously, and one aspect we need to tackle in NoTube is the fine line between analysing user activity (in order to personalise their TV experience) and using that analysis commercially.

The third NoTube scenario involves pushing personalised news streams to TV viewers. Here, one could imagine that such a service could be packaged within a Pay TV offer, and used to give competitive advantage or justify a higher fee.

Despite many attempts experience has shown that television is a rather conservative and innovation-averse medium. What can be done to stimulate the uptake of semantic technologies in the television sector?

That’s true; in the traditional broadcasting sector the larger companies are extremely slow to adopt new technologies. However, I think Web video and TV has really shook up the sector – traditional broadcasters are seeing that they lose viewer share to Web-based offers and have been quick to take their video material to the Web. There is a clear demand for this, look at the viewing numbers for BBC’s iPlayer in the UK for example.

IPTV also means that new applications and services can be built on top of traditional TV. I think once the broadcasters see the added value of offering applications and services tied into the content of their programming – such as through semantic analysis of the program metadata, which NoTube is doing – they will be encouraged to support better these efforts. The BBC is really taking a lead in this, publishing a lot of their data already in RDF.

Workshop Information

The NoTube workshop on Future Television: integrating the Social and
Semantic Web
will take place at the EuroITV 2010 conference in Tampere, Finland on June 9, 2010.
For more information please see

http://www.euroitv2010.org

and

http://www.notube.tv/news/73-futuretv-2010

For more information about NoTube, please see

http://notube.tv and follow our blog, at http://blog.notu.be

About Lyndon Nixon

Dr. Lyndon Nixon joined STI International as senior postdoctoral researcher in November 2008. Previously he was a researcher at the FU Berlin, where he acted as Industry Area Co-Manager of the EU Network of Excellence KnowledgeWeb and double Workpackage Leader in the EU project TripCom. In KnowledgeWeb, Dr. Nixon organized and led activities promoting the transfer of semantic technology to industry. He received his PhD in January 2007 with the topic ‘Semantic Web enabled Multimedia Presentation system’. His research focus is Web-based TV/video and the semantically guided integration of Web-based content, and he has several publications and has organized a number of workshops around related themes.

Sphere: Related Content