Semantic Web Company

The Semantic Puzzle

Open World Assumptions

subscribe RSS

Archive for the ‘Social Software’

Invited Talk at IFRA 2009

September 25, 2009 By: Tassilo Pellegrini Category: Calls & Competitions, Conferences & Events, Internet & Media, Knowledge Management, Mashups & Web services, Social Software No Comments →

I will give a talk about the relevance of Semantic Web and Linked Data for news publishers at this year’s IFRA summit in Vienna on October 15, 2009. IFRA is the World Association of Newspapers and News publishers and within their Technical Group Publishing they are starting to deal with Semantic Web. Further invited speakers are Michael Steidl (IPTC) and Robert Schmidt-Nia (dpa mediatechnology).

Reblog this post [with Zemanta]
Sphere: Related Content

Semantic Web Awareness Barometer 2008 – Preliminary Results

March 20, 2009 By: Tassilo Pellegrini Category: Literature & Publications, Social Software 1 Comment →

First results from our last online survey “Semantic Web Awareness Barometer” are now available. We conducted the survey togetehr with the Corporate Semantic Web Initiative from the FU Berlin and the Know Center in Graz. We got 256 valid cases (from 561 responses) which reveal some intertesting results concerning the experience , expectations and readiness for Social Software and the Semantic Web. In short:

Social Software
1. Wikis are king! Social Bookmarking stays behind.
2. Differring applications & usage patterns of social software
3. Differring notions about the benefits of and barriers to Social Software
Semantic Web
1. Semantic Web is something familiar!
2. Application-oriented catch up – but where are the young academics?
3. „I taught myself about the Semantic Web.“
4. Semantic Web has a corporate relevance!: Search – the killer app! Integration costs & data control might be important aspects.
5. Differring notions about the barriers?
6. Competencies and collaboration will change …
7. Time to market 2 – 5 years!
8. No differences in region, IT competence & familiarity
We will give a short presentation at today’s Semantic Web Meetup in Berlin. If you can’t join us, don’t worry! You can download the slides right here: Semantic Web Awareness Barometer 2008 – Preliminary Results
A detailed report will be available by April.
Reblog this post [with Zemanta]
Sphere: Related Content

OntoWiki Kick-off in Leipzig

December 03, 2008 By: Andreas Blumauer Category: Companies & Institutions, Conferences & Events, Linked Data & Open Data, Ontology Engineering, Search Engines, Semantics & Philosophy, Social Software 1 Comment →

Virtuoso+DBpedia+OntoWiki together with several industry relevant uses cases – that´s about the formula of the OntoWiki project, which was launched yesterday in Leipzig.

Sören Auer and his team from AKSW at Uni Leipzig are the coordinators of this EU funded project which supports the development of innovative software products. All industry partners are SMEs which offer services for different fields like E-learning, E-tourism or Business Intelligence. Leipzig and OpenLink Software will work on an integration of OntoWiki & Virtuoso.

The first day of the meeting was, of course, dedicated to socialize and get to know each other. The mixture of the project team turned out to be well chosen – and in the evening we flew at higher game: We had a nice overview over Leipzig standing on the highest building of the town.

On the second day of the meeting Orri Erling, Program Manager at OpenLink Software, came up with an idea which is pretty forward: Why shouldn´t we provide OntoWiki as a Linked Data Browser, e.g. on top of DBpedia etc.? One possible outcome of this project.

Some other use cases which make already use of the existing OntoWiki system were demonstrated: Take a look at Vakantieland (…and start to plan your holidays in the Netherlands) and also at LinkedGeoData where a nice user interface can be tried out.

The Kick-Off Meeting will proceed with two workshops dedicated to semantic technologies and to Application Development with the OntoWiki Framework. Thanks to Sören and his team for the excellent hosting of this event!

Sphere: Related Content

KiWi as a Social Wiki Platform for Software Development, Open Ontology Management

November 28, 2008 By: Jana Herwig Category: Knowledge Management, Social Software, Tools & Software 1 Comment →

KiWi – Knowledge in a Wiki, Day 2 – Josef Holy from Sun Microsystems Prague led the first part of today’s use case presentation. With the KiWi semantic wiki system (or: wiki on steroids, as Josef Holy put it), they want to be able to increase the productivity of knowledge workers. Sun Microsystems have extensive experience with online and community collaboration and they want Kiwi to become a social wiki platform that is deployable in various contexts, i.e. that ties in with other platforms such as Netbeans or Zembly.

One of Sun’s further assumptions is that users will migrate to KiWi neither immediately nor completely – and that’s an insight anyone developing yet another social platform should take to their heart. What was true in Field of Dreams – “If you build it, they will come” – does not quite apply here. The network effect works in favour of existing communities, and instead of striving to replace an existing platform, one might be better off with mashable contents and services.

The particular benefit of a semantic wiki is that it allows moving from unstructured to structured information (relatively) easily. For KiWi @ Sun (and in favour of mashed information), this means that what is relevant will be structured, both by people and by machines – a process that is going to extend beyond company boundaries. People will bring in structure by creating links from KiWi documents to external systems as well as by writing new facts (which the KiWi system will represent as triples) about external information. What is not relevant, won’t be structured – and will be forgotten. After all, it’s forgetting that makes you remember the important stuff.

Sun Microsystems use Case

One note about the users of KiWi at Sun: Since this use case focuses on knowledge management for software development, it can be taken for granted that users will have an above-average level of web savvyness. Primary users will be software designers (i.e. the people who design for the users of the final product) and developers – learn more about the different roles in a software development project at Sun here.

Consequently, the User Interface (UI) concept Josef introduced also comprises a social networking unit – things such as a ‘My Contacts’, ‘My Pages’ list, but most importantly an activity feed, which will help users to collaborate, participate, discover activities that others are currently working, develop a mental ‘social map’ of the community. Such an activity stream (similar to Facebook’s News Stream) would contain items such as:

  • Szaby wrote a blog post
  • Josef rated document XUI specs: five stars
  • Peter created document ToDoList KiWi-UI
  • Stephanie is now a contact of Marek
  • Klara shared a document with Sebastian

Considering the target group, it is also planned that the UI will be extensible through widgets that users are able to write themselves.

*coffee break*
KiWi Team Meeting Vienna
Above: The KiWi-Team, hailing (officially) from Austria, the Czech republic, Denmark and Germany

After the break, Andreas Blumauer (Semantic Web Company, Vienna) followed up with a talk entitled “Open Ontology Management & Linked Data” which explored the uses of the Web of Data for the Sun usecase.

His argument was that content and topic-centred, open communities should have mechanisms at their disposal for relating content and activities to particular parts of a shared concept model, e.g. of an ontology. In particular in projects like NetBeans, where contents and related processes evolve over time, different NetBeans groups utilizing the KIWI system should be allowed to maintain and share their own concept models. The combination of bottom-up and top-down approaches would, for instance, come as the combination of free tagging (where people often use different labels to refer to the same, or the same label to refer to different things) and concept tagging.

Free and Controlled Tags

Free concepts can be turned into controlled ones, too, by being inserted into an existing controlled vocabulary, as either a narrower or related concept of any existing controlled concept. Open Ontology Management done this way is a Learning system: Through the combination of a Free Extraction Model (FEM) and a Controlled Extraction Model (CEM), text extraction improves over time.

Andreas also revealed a first glimpse of a project currently in stealth mode, code name ‘PoolParty’, which is an Open Ontology Management System that can be used to enrich local knowledge with data from the web. PoolParty consumes Linked Data and provides Linked Data; in the context of the current use case, it will be able to communicate with the KiWi System. Please contact Andreas if you would like to be notified about the further development of PoolParty.

Reblog this post [with Zemanta]
Sphere: Related Content

Knowledge Management for Project Management: from unstructured to structured information

November 28, 2008 By: Jana Herwig Category: Knowledge Management, Social Software, Tools & Software 1 Comment →

KiWi – Knowledge in a Wiki session, pt. 2: This afternoon, we turned to the Logica use case, which is dedicated to the development and optimization of KiWi as a knowledge management tool specifically tailored to the needs of project management.

Regarding the use case requirements: As Daniel Grolin, a process expert and business architect at Logica (formerly WM Data), pointed out, what is most required at the moment is an application for designing processes, i.e. for designing the ways that people do things. This can be a painful process, in particular if one group of people (consisting of process designers) thinks about the ways that another group of people (e.g. the project managers) are going to do certain things – a collaborative approach should be able to

1) alleviate this challenge
2) generate commitment among the involved parties.

The primary users will be on the one hand the process engineers, and on the other hand the project managers who are the recipients and users of these processes.

In his presentation, Daniel Grolin chose one of four scenarios in which KiWi would ideally be employed: the risk analysis process – which is a vital process for Logica, as the outcomes of this analysis influence the decision whether or not a project will be accepted. From an architectural point of view, KiWi is going to mediate between the process guidance column – which consists of process and workflow features – and the final work product, i.e the result of a process, in this case the report of the risk analysis.

In practice this means that if, for instance, a user has selected the risk analysis process, the Kiwi core system and enabling technologies will provide concepts related to risk analysis, supporting the user in the tagging process. Wiki technology is already being used in the industry, said Daniel, but what is lacking at the moment is the integration of structure, and this is also where he sees the potential of KiWi as a knowledge management tool, and as a means to move easily from unstructured to structured information (by the way, if you are interested in using wikis in the enterprise, I also recommend this article: Wikis for Knowledge Engineering, and in Global Businesses).

Karsten Jahn

Karsten Jahn (Aalborg University) then gave us a preview of a possible user interface (i.e. not of the screen design, but the functionalities) which seeks to address one particular problem: Many companies use many different, sophisticated tools which operate fine on their own, but are not integrated (i.e. there is no communication or exchange of data between them). With KiWi, the aim is to develop a tool that is going to be able to cover all features and processes currently being taken care of by individual tools, to allow for an optimum of data integration.

To conclude, Rolf Sint (Salzburg Research) showed us screens of the current configuration of KiWi for Logica’s needs – the example below is related to the risk analysis process outlined by Daniel Grolin above.

Logica Kiwi Wiki

Reblog this post [with Zemanta]
Sphere: Related Content

Content Versatility in the KiWi Core System

November 27, 2008 By: Jana Herwig Category: Social Software, Software Development, Tools & Software No Comments →

It’s been five months since the last Joint Work Package (WP) meeting in the KiWi – Knowledge in a Wiki – project. This morning, we gathered in Vienna for the next round – focus this time around will be on the core system (architecture developed by the WP3 team, handing over and paving the way for WP 4 team) and the use cases (Logica, Sun Microsystems) where it is of particular importance that everyone involved in the project understands the requirements of the use cases.

In the first presentation today, Sebastian Schaffert from Salzburg Research gave us a tour of two different configurations of the KiWi system. The KiWi core system is oriented towards content versatility, meaning that content items can be displayed and used in various contexts and configurations. As a service to the user, KiWi uses Javascript-based WYSIWYG Editor TinyMCE enhanced with a few home-grown plug-ins which, for instance, make it easier to set links to other wiki pages. Memorizing wiki shorthand is sometimes a challenge, so this feature helps getting things done.

Using a different skin and interface, KiWi can take various forms and shapes – even shapes where you might not spot the wiki in it at first glance. TagIT is such an example of an adaptation of the KiWi core system: a geotagging platform targeting youth in Salzburg who can locate, tag and comment on places that matter to them.

Vice versa, KiWi in its wiki incarnation displays a little map, provided a content item is enhanced with geoinformation; technically, the map on the wiki page is an interpretation of a georelated tag (learn more about complex, structured tags proposed by the KiWi Enabling Technologies Work Package in this article: Usage Data Model Day in the KiWi Project).

Take a look at the screenshots below:

KiWi-Screenshot

It is the same article that is being displayed, in the first example using the classic KiWi interface, in the second example using the TagIT interface with the article appearing as an info page.

TagIt Screenshot

This afternoon, we expect to see another configuration of the system, in a presentation about how the system is specifically tailored to the needs of Logica’s “Knowledge Management for Project Management” usecase.

N.B. The system is not yet publicly available, if you have questions, please contact Sebastian Schaffert.

Reblog this post [with Zemanta]
Sphere: Related Content

Short Semantic MediaWiki Tutorial (with link to sandbox)

November 05, 2008 By: Thomas Schandl Category: Conferences & Events, Knowledge Management, Linked Data & Open Data, Social Software, Tools & Software, Videos & Tutorials 3 Comments →

On the occasion of the recent publication of our book, Social Semantic Web, we have created an accompanying wiki for you to explore the contents of the book and obtain information about its authors. Staying true to the motto “Eat your own dog food”, the Semantic Web Company has used a semantic wiki for that purpose.

We opted for Semantic MediaWiki (SMW) and the extensions Semantic Forms and Semantic Drilldown. In this blog post we’ll take a look at the handy features you get with these. This short tutorial is based on my SMW demonstration at the Web of Data Practitioners’ Days in Vienna two weeks ago.

As the book is in German, the wiki is set up in German, too, but that shouldn’t be a problem for understanding the demonstrated features. For the following examples, we have created a mirror of our productive wiki, so don’t hesitate to edit and play with this mirror wiki (we might refresh it occasionally, so don’t write any data into the wiki that you don’t also have stored elsewhere). This tutorial is going to take you through the following SMW features:

  • Automatically created lists
  • Faceted search
  • Semantic queries
  • Entering data via forms
  • RDF export

So let’s see what these features hold for us.

  • Automatically created lists

A common problem in wikis like Wikipedia is the (amount of) effort it requires to create and maintain various lists like the list of the EU’s largest cities. It’s an equally laborious and error-prone activity to keep such lists up to date; as a result, there are a lot of useful Wikipedia lists we can imagine that don’t exist at all, like a list of the world’s largest corporations with a CEO younger than 35.

In SMW it is easy to create all kinds of lists with queries. This page for the book’s table of contents is an example. View its source to see the inline queries used to generate the page (click to enlarge or view the source on the wiki):

Semantic Media Wiki Query

As a result, the list is generated afresh any time the table of contents page is called up. If the data on an article’s page has changed, it will also be updated in that list – while in regular MediaWikis one has to manually update the data in both places (the article, and the list), which, apart from the extra work, also makes errors and inconsistencies much more likely.

  • Faceted search

Take at look at the list of articles page… (more…)

Sphere: Related Content

The Future, Quantum Encryption, Privacy on the Social Semantic Web

October 28, 2008 By: Jana Herwig Category: Semantics & Philosophy, Social Software No Comments →

Just two memos: There is a talk tonight with Thomas Länger from the Viennese quantum encryption project (BBC article about the project), co-organized by quintessenz (an organisation devoted to civil rights in the information age) and Transforming Freedom (who are dedicated to documenting the discourse of the battle zones of digital culture; I volunteer for them). ORF wrote a German article about it, with information about the venue and start time. The key issue quintessenz want to raise with this talk is: Who is going to benefit? Will “unbrekable” quantum encryption become available to citizens, too? Quantum encryption cartridges for your PC, anyone?

Secondly: I published an “inaugural interview” Marion Fugléwicz-Bren did with two of my colleagues, Matthias Samwald and Thomas Schandl (not so inaugural for the former, as he already joined SWC in January). I’d like to extract this quote by W3C member Samwald regarding privacy on the (corporation owned) social web and the future (user-managed) social semantic web:

I also think that Semantic Web technologies will receive a lot of media attention when the first big, public breach in security / privacy happens in one of the websites that currently dominate the whole world wide web. At the moment, we all are uploading most of our private and business lives to web sites such as Google, Facebook, Flickr and others. It is just a matter of time until a big scandal happens, be it the companies themselves that misuse the vast amounts of data they have, or be it a government agency in an overzealous effort of crime prevention.

When this will happen, people will re-evaluate the trend towards massive centralisation on the web, and will search for opportunities to make the same feeling of being ‘in the network’ happen in a distributed environment, without selling ones soul to a multinational corporation. Then we will find that such an opportunity already exists — the Semantic Web.

Read the whole interview here.

Sphere: Related Content

Multimedia in the Web of Data – Annotating and Interlinking Photos, Music, Multimedia [WOD-PD]

October 23, 2008 By: Jana Herwig Category: Conferences & Events, Internet & Media, Linked Data & Open Data, Mashups & Web services, Social Software 4 Comments →

The Web of Data Practitioners Days concluded with the session on Multimedia in the Web of Data, the first part of which was led by Ansgar Scherp (University of Koblenz-Landau, Germany).

Multimedia content, as Ansgar pointed out, is hardly annotated, badly organized, and hardly ever looked at again – just think of the 300 something pics you might take on an average week-end getaway, and which you never touch again. Annotating multimedia content requires a lot of work and dedication – but most of the time, these pictures eventually dissappear in the “digital shoe box” that is your photo management software.

The most obvious remedy is to annotate content as early as possible, ideally when creating the content, ideally already on your portable camera (formerly known as: mobile phone:) Ansgar suggested to provide incentives for people to encourage picture annotation – professionals could for instance receive a higher financial reward if the deliver already annotated pictures. And of course there are ‘Games with a purpose’ such as Google Image Labeler, where players tag images in pairs, with and against each other, and are rewarded with the entertainment factor of the game.

The slide below shows what has happened (or will happen) to the process of creating photo books in the digital age and the age of mashups:

Ansgar Scherp's slides

After all, this is the age of the social semantic web, so why not try and (re-)use the content, structure and contexts that other users have already created on the web? Content augmentation, for the scope that Ansgar is concerned with, consists in the reuse of content and structures (e.g. from sources such as Flickr and Wikipedia, Geonames) made possible through the definition of rules, e.g.:

  • If there are two or less pictures on a page*
  • then automatically augment the page with additional photos using location information.

* Page here means a page in the album you are currently working on – you probably took a picture of yourself and your friend in Paris, and even though you went to the Centre Pompidou, you forgot to actually take a pic of the building itself – well, let the web be your library!

So the goal is clear: develop a procedure for applying automatic content augmentation in the creation of good photo books.

But what makes a ‘good’ photo book anyway? Here are some of the results of a structural analysis of real, human-created photobooks conducted at CeWe Color:

  • % of photos with faces: 36%
  • Number of album pages: 16.96
  • Photos per page: 6.69
  • Text fields per page: 1.45
  • % of pages with text: 87%

There are many rules that can be established from the structural analysis, which can be applied in turn in the creation of photoboooks, e.g. rules like this one,

  • If the text located in the upper third of a page
  • if the font size is equal or larger that 16 points
  • if the number of words is less than 10
  • if there is no caption on the page that has a bigger font size
  • then this page is the title

Ansgar recommended xSmart, which he described as a “context-driven authoring tool for page-based multimedia presentations.”

Ansgar’s presentation was followed by two more: one by Yves Raimond on Interlinking Music on the Web of Data, and one on Interlinking Multimedia – in spite of better intentions, I did not manage to cover these two in detail, but at least I gathered the links to relevant resources from all three sessions… (more…)

Sphere: Related Content

Semantic Desktop, Lifting and Human Language Technology [WOD-PD]

October 22, 2008 By: Jana Herwig Category: Conferences & Events, Search Engines, Social Software 2 Comments →

The next session at WOD-PD was given by Leo Sauermann (German Research Center for Artificial Intelligence DFKI, Germany), and Brian Davis (DERI Galway, Ireland). Leo introduced the idea of the Semantic Desktop, and more specifically, the Nepomuk Social Semantic Desktop. There’s good article about Nepomuk on Linux.com, written by Bruce Byfield on August 26, 2008, from which I quote the following, enlightening passages:

Ansgar Bernardi, deputy head of the Knowledge Management Department at Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI, or the German Research Center for Artificial Intelligence) and Nepomuk’s coordinator, explains, “The basic problem that we all face nowadays is how to handle vast amounts of information at a sensible rate.” [...] “The point is, you have a vast amount of information on your desktop, hidden in files, hidden in emails, hidden in the names and structures of your folders. Nepomuk gives a standard way to handle such information.”

At a high level of generalization, Nepomuk has three main aspects, according to Bernardi. First, there is a standard framework for annotating pieces of information so that connections can be made between them. Second, there are ontologies, the sets of “documented shared understanding” or common concepts that can be defined for particular types of information, such as bio-science or computer desktop use. Finally, there are the tools for making or using the annotations and ontologies, what Bernardi calls the “workspaces that connect to other workspaces and help you in your day to day activities of collecting information, structuring it, making sense of it, and creating new information and communicating it.”

Leo has provided the relevant download links for those who “want to get their hands dirty” with Nepomuk (as he put it) on his blog. Leo Sauermann and Ansgar Bernardi also contributed an article about the Semantic Desktop to the recently published Social Semantic Web volume – a preview of the article is available here (in German – I’m sorry!).

Brian Davis‘ part of the talk focused on Lifting and Human Language Technology (HLT) for the Semantic Desktop – Semantic Lifting means to capture semantics and translate them into ontologies. Human language technology (HLT), in its broadest sense, can be described as computational methods for processing and manipulating language (for instance text analysis).

One of the goals of the Semantic Desktop is speech act detection for email – speech act here as defined by John Searle. At its most basic definition, a speech act is simply an utterance, but is also often understood more specifically as an illocutionary act (which is a term introduced by John L. Austin in How to do things with words), or a ‘performative utterance’, meaning that by saying something, one actually does something. For instance, the sentence “Please have the document ready for Workshop 1.” contains an instruction: It informs the reader about the requirements for a particular event, and asks him or her to meet these requirements.

Brian also introduced Roundtrip Ontology Authoring (ROA), which is a process that allows non-expert users to author or amend an ontology by using simple, easy to learn, controlled natural language. The process is a combination of Controlled Language for Information Extraction (CLIE) and Text Generation which is developed on top of GATE. ROA is documented on the the Nepomuk website; for further information about CLIE, read this article by Valentin Tablan, Tamara Polajnar, Hamish Cunningham and Kalina Bontcheva: User-friendly ontology authoring using a controlled language (PDF, 64 KB).

Reblog this post [with Zemanta]
Sphere: Related Content