Thomas Schandl

Linked data based thesaurus management in collaborative settings

The creation and management of controlled vocabularies in companies often takes place in a distributed manner. Different departments in different branch offices often rather create their own vocabularies, than have one large central knowledge model, where everyone contributes.

How to model divergent views on one concept?

Such a central model is not only much harder to manage, but there is also the general problem that differerent departments like marketing, quality assurance, R&D, etc. will have divergent views on the model and its concepts. These different perspectives on one and the same concept are hard to unify in a single model.

Think of a company that sells mobile phones and wants to create a model of its line of products. It wants to utilize this model in the context of its online shop as well as in the context of its user support forum. While the structure of the model (i.e. the relationships between the products) might be very similar or the same in both contexts, there will be differences in which properties of the products are actually relevant in the respective contexts.

In the model of the marketing department there might be a concept for a “Phantastax StamiMaxx” cell phone with a definiton “The StamiMaxx has a powerful battery and is great for professionals who travel a lot”. They might relate it to manufacturer “ACME Corporation” and to several concepts representing different features like “Android OS”, “Multi-touch touchscreen”, etc.
The very same phone has different properties that are interesting from the Quality Assurance departement’s perspective. They might call it by a more specific name like “Phantastax i3000 StamiMaxx S”, have a different definition for it like “3G cell phone implementing the new WTF3000 protocol, …” and relate it to concepts representing known problems and their solutions.

Now they face the task to integrate these different models, as it is not desirable to use a bunch of isolated models within one company.

Support of collaborative work on distributed models

To support this kind of collaborative work on distributed knowledge models, we would like to link the concepts of the models, just as is we link documents in the World Wide Web. Fortunately the Simple Knowledge Organisation System (SKOS) offers mapping properties that can be used to define relationships between concepts from different knowledge models.

E.g. when we want to say that concept “Phantastax StamiMaxx” in the product line thesaurus refers to the same real world entity as concept “Phantastax i3000 StamiMaxx S” in the Quality Assurance thesaurus, then we can use skos:exactMatch to express that. If we want to express that the concepts are merly similar, skos:closeMatch could be used.

The other SKOS mapping properties express a hierarchical (narrowMatch, broadMatch) or an associative (relatedMatch) mapping relation between concepts from different concept schemes. With those we can say that my Samsung Galaxy concept has a skos:broadMatch “Smartphone” in the product line vocabulary and a skos:relatedMatch “ACME Corporation” in a controlled vocabulary about Tech companies.

Modularisation of knowledge models

In this way SKOS thesaurus management systems like PoolParty make it possible to modularise knowledge models, represent concepts in their different contexts and consequently enable collaborative work on those models: The marketing guy can work on his model with the concept properties focused on sales without disrupting the work of the quality assurance expert on her own thesaurus. Later one or both of them can create the skos:exactMatch link between the concepts that are the same, like seen in the “Exact Matching Concepts” box in screenshot of PoolParty below.

Enrich your knowledge: Get connected with the LOD Cloud

Going a step further the models could be connected to external knowledge, e.g. a source from the Linked Open Data (LOD) Cloud. Once we establish links to LOD hubs like DBpedia, we can import additional information for their concepts or use it to establish whether similar concepts from different models really refer to the same real world resource.

Thomas Thurner

Report on developments at the European Semantic Technology Market

The present state of development, future trends and expected market scenarios for Semantic Technologies are shown in the just published “Demand driven Mapping Report”. The report is part of the EU-funded project Value It, which is about bringing together the various stakeholders within the sector: Industry, Research and Government. VALUE-IT preliminary findings show that the STE potential market in Europe will size up to €1.44B for 2014. Scanning furthermore the executive summary of the report, some findings attract attention:

The survey results also show considerable variation by sector, both of policy and technology implementation. With respect to technologies, ICT companies are also the most willing to consider semantic approaches. The ICT sector has an unusually high interest in all ST components, with 20% or more being willing to consider all of them, and over half of IT respondents looking at Web 2.0 (social computing). [...]  The use of tagging technologies – which overall is the least mature approach in the survey – is most advanced in Life Sciences. The Life Sciences, Media & Entertainment, and ICT sectors all have a reasonably strong interest in Natural Language Processing (roughly 25% on average). Ontologies and RDF/OWL are the technologies least often considered, though the interest in these Semantic Technologies is not insignificant. Taxonomies are slightly more popular, perhaps indicating that companies are taking the first step to prepare for a more semantic approach to IT solutions. The ICT, Energy & Utilities, and Media & Entertainment sectors all have a reasonably strong interest in using taxonomies.

The 190 pages report gives an actual overview of the status quo on European Semantic Technology Market and is now available for download: Final demand driven mapping Report

Tassilo Pellegrini

Eric A. Franzon: “Semantic Technologies are becoming mainstream.”

Started in 2005 the Semantic Technology Conference has become one of the international community hot spots for the commercial application of and trend scouting in semantic technologies. Tassilo Pellegrini talked to the organizer Eric A. Franzon, VP of Wilshire Conferences and Semantic Universe, about what to expect from the upcoming event and how semantic technologies are becoming mainstream.

From June 21 – 25, 2010 the annual Semantic Technology conference will take place for the 6th time. Looking back: what has changed over time? What are the hot topics at this year’s conference?

We launched SemTech in 2005 in San Francisco.  It was a good turnout for a new event, with around 300 attendees.  By 2009, that number had grown to 1100, so audience size has been a significant change, certainly.  However, our interest all along was to grow an industry as well as an event, and I have absolutely seen that growth and maturation.  Ours was the first conference devoted to the commercialization of Semantic Technologies, and at that first conference, there was a predominant academic presence.  That’s not a bad thing – this, like so many technical industries, came out of academia.  Nonetheless, it’s nice to see that by 2010, there is significant adoption by businesses and organizations. I actually feel comfortable saying that Semantic Technologies are becoming mainstream; certainly not ubiquitous, but widely adopted.

The hot topics at the 2010 conference include exciting news in areas we have covered extensively before such as Linked Data, Semantic Search, Healthcare, and Publishing.  But we also are delving much more deeply into new domains that have received a lot of attention recently such as Open Government, Marketing & Advertising, and Social Networks.  There are new standards benchmarks to discuss such as SPARQL 1.1 and the business rules work that is being done with RIF.  Additionally, we are seeing a lot of traction in Semantics in the Enterprise, so SemTech will have quite a bit to offer in that area as well.

While semantic technologies have been around for quite some years now the advent of the Semantic Web added a new spin to the community. What do you expect for the future when it comes to the convergence of semantic technologies and the Semantic Web?

I see Semantic Technologies as a superset of the space that is the Semantic Web.  The Semantic Web is public; the area I call Semantic Technologies includes non-public, closed systems – behind firewalls.  We’ve actually seen this before.  At the same time that the World Wide Web really hit its stride in the mid-1990’s, we saw widespread adoption of portals and corporate intranets.  Even though they did not sit on the public Web, these systems used the technologies of the Web to link documents, enabling organizations to share those documents globally, quickly, and inexpensively.

As the tools become better and we see more use cases in the Semantic Web, I see parallel development of semantically enabled enterprise systems.  In the same way enterprises were using early Web technologies to share documents behind firewalls, they are now using semantic systems to share data globally, quickly, and inexpensively.  At first – and we are seeing this already – in-house systems will consume data from the public Web, essentially mixing public and private data.  This is relatively easy to do when both systems are built on a similar set of technologies, and there are an increasing number of rich data sets for companies to use.  Think of a corporate system that consumes real-time stock data, for example.  The system is not generating that information itself, but it might be using it in a corporate application.

One of the prominent topics at the moment is Linked Data which in connection with Semantic Web might evoke a paradigm shift in data integration issues. How do you experience this trend? How should companies react?

If you think about the ‘traditional’ challenges that enterprises have faced in managing data and meta data — issues like integration, disparate data, unstructured data, governance, legacy systems, and data quality (to name a few) — Semantic Technologies offer solutions.  They’re not always the best solution for every problem, and I don’t expect that RDBMS systems will go away, but there are companies using Semantic Technology today to make money and save money.

From your perspective: what are the most exciting things to look out for in the near future?

There is a great opportunity for tool developers to enter the marketplace. The community is hungry for new tools and for semantic development to be integrated into the tools and development environments they are already using.  Another area that I believe the industry is hungry for is good UI development.  Data is powerful, but its usefulness is often only seen in solid visualizations and reporting.  I expect that more of these tools will emerge in the very near future.

Tools for publishers like OpenCalais, Zemanta, and the rich semantics available in Drupal 7 are making it possible for less-technical people to include semantics in their web pages.

Another area to watch is consumer applications. Tripit, Siri, and Adaptive Blue’s Glue have shown that there is a market for data-driven applications for consumers.

About Eric A. Franzon

Over the last decade, Eric Franzon has served as VP of Wilshire Conferences, where he has been exploring the world of enterprise data. As VP of Semantic Universe, he has worked to raise awareness and explain the usage of Semantic Technologies and Web 3.0 in business and consumer settings.  A lifelong learner and teacher, Eric is frequently called on as a consultant, coach, and trainer around complex technical topics. He is an advisory committee representative with the World Wide Web Consortium and an Affiliate Analyst with Guidewire Group.  Eric has also taught improvisational comedy, early childhood education, blues harmonica, and gender studies.  A Chicago native, he now lives in Los Angeles.

Tassilo Pellegrini

Looking back I-Semantics 2009

isemantics_logoLast Friday, September 4, 2009 I-Semantics, the 5th International Conference on Semantic Systems, ended. I am extremely happy about the positive response from so many people I got in the last few days. It was a lot of work and I am glad everything worked out fine.

I-Semantics, which started on Wednesday, September 2, and was colocated with I-Know, the International Conference on Knowledge Management, for the third time now, attracted 450 participants. As inteded by our original idea – bringing the Semantic Web out of the echo chamber – this colocation has proven to be absolutely fertile as the semantic systems community and the knowledge management community really fit well together and complement each other. So we had a rich program consisting of 64 scientific talks (30 I-Semantics / 34 I-Know), a poster session, an industry track and numerous mini tracks and discussion panels. Read a review of the first, second and third conference day on Harald Sack’s blog (with whom I enjoyed pondering about Net Neutrality and IPV6.)

For the first time we had the Pragmatic Web Community on board, which held a special track bringing in lots of new ideas and views on computational semantics. Beside that I recognized that in this track we had quite large amount of people from the social sciences and humanities among the audience, which is a promising signal and hopefully leads to new research and human-oriented technologies.

Another highlight was this year’s matchmaking event which aims at initiating business contacts between industry and academia. According to the organizers the Styrian Research Agency and the Enterprise Europe Network,  120 bilateral meetings took place. Astonishingly 56 of the 71 registered participants had a company background.

And finally we hosted the second Triplification Challenge where Chris Bizer gave a keynote and introduced quite a bunch of people to the idea of Linked Data. Unfortunatelly Michael Hausenblas who chaired this year’s challenge could not attend so I did the moderation during the award ceremony and Chris assisited me handing over the awards to the winners. For the results of the challenge go to Soeren Auer’s blog.

Wrapping up, all this would not have been possible without the great support of Prof. Klaus Tochtermann and his team from Know Center. Year after year they do a great job and it is a great opportunity and pleasure to work together with them. Big thanks also go to Adrian Paschke from Corporate Semantic Web of Free University of Berlin, Hans Weigand from Tilburg University and the guys from Salzburg New Media Lab, who helped to set up the I-Semantics conference this year.

The next I-Semantics will take place from September 1 – 3, 2010. Hope to see you next year in Graz!

Reblog this post [with Zemanta]