Semantic Web Company

The Semantic Puzzle

Open World Assumptions

subscribe RSS

Kingsley Idehen: “By declaring its context, Linked Data can be made more easily reusable by others”

June 16, 2010 By: Andreas Blumauer Category: Corporate Semantic Web, Enterprise 2.0, Linked Data & Open Data, Tools & Software No Comments →

Semantic Web Company talked with Kingsley Idehen who is CEO of OpenLink Software and probably one of the most profound experts on data integration issues about “Linked Data”.

The interview covers questions like:

  • How can Linked Data help to make companies more productive?
  • Do you think that the Linked Data Initiative can build upon a stable architecture or will it face more and more problems the bigger the “cloud” will grow?
  • What´s the ultimate argument for an Enterprise Architect to use languages like SPARQL at least in addition to SQL?
  • How will a “Real Time Semantic Web” change the whole game?
  • How will the “Semantic Web” be called in 10 years? Will there still be a “Semantic Web”?

Read the full version of the interview here.

Sphere: Related Content

TuQS QuadStore combines the best of two worlds

March 22, 2010 By: Andreas Blumauer Category: Software Development, Tools & Software 4 Comments →

A new QuadStore which combines the best of two worlds (Lucene/Fulltext search engines & TripleStores/RDF/SPARQL) is out and can be evaluated online.

TuQs offers the following feature:

  • SAIL accessible
  • True QuadStore with GraphSupport
  • HighSpeed regex SPARQL filters
  • Userrights on TripleBasis
  • Extendable to a QuintStore (or more generally to an n-Store)
  • Cachable SPARQL Queries for further speed improvement
  • Clusterable
  • Federationable
  • FullTextSearchable

Some queries are really complex and high-speed, e.g.:

SELECT ?s ?o
WHERE {
?s <http://www.w3.org/2004/02/skos/core#definition> ?o .
?o <http://www.turnguard.com/tuqs/function#BooleanTerm> ‘Computer AND (java* OR HTML)’
}

The best starting point to find out, what´s the speciality of TuQS is here: Just click the sample queries on the right side and see how fast they perform even on very simple hardware.

Next steps: The developer of TuQS, Jürgen Jakobitsch (aka Turnguard), is currently working on SAIL inferencing.

Sphere: Related Content

Attending TopQuadrant’s SemWeb Technology Training

October 14, 2009 By: Thomas Schandl Category: Companies & Institutions, Tools & Software No Comments →

There’s a lot to know about semantic standards, languages, technologies and their application, so last week I attended TopQuadrant’s first European training from Oct 5th to 9th in Amsterdam.

We kicked off with Eddy Vanderlinden elaborating on the lessons he learned from 30 years of work in the financial sector. He outlined how improvements could be achieved by using data models relying on semantic web standards. You can read about his ideas in this essay.

TQ’s chief scientist Dean Allemang then continued with his talk “Enabling Creativity at the Edge”. “The edge” refers to the boundary between an information system and the real world, where the end users of a system work. As business needs change faster and faster, the people working at the edge need to be able to adapt the company’s applications on their own and shape them to their everyday needs.

Dean Allemang

Dean Allemang

Nowadays end user often achieve this kind of creativity on the edge by using self-made spreadsheets. The problem with that is their lack of interoperability. These data from different spreadsheets, databases, reports, etc. are often connected through business processes that rely on repetitive and error prone human processing, like copying things from a spreadsheet to a database, creating a report and pasting its result into another system, and so on.

The result is a complex system with many heterogenous parts and an organisation that cannot possibly know what it knows.

As a solution Dean proposed to “think outside the table” and go beyond the relational database way of orgranising data. This of course can be achieved by integrating the data using semantic technologies. TopQuadrant’s software offers possibilities to do just that, and makes it possible to create highly customizable dashboards and applications that all rely on the same data.

During the following days we learned about the ins and out of using semantic standards and languages and tried out TopBraid tools in several hands-on excercises. The TopBraid Suite is a very powerful, commercial toolkit. It includes TopBraid Composer, Live and Ensemble. Composer is a semantic web modeling and application developement tool, that uses the Eclipse framework. TopBraid Live is a server for semantic applications built with TopBraid Ensemble. Ensemble is a graphical application assembly toolkit, that enables end users to create custom apps that run in a browser and use RDF data and data models – thereby allowing for the above mentioned “creativity at the edge”.

I am very impressed with the capabilities of these tools, they enable the user to realize manifold possibilities that come with using semantic web standards – and that without programming. You can see some of these tools in action and learn about applying semantic standards in a series of webcasts from Semantic Universe. For the latter topic you might also attend one of our webinars.

On the last day Dean coverd several case studies, like connecting ontologies to legacy data sources (using e.g. D2RQ inside Composer), applying semantic technologies to the customer service management of a larger retailer or using ontologies in Federal Enterprise Architecture.

All in all I am very happy to have attended TopQuadrant’s training and hope they will establish a successful series of trainings in Europe just as they did in the US.

Sphere: Related Content

Greetings from Crete!

June 04, 2009 By: Andreas Blumauer Category: Conferences & Events No Comments →

Michael Hausenblas & Chris Bizer

Michael Hausenblas & Chris Bizer

ESWC 2009 is not over yet – but I am happy to announce: The Semantic Web Community is more alive than ever before! We had four days of brilliant talks, vibrant meetings, and great atmosphere so far. Some highlights:

  • Chris Bizer presentation of Berlin SPARQL Benchmark (BSBM) or Enrico Minacks´s talk about benchmarking RDF stores showed that base technologies of the semantic web are mature enough for real-world applications.
  • Use cases from many domains like biodiversity, astronomy or multi-media showed clearly the trend that the semantic web becomes “ubiquitous” and has left the labs.
  • The idea of Linking (Open) Data became pre-dominant in the community, many projects are built around this infrastructure already. But there is a clear demand for improved ontology matching or brokering services like the recently released <sameAs>
  • Martin Hepp´s lightning talk about “What makes for a good ontology?” and emotional reactions from the audience on that showed, that grass-root approaches and top-down approaches for ontology building still haven´t grown together, but they are getting closer ;-)
  • Weather and food here in Crete is great!
Reblog this post [with Zemanta]
Sphere: Related Content

Tim Berners-Lee: “We need data on the Web to work better together”

April 22, 2009 By: Christoph Wieser Category: Conferences & Events, Linked Data & Open Data 3 Comments →

Today, the 18th WWW conference started in Madrid, Spain. In his opening talk, Tim Berners-Lee outlined the status quo of the current Web and focused on areas for ongoing research.

tbl_klein

According to Tim Berners-Lee the Web is still static and consists mostly of archived HTML and PDF documents. There is still a need for a read/write Web and the standards are still not used to a sufficient extend. Changes in the Web are the ‘move to mobile’ and the climb up of ‘advertizing to being a science’.

Beside the still existing challenges of the current Web, additional ones arrived. Web Applications as well as Open Social Networking and Open Linked Data count to the area of current interest.

Web Applications are supposed to become new computing platforms and need a serious clean trust system. In the future Web Applications could offer a decentralized modular installation like a webized Debian.

Open Social Networking has become a great application in the Web. Currently it suffers from the ‘Social Silo Problem’. Users have often accounts in several platforms like Facebook or MySpace. The platforms, however, are separated from each other like in a field of silos. The challenge of the Semantic Web Community is now to interconnect the silos via RDF, OWL, HTTP, and SPARQL. A further requirement of Tim Berners-Lee are to focus on a Secure Web id.

Open Linked Data attracted the attention of Tim Berners-Lee most of all. Being one of the chairs of the co-located workshop ‘Linked Data on the Web’ he stressed that “we need data on the Web to work better together” in government, enterprise, and science. Open Linked Data could be a wizard for users of existing relational database systems. As query language he proposed a federated/delegated SPARQL.

Finally, Tim Berners-Lee described the role of researchers in those challenges. Researchers should ‘build a platform for others that follow’. Thereby, one should not assume what people will use the platform for.

(Report by Christoph Wieser / Salzburg Research)

Sphere: Related Content

Using Triplify to expose the semantics of a site

April 20, 2009 By: Thomas Schandl Category: Calls & Competitions, Linked Data & Open Data 2 Comments →

Recently the SWC took a thorough look at Triplify, a tool for mapping the contents of a relational DB to RDF, in the course of which we could convince ourselves of Triplify’s ease of use and its potent capabilities.
We take this opportunity to given an account of the philosophy behind Triplify, how it is used and also had the chance to interview the creator Sören Auer.

Triplify Logo

A common objection from critics of the semantic web is that regular users or webmasters won’t go to the trouble of marking up their content or whole web sites with RDF.
While it is obvious that nobody is going to decorate their web pages with hand-carved RDF triples, it is also apparent that a lot of the current web’s pages are generated by transforming information from relational databases to HTML pages, which are perfectly suited for human consumption, but which suffer from a big loss of machine-readable semantics.

As the information in the relational databases is highly structured and contains rich semantics, it is only natural to also use the already existing structured data to generate RDF representations of the same information.

Triplify is all about this approach of bootstrapping data for the semantic web. It does this for web applications which are built on PHP and MySQL.
Triplify consists of a lightweight PHP script and a configuration file. The latter is used to do the mapping of the columns of an application’s relational database to appropriate RDF classes and properties.

In many cases a site administrator who wants to export her site’s content as RDF, only has to save Triplify with a premade configuration file for her site’s application into the right folder, as for many popular applications like Wordpress, Joomla! or phpBB all the work has already been done.
Once installed, Triplify can be used to generate a dump of the site’s complete RDF graph, or to generate Linked Data, as each of the site’s main concepts’ RDF graph is provided under its own URL, e.g. the semantic description of a user with the ID 123 can be accessed under http://yoursite.com/triplify/user/123.

If no configuration for an application exits, it is fairly easy to create one by yourself.
All one has to do is to look at the app’s database schema, find appropriate classes and properties from well known ontologies and create MySQL queries that grab the data from the relational database and map them to RDF classes or properties.
An example for a query that takes the data from a table describing the user of a CMS:
"SELECT id, name AS 'foaf:name', url AS 'foaf:homepage', short_description AS 'dc:abstract' FROM user_table",

Triplify’s creator Sören Auer kindly gave us the opportunity for an interview:

Triplify is very easy to configure for web developers. For which scenarios would you recommend to use Triplify, and in which situations other approaches of semantifying your data might be more suitable?

As you already mentioned Triplify was primarily developed for Web applications developed in PHP. These usually have a relatively small and simple set of tables. Triplify creates complete RDF exports, Linked Data or JSON, but does not include SPARQL endpoint functionality. When SPARQL is required you are better off with D2R Server or Virtuosos RDF views.

Triplify creates semantic representations of the data in relational databases. Do you think there would also be benefit in the inverse approach i. e. creating an application that parses triples and writes it to a relational DB according to a mapping file?

In certain scenarios this might make sense, but for the most cases I think the database schema has to be developed separately. Database schemata contain more storage and retrieval oriented information, such as for example about data indexing. Vocabularies and ontologies on the other hand represent information on a conceptually higher level and are more flexible with regard to evolution of the information structures than databases.

Are there plans for further development of Triplify?

Sure. We want to add SPARQL support and possibly port Triplify to other scripting languages such as Ruby and Python.

Thank you Sören, we will stay tuned about the news from your great application and look forward to the Triplification Challenge 2009!

Sphere: Related Content

semantic technolgies for non-SQL-writers

January 30, 2009 By: Thomas Thurner Category: Tools & Software No Comments →

isd_banner3Andreas Blumauer (Semantic Web Company) talked with Brian Donnelly about a new system on the market called “Semantic Discovery System” (SDS), which helps to do sophisticated queries across existing datasets. Also talking why complex scripts or triple stores should not be exposed to the end-users anymore.

SDS is doing, what semantic web enterprises promised for years: An application that allows users to formulate sophisticated questions on their datasets and getting back data without writing SQL statements or going down to OWL concepts.

SDS leave the data in its orignal format and doing no transformation into triple stores. And then give the user through a graphical desktop software – with the use of OWL and SPARQL – the possibility to formulate questions on this datasets. So this is a software engine that focuses “at business people with a tool as easy to use as Excel or Mind Manager – with zero need to know or care about OWL, SPARQL” as Donnelly explains.

The next times will show if Donnelly’s “Semantic Discovery System” may be a semantic web killer application. In any case it seems to be a good step in bringing semantic technologies out of the teccie’s corner onto the desktops of business users.

Read the full interview at www.semantic-web.at

Reblog this post [with Zemanta]
Sphere: Related Content

SWC’s Matthias Samwald contributes to W3C notes

July 14, 2008 By: Jana Herwig Category: Ontology Engineering, Vocabularies & Languages No Comments →

Early June saw the release of two notes drafted by the Semantic Web Health Care and Life Sciences (HCLS) Interest Group within the W3C. One of the contributors, and editor of one note, is Matthias Samwald, a project coordinator at SWC, who is a member of this SIG and who has worked on several Semantic Web projects for the Yale Center for Medical Informatics (USA), Science Commons (USA) and DERI Galway (Ireland).

A Prototype Knowledge Base for the Life Sciences
W3C Interest Group Note 4 June 2008
Editors: M. Scott Marshall, Eric Prud’hommeaux
Contributors: Alan Ruttenberg, Jonathan Rees, Susie Stephens, Matthias Samwald, Kei-Hoi Cheung
Abstract: The prototype we describe is a biomedical knowledge base, constructed for a demonstration at Banff WWW2007 , that integrates 15 distinct data sources using currently available Semantic Web technologies such as the W3C standard Web Ontology Language [RDF]. This report outlines which resources were integrated, how the knowledge base was constructed using free and open source triple store technology, how it can be queried using the W3C Recommended RDF query language SPARQL [SPARQL], and what resources and inferences are involved in answering complex queries. While the utility of the knowledge base is illustrated by identifying a set of genes involved in Alzheimer’s Disease, the approach described here can be applied to any use case that integrates data from multiple domains.

Experiences with the conversion of SenseLab databases to RDF/OWL
W3C Interest Group Note 4 June 2008
Editors: Matthias Samwald, Kei-Hoi Cheung
Contributors: Alan Ruttenberg, Huajun Chen
Abstract: One of the challenges facing Semantic Web for Health Care and Life Sciences is that of converting relational databases into Semantic Web format. The issues and the steps involved in such a conversion have not been well documented. To this end, we have created this document to describe the process of converting SenseLab databases into OWL. SenseLab is a collection of relational (Oracle) databases for neuroscientific research. The conversion of these databases into RDF/OWL format is an important step towards realizing the benefits of Semantic Web in integrative neuroscience research. This document describes how we represented some of the SenseLab databases in Resource Description Framework (RDF) and Web Ontology Language (OWL), and discusses the advantages and disadvantages of these representations. Our OWL representation is based on the reuse and extension of existing standard OWL ontologies developed in the biomedical ontology communities. The purpose of this document is to share our implementation experience with the community.

Zemanta Pixie
Sphere: Related Content

And the winner is: The vision of a future where ordinary people publish structured data

May 20, 2008 By: Jana Herwig Category: Calls & Competitions, Linked Data & Open Data 5 Comments →

Vision CompetitionThe Semantic Web Company is one of the partners of this year’s LinkedData Planet Conference in New York (June 17-18, 2008). As part of this partnership, we launched a competition, asking for your vision of a future with Linked Open Data – and we have a winner!

Aman Shakya, who is a PhD student at the Department of Informatics at The Graduate University for Advanced Studies (SOKENDAI) in Tokyo, developed his vision around the idea of ordinary people being able to publish structured data instead of unstructured text:

The current gigantic network of web documents could be realized by enabling any user to publish any document and link to other documents. If we want to see the network of Linked Open Data explode on a similar scale, we need to enable general users to publish “data” directly on the web and link to other “data”. We need to move the paradigm of web page publishing and hyperlinking towards data publishing and data linking. We should enable people to post structured data about anything rather than just unstructured text. We need the active participation and contribution of the billions of worldwide internet users. Recently, the web has seen enormous user participation with the rise of easy-to-use social software. We should exploit this trend of social web applications, however, for enabling people to create, share and link “data” on the global Linked Data Web.

To endorse his vision, Aman Shakya also introduced his StYLiD application, which I would like to describe as a ’semantically enhanced tumblelog’, and which “enables people to share a wide variety of structured data with the freedom to define their own structured concepts on the fly.” We have chosen his proposal because it met the criteria of the competition in various ways:

  • The feasibility of the vision is clearly laid out in the proposal, which describes the process of the creation of structured data and the interaction with existing data on the web.
  • The proposal has innovative potential in that it seeks to further and harness the collaborative sharing of structured data, and combines bottom-up and top-down governance for the social semantic web.
  • Sustainability is achieved by its reliance on open standards such as SPARQL.

Read his full proposal here.

We would also like to make an honorary mention of Mike Veytsel’s quadruple-fold approach to a semantic future in which users will be able “to easily and finely tune in to the long tail of knowledge and find content with low friction and high precision.”

Finally, I would also like to give my personal bookworm award to Rob Styles, for his prose account of a life with the semweb which he develops as an antithesis to Orwellian dystopia.

A big ‘Thank you’ to everyone who contributed!

Zemanta Pixie
Sphere: Related Content

LinkedData Planet – Conference & Expo 2008

April 17, 2008 By: Jana Herwig Category: Conferences & Events 3 Comments →

Come share your expertise with linked data and semantic technologies and learn from others at LinkedData Planet in New York City (June 17-18, 2008).

In creating the modern generation of enterprise and web applications, we typically integrate information from multiple sources. Relating data from disparate sources presents a challenge of deriving information. However, semantic tools and technologies are evolving that enable us to understand information derived by linking data from different sources, including data from applications, databases, ontologies and content management systems. Semantic technologies and tools support techniques such as tagging online information to make it more readily accessible for data integration. This makes it easier to understand data in relation to other data, even if some of this data is inside your firewall, some is in a business partner’s system, and some is part of the growing collection of useful publicly available data on the web.

LinkedData Planet provides insights into those technologies that enableus to:

  • connect data contained in silos within organizations in a meaningful way
  • extract and correlate data from web sites and databases for purposes such as analyzing trends and decision support, customer and vendor relationship management, and social networking

(more…)

Sphere: Related Content