Jana Herwig

Common vs. Marginalized Knowledge – a Potential Showstopper for the Semantic Web?

Earlier today I published an interview that my colleague Marion Fugléwicz-Bren led with Corinna Bath from the Institute for Advanced Studies in Science, Technology and Society (IAS-TS). Corinna Bath is a researcher with a focus on gender studies in computer science and has been working specifically towards a methodology for de-gendering IT design processes and is now also turning towards the Semantic Web. Now that CYC seems to be coming into wider, or renewed use (e.g. Zitgist’s UMBEL is deriving its subject concepts and relationships from OpenCYC), it was interesting for me to read her remarks about the CYC project and specifically the research undertaken by Alison Adam in this context:

Alison Adam analyzed the well-known ontology CYC that was build to capture common sense knowledge from the 1980ies on. Her criticism focussed on the built-in assumption that we would all share a consensus reality: “be it a professor, a waitress, a six-year old child, or even a lawyer” (Lenat and Guha 1990). She revealed that the knowing subject implicitly assumed by the system is a white, middle-class male professional.

Hence, in contrast to its own agenda CYC ignores minority views, quieter voices, and allows the dominant voice to speak for everyone, which seems highly problematic. Other studies give more evidence for the highly problematic prerequisite of computer science modelling that rests on the Cartesian epistemology. Even the modelling concepts themselves should be questioned as Cecile Crutzen suggest, since e.g. the class concept and the inheritance concept lack to represent social processes, because of limited formal expressiveness for conflict, change and fluidity. Such an ontology abstracts from human sociality, situated action and real meaning construction processes.

This also made me think about my own role within and attachment to the Semantic Web Community – from a professional point of view, I see myself as a sort of mouthpiece for the Semantic Web (at least within the professional community that I am a part of), and while I am convinced that the movement is going to see its big break within the next five years, I don’t see myself as playing a significant role in it. And I’m always inclined to leave all the ‘hard stuff’, i.e. all the technology-related questions to the ‘boys’ in our team.

But one of the good things about the Semantic web is that it is actually EASY to understand – I’ve also been told by Henry Story for instance that N3 (Notation3, a shorthand non-XML serialization of Resource Description Framework models) is relatively easy to learn; and since I am one of the few women I know (sadly) who actually know what an ontology is, maybe it would be about time that I learned to model one myself.

Because we cannot expect that white, middle-class male professionals are going to be able to explore the feminine or queer knowledge in this world and mold it into a common knowledge base. Even if marginalized voiced can hardly expect that the hegemony is going to advocate their cause: The Semantic Web project itself is at stake if some voices, views and knowledge are excluded. This could indeed be a showstopper for the Semantic Web – not immediately on a technology level, but with regard to meeting the societal goals of its own agenda.

Read the entire interview with Corinna Bath here.

Alison Adam’s cited work is contained in: Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project (D.B. Lenat and R.V. Guha 1990).

Reblog this post [with Zemanta]
Jana Herwig

SWC’s Matthias Samwald contributes to W3C notes

Early June saw the release of two notes drafted by the Semantic Web Health Care and Life Sciences (HCLS) Interest Group within the W3C. One of the contributors, and editor of one note, is Matthias Samwald, a project coordinator at SWC, who is a member of this SIG and who has worked on several Semantic Web projects for the Yale Center for Medical Informatics (USA), Science Commons (USA) and DERI Galway (Ireland).

A Prototype Knowledge Base for the Life Sciences
W3C Interest Group Note 4 June 2008
Editors: M. Scott Marshall, Eric Prud’hommeaux
Contributors: Alan Ruttenberg, Jonathan Rees, Susie Stephens, Matthias Samwald, Kei-Hoi Cheung
Abstract: The prototype we describe is a biomedical knowledge base, constructed for a demonstration at Banff WWW2007 , that integrates 15 distinct data sources using currently available Semantic Web technologies such as the W3C standard Web Ontology Language [RDF]. This report outlines which resources were integrated, how the knowledge base was constructed using free and open source triple store technology, how it can be queried using the W3C Recommended RDF query language SPARQL [SPARQL], and what resources and inferences are involved in answering complex queries. While the utility of the knowledge base is illustrated by identifying a set of genes involved in Alzheimer’s Disease, the approach described here can be applied to any use case that integrates data from multiple domains.

Experiences with the conversion of SenseLab databases to RDF/OWL
W3C Interest Group Note 4 June 2008
Editors: Matthias Samwald, Kei-Hoi Cheung
Contributors: Alan Ruttenberg, Huajun Chen
Abstract: One of the challenges facing Semantic Web for Health Care and Life Sciences is that of converting relational databases into Semantic Web format. The issues and the steps involved in such a conversion have not been well documented. To this end, we have created this document to describe the process of converting SenseLab databases into OWL. SenseLab is a collection of relational (Oracle) databases for neuroscientific research. The conversion of these databases into RDF/OWL format is an important step towards realizing the benefits of Semantic Web in integrative neuroscience research. This document describes how we represented some of the SenseLab databases in Resource Description Framework (RDF) and Web Ontology Language (OWL), and discusses the advantages and disadvantages of these representations. Our OWL representation is based on the reuse and extension of existing standard OWL ontologies developed in the biomedical ontology communities. The purpose of this document is to share our implementation experience with the community.

Zemanta Pixie
Jana Herwig

Conceptualizing knowledge with CMMI: the KiWi/Logica usecase

Peter Axel NielsenPeter Axel Nielsen, a researcher from Aalborg University who is working on the Logica usecase, started his report by giving us an overview of CMMI, a process improvement approach that is being used by Logica and that is thus going to be of eminent importance for the KiWi/Logica usecase.

CMMI stands for Capability Maturity Model® Integration, and, according to its inventors, “helps integrate traditionally separate organizational functions, set process improvement goals and priorities, provide guidance for quality processes, and provide a point of reference for appraising current processes.” CMMI was created by the Carnegie Mellon Software Engineering Institute, and has been adopted worldwide.

CMMI provides a framework for a 5-step path towards maturity in software process improvement – a path that can be expected to take years and which demands a company’s full commitment. Peter drew our attention to a volume, fresh from the printing press, which he edited together with Karlheinz Kautz: Software Process & Knowledge. Beyond Conventional Software Process Improvement.*) The volume contains an interesting CMMI case study, “The Road to High Maturity. How the first Danish company reached CMMI level 5 in 100 months,” in which the authors emphasize that software process improvement, in essence, is “an organizational change process, that is, the processes in an organization, and the behaviour and interaction of people, groups, projects and, in fact, the whole organization.” Some of the benefits reported by the participants in the case study were: reduction of overtime, increased satisfaction of employees, increased opportunities to delegate as a result of formal definitions of roles and greater ease to move between projects.

Switching to CMMI brings with it the benefits of a well-defined, ready to use conceptual model of knowledge – this is, of course, an invaluable asset when it comes to defining the requirements of the KiWi system, which is going to be used as a wiki-based, semantically enhanced knowledge management system for IT project management in the Logica usecase.

*) Peter Axel Nielsen, Karlheinz Kautz (eds.): Software Process & Knowledge. Beyond Conventional Software Process Improvement. Software Innovation Publisher. Aalborg. 2008. ISBN 978-87-992586-0-4 See also: website of the symposium on Software Process & Knowledge that preceeded the publication.

Zemanta Pixie
Andreas Blumauer

The social hub @ LinkedData Planet 2008

Eric HofferThe LinkedData Planet conference is over now. I had a great time here and met a lot of great and inspiring people. The exhibition area especially turned out to be THE meeting point of the conference. People from media companies, major IT-companies like IBM or from governmental and non-governmental organizations were there, meeting up with some of the most prestigious software providers and experts of the semantic web world.

Mike Bergman in SWC gear

And that says a lot about the semantic web both as a technology and a movement: The semantic future is made happen not behind closed doors or in some ivory tower, not thought up by some secluded genius, but by people, companies and research institutions that are as close to the heart of the web as one can be.

I learned a lot about the upcoming new release of UMBEL (Upper Mapping and Binding Exchange Layer) thanks to Mike Bergman (who you can see in the picture below, sporting Semantic Web Company gear). UMBEL (in the words of the project itself) has two purposes:

1) to provide a lightweight structure of subject concepts as a reference to what Web content or data “is about”;
and 2) to define a variety of binding protocols for different Web data formats to map to this “backbone.”

You might want have a look at the UMBEL subject concepts explorer provided by Mike’s Zitgist: Start exploring here, with a preset concept search for ‘Manager’.

I also learned more about the huge variety of possible applications which can be built on top of the Talis platform – thanks to Ian Davis. One example is the Lancashire Lantern WiCI – WiCI because it is a service providing Community Information.

And finally I met Richard Cyganiak in person who gave me a thorough overview of the Semantic Web index Sindice – try a search for Richard Cyganiak to see how it works (and to learn more about him, of course).

I ended up discussing possible applications using linked data with Tom Heath, Mike Bergman, Gregory Williams, Eric Hoffer (picture on top, see also his blogpost where he features the SWC “Escape from the Data Silo” logo) and Marco Neumann, both from Semantic Web Meetup NYC. It was a great evening!

Thank you, folks!

Read also pt. 1 of our conference report: LinkedData Planet in New York: A great community event for all things semantic

Zemanta Pixie