Jana Herwig

Common vs. Marginalized Knowledge – a Potential Showstopper for the Semantic Web?

Earlier today I published an interview that my colleague Marion Fugléwicz-Bren led with Corinna Bath from the Institute for Advanced Studies in Science, Technology and Society (IAS-TS). Corinna Bath is a researcher with a focus on gender studies in computer science and has been working specifically towards a methodology for de-gendering IT design processes and is now also turning towards the Semantic Web. Now that CYC seems to be coming into wider, or renewed use (e.g. Zitgist’s UMBEL is deriving its subject concepts and relationships from OpenCYC), it was interesting for me to read her remarks about the CYC project and specifically the research undertaken by Alison Adam in this context:

Alison Adam analyzed the well-known ontology CYC that was build to capture common sense knowledge from the 1980ies on. Her criticism focussed on the built-in assumption that we would all share a consensus reality: “be it a professor, a waitress, a six-year old child, or even a lawyer” (Lenat and Guha 1990). She revealed that the knowing subject implicitly assumed by the system is a white, middle-class male professional.

Hence, in contrast to its own agenda CYC ignores minority views, quieter voices, and allows the dominant voice to speak for everyone, which seems highly problematic. Other studies give more evidence for the highly problematic prerequisite of computer science modelling that rests on the Cartesian epistemology. Even the modelling concepts themselves should be questioned as Cecile Crutzen suggest, since e.g. the class concept and the inheritance concept lack to represent social processes, because of limited formal expressiveness for conflict, change and fluidity. Such an ontology abstracts from human sociality, situated action and real meaning construction processes.

This also made me think about my own role within and attachment to the Semantic Web Community – from a professional point of view, I see myself as a sort of mouthpiece for the Semantic Web (at least within the professional community that I am a part of), and while I am convinced that the movement is going to see its big break within the next five years, I don’t see myself as playing a significant role in it. And I’m always inclined to leave all the ‘hard stuff’, i.e. all the technology-related questions to the ‘boys’ in our team.

But one of the good things about the Semantic web is that it is actually EASY to understand – I’ve also been told by Henry Story for instance that N3 (Notation3, a shorthand non-XML serialization of Resource Description Framework models) is relatively easy to learn; and since I am one of the few women I know (sadly) who actually know what an ontology is, maybe it would be about time that I learned to model one myself.

Because we cannot expect that white, middle-class male professionals are going to be able to explore the feminine or queer knowledge in this world and mold it into a common knowledge base. Even if marginalized voiced can hardly expect that the hegemony is going to advocate their cause: The Semantic Web project itself is at stake if some voices, views and knowledge are excluded. This could indeed be a showstopper for the Semantic Web – not immediately on a technology level, but with regard to meeting the societal goals of its own agenda.

Read the entire interview with Corinna Bath here.

Alison Adam’s cited work is contained in: Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project (D.B. Lenat and R.V. Guha 1990).

Reblog this post [with Zemanta]
Jana Herwig

Just released: UMBEL – A New Vocabulary for the Semantic Web

UMBELNews has reached me this morning that UMBEL has now been publicly released! UMBEL is a new vocabulary for the Semantic Web – I first learned about it when Andreas Blumauer returned from LinkedData Planet where he had met up with Mike Bergman from Zitgist LLC who are working on UMBEL.

Here is the release announcement Mike communicated via email yesterday:

UMBEL (Upper Mapping and Binding Exchange Layer) [1] is a lightweight ontology for relating Web content and data to a standard set of 20,000 subject concepts. Based on OpenCyc [2], these subject concepts have defined relationships between them, and can act as semantic binding nodes for any data or Web content. A further 1.5 million named entities have been extracted from Wikipedia and mapped to the UMBEL reference structure with cross-links to YAGO [3] and DBpedia [4]. The system can easily be extended with additional dictionaries of named entities, including ones specific to enterprises or domains.

UMBEL is provided as open source under the Creative Commons 3.0 Attribution-Share Alike license. The complete ontology with all subject concepts, definitions, terms and relationships can be freely downloaded [see 5]. All subject concepts and named entities are available as Linked Data [see 5]. Five volumes of documentation [5] are also available.

The release is accompanied by about a dozen Web services [6] for using or manipulating UMBEL, along with a new introductory slide show [7]. Additional release information may be found on Fred’s [8] or my [9] separate blog postings. We welcome those with interest or suggestions for improvements to do so through the UMBEL discussion forum [10]. We will shortly be putting easier services online for such input.

So, enjoy! We look forward to your commentary, suggestions and putting UMBEL under production-grade stress. We know will be doing the same!

Regards, Mike

Great release! They have also given us access to a media-oriented article which you can read on our portal.