Subscribe to

Let me disambiguate that title: Reuters is offering a Web service, called Calais, that will parse text and return it in a form (RDF) that can be utilized by Semantic Web applications. It uses natural language processing (from ClearForest) to find structures of meaning such as places, jobs, facts, events, etc. It apparently has its own metadata schema, but it allows users to extend it. It’s an open API, and Reuters is being quite generous in how much they’ll let you submit during this beta period. It’s English only for now, although they plan to support other languages, opening the exciting prospect of being able to find items of interest in languages you don’t understand via a unified metadata framework.

I’m going by the site’s FAQ. I haven’t tried it and can’t tell how well it works, how accurate it is, how comprehensive or detailed its metadata are, and how much post-processing cleanup uses will want to provide (which of course depends on the application). There are some points I just don’t understand, such as the claim “Calais carries your own metadata anywhere in the content universe.” But if it works within some reasonable definition of “works,” and if it gets widely adopted, Calais could make a lot more information a lot easier to find, and to process for further meaning. [Tags:semantic_web semweb reuters calais nlp ]

3 Responses to “Reuters Semantic Web Web service”

  1. on 13 Feb 2008 at 10:57 amMark V. McDonnell

    David, somewhat OT, but I just wanted to alert you that your HTML is rendering as text, at least for me [up-to-date WinXP, up-to-date FF, Opera *and* Safari].

  2. on 13 Feb 2008 at 11:15 amDavid Weinberger

    definitely my fault. Fixed it. Thanks.

    PS:L I am an idiot,.

  3. on 17 Feb 2008 at 12:08 amJustin Knol

    Hi David,

    Can you check the Calais and ClearForest links in this story. The links both have your site address prefixing the destination & are therefore not working.