Introducing omnsplit

2009-06-17

After a bit of struggle, I now have another literate OWL tool working, along the lines discussed in a previous blog post. Rather than generating the OWL documentation, I now split a Manchester syntax file up, so that I can refer to bits of it. I have this working with OBI, using Protege to produce a single merged ontology file, in Manchester syntax.

The current implementation is rather simple; it produces one file-per-entity in the OWL file which I don't think is entirely good. When run on OBI, it creates over 1400 files which is a lot. The other problem is that I've had to do some dubious hacking to get the file names work out. Firstly, I have to remove spaces and "\"'s, as wel as ":" which is illegal on NTFS.

There's also a problem with some of the OWL. Unfortunately, the OBI to OWL conversion process has a reification step which I don't quite understand the purpose of. This comes out as this sort of anonymous individual. I'm not sure at all how the definition has come out as the rdfs:label, but, for sure, you can't use this as a filename!

Individual: relationship:genid7

    Annotations:
        rdfs:label "C located_in C' if and only if: given any c that
instantiates C at a time t, there is some c' such that: c' instantiates
C' at time t and c *located_in* c'. (Here *located_in* is the
instance-level location relation.)"@en,
        oboInOwl:hasDbXref relationship:genid8

    Types:
        oboInOwl:Definition
------------------------------


I think I might change the implementation a bit, though. Having 1400 files in
one directory is not good. My idea is to serialize the entire file out as
latex, with lots of macros, autogenerated.

[source,latex]
------------------------------
%% this would appear in the generated file
\newcommand{\OwlClassowlthing}{
  \begin{omn}
Class: owl:Thing
  \end{omn}
}

%% then in your latex file you would do
\owlclass{owl}{Thing}

%% which would just resolve to the class above
------------------------------

The only worry with this is that latex would then have to read a large file
into latex, even if most of the macros are not used. This might be really,
really slow. Well, we can but try.

As before, the current version is available at
`git://github.com/phillord/literate_omn.git`.