[NLP2RDF] NIF ontology modifications for dependency relations and ambigious OLiA properties
Martin Brümmer
der.bruemmer at googlemail.com
Wed Aug 20 16:39:21 CEST 2014
Hi there,
while converting the TIGER corpus and its dependency trees to NIF, I
bumped against some limitations of the NIF ontology:
At the moment, the only dedicated property for dependency structures is
"nif:dependency", pointing from the head to the dependant. However, I
think an inverse property to that would also be nice to have. So I
propose "nif:phraseHead", pointing in the other direction.
A completely missing property is "nif:dependencyRelationType",
annotating the type of the dependency relation as a literal, just like
"nif:posTag" does for POS tags.
It would also be nice to have some property that annotates the root node
of a sentence that could be used to traverse the dependency tree of the
sentence just like "nif:firstWord" and "nif:nextWord" enable traversing
surface structure of the sentence.
I think these additions to the ontology are not controversial and rather
useful. Do you agree?
On a different node, the OLiA tags may need some changes:
At the moment, there is nif:oliaLink and nif:oliaCategory used to link
annotated words to respective OLiA resources. However, these resources
can either be mophological or syntactic annotations. The properties
themselves don't make it sufficiently clear if the oliaLink is used to
link to a POS tag category or a syntactic category, like "NounPhrase". I
think this is semantically ambigious. If OLiA is used for different
classes of annotation, the properties should reflect this. So the tags
should rather be "nif:oliaPosLink" and "nif:oliaSyntaxLink" or something
like that.
Another point in question is that NIF is rather dependent on OLiA
categories. Now some tagsets used to annotate corpora are not mapped by
OLiA. Users might also not agree with the OLiA categories themselves and
might like to define own categories. There is no way to support such
additions. Of course we could speak to Christian Chiarcos about
additions to OLiA, but I don't know how open he will be to collaborative
additions and changes to his model. My vague proposal (just an idea at
this point) would be:
A new ontology, "nif-annotation" and some additions to the core ontology
that define classes that are superclasses of OLiA categories and
properties that are superproperties of nif:oliaLink and nif:oliaCategory
(like "nif:posLink/posCategory" and "nif:syntaxLink/syntaxCategory")
essentially setting an ontology on top of the existing OLiA model,
allowing users to define fitting categories for tags not included in
OLiA without breaking existing NIF applications.
So what do you think?
regards,
Martin
More information about the NLP2RDF
mailing list