<html>

  <head>

    <meta content="text/html; charset=ISO-8859-1"

      http-equiv="Content-Type">

  </head>

  <body bgcolor="#FFFFFF" text="#000000">

    <div class="moz-cite-prefix">Am 30.05.13 08:07, schrieb Steve

      Cassidy:<br>

    </div>

    <blockquote

cite="mid:CADg8aoiuc00hmyOK=v2YENFbwuF-458E_ETgZ=5rM7p8R9PP7Q@mail.gmail.com"

      type="cite">

      <div dir="ltr"><br>

        <div class="gmail_extra">

          <div class="gmail_quote">

            <blockquote class="gmail_quote" style="margin:0px 0px 0px

0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><br>

              The basic unit in NIF is the nif:Context, so the

              document-level is covered, when the string in a

              nif:Context equals the content of a document.&nbsp;</blockquote>

            <blockquote class="gmail_quote" style="margin:0px 0px 0px

0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">...<br>

              &lt;Alcoholism.txt#char=37028,37043&gt;<br>

              &nbsp; &nbsp; &nbsp; &nbsp; a &nbsp;nif:RFC5147String ;<br>

              &nbsp; &nbsp; &nbsp; &nbsp; nif:beginIndex "37028" ;<br>

              &nbsp; &nbsp; &nbsp; &nbsp; nif:endIndex "37043" ;<br>

              &nbsp; &nbsp; &nbsp; &nbsp; itsrdf:taIdentRef &lt;<a moz-do-not-send="true"

                href="http://dbpedia.org/resource/Benzodiazepine"

                target="_blank">http://dbpedia.org/resource/Benzodiazepine</a>&gt;

              ;<br>

              &nbsp; &nbsp; &nbsp; &nbsp; nif:referenceContext

              &lt;Alcoholism.txt#char=0,91429&gt; &nbsp;.<br>

            </blockquote>

            <div><br>

            </div>

            <div style="">Just wondering why you don't use

              &lt;Alcoholism.txt&gt; when making assertions about the

              document as a whole rather than giving the entire

              character range as a qualifier. </div>

          </div>

        </div>

      </div>

    </blockquote>

    <br>

    Hi Steve,<br>

    <br>

    Sebastian may have a different answer, but here is my view from how

    this is used in ITS 2.0: when you convert a&nbsp; document like<br>

<a class="moz-txt-link-freetext" href="http://www.w3.org/International/multilingualweb/lt/drafts/its20/its20.html#EX-HTML-whitespace-normalization">http://www.w3.org/International/multilingualweb/lt/drafts/its20/its20.html#EX-HTML-whitespace-normalization</a><br>

    to NIF, you will make a lot of decisions what to drop (white space

    nodes, content of HTML "head" or "script" inside "body") and how to

    segment (e.g. not extract content of "span" separately but rather as

    part of "p"). nif:referenceContext gives you together with

    nif:isString clear information what the extracted complete string

    is.<br>

    <br>

    Best,<br>

    <br>

    Felix<br>

    <br>

    <blockquote

cite="mid:CADg8aoiuc00hmyOK=v2YENFbwuF-458E_ETgZ=5rM7p8R9PP7Q@mail.gmail.com"

      type="cite">

      <div dir="ltr">

        <div class="gmail_extra">

          <div class="gmail_quote">

            <div style="">&nbsp;Presumably the same assertion would be true

              of &lt;Alcoholism.txt#char=0,91427&gt; &nbsp;too but if you are

              trying to encode document level meta-data and you have an

              identifier for the document, why not use it?&nbsp;</div>

            <div style=""><br>

            </div>

            <div style="">Steve</div>

            <div>&nbsp;</div>

          </div>

          -- <br>

          Department of Computing, Macquarie University

          <div><a moz-do-not-send="true"

              href="http://web.science.mq.edu.au/%7Ecassidy/"

              target="_blank">http://web.science.mq.edu.au/~cassidy/</a></div>

        </div>

      </div>

      <br>

      <fieldset class="mimeAttachmentHeader"></fieldset>

      <br>

      <pre wrap="">_______________________________________________

NLP2RDF mailing list

<a class="moz-txt-link-abbreviated" href="mailto:NLP2RDF@lists.informatik.uni-leipzig.de">NLP2RDF@lists.informatik.uni-leipzig.de</a>

<a class="moz-txt-link-freetext" href="http://lists.informatik.uni-leipzig.de/mailman/listinfo/nlp2rdf">http://lists.informatik.uni-leipzig.de/mailman/listinfo/nlp2rdf</a>

</pre>

    </blockquote>

    <br>

  </body>

</html>