<html>

  <head>

    <meta content="text/html; charset=ISO-8859-1"

      http-equiv="Content-Type">

  </head>

  <body bgcolor="#FFFFFF" text="#000000">

    <div class="moz-cite-prefix">Am 30.05.13 09:01, schrieb Steve

      Cassidy:<br>

    </div>

    <blockquote

cite="mid:CADg8aoinp3bPqSPK=hkNwG0NHpK_b+R7Ec5L2oiVAjgkQ-SVrg@mail.gmail.com"

      type="cite">

      <div dir="ltr">On 30 May 2013 16:39, Felix Sasaki <span dir="ltr">&lt;<a

            moz-do-not-send="true" href="mailto:fsasaki@w3.org"

            target="_blank">fsasaki@w3.org</a>&gt;</span> wrote:

        <div><br>

          <div class="gmail_extra">

            <div class="gmail_quote">

              <blockquote class="gmail_quote" style="margin:0px 0px 0px

0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">

                <div bgcolor="#FFFFFF" text="#000000"> Well, do avoid

                  the problem you need two pieces of information:<br>

                  - document URI independent of complete character range<br>

                  - document URI + complete character range <br>

                  <a moz-do-not-send="true"

                    href="http://example.com/exampledoc.html#=char=0,29"

                    target="_blank">http://example.com/exampledoc.html#=char=0,29</a>

                  gives you both, and the ability to distinguish between

                  different calculations of complete character ranges.<br>

                </div>

              </blockquote>

              <div style=""><br>

              </div>

              <div>&lt;<a moz-do-not-send="true"

                  href="http://example.com/exampledoc.html#=char=0,29"

                  target="_blank">http://example.com/exampledoc.html#=char=0,29</a>&gt;

                xx:wordcount 5 .</div>

              <div>&lt;<a moz-do-not-send="true"

                  href="http://example.com/exampledoc.html#=char=0,29"

                  target="_blank">http://example.com/exampledoc.htm</a>l&gt;

                xx:wordcount 5 .<br>

              </div>

              <div><br>

              </div>

              <div style="">These are two separate statements and not

                related unless we say</div>

              <div style=""><br class="">

                &lt;<a moz-do-not-send="true"

                  href="http://example.com/exampledoc.html#=char=0,29"

                  target="_blank">http://example.com/exampledoc.htm</a>l&gt;&nbsp;</div>

              <div style="">&nbsp; &nbsp; &nbsp; &nbsp; xx:full_character_range &lt;<a

                  moz-do-not-send="true"

                  href="http://example.com/exampledoc.html#=char=0,29"

                  target="_blank">http://example.com/exampledoc.html#=char=0,29</a>&gt;

                .</div>

              <div style=""><br>

              </div>

              <div style="">which of course you could assert. &nbsp;</div>

              <div style=""><br>

              </div>

              <div style="">I guess the question is for a processing

                component that wants to make an assertion in its output

                about the document as a whole so that a subsequent step

                can use it. &nbsp;Should it use the input document URI or

                make an assertion about the character range that it used

                to represent the document internally. &nbsp;Given that the

                character range might be different between different

                components, it would seem useful to have a way of making

                assertions about the whole document that didn't depend

                on how it was pre-processed.</div>

            </div>

          </div>

        </div>

      </div>

    </blockquote>

    <br>

    <br>

    I think you have the pre-processing information via

    nif:wasConvertedFrom, see<br>

<a class="moz-txt-link-freetext" href="http://www.w3.org/International/multilingualweb/lt/drafts/its20/examples/nif/EX-nif-conversion-output.xml">http://www.w3.org/International/multilingualweb/lt/drafts/its20/examples/nif/EX-nif-conversion-output.xml</a><br>

    and a URI like<br>

    <a class="moz-txt-link-freetext" href="http://example.com/exampledoc.html#xpath(/html/body">http://example.com/exampledoc.html#xpath(/html/body</a>[1]/h2[1]/b[1])<br>

    gives you the source of the NIF data before the pre-processing. It

    is defined in <br>

<a class="moz-txt-link-freetext" href="http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core/version-1.0/nif-core.ttl">http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core/version-1.0/nif-core.ttl</a><br>

    as a sub property of prov:wasDerivedFrom.<br>

    <br>

    Best,<br>

    <br>

    Felix<br>

    <br>

    <br>

    <blockquote

cite="mid:CADg8aoinp3bPqSPK=hkNwG0NHpK_b+R7Ec5L2oiVAjgkQ-SVrg@mail.gmail.com"

      type="cite">

      <div dir="ltr">

        <div>

          <div class="gmail_extra">

            <div class="gmail_quote">

              <div style=""><br>

              </div>

              <blockquote class="gmail_quote" style="margin:0px 0px 0px

0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">

                <div bgcolor="#FFFFFF" text="#000000"> Can you give a

                  triple and a sparql query that only works if we drop

                  #=char=0,29 from the URI?<br>

                  <div class="im"><br>

                  </div>

                </div>

              </blockquote>

              <div style="">Well, it would be the result of two

                components making assertions about different character

                ranges each believing that it is making an assertion

                about the whole document.</div>

              <div style=""><br>

              </div>

              <div style="">Steve</div>

              <div style=""><br>

              </div>

            </div>

            -- <br>

            Department of Computing, Macquarie University

            <div><a moz-do-not-send="true"

                href="http://web.science.mq.edu.au/%7Ecassidy/"

                target="_blank">http://web.science.mq.edu.au/~cassidy/</a></div>

          </div>

        </div>

      </div>

    </blockquote>

    <br>

  </body>

</html>