<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Friendly XML, trusted XML and parity bits</title>
	<atom:link href="http://labnotes.org/2007/02/01/friendly-xml-trusted-xml-and-parity-bits/feed/" rel="self" type="application/rss+xml" />
	<link>http://labnotes.org/2007/02/01/friendly-xml-trusted-xml-and-parity-bits/</link>
	<description></description>
	<lastBuildDate>Tue, 15 May 2012 02:41:51 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>By: Assaf</title>
		<link>http://labnotes.org/2007/02/01/friendly-xml-trusted-xml-and-parity-bits/comment-page-1/#comment-137526</link>
		<dc:creator>Assaf</dc:creator>
		<pubDate>Thu, 01 Feb 2007 17:28:56 +0000</pubDate>
		<guid isPermaLink="false">http://blog.labnotes.org/2007/02/01/friendly-xml-trusted-xml-and-parity-bits/#comment-137526</guid>
		<description>Eran,

Canonical XML (which XML sig uses) is UTF-8 encoded. So for hasing purposes, there&#039;s only one encoding to get right. And if you use any other encoding for the actual document, it verifies you got the implementation right (or wrong, but consistently on both ends).

Whitespace is a bit trickier, but that too is solvable.

The point is to make a strong assertion that two implementations are working on the same data, at least in the XML sense of it. Then you can relax well-formed to be just a means to that end, and turn it off in some implementations.</description>
		<content:encoded><![CDATA[<p>Eran,</p>
<p>Canonical XML (which XML sig uses) is UTF-8 encoded. So for hasing purposes, there&#8217;s only one encoding to get right. And if you use any other encoding for the actual document, it verifies you got the implementation right (or wrong, but consistently on both ends).</p>
<p>Whitespace is a bit trickier, but that too is solvable.</p>
<p>The point is to make a strong assertion that two implementations are working on the same data, at least in the XML sense of it. Then you can relax well-formed to be just a means to that end, and turn it off in some implementations.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Aristotle</title>
		<link>http://labnotes.org/2007/02/01/friendly-xml-trusted-xml-and-parity-bits/comment-page-1/#comment-137525</link>
		<dc:creator>Aristotle</dc:creator>
		<pubDate>Thu, 01 Feb 2007 12:49:50 +0000</pubDate>
		<guid isPermaLink="false">http://blog.labnotes.org/2007/02/01/friendly-xml-trusted-xml-and-parity-bits/#comment-137525</guid>
		<description>The problem will be defining a canonical form that everyone agrees and which actually looks like what any application would actually use directly. If you define a canonical form thatâ€™s just another intermediate format it might help a little, but you wonâ€™t really gain that much.</description>
		<content:encoded><![CDATA[<p>The problem will be defining a canonical form that everyone agrees and which actually looks like what any application would actually use directly. If you define a canonical form thatâ€™s just another intermediate format it might help a little, but you wonâ€™t really gain that much.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Eran</title>
		<link>http://labnotes.org/2007/02/01/friendly-xml-trusted-xml-and-parity-bits/comment-page-1/#comment-137523</link>
		<dc:creator>Eran</dc:creator>
		<pubDate>Thu, 01 Feb 2007 11:04:15 +0000</pubDate>
		<guid isPermaLink="false">http://blog.labnotes.org/2007/02/01/friendly-xml-trusted-xml-and-parity-bits/#comment-137523</guid>
		<description>Hashing XML files is a very delicate process.
Just look at the &lt;a href=&quot;http://www.w3.org/Signature/&quot;&gt;XML Signature specs&lt;/a&gt; which has a section about hashing an XML document (or part of it) for change verification purposes.

There are a couple of problems there:
1) Encoding, some implementation will not handle various encoding correcting and will hash something that is not the exact coding thus producing a wrong hash.
2) Whitespaces - the holy pain the butt for XML - some will hash with whitespaces, some without.

Even hashing this simply plain crap is going to be a nightmare which might suffer from the same stuff every other XML document has.

Then again, good idea though... It&#039;s not that I&#039;m just seeing the half empty glass :-)</description>
		<content:encoded><![CDATA[<p>Hashing XML files is a very delicate process.<br />
Just look at the <a href="http://www.w3.org/Signature/">XML Signature specs</a> which has a section about hashing an XML document (or part of it) for change verification purposes.</p>
<p>There are a couple of problems there:<br />
1) Encoding, some implementation will not handle various encoding correcting and will hash something that is not the exact coding thus producing a wrong hash.<br />
2) Whitespaces &#8211; the holy pain the butt for XML &#8211; some will hash with whitespaces, some without.</p>
<p>Even hashing this simply plain crap is going to be a nightmare which might suffer from the same stuff every other XML document has.</p>
<p>Then again, good idea though&#8230; It&#8217;s not that I&#8217;m just seeing the half empty glass :-)</p>
]]></content:encoded>
	</item>
</channel>
</rss>

