<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Avi's Code &#187; Parser</title>
	<atom:link href="http://telyas.com/wordpress2/tag/parser/feed/" rel="self" type="application/rss+xml" />
	<link>http://telyas.com/wordpress2</link>
	<description>My Open source and "free time" work</description>
	<lastBuildDate>Mon, 11 May 2009 07:08:19 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Android Nekohtml &#8211; Project postponed</title>
		<link>http://telyas.com/wordpress2/2009/01/31/android-nekohtml-project-postponed/</link>
		<comments>http://telyas.com/wordpress2/2009/01/31/android-nekohtml-project-postponed/#comments</comments>
		<pubDate>Sat, 31 Jan 2009 20:45:32 +0000</pubDate>
		<dc:creator>Avi Telyas</dc:creator>
				<category><![CDATA[Android]]></category>
		<category><![CDATA[HTML]]></category>
		<category><![CDATA[nekohtml]]></category>
		<category><![CDATA[Parser]]></category>

		<guid isPermaLink="false">http://telyas.com/wordpress2/?p=95</guid>
		<description><![CDATA[I am sorry to announce that the Html project is postponed to a later date. Would like to complete the 2Chan browser as soon as possible, and although an Html parser would have been nice, it is not really necessary. For the 2chan browser, the parser required is quite simple and can be accomplished by [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone size-full wp-image-99" title="2Ch Android Logo" src="http://telyas.com/wordpress2/wp-content/uploads/2009/01/try1.jpg" alt="2Ch Android Logo" width="400" height="400" /></p>
<p>I am sorry to announce that the Html project is postponed to a later date.</p>
<p>Would like to complete the 2Chan browser as soon as possible, and although an Html parser would have been nice, it is not really necessary. For the 2chan browser, the parser required is quite simple and can be accomplished by a dedicated parser.</p>
<p>Again, sorry, and I do promise to continue this project at a later date.</p>
<p>One more comment on Nekohtml: If you are familiar with it, you know it requires <a href="http://xerces.apache.org/xerces2-j/">Xerces</a>, therefore the porting project should start from that. There is a <a href="http://groups.google.com/group/android-developers/browse_thread/thread/50f5e3075d08b58c/84bb655efa405552?show_docid=84bb655efa405552&amp;pli=1">guy who claimed he has done that</a>, but I couldn&#8217;t find any link to the ported lib.</p>
<p><em>The Android Logo (AKA Droid Robot) was modified. Original logo is <a href="http://code.google.com/policies.html">shared by Google</a> and used  according to terms described in the <a href="http://creativecommons.org/licenses/by/2.5/">Creative Commons  2.5 Attribution License</a>. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://telyas.com/wordpress2/2009/01/31/android-nekohtml-project-postponed/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Android + Nekohtml &#8211; the Naive approach</title>
		<link>http://telyas.com/wordpress2/2009/01/30/android-nekohtml-the-naive-approach/</link>
		<comments>http://telyas.com/wordpress2/2009/01/30/android-nekohtml-the-naive-approach/#comments</comments>
		<pubDate>Fri, 30 Jan 2009 20:15:25 +0000</pubDate>
		<dc:creator>Avi Telyas</dc:creator>
				<category><![CDATA[Android]]></category>
		<category><![CDATA[HTML]]></category>
		<category><![CDATA[nekohtml]]></category>
		<category><![CDATA[Parser]]></category>
		<category><![CDATA[SAX]]></category>

		<guid isPermaLink="false">http://telyas.com/wordpress2/?p=90</guid>
		<description><![CDATA[Nekohtml zip includes a JAR file. Naive approach: Add the JAR to the build path, and hope for the best. How: Quite simple, create a lib directory under the project, copy the JAR file. Add code that access the nekohtml: DOMParser parser = new DOMParser(); Result: Failed! W/dalvikvm(  170): Unable to resolve superclass of Lorg/cyberneko/html/parsers/DOMParser; [...]]]></description>
			<content:encoded><![CDATA[<p>Nekohtml zip includes a JAR file.</p>
<h3><strong>Naive approach</strong>:</h3>
<p>Add the JAR to the build path, and hope for the best.</p>
<h3>How:</h3>
<ol>
<li>Quite simple, create a lib directory under the project, copy the JAR file.</li>
<li>Add code that access the nekohtml:</li>
</ol>
<blockquote>
<pre>DOMParser parser = new DOMParser();</pre>
</blockquote>
<h3>Result:</h3>
<p><strong>Failed! </strong></p>
<blockquote>
<pre>W/dalvikvm(  170): Unable to resolve superclass of Lorg/cyberneko/html/parsers/DOMParser; (115)
W/dalvikvm(  170): Link of class 'Lorg/cyberneko/html/parsers/DOMParser;' failed
W/dalvikvm(  170): VFY: unable to resolve new-instance 196 (Lorg/cyberneko/html/parsers/DOMParser;)</pre>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://telyas.com/wordpress2/2009/01/30/android-nekohtml-the-naive-approach/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Android HTML parser</title>
		<link>http://telyas.com/wordpress2/2009/01/30/android-html-parser/</link>
		<comments>http://telyas.com/wordpress2/2009/01/30/android-html-parser/#comments</comments>
		<pubDate>Fri, 30 Jan 2009 19:54:14 +0000</pubDate>
		<dc:creator>Avi Telyas</dc:creator>
				<category><![CDATA[Android]]></category>
		<category><![CDATA[HTML]]></category>
		<category><![CDATA[Parser]]></category>

		<guid isPermaLink="false">http://telyas.com/wordpress2/?p=87</guid>
		<description><![CDATA[Android API has a SAX XML parser. But, what do you do when you want to parse HTML? The SAX XML parser will work great till it encounters an XML error, a mismatched tag as an obvious example. The parser will throw an exception. You can implement the warning and error functions, but it will [...]]]></description>
			<content:encoded><![CDATA[<p>Android API has a <a href="http://code.google.com/android/reference/javax/xml/parsers/SAXParser.html">SAX XML parser</a>. But, what do you do when you want to parse HTML?</p>
<p>The SAX XML parser will work great till it encounters an XML error, a mismatched tag as an obvious example. The parser will throw an exception. You can implement the warning and error functions, but it will not help, the exception will still occur, halting the parsing process.</p>
<p>Therefore, the only solution is to import an external HTML parser to Android.</p>
<p>I have searched, read, compared and decided to use <a href="http://nekohtml.sourceforge.net/index.html">NekoHtml </a>. Not sure how the programming team selected the name, but looks like a strong implementation.</p>
<p>I will document all the steps I take to include this library, until the hopefully succesful outcome.</p>
<h3>Step 1: <a href="http://sourceforge.net/project/platformdownload.php?group_id=195122">Download NekoHtml from sourceforge</a></h3>
]]></content:encoded>
			<wfw:commentRss>http://telyas.com/wordpress2/2009/01/30/android-html-parser/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
