This is an exported version of the JIRA issue tracker. Please use the Google Code site to open new tickets or report updates to these existing tickets. Feel free to contact the mailing list with any questions.

[GJC-10] Recode XmlW/HtmlW to be two different childish parsers
Created: Wed, 21 Jan 2004 00:06:55 -0800 (PST)  Updated: Fri, 18 Jun 2004 15:22:41 -0700 (PDT)

Status:Open
Project:Genjava
Component/s:gj-scrape
Affects Version/s:scrape-1.0
Fix Version/s:

Type:ImprovementPriority: Major
Reporter:Henri YandellAssignee:Henri Yandell
Resolution:Unresolved 
Environment:


 Description   
XmlW and HtmlW are pretty much the same except that HtmlW is case insensitive and has more in the way of escaping. These should probably be split so you can have HtmlW escaping with case sensitive parsing, not that it is really parsing, just lots of indexOf trickery.

Also need to check which of the removeXml methods are correct. XmlW's has an extra continue statement and is probably the more correct.
Comment by bayard [ Fri, 18 Jun 2004 15:22:41 -0700 (PDT) ]
Assuming the continue XmlW variant is correct, so killed HtmlW's remove method. Rest of this is being left for a later release.