shlax: a shallow, lazy XML parser in python

Leave a Comment
Recently, I stumbled upon a paper from the dawn age of XML:"REX: XML Shallow Parsing with Regular Expressions", Robert D. Cameronhttp://www.cs.sfu.ca/~cameron/REX.htmlIt describes how to do something I'd never seen done before: parse the entirety of standard XML syntax in a single regular expression. We've all written short regexes to find some particular feature in an XML document, but we've also all seen those fail because of oddities of whitespace, quoting, linebreaks, etc., that are perfectly legal, but hard to account...
Read More
Next PostNewer Posts Previous PostOlder Posts Home