New Code!

After an intermission of a few weeks, Discount has been updated to version 1.2.1 with the addition of a completely reworked emphasis parser. Previously, it did a naive “turn all *’s and _’s into ; turn all ** and __ into ” which led to incorrect XML when I interleaved * and **, but it now attempts to pair matching tokens before it starts spitting out emphasis.

The way I did this was to split the existing second pass (discount has a first pass that breaks the input into blocks, and a second pass that does text substitutions on the contents of those blocks) into two passes; The second pass now converts runs of * into emphasis tokens, interleaved with fully-processed other stuff, and the third pass concatenates them together, matching open and close emphasis together as it goes.

for example, the four variants of **A*B*** produce correct XML now:

***A*B** –> AB
***A**B* –> AB
**A*B*** –> AB
*A**B*** –> AB

which is much better than the status quo ante.

So it doesn’t dump core (as the presence of this weblog post shows,) it fixes a few memory leaks, and it produces better XML on pathological emphasis cases. And that’s good enough to be New Code! to amaze and educate your friends and family.

—orc Thu Apr 10 19:55:44 2008

This Space for Rent

New Code!