Thanks for sharing. I'm curious about a couple of things:
- What's your favourite way to preprocess images? Most real world data isn't already segmented into images of complete words, and that preprocessing step could affect the quality of the final output.
- Do you know of any open source models I could use as a starting point to recognise text chunks with some 2D structure, like math equations that span multiple lines?
Sort of off-topic, but an interesting angle on handwriting recignition--
The Sony DPT-RP1 digital paper system has a stylus for handwritten notes and highlighting. They do not have handwriting recognition; however, there is a search feature to find occurrences in your document of handwritten characters _star_ and _asterisk_. Personally, I would like to see these two tokens expanded to a customizable set of handwritten search tokens. The Remarkable tablet's SDK was recently discussed [1] here at HN, and this article is inspiring in this regard.
I don't think it's off topic :) This is a really useful and I suspect under-rated feature. There was a similar feature in an older version of the Livescribe digital pen software. You could search for any text string, without having run the handwriting to text function. This was incredibly useful as you could search for terms not in the dictionary (so for example in my lab book I could search for gene names that handwriting to text would routinely scramble).
Sadly they removed it, and I've never seen anything similar again (with the exception of the DTP-RP1 example you mention).