is there a type of quasi-OCR already done that i can leverage to just recognize exact characters which are always rendered identically like this or do i need to get my paws dirty in the python mines
@mavica_again FWIW I have gone through the Python mines doing exactly this. Even with custom code I could not get better than roughly 99% accuracy, mostly because of lookalike characters. Every off-the-shelf OCR system I tried was much worse (no better than roughly 85% IIRC).
Honestly, unless you have some kind of checksum to verify the result, you might have to just do it manually. Either manually typing in the characters ("human OCR" π), or having some code do a first pass and manually reviewing the result.
@diazona given that windows cleartype makes every single character pixel perfect identical and i've reduced it to 16 colors for even higher accuracy and even I and l are distinctive (dunno which one is which, but i can tell the two apart) i think i'll get 100%. there's no way i'm typing all of it out lol it's massive
@mavica_again Gotcha, makes sense. I dunno, you might actually be the first!