![]() ![]() ![]() I couldn't get Tesseract to pick Z against 2, I against 1 etc reliably when no serifs, no matter what i did for preprocessing - dilate/erode, threshold, border, contours with smoothing, x2 and x5 resize, all the modes and so on.īecause it's engineering diagrams only limited contextual guidance can be used, it's basically independent characters each time. I have just given up on Tesseract for parsing engineering diagrams where the one single font is used throughout and rolled my own non-ML OCR using opencv that requires a little setup but easy hits 99.9% on a character basis and is way quicker, plus uses way less data stored. As usual with Tesseract it's great if you want generic OCR that gives 97-98% accuracy with little to no work, but it's never going to hit 100% or near across all fonts, diagrammatic input etc. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |