Image-to-text (OCR) functionality omits top-most line for recognition / output

#8
by snowboarder04 - opened

I tried uploading a PNG containing handwritten notes using LM Studio and it returned the text but in almost all cases the top line of text from the document is missing, especially when near a possible margin.

I have tried several docs, with varying results which are unique to this model.

Other more purpose-trained models, such as olmocr-7b DO recognise / output the topmost line.
Gemma template_1.png

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment