你的位置：首页>programmer>accessibility - OCRMyPdf conversion issue specially in table format - Stack Overflow

accessibility - OCRMyPdf conversion issue specially in table format - Stack Overflow

programmeradmin2025-04-051浏览0评论

I’m working with a scanned PDF that contains a table with two columns, where each column has two lines of text. When I convert the scanned PDF using OCRmyPDF, I'm encountering an issue with the resulting content.

Tesseract processes the text line by line, but this causes OCRmyPDF to generate separate spans for each piece of content. Specifically, it creates a span for row 1, cell 1, then another span for row 1, cell 2, followed by separate spans for row 2, cell 1, and row 2, cell 2.

This results in accessibility problems for screen readers, as the content is not structured properly. Is there any way to resolve this issue and ensure the table is interpreted correctly by screen readers?

与本文相关的文章

accessibility - OCRMyPdf conversion issue specially in table format - Stack Overflow

评论列表(0)

暂无评论

科技改变生活-雨落星辰 - 所有的伟大,都源于一个勇敢的开始

与本文相关的文章

评论列表(0)