你的位置：首页>programmer>c# - Extract all content using MUPDF.net - Stack Overflow

c# - Extract all content using MUPDF.net - Stack Overflow

programmeradmin2025-04-203浏览0评论

Is there a way to extract all content from mupdf? For example the following code using the GetText() method will extract all text in html format:

using MuPDF.NET

var document = new Document("path-to-doc.pdf")
for (int i = 0; i < document.PageCount; i++) {
           var htmlContent = page.GetText("html");
           
}

this will not necessairly include form fields, vector graphics e.t.c. How would i get all of these and their relative positions within the PDF?

与本文相关的文章

c# - Extract all content using MUPDF.net - Stack Overflow

评论列表(0)

暂无评论