嗨! 我正在寻找一个可以阅读PDF文件的Python库,我可以用它从PDF中提取信息。我用谷歌搜索过,但只找到了可以用来写PDF文件的库。 有什么想法吗? 彼得
推荐答案>我正在寻找一个可以读取PDF文件的Python库,而我 > I am looking for a library in Python that would read PDF files and I 可以用它从PDF中提取信息。我用谷歌搜索过,但只找到了可用于编写PDF文件的库。 could extract information from the PDF with it. I have searched with google, but only found libraries that can be used to write PDF files.
reportlab有一个名为pagecatcher的库;它完全支持python, 它不是免费的。 Harald
reportlab has a lib called pagecatcher; it is fully supported with python, it is not free. Harald
" Peter Galfi < GA **** @ freestart.hu>在消息新闻中写道:< ma ************************************** @ pyt hon。 org> ... "Peter Galfi" <ga****@freestart.hu> wrote in message news:<ma**************************************@pyt hon>... 我正在寻找一个可以阅读PDF文件的Python库,我可以用它从PDF中提取信息。我用谷歌搜索了,但只找到了可用于编写PDF文件的库。 任何想法? I am looking for a library in Python that would read PDF files and I could extract information from the PDF with it. I have searched with google, but only found libraries that can be used to write PDF files. Any ideas?
我很快就通过谷歌搜索了一下,但我确切地知道我在找什么?寻找:;-) groups.google/groups?selm...ing.google 提到的页面在这里: www.boddie.uk/david/Proje...thon/pdftools/ 该模块非常正在进行中。您可以从一些文档中获得一些文本和位图图像,但是除非您想要改进它,否则这可能是您所期望的全部(&b) br /> 提交补丁)。 祝你好运! David
I quickly searched back through Google, but I knew exactly what I was looking for: ;-) groups.google/groups?selm...ing.google The page referred to is here: www.boddie.uk/david/Proje...thon/pdftools/ The module is very much a "work in progress". You can probably get some text and bitmap images out of a few documents, but that''s probably all you can expect unless you want to improve it (and submit patches). Good luck! David
在文章< Xn ********************************** @ 62.153.159.1 34>中, Harald Massa< cp ********* @ spamgourmet>写道: In article <Xn**********************************@62.153.159.1 34>, Harald Massa <cp*********@spamgourmet> wrote: 我正在寻找一个可以读取PDF文件的Python库,我可以用它从PDF中提取信息。我用谷歌搜索过,但只找到了可用于编写PDF文件的库。 I am looking for a library in Python that would read PDF files and I could extract information from the PDF with it. I have searched with google, but only found libraries that can be used to write PDF files.
reportlab有一个名为pagecatcher的库;它完全支持python,它不是免费的。 Harald
reportlab has a lib called pagecatcher; it is fully supported with python,it is not free.Harald
ReportLab的库很棒 - - 但是他们没有从PDF中提取 信息。从某种意义上说,我相信原来的 提问者的意图。正如安德烈亚斯建议的那样,他可能最好使用现有的独立应用程序作为单独的进程,使用Python控制。 - - Cameron Laird< cl **** @ phaseit> 业务: www.Phaseit
更多推荐
Fw:用于阅读PDF文件的PDF库
发布评论