是否存在用于以编程方式从PDF删除密码的库?

问题描述:

是否存在一个库,该库将从PDF文档中删除所有者"密码,以便可以从其中以编程方式提取文本?类似于 PDF Technologies的密码恢复工具,但可以从命令行或Python. GUI界面对我来说并不是真正有用的,因为文档数量如此之大.

Does a library exist that will remove "owner" passwords from PDF documents so that the text can then be programmatically extracted from them? Something like PDF Technologies' Password Recovery tool, but callable from the command line or from Python. A GUI interface is not really useful to me, since the number of documents is so large.

请不要对流程的合法性发表评论.有争议的PDF归其所有,并且需要提取文本才能形成文档集的关键字云.

Please, no comments on the legality of the process. The PDFs in question are owned, and the text needs to be extracted in order to form keyword clouds for the document set.

我不了解python库,但是对于从PDF文档中批量删除密码,我的同事们在

I do not know about python libraries, but for batch removal of passwords from PDF documents, my colleagues have had good experience with PwdRemover (not free).