如何使用Perl提取Word文档中的数据?

问题描述:

如何使用Perl从Word文档中提取数据?

How to extract the data from a word doc using Perl?

如果您不在Windows上,我认为最好的方法可能是先将其转换.

If you are not on Windows, I think the best route might be to convert it first.

如果您不使用Windows,并且无权访问 Win32 :: OLE ,则可以使用 OpenOffice进行转换文件.

If you are not using Windows and don't have access to Win32::OLE, you can use OpenOffice to convert the documents.

您可以将链接中的脚本包装到您的Perl程序中.尽管该链接以PDF开头(如果您继续阅读),则可以将其转换为文本.另请参见此stackoverflow帖子,内容涉及转换doc和docx文件.

You could wrap up the script in the link into your Perl program. Although the link starts with PDF if you read on it can convert it to text. Also see this stackoverflow post about converting doc and docx files.