使用Java以编程方式将Word文档转换为HTML
我需要将一个Word文档转换为Java中的HTML文件。该函数将输入一个单词文档,并且输出将是基于单词文档具有的页数的html文件,即如果单词文档具有3个页面,则将生成具有所需分页符的3个html文件。
I need to convert a Word document into HTML file(s) in Java. The function will take input an word document and the output will be html file(s) based on the number of pages the word document has i.e. if the word document has 3 pages then there will be 3 html files generated having the required page break.
我搜索了可以将doc转换为html但无结果的开源/非商业API。
I searched for open source/non-commercial APIs which can convert doc to html but for no result. Anybody who have done this type of job before please help.
谢谢
Thanks
我们使用tm-extractors( http://mvnrepository.com/artifact/org .textmining / tm-extractors ),并退回到商业Aspose( http://www.aspose的.com / )。两者都有本地Java API。
We use tm-extractors (http://mvnrepository.com/artifact/org.textmining/tm-extractors), and fall back to the commercial Aspose (http://www.aspose.com/). Both have native Java APIs.