阅读使用C#一个Word 2007表格
我在Word 2007中的编程比较新的。请原谅我,如果这个问题已经问。我想读一个字表及其子细胞,并提取出文字在C#(VSTO工具)。我想打造出来以后,从数据提取XML。
I am relatively new in Word 2007 programming. Pardon me if this question is already asked. I would like to read a word table and its child cells and extract that text out in C# (VSTO tools). I would like to build out an xml from the data extract later.
请指引我,如果有人做这种东西。真的很感激。
Please guide me if anyone has done something of this sort. Would really appreciate.
感谢您。除非在向后兼容模式下使用
安键
Thank you. Anjan
,Word 2007中产生的文档中的的Office Open XML格式为微软提供了一个.NET库
Unless used in backward compatibility mode, Word 2007 produces documents in the "Office Open XML Format" for which Microsoft provides an library in .NET
这的 MSDN文章提供了各种指针和片段,在C#中,如何做这样的事情。另外这款 Walkthough Word 2007的格式可能是有用的。
This MSDN article provides various pointers and snippet, in C#, on how to do this kind of things. Also this Walkthough Word 2007 format may be useful.
如果您需要访问旧的MS-Word中的格式,你可以使用或从的文本挖掘开源项目(JAVA)。
If you need to access older MS-Word formats, you may be able to use or inspire yourself from the text-mining open source project (java).