Word Doc to XML
How do you convert a Word Document into an XML document or XML format?
[70 byte] By [
Ryan] at [2007-11-9 15:20:13]

# 1 Re: Word Doc to XML
Why? A Word document is usually unstructured data while an XML document is
structured. If your Word doc is structured then you will need to map it
to xml Elements and parse the word doc. If your Word doc is unstructured
you can write a macro that selects the whole document and puts it into xml
-
<WordDoc>
<Body>Your word doc data goes here ...</Body>
</WordDoc>
"Ryan" <rsmm1999@ev1.net> wrote:
>
>How do you convert a Word Document into an XML document or XML format?
markn at 2007-11-11 23:30:38 >

# 2 Re: Word Doc to XML
"Ryan" <rsmm1999@ev1.net> wrote
>
> How do you convert a Word Document into an XML document
> or XML format?
>
Microsoft Word Document?
Here are some related resources:
http://www.wvware.com/
http://www.wvware.com/wvInfo.html
http://sourceforge.net/projects/poi
http://poi.sourceforge.net/
http://api.openoffice.org/
http://www.gnu.org/gnulist/production/filter.html
http://xml.openoffice.org/
http://www.openoffice.org/
"sot" module
http://util.openoffice.org/
http://sw.openoffice.org/
AbiWord
http://sourceforge.net/projects/abiword
http://www.koffice.org/kword/
http://freshmeat.net/projects/libole2/
Corel Office for Java (do they still offer it?)
http://www.corel.com/
# 3 Re: Word Doc to XML
"Ryan" <rsmm1999@ev1.net> wrote:
>
>How do you convert a Word Document into an XML document or XML format?
Please have a look at the following article at MSDN describing how to convert
a WORD document to XML based on MS WORD 2000 or XP:
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnword2k/html/odc_expwordtoxml.asp
Frank
# 4 Re: Word Doc to XML
"Ryan" <rsmm1999@ev1.net> wrote:
>
>How do you convert a Word Document into an XML document or XML format?
We have actually developed a tool (Word template/macro) that includes a "Save
as XML" button on the Word toolbar.
It does not work for any Word document (because most Word documents are quite
unstructured and really not worth putting into XML). The document must:
* Be structured using the Information Mapping methodology
* Be produced using our Formatting Solutions software
With this said, it actually produces very versatile and powerful XML.
Take a look at http://www.imap.dk/FSXMLViewer/ to see what it looks like
with the output formatted in various XSL stylesheets (You MUST use IE5+ to
see the example)
Steffen
# 5 Re: Word Doc to XML
Ryan,
Our company has developed an application that will convert Word documents
to pure XML, with nesting of tags and proper structure. We have a standard
template and XSL that comes with the program, but we can create a custom
template and XSL to suit your specific needs.
Let me know if you are interested. The program is free with the standard
template and XSL.
Kurt
DocSoft LLC
"Ryan" <rsmm1999@ev1.net> wrote:
>
>How do you convert a Word Document into an XML document or XML format?