This discovery just made my job easier:
Converting a .doc to an .html file inside Word results in a rat’s nest of MS-brand HTML and CSS. A quick way to get a clean version: Send the file to your Gmail account, then hit “View as HTML” and then view>page source in your browser’s toolbar.
I'm not sure what version you have but if you do a Save As and then select "Web page - Filtered", it should clean it up quite a bit as well. But there are still quite a few tags left but they are not MS Office specific.
Good god, stay as far away from MS Office html as possible. It inserts OCEANS of garbage. I don't care if it's compliant or not, it's grossly unnecessary bloat. This is a useful tip.
Or, switch to OpenOffice.org or GoogleDocs and don't worry about dealing with Microsoft crap at all. And yes, they do save as .doc files so you can send them to people who won't give up MS Word.
OpenOffice cannot handle DOC very well. It constantly screws up the formatting of anything more complex then a bulleted list. If you're using it solely amongst other OpenOffice users, great. But I would never send a DOC created in OO to somebody who may not have it. Export to PDF would be better.
Comments
I'm not sure what version you have but if you do a Save As and then select "Web page - Filtered", it should clean it up quite a bit as well. But there are still quite a few tags left but they are not MS Office specific.
AWESOME! thank you!
Good god, stay as far away from MS Office html as possible. It inserts OCEANS of garbage. I don't care if it's compliant or not, it's grossly unnecessary bloat. This is a useful tip.
Or http://lifehacker.com/384560/convert-word-documents-to-cruft+free-html
Have you heard of Markdown, Jimmy? Do you like baseball?
Or, switch to OpenOffice.org or GoogleDocs and don't worry about dealing with Microsoft crap at all. And yes, they do save as .doc files so you can send them to people who won't give up MS Word.
I agree with OpenOffice comment.
And it's not even good HTML if you let MSFT do it ... I mean, do I need to keep specifying the font for every line? No, and neither does the HTML.
Not just every line -- if you change it and then change it back, it will often keep ALL THREE sets of tags. It really is amazingly awful.
OpenOffice cannot handle DOC very well. It constantly screws up the formatting of anything more complex then a bulleted list. If you're using it solely amongst other OpenOffice users, great. But I would never send a DOC created in OO to somebody who may not have it. Export to PDF would be better.
OMG you are so cool thank you! For real!
I've had really good luck with this website:
http://www.zamzar.com/
Comments Closed
In order to combat spam, we are no longer accepting comments on this post (or any post more than 14 days old).