Blog

Docx HTMLizer: Convert docx files into clean, simple and well formatted HTML code

Docx HTMLizer: Convert docx files into clean, simple and well formatted HTML code

I published in my GitHub the source code and a release of a tiny App that I developed mainly for two reasons: first of all because I didn't want to become obsolete on .NET stack, the second one is that I needed a tool to clean docx files when sometimes I received a terrible and monstrous docx file plenty of styles and creativity to publish on web.

Generated code is clean, simple and well formatted and you can save the entire file or copy any selection to clipboard.

This App is not possible without the great Mammoth library from Michael Williamson, available on many languages like Python, JavaScript, Java..., and without the great work of AngleSharp for it's powerfull way to work with DOM and define beauty ways to format a HTML document.

Source code and installer are available in my GitHub:

https://github.com/antoniobarranco/Docx-HTMLizer

I wish it can be useful. I will try to work on it and make improvements in future.