If you do not need formatting information, images, and all other bizarre things, then the task is much simpler. In total there will be from 5 to 10 lines of code.
- DOCX zip . , "document.xml". ZipInputStream . ( zip docx !)
- SAX node body/p/r/t - voila, !
, .