Is there a .NET library / utility that converts a Word document to MP3?

Does anyone know of well-supported / proven methods for converting a Microsoft Word document to MP3 or WAV format so that people with hearing impairments can “listen” to documents that I saved in my document management system on the website

I already have an interface built in such a way that someone can use the phone to get a list of available documents, with dates and headings that they “read” on the phone, but now I would like someone to really listen to the content Word files stored on the system.

Ideally, a library or .NET utility that would allow me to convert DOC → MP3 after each download would be better, but the one that “read” the file on demand would also be fine.

+4
source share
5 answers

If your Word document is stored in the new OpenXML format introduced in Office 2007, you may need to look at the Daisy toolchain.

SourceForge has a free converter for creating a Daisy file from your Word document. The Daisy format is understood by screen readers, and converters are also available to convert the Daisy file to a sound file.

For more details see:

+2
source

You can use System.Speech.Recognition to configure your TTS engine and System.Speech.Synthesis to write / save (saves to .WAV). In System.Speech.AudioFormat you can configure the output parameters of the WAV file. For Word, you can simply use COM Interop to capture the file, open it and receive it in any form that you need, and submit it to a managed SAPI. I would probably divide things into paragraphs / runs, tables, links, and others.

If you are looking for a much more reliable technology, one of the most famous TTS engines is the NaturallySpeaking Server SDK - they have recently added an Audio Stream feature that allows you to use TTS on the fly, so there’s no need to pre-record to a permanent audio file (but if you want it, it can also be pre-recorded). You can also check out their AudioMining SDK , which allows you to fully index and find spoken voice in audio files. Their SDKs are all COM, but not .NET.

+4
source

Getting text from Word is the easy part. There are some tts libraries, but goods are usually expensive. Acapela ( http://www.acapela-group.com ) is probably the best I've found, and the API supports output to audio files. I'm not sure MP3 is supported, but audio format conversion is trivial.

Good luck.

+2
source

You can try Panopreter , it has a free and professional version. I used it before with pretty good success.

I'm not sure that the free version can be saved directly to mp3, but you can always use another program to write output to mp3.

+1
source

I would use the mintext version of the command line to first convert the .doc file to text, and then save that text in your database (at boot, maybe?)

Then use any text library to convert and save this file, possibly with the link identifier that links this file to the database.

+1
source

Source: https://habr.com/ru/post/1312475/


All Articles