Why does this reduce the sound quality?

Question

Why does this reduce the sound quality?

I transmit audio in my flash application from client to server, but the received sound is of very poor quality.

On the client, I pre-process the audio buffer as follows:

this.node.onaudioprocess = function(e){ var buf = e.inputBuffer.getChannelData(0); var out = new Int16Array(buf.length); for (var i = 0; i < buf.length; i++){ var s = Math.max(-1, Math.min(1, buf[i])); out[i] = s < 0 ? s * 0x8000 : s * 0x7FFF; } socket.emit('audio event',{data: out}) return; }

On the server side, I get the sound as follows:

 audio_file = open('tempfile.raw', 'w') @socketio.on('audio event') def audio_message(message): dat = [v[1] for v in sorted(message['data'].iteritems())] n = len(dat) byteval = struct.pack('<'+str(n)+'h',*dat) audio_file.write(byteval)

But the resulting sound sounds metallic, interrupted and noisy. Here's what the resulting waveform looks like:

Where in my code is sound quality lost? How can I transmit audio without losing quality?

+5

javascript flask audio pack flask-socketio

user2212461 Sep 2 '16 at 19:05

source share

1 answer

Miguel · Accepted Answer · 2016-09-02T21:57:02+0000

My first impression of how you process the sound is that it works too slowly in real time.

On the client, you iterate over each individual sample, apply border checking (do you really need to do this?), Then convert from float32 to int16 format using a conditional expression and multiplication by each individual sample.

Then, on the server side, you do another cycle through each sample, only to get the samples into the list (is this not the data already arriving to you in the form of a list?). And only then you pack this list into a binary array, which is written to disk.

It is a lot of work to just write a buffer, you are probably losing data.

Here's what I recommend you try: delete all conversions and see if you can get the data passing through the system in native float32 format. With socket.io, you can send float32 data packaged directly from the client. Did not check this, but I believe that socket.emit('audio event',{data: buf.buffer}) will receive a binary payload sent directly and without conversion on the client side. Then on the server, message['data'] will be a binary payload that you can write directly to disk. To check if the data looks good, you can use the courage using the 32-bit float option in the Import Raw dialog box.

Once you upload the raw data of float32, if you need data in a different format, you can see if the addition adds conversions (I hope only in one place) to maintain real-time exposure. I suspect that you may need to code this conversion in C / C ++, since Python is too slow for this type of thing. If you go this route, searching in Cython might be a good idea.

Why does this reduce the sound quality?

More articles: