Python struct.unpack not working

Question

Python struct.unpack not working

I am trying to run this:

def ReadWord(fid,fmt,Addr): fid.seek(Addr) s = fid.readline(2) s = unpack(fmt + 'h', s) if(type(s) == tuple): return s[0] else: return s

with:

 len(s) = 2 len(fmt) = 1 calcsize(fmt) = 0 calcsize(fmt + 'h') = 2

However, Python returns:

struct.error: unpack requires a string argument of length 4

According to python struct.unpack documentation :

The string must contain exactly the amount of data required by the format (len (string) must equal calcsize (fmt)).

So, if my string length is 2, and calcsize from fmt+'h' also 2, why does python say "unpack requires a string argument of length 4 " ??

EDIT:

Thanks for all your answers. Here is the complete code:

http://qtwork.tudelft.nl/gitdata/users/guen/qtlabanalysis/analysis_modules/general/lecroy.py

So, as you can see in the read_timetrace function, fmt set to '<' or '>' in the if...else . Printing of this document confirms this.

But you should also know that I am working on windowsx64 (for work).

EDIT2

Here's the full trace, sorry for the error.

 Traceback (most recent call last): File "C:\Users\maxime.vast\Desktop\Test Campaign Template\Test Suite\Include\readLecroyTRCFile.py", line 139, in <module> read_timetrace("C:\Users\maxime.vast\Desktop\Test Campaign Template\Test Suite\Traces\KL.ES.001.001.trc") File "C:\Users\maxime.vast\Desktop\Test Campaign Template\Test Suite\Include\readLecroyTRCFile.py", line 60, in read_timetrace WAVE_ARRAY_1 = ReadLong(fid, fmt, aWAVE_ARRAY_1) File "C:\Users\maxime.vast\Desktop\Test Campaign Template\Test Suite\Include\readLecroyTRCFile.py", line 100, in ReadLong s = unpack(fmt + 'l', s) struct.error: unpack requires a string argument of length 4 [Finished in 0.2s]

EDIT3:

I replaced readline with read and add:

 print "len(s) ", len(s) print "len(fmt) ", len(fmt) print "calcsize(fmt) ", calcsize(fmt) print "calcsize(fmt + 'h') ", calcsize(fmt + 'h') print "fmt ", fmt

to ReadLong .

Here's the new trace:

 len(s) 4 len(fmt) 1 calcsize(fmt) 0 calcsize(fmt + 'h') 2 fmt < len(s) 4 len(fmt) 1 calcsize(fmt) 0 calcsize(fmt + 'h') 2 fmt < len(s) 4 len(fmt) 1 calcsize(fmt) 0 calcsize(fmt + 'h') 2 fmt < len(s) 1 len(fmt) 1 calcsize(fmt) 0 calcsize(fmt + 'h') 2 fmt < Traceback (most recent call last): File "C:\Users\maxime.vast\Desktop\Test Campaign Template\Test Suite\Include\readLecroyTRCFile.py", line 143, in <module> read_timetrace("C:\Users\maxime.vast\Desktop\Test Campaign Template\Test Suite\Traces\KL.ES.001.001.trc") File "C:\Users\maxime.vast\Desktop\Test Campaign Template\Test Suite\Include\readLecroyTRCFile.py", line 60, in read_timetrace WAVE_ARRAY_1 = ReadLong(fid, fmt, aWAVE_ARRAY_1) File "C:\Users\maxime.vast\Desktop\Test Campaign Template\Test Suite\Include\readLecroyTRCFile.py", line 104, in ReadLong s = unpack(fmt + 'l', s) struct.error: unpack requires a string argument of length 4 [Finished in 0.2s]

+5

python

Maxime VAST Sep 11 '15 at 9:05

source share

3 answers

The length of the format does not matter on its own. What matters is what formats you specify there. For example, there are format specifications that define one byte or even eight bytes. So it really depends on the format, how many characters should be in s .

For instance:

 >>> struct.unpack('b', 'A') (65,) >>> struct.unpack('L', 'A') Traceback (most recent call last): File "<pyshell#3>", line 1, in <module> struct.unpack('L', 'A') error: unpack requires a string argument of length 4 >>> struct.unpack('L', 'AAAA') (1094795585,)

If fmt really > as you say, then it should work fine:

 >>> struct.unpack('>h', 'AA') (16705,)

So, I assume that when the fmt error fmt not just > but something else that will consume an additional 2 bytes. Try printing fmt to unpack .

+3

poke Sep 11 '15 at 9:15

source share

Like len(fmt) = 1, this means that fmt matters. If fmt = 'h', then fmt+'h' will be "hh". Therefore, unpack () will expect 4 bytes of data, since each "h" requires a short integer (2 bytes).

0

acw1668 Sep 11 '15 at 9:18

source share

PM 2Ring · Accepted Answer · 2015-09-11T09:30:10+0000

FWIW, you should use read(2) , not readline(2) . And if the fmt line is really equal to '>' , you should not get this error. Here is a short demo that works as expected.

 from struct import unpack fname = 'qbytes' #Create a file of all byte values with open(fname, 'wb') as f: f.write(bytearray(range(256))) def ReadWord(fid, fmt, addr): fid.seek(addr) s = fid.read(2) s = unpack(fmt + 'h', s) return s[0] fid = open(fname, 'rb') for i in range(16): addr = i n = 256*i + i+1 #Interpret file data as big-endian print i, ReadWord(fid, '>', addr), n fid.close()

Output

 0 1 1 1 258 258 2 515 515 3 772 772 4 1029 1029 5 1286 1286 6 1543 1543 7 1800 1800 8 2057 2057 9 2314 2314 10 2571 2571 11 2828 2828 12 3085 3085 13 3342 3342 14 3599 3599 15 3856 3856

BTW, struct.unpack() always returns a tuple, even if the return value is a single element.

Using readline(2) in a binary may give unexpected results. In my test file in the above code there is (in Linux style) a new line \xa0 in the file. Therefore, if you change s = fid.read(2) to s = fid.readline(2) , everything works fine at first, but on line 10 it crashes because it only reads one byte because of this new line char:

 from struct import unpack fname = 'qbytes' #Create a file of all byte values with open(fname, 'wb') as f: f.write(bytearray(range(256))) def ReadWord(fid, fmt, addr): fid.seek(addr) s = fid.readline(2) print repr(s), s = unpack(fmt + 'h', s) return s[0] with open(fname, 'rb') as fid: for i in range(16): addr = i n = 256*i + i+1 #Interpret file data as big-endian print i, ReadWord(fid, '>', addr), n

Output

 0 '\x00\x01' 1 1 1 '\x01\x02' 258 258 2 '\x02\x03' 515 515 3 '\x03\x04' 772 772 4 '\x04\x05' 1029 1029 5 '\x05\x06' 1286 1286 6 '\x06\x07' 1543 1543 7 '\x07\x08' 1800 1800 8 '\x08\t' 2057 2057 9 '\t\n' 2314 2314 10 '\n' Traceback (most recent call last): File "./qtest.py", line 30, in <module> print i, ReadWord(fid, '>', addr), n File "./qtest.py", line 22, in ReadWord s = unpack(fmt + 'h', s) struct.error: unpack requires a string argument of length 2

P.S.

You have several functions in your code that almost do the same. This violates the DRY principle: do not repeat yourself. Here is one way to fix this using the partial function application. For more information, see functools docs .

 from functools import partial def ReadNumber(fid, datalen=1, fmt='>', conv='b', addr=0): fid.seek(addr) s = fid.read(datalen) if len(s) != datalen: raise IOError('Read %d bytes but expected %d at %d' % (len(s), datalen, addr)) return unpack(fmt+conv, s)[0] ReadByte = partial(ReadNumber, datalen=1, conv='b') ReadWord = partial(ReadNumber, datalen=2, conv='h') ReadLong = partial(ReadNumber, datalen=4, conv='l') ReadFloat = partial(ReadNumber, datalen=4, conv='f') ReadDouble = partial(ReadNumber, datalen=8, conv='d')

To call these new functions you need to use keywords. For instance,

 ReadLong(fid, fmt='>', addr=addr)

True, this is a little longer, but it makes the code a little more readable.

Python struct.unpack not working

P.S.

More articles: