1) extract from a large zip file
I want to extract files from a large zip file (30Gb +) on a linux server. Enough free disk space.
I tried jar xf dataset.zip . However, there is an error that push button is full , and it was not possible to extract all the files.
I tried unzip , but the zipfile is corrupted.
Archive: dataset.zip warning [dataset.zip]: 35141564204 extra bytes at beginning or within zipfile (attempting to process anyway) error [dataset.zip]: start of central directory not found; zipfile corrupt. (please check that you have transferred or created the zipfile in the appropriate BINARY mode and that you have compiled UnZip properly)
I tried zip -FF dataset.zip --out data.zip and there the error is too big:
zip error: Entry too big to split, read, or write (Poor compression resulted in unexpectedly large entry - try -fz)
Anyway, can I efficiently extract files from a really big zip file?
2) extract specific files from a large zip file
If I need only some files from this large zip file, can I still extract only these files? For example, data1.txt from dataset.zip? It seems that I cannot use the zip or unzip command (there is always a problem with zipfile corruption).
Thanks!
linux extract large-files zip
Irene W.
source share