Another thing is to look at some of the common compression methods, especially zip and gzip, and find out their "signatures." Most of these formats are "self-defined", so when they start to unpack, they can quickly check that what they are working on is in a format that they understand.
The prohibition of encryption, the file format of the archive - this is basically some kind of indexing mechanism (directory or sorting) and a way to find these elements from the archive through pointers in the index.
With the ubiquity of standard compression algorithms, it is basically a matter of searching where these blocks begin, and trying to find an index or table of contents.
Some will have an index in just one place (for example, the file system), others will simply precede each element in the archive with their identification information. But in the end, there is information about offsets from one block to another, there is information about data types (for example, if they store GIF files, GIFs also have a signature), etc.
Here are the patterns you are trying to track down in a file.
It would be nice if you could somehow get your hand on two versions of the data in the same format. For example, in the game you can get the original version from the CD and a newer fixed version. They can really highlight the information you are looking for.
source share