I heard a discussion about how OpenOffice (ODF) files are compressed zip files for XML and other data. Therefore, making minor changes to the file can potentially completely change the data, so delta compression does not work in version control systems.
I did some basic testing in an OpenOffice file, unzipping it and then rearranging it with zero compression. I used the zip Linux utility for testing. OpenOffice will happily open it anyway.
So I'm wondering if it's worth developing a small utility to work in ODF files every time before I start using version control. Any thoughts on this idea? Possible alternatives?
Secondly, what would be a good and reliable way to implement this small utility? Bash shell that calls zip (maybe only Linux)? Python Can you think of any problems? Obviously, I don't want to accidentally distort the file, and there are several ways that can happen.
Possible errors that I can think of:
- Not enough disk space
- Some other permissions issues that prevent writing a file or temporary files
- The ODF document is encrypted (perhaps it should just leave it alone, which probably also leads to large file changes and thus prevents effective delta compression)
Craig McQueen Jun 10 '09 at 12:01 2009-06-10 12:01
source share