Compression will never give guaranteed compression ratios. The best you can get is the average ratio for sample data.
So, download the sample data, insert it into the test instance and measure the disk usage.
You may have data that is very badly compressed with Snappy and actually leads to more disk usage than storing raw bytes.
When it comes to compressing your data, there is one and only one rule: MEASURE
source share