DCT Compression - block size, selection of coefficients

Question

DCT Compression - block size, selection of coefficients

I am trying to understand the effect of block size and the best DCT compression coefficient selection strategy. Basically I want to ask what I wrote here:

Video compression: what is discrete cosine transform?

Let's look at the most primitive compression. Creating an image block. Running a DCT on every blog and zeroing out some odds.

As far as I understand, the smaller the block, the better. Smaller blocks mean that the pixels are more correlated, so the energy in the DCT spectrum is more "compact." This should be more emphasized in fast-changing images (High Frequency).

Let's say we zero out a certain percentage of coefficients, which will lead to better image quality, small or large blocks? Say we save 10%, 25%, 50%, 75%, would you say that this is a different answer for a different percentage?

Another problem is how to choose the odds you leave untouched. Suppose I have to make a decision based on location, not energy. Will you take the square from the upper left? I averaged many blocks in the DCT spectrum and came to the conclusion that it is best to take a triangle from the upper left corner. What do you think?

Hope we have an effective discussion.

+4

image-processing compression signal-processing dct

Royi Dec 23 '09 at 10:26

source share

1 answer

metal · Accepted Answer · 2009-12-24T15:56:17+0000

The essence of your question seems to be regarding image quality. There was considerable literature on this subject, and the result is that image quality is difficult to determine.

Standard mathematical errors, such as signal-to-noise ratio (SNR) and standard error (MSE), can give a quantitative answer, but it is well known that they do not correlate well with the subjective opinions of the viewer, which should be our final authority. No other methods, even based on the psycho-visual models of the viewer (for example, S. A. Karunaseker and N. G. Kingsbury, “A measure of distortion to block artifacts in images based on human visual sensitivity,” IEEE Trans. On Image Proc., Volume 4 , No. 6, June 1995, pp. 713-724, and M. Miyahara, K. Kotani and V. R. Algazi, "Objective picture quality scale (PQS) for image encoding", IEEE Trans. On Comm., Volume 46, No. 9, September 1998, pp. 1215-1226), turned out to be better than SNR.

In addition, when you change the type of image (line drawing, cartoon, photograph, portrait, etc.), some types of compression distortion become more apparent. Mosquito noise may be undesirable in one image, while staircase noise may be the culprit in another.

In short, there is no answer to your question, "what can lead to better image quality?"

Speaking of which, we can say some things about DCT that matter. The pixels in the DCT block go from a low deviation to a large change in the zigzag pattern from the upper left corner [(0,0) → (0,1) → (1,0) → (2, 0) → (1,1) → (0 , 2) → etc.], since your choice of triangle mirrors. The closer the pixel is to the upper left corner, the smoother the information contained in it [in fact, the (0,0) DCT value is the average value for the entire block], and the further you get from this angle, the more “high-frequency” details you will get . The closer to the top and left of the image, the more horizontal and vertical details you will represent with this DCT coefficient, and the closer to the block diagonal, the more diagonal details you will have.

In short, lossy compression usually entails discarding some “parts” that may not be perceived by the eye. (Dropping "smoother" DCT values causes severe distortion.) The more DCT values you throw, the greater the compression ratio, but also the greater the distortion that you cause.

As for the block size, it all depends. The more differences and details in a block, the more you lose by throwing out the odds. Some compression algorithms adaptively use different block sizes in the same image, so that highly detailed areas get more and less blocks, and smooth areas get fewer and more blocks.

For algorithms that use the same block size, 8x8, 16x16, and 32x32 are common to things like JPEG and MPEG. The processing required to compress them will be smaller than the size of the adaptive block, but overall the quality will also be lower.

DCT Compression - block size, selection of coefficients

More articles: