How to speed up complex image processing?

Question

How to speed up complex image processing?

Each user will be able to upload 100 TIFF (black and white) images.

The process requires:

Convert tif to jpg .
Resize image to xx.
Crop image to 200 pixels.
Add a text watermark.

Here is my PHP code:

 move_uploaded_file($image_temp,$destination_folder.$image_name); $image_name_only = strtolower($image_info["filename"]); $name=$destination_folder.$image_name_only.".jpg"; $thumb=$destination_folder."thumb_".$image_name_only.".jpg"; $exec = '"C:\Program Files\ImageMagick-6.9.0-Q16\convert.exe" '.$destination_folder.$image_name. ' '.$name.' 2>&1'; exec($exec, $exec_output, $exec_retval); $exec = '"C:\Program Files\ImageMagick-6.9.0-Q16\convert.exe" '.$name. ' -resize 1024x '.$name; exec($exec, $exec_output, $exec_retval); $exec = '"C:\Program Files\ImageMagick-6.9.0-Q16\convert.exe" '.$name. ' -thumbnail 200x200! '.$thumb; exec($exec, $exec_output, $exec_retval); $exec = '"C:\Program Files\ImageMagick-6.9.0-Q16\convert.exe" '.$name. " -background White label:ش.پ12355 -append ".$name; exec($exec, $exec_output, $exec_retval);

This code works. But the average processing time for each image is 1 second. Thus, for 100 images, it will probably take about 100 seconds.

How to speed up this process (convert, resize, crop, watermark)?

EDIT

I have a G8 server: Ram: 32G, CPU: Intel Xeon E5-2650 (4 processes)
version: ImageMagick 6.9.0-3 Q16 x64
FEATURES: OpenMP

 convert logo: -resize 500% -bench 10 1.png Performance[1]: 10i 0.770ips 1.000e 28.735u 0:12.992 Performance[2]: 10i 0.893ips 0.537e 26.848u 0:11.198 Performance[3]: 10i 0.851ips 0.525e 27.285u 0:11.756 Performance[4]: 10i 0.914ips 0.543e 26.489u 0:10.941 Performance[5]: 10i 0.967ips 0.557e 25.803u 0:10.341 Performance[6]: 10i 0.797ips 0.509e 27.737u 0:12.554 Performance[7]: 10i 0.963ips 0.556e 25.912u 0:10.389 Performance[8]: 10i 0.863ips 0.529e 26.707u 0:11.586

Resource Limitations:

Width: 100 MP; Height: 100 megapixels; Area: 17.16 GP; Memory: 7.9908GiB; Map: 15.982GiB; Disk: unlimited; File: 1536; Subject: 8; Throttle: 0; Time: unlimited

+5

php image imagemagick converter imagemagick-convert

user4397664 Feb 24 '15 at 19:51

source share

4 answers

, @KurtPfeifle , , , , .

, , , , , Kurt's...

, imput Kurt, 3000x2000 , , , . 42 , - 36 , , .

GNU Parallel - , Xeon. ...

 time for i in $(seq -w 1 100); do cat <<EOF convert image.tiff \ -respect-parentheses \ +write mpr:XY \ \( mpr:XY -resize 1024x \ +write image-1024-pipel-run-${i}.jpg \) \ \( mpr:XY -thumbnail 200x200 \ +write image-thumb-pipel-run-${i}.jpg \) \ \( mpr:XY -background white label:12345 -append \ +write image-labelled-pipel-run-${i}.jpg \) \ null: echo "DONE: run pipeline $i ..." EOF done | parallel

, , , , stdout GNU Parallel. , 10 .

ffmpeg , , - .

 #!/bin/bash for i in $(seq -w 1 100); do echo ffmpeg -y -loglevel panic -i image.tif ff-$i.jpg echo ffmpeg -y -loglevel panic -i image.tif -vf scale=1024:682 ff-$i-1024.jpg echo ffmpeg -y -loglevel panic -i image.tif -vf scale=200:200 ff-$i-200.jpg done | parallel

7 iMac image.tif 3000x2000.

, libturbo-jpeg ImageMagick homebrew .

+4

Mark setchell Feb 25 '15 at 10:48

source share

, GraphicsMagick ( , ImageMagick) , ImageMagick.

, , . .

4 gm . 4 convert , . :

 time for i in $(seq -w 1 100); do gm convert image.tiff gm-${i}-image.jpg gm convert gm-${i}-image.jpg -resize 1024x gm-${i}-image-1024.jpg gm convert gm-${i}-image-1024.jpg -thumbnail 200x200 gm-${i}-image-thumb.jpg gm convert -background white \ gm-${i}-image-1024.jpg label:12345 -append gm-${i}-image-labelled.jpg echo "GraphicsMagick run no. $i ..." done

:

 real 1m4.225s user 0m51.577s sys 0m8.247s

: Q8 GraphicsMagick ( 1.3.20 2014-08-16 Q8 ) 64 , Q16 ImageMagick ( 6.9.0-0 Q16 x86_64 2014-12-06 ), 50 100 .

_{, .}

_{: ? ? .., . - . , , : !)}

+3

Kurt pfeifle Feb 26 '15 at 2:19

source share

vips . :

 #!/bin/bash for file in $*; do convert $file \ -respect-parentheses \ \( -clone 0 -resize 200x200 \ +write $file-thumb.jpg \) \ \( -clone 0 -resize 1024x \ -background white label:12345 -append \ +write $file-labelled.jpg \) \ null: done

vips, Python:

 #!/usr/bin/python import sys from gi.repository import Vips for filename in sys.argv[1:]: im = Vips.Image.new_from_file(filename, access = Vips.Access.SEQUENTIAL) im = im.resize(1024.0 / im.width) mem = Vips.Image.new_memory() im.write(mem) thumb = mem.resize(200.0 / mem.width) thumb.write_to_file(filename + "-thumb.jpg") txt = Vips.Image.text("12345", dpi = 72) footer = txt.embed(10, 10, mem.width, txt.height + 20) mem = mem.join(footer, "vertical") mem.write_to_file(filename + "-labelled.jpg")

100 3000 x 2000 RGB tiff- IM 6.8.9-9 vips 8.0, libjpeg-turbo, :

 $ time ../im-bench.sh * real 0m32.033s user 1m40.416s sys 0m3.316s $ time ../vips-bench.py * real 0m22.559s user 1m8.128s sys 0m1.304s

+1

user894763 Jun 08 '15 at 16:38

source share

Kurt pfeifle · Accepted Answer · 2015-02-24T23:50:42+0000

0. Two approaches

In principle, this problem can be solved in two different ways or by a combination of two:

Build your teams as smart as possible.
Speeding up sales for quality loss.

The next two sections discuss both approaches.

1. Check who has ImageMagick: “Q8”, “Q16”, “Q32” or “Q64”?

First check your exact version of ImageMagick and run:

 convert -version

If your ImageMagick has Q16 (or even Q32 or Q64 , which is possible, but overkill!) In the version line: This means that all internal ImageMagick functions treat all images as having 16-bit (or 32 or 64-bit) channel depths . This gives you the best quality image processing. But it also requires dual memory compared to the Q8 . Therefore, at the same time, this means poor performance.

Therefore: you can check what benefits you will achieve by switching to Q8 -build. ( Q is the "quantum depth" symbol supported by the ImageMagick layout.)

Nevertheless, you pay for possible loss of Q8 -performance with loss of quality. Just check what speed you achieve with Q8 over Q16 , and what quality losses you suffer. Then decide whether you can live with disabilities or not ...

In any case, Q16 will use twice as much RAM for each image for processing, and Q32 will again use twice as much Q16 . This is independent of the actual bits on the pixels that are visible in the input files. When saved, 16-bit image files will also consume more disk space than 8-bit ones.

If Q16 or Q32 requires more memory, you always need to make sure that you have enough. Because exceeding your physical memory would be very bad news. If a larger Q does the swap to disk process, performance will drop. A 1074 x 768 pixel image ( width x height ) will require the following amounts of virtual memory depending on the quantum depth:

 Quantum Virtual Memory Depth (consumed by 1 image 1024x768) ------- ------------------------------ 8 3.840 kiB (=~ 3,75 MiB) 16 7.680 kiB (=~ 7,50 MiB) 32 15.360 kiB (=~ 14,00 MiB)

Also keep in mind that some “optimized” processing pipelines (see below) must contain multiple copies of the image in virtual memory! When virtual memory cannot be satisfied with available RAM, the system will begin to replace and will require "memory" from the disk. In this case, all the smart optimization of the command pipeline, of course, is gone, and begins to move on to the very opposite.

ImageMagick's birthday was in aera, when processors could only process 1 bit at a time. That was decades ago. Since then, the processor architecture has changed a lot. 16-bit operations performed twice as much as 8-bit operations or even longer. Then came the 16-bit processors. 16-bit operators have become standard. CPUs have been optimized for 16-bit: Suddenly, some 8-bit operations may take even more than 16-bit equivalents.

Currently 64-bit processors are shared. Therefore, the argument of Q8 vs. Q16 vs. Q32 in real conditions may even be invalid. Who knows? I do not know about any serious benchmarks about this. It would be interesting if someone (with really deep know-how about processors and about testing real programs) would one day work with such a project.

_{Yes, I see that you are using Q16 on Windows.} _{But I still wanted to mention this, for completeness ... In the future, other users reading this question and answers will be read.}

_{Most likely, since your input TIFFs are only black + white, the output image quality of the Q8 assembly will be good enough for your workflow.} _{(I just don't know if it will also be much faster: it pretty much also depends on the hardware resources you are running this on ...)}

_{In addition, if your sports support the installation of HDRI (high dynamic resolution images), this may also cause some speed limitation.} _{Who knows?} _{Thus, creating IM with the settings --disable-hdri --quantum-depth 8 may or may not lead to an improvement in speed.} _{No one has ever tested this seriously ... The only thing we know about it: these settings will reduce image quality.} _{However, most people will not even notice this if they do not show a very close look and do not make direct comparisons in the image ...}

2. Test Your ImageMagick Capabilities

Then check if your ImageMagick installation is installed with OpenCL and / or OpenMP Support :

 convert -list configure | grep FEATURES

If so (like mine), you should see something like this:

 FEATURES DPC HDRI OpenCL OpenMP Modules

OpenCL (for C omputing L anguage) uses ImageMagick parallel computing functions (if compiled). This will allow your GPU to be used in addition to the processor for image processing operations.

OpenMP (for M completion P ) does something similar: it allows ImageMagick to run in parallel on all the cores of your system. Therefore, if you have a quad-core system and image resizing, resizing occurs on 4 cores (or even 8 if you have hyper-threading).

Command

 convert -version

Displays basic information about supported features. If OpenCL / OpenMP is available, you will see one of them (or both) at the output.

If none of the two appears: look at getting the latest version of ImageMagick, which supports OpenCL and / or OpenMP support.

If you yourself create a package from sources, make sure that OpenCL / OpenMP are used. Do this by including the appropriate options in your configure step:

 ./configure [...other options-] --enable-openmp --enable-opencl

ImageMagick's documentation on OpenMP and OpenCL is here:

Concurrent execution using OpenMP . Read carefully. Since OpenMP is not a silver bullet, and it does not work under any circumstances ...
Concurrent execution with OpenCL . It applies the same as above. In addition, not all ImageMagick operations are supported by OpenCL. The link here contains a list of those that are. -resize is one of them.

Tips and instructions for creating ImageMagick from sources and customizing the assembly, explaining the various options, are here:

ImageMagick Advanced Unix Installation

This page also contains a brief description of the --with-quantum-depth configure option.

3. Control your ImageMagick

Now you can use the built-in -bench option to force ImageMagick to run a test for your team. For instance:

 convert logo: -resize 500% -bench 10 logo.png [....] Performance[4]: 10i 1.489ips 1.000e 6.420u 0:06.510

The above command with -resize 500% tells ImageMagick to run the convert command and scale the embedded image IM logo: by 500% in each direction. The -bench 10 reports that it runs the same command 10 times in a loop, and then prints the performance results:

Since I have OpenMP enabled, I have 4 threads ( Performance[4]: .
It is reported that he performed 10 iterations ( 10i ).
The speed was almost 1.5 iterations per second ( 1.489ips ).
The total battery life is 6.420 seconds.

If your result includes Performance[1]: and only one line, then your system does not support OpenMP. (You may be able to enable it if your assembly supports it: run convert -limit thread 2 )

4. Change ImageMagick Resource Limits

Find out how your ImageMagick system is configured for resource limits. Use this command:

  identify -list resource
   File Area Memory Map Disk Thread Time
   -------------------------------------------------- ------------------
    384 8.590GB 4GiB 8GiB unlimited 4 unlimited

My current system settings are shown above (and not the default values - I already configured them in the past). Numbers is the maximum amount of each resource that ImageMagick will use. You can use each of the keywords in the column headings to pimp your system. To do this, use convert -limit <resource> <number> to set it to a new limit.

Perhaps your result would look like this:

  identify -list resource
   File Area Memory Map Disk Thread Time
   -------------------------------------------------- ------------------
    192 4.295GB 2GiB 4GiB unlimited 1 unlimited

files defines the maximum simultaneously open files that ImageMagick can use.
The resource, memory , map , area and disk resource limits are defined in bytes. To set them to different values, you can use SI prefixes, for example 500 MB).

If you do have OpenMP for ImageMagick on your system, you can run.

 convert -limit thread 2

Include 2 parallel threads as a first step. Then run the test and see if it really matters, and if so, how much. After that, you can set the limit to 4 or even 8 and repeat the exercise ....

5. Use the Magic Pixel Cache (MPC) and / or the Magic Permanent Register (MPR)

Finally , you can experiment with ImageMagick's special internal pixel cache format. This format is called MPC (Magick Pixel Cache). It exists only in memory.

When an MPC is created, the processed input image is saved in RAM as an uncompressed raster format. Thus, MPC is ImageMagick's own uncompressed memory format in memory. This is just a direct dump of memory to disk. Reading is a quick memory card from disk to memory as needed (similar to exchanging a memory page). But image decoding is not required.

(Additional technical details: MPC is not portable as a format. It is also not suitable as a long-term archive format. Its only suitability is an intermediate format for high-performance image processing. Two files are required to support one image.)

If you still want to save this format to disk, keep this in mind:

Image attributes are written to a file with the extension .mpc.
Image icons are written to a file with a .cache extension.

Its main advantage arises when ...

... processing very large images or when
... applying multiple operations on the same image in "control pipelines".

MPC was designed specifically for workflow templates that meet the criteria for "read many times, write once."

Some say that for such operations, performance improves here, but I have no personal experience with it.

First convert the base image to MPC:

 convert input.jpeg input.mpc

and only then run:

 convert input.mpc [...your long-long-long list of crops and operations...]

Then see how much this will save your time.

Most likely, you can use this MPC format even "inline" (using the special mpc: notation, see below).

The MPR format (persistent memory register) does something similar. It reads the image into a named memory register. The process pipeline can also read the image from this register again if it needs to access it several times. The image is saved in the register from which the current instruction pipeline exits.

But I never applied this technique to a real problem, so I can’t say how it works in real life.

6. Build a suitable IM processing pipeline to complete all tasks in one go

As you describe your process, it consists of 4 highlighted steps:

Convert TIFF to JPEG.
Resize JPEG image to xx (? What value?)
Crop JPEG to 200px.
Add a text watermark.

Tell me, did I understand your intentions from reading code fragments correctly:

You have 1 input file, TIFF.
You want to get 2 final output files:
- 1 thumbnail JPEG, size 200x200 pixels;
- 1 tagged JPEG, with a width of 1024 pixels (aspect ratio while maintaining the input TIFF);
- 1 (untagged) JPEG is just an intermediate file that you really don't want to store.

In principle, each step uses its own team - only 4 teams. This can be greatly accelerated by using a single command pipeline that performs all the steps on its own.

In addition, you apparently do not need to save unlabeled JPEG as the final result, but your one command to create it as an intermediate temporary file saves it to disk. We can try to skip this step as a whole and try to achieve the final result without additional writing to disk.

Different approaches are possible for this change. I will show you (and other readers) only one - and only for the CLI, not for PHP. I'm not a PHP guy - it is your own job to “translate” my CLI method to the appropriate PHP calls.

(But, anyway, please check my commands first, really using the CLI to see if the translation approach is worth the PHP!)

But first, make sure you really understand the architecture and structure of the more complex ImageMagick commands! For this purpose, please refer to this other answer :

Order ImageMagick command-line option (and command-line option categories)

Your 4 steps are converted to the following separate ImageMagick commands:

 convert image.tiff image.jpg convert image.jpg -resize 1024x image-1024.jpg convert image-1024.jpg -thumbnail 200x200 image-thumb.jpg convert -background white image-1024.jpg label:12345 -append image-labelled.jpg

Now, to convert this workflow into one pipeline command ... The following command is executed. It should run faster (no matter what your results are, following my previous steps 0 .-- 4. ):

 convert image.tiff \ -respect-parentheses \ +write mpr:XY \ \( mpr:XY +write image-1024.jpg \) \ \( mpr:XY -thumbnail 200x200 +write image-thumb.jpg \) \ \( mpr:XY -background white label:12345 -append +write image-labelled.jpg \) \ null:

Explanations:

-respect-parentheses : to really make subcommands executed inside brackets \( .... \) independent of each other.
+write mpr:XY : used to write the input file to the MPR memory register. XY is just a label (you can use anything) that is needed for subsequent recall of the same image.
+write image-1024.jpg : writes the result of a subcommand executed inside the first pair of parentheses to disk.
+write image-thumb.jpg : writes the result of a subcommand executed in the second pair of parentheses to disk.
+write image-labelled.jpg : writes the result of a subcommand executed inside a pair of third parentheses to disk.
null: terminates the command pipeline. Required, because otherwise we would end the last copied sub-command bracket.

7. Benchmarking 4 separate teams compared to a single pipeline

To get a rough idea of my proposal, I ran the commands below.

The first one starts a sequence of 4 separate commands 100 times (and saves all the resulting images under different file names).

 time for i in $(seq -w 1 100); do convert image.tiff \ image-indiv-run-${i}.jpg convert image-indiv-run-${i}.jpg -sample 1024x \ image-1024-indiv-run-${i}.jpg convert image-1024-indiv-run-${i}.jpg -thumbnail 200x200 \ image-thumb-indiv-run-${i}.jpg convert -background white image-1024-indiv-run-${i}.jpg label:12345 -append \ image-labelled-indiv-run-${i}.jpg echo "DONE: run indiv $i ..." done

My result for 4 separate commands (repeated 100 times!):

 real 0m49.165s user 0m39.004s sys 0m6.661s

The second command starts one pipeline:

 time for i in $(seq -w 1 100); do convert image.tiff \ -respect-parentheses \ +write mpr:XY \ \( mpr:XY -resize 1024x \ +write image-1024-pipel-run-${i}.jpg \) \ \( mpr:XY -thumbnail 200x200 \ +write image-thumb-pipel-run-${i}.jpg \) \ \( mpr:XY -resize 1024x \ -background white label:12345 -append \ +write image-labelled-pipel-run-${i}.jpg \) \ null: echo "DONE: run pipeline $i ..." done

The result for one pipeline (repeated 100 times!):

 real 0m29.128s user 0m28.450s sys 0m2.897s

As you can see, a single pipeline is approximately 40% faster than four separate commands!

Now you can also invest in multiprocessor, a lot of RAM, fast hardware SSD, to speed up the work :-)

But first, translate this CLI approach into PHP code ...

There are a few more things to say about this topic. But the time has come. I will probably return to this answer in a few days and update it more ...

_{Update: I had to update this answer with new benchmarking numbers: initially I forgot to include the -resize 1024x operation (dumb me!) In the pipelined version.} _{By turning it on, the performance gain still exists, but not so big.}

8. Use `-clone 0` to copy the image to memory

Here is another alternative to try instead of the mpr: approach mpr: with a named memory register, as suggested above.

It uses (again inside "side processing in parentheses") the -clone 0 operation. How it works:

convert reads the input TIFF from the disk once and loads it into memory.
Each -clone 0 statement creates a copy of the first loaded image (since it has index 0 in the current image stack).
The sub-category "inside-brackets" of the complete pipeline of commands performs some operation on the clone.
Each +write operation saves the corresponding result to disk.

So here is the command to compare:

 time for i in $(seq -w 1 100); do convert image.tiff \ -respect-parentheses \ \( -clone 0 -thumbnail 200x200 \ +write image-thumb-pipel-run-${i}.jpg \) \ \( -clone 0 -resize 1024x \ -background white label:12345 -append \ +write image-labelled-pipel-run-${i}.jpg \) \ null: echo "DONE: run pipeline $i ..." done

My result:

 real 0m19.432s user 0m18.214s sys 0m1.897s

, , , mpr: !

9. `-scale` `-sample` `-resize`

, , . , , ( , ).

-resize , -sample -scale . :

sample/resample/scale/resize/adaptive-resize/thumbnail ImageMagick convert?

:

 time for i in $(seq -w 1 100); do convert image.tiff \ -respect-parentheses \ \( -clone 0 -thumbnail 200x200 \ +write image-thumb-pipel-run-${i}.jpg \) \ \( -clone 0 -scale 1024x \ -background white label:12345 -append \ +write image-labelled-pipel-run-${i}.jpg \) \ null: echo "DONE: run pipeline $i ..." done

My result:

 real 0m16.551s user 0m16.124s sys 0m1.567s

( +clone ).

, , 4 .

10. `Q8` `-depth 8` .

, .

 time for i in $(seq -w 1 100); do convert image.tiff \ -respect-parentheses \ \( -clone 0 -thumbnail 200x200 -depth 8 \ +write d08-image-thumb-pipel-run-${i}.jpg \) \ \( -clone 0 -scale 1024x -depth 8 \ -background white label:12345 -append \ +write d08-image-labelled-pipel-run-${i}.jpg \) \ null: echo "DONE: run pipeline $i ..." done

" 4 " - .

11. GNU `parallel` ,

, , , .

. - , ...

 time for i in $(seq -w 1 100); do \ cat <<EOF convert image.tiff \ \( -clone 0 -scale 1024x -depth 8 \ -background white label:12345 -append \ +write d08-image-labelled-pipel-run-${i}.jpg \) \ \( -clone 0 -thumbnail 200x200 -depth 8 \ +write d08-image-thumb-pipel-run-${i}.jpg \) \ null: echo "DONE: run pipeline $i ..." EOF done | parallel --will-cite

Results:

 real 0m6.806s user 0m37.582s sys 0m6.642s

user real : user , 8 .

, , : 10 .

12.

- :

( , ) . ( convert ). . " ". -clone mbr: mbc: .
: :
- -depth 8 ( OP) -depth 16 ( OP)
- -resize 1024 vs. -sample 1024x vs. -scale 1024x
GNU parallel , .

How to speed up complex image processing?

0. Two approaches

1. Check who has ImageMagick: “Q8”, “Q16”, “Q32” or “Q64”?

2. Test Your ImageMagick Capabilities

3. Control your ImageMagick

4. Change ImageMagick Resource Limits

5. Use the Magic Pixel Cache (MPC) and / or the Magic Permanent Register (MPR)

6. Build a suitable IM processing pipeline to complete all tasks in one go

7. Benchmarking 4 separate teams compared to a single pipeline

8. Use -clone 0 to copy the image to memory

9. -scale -sample -resize

10. Q8 -depth 8 .

11. GNU parallel ,

12.

More articles:

8. Use `-clone 0` to copy the image to memory

9. `-scale` `-sample` `-resize`

10. `Q8` `-depth 8` .

11. GNU `parallel` ,