Let me make an assumption - Mmmmm ... No. It is impossible to do it faster.
Why am I so sure? Because copying files requires talking to the disk, and this is an awfully slow operation. Moreover, if you try to use multithreading, the results will be slower, and not faster, because the "mechanical" operation of moving the head across the disk is no longer sequential, which was previously possible by chance.
See the answers to this question I asked earlier .
So, try switching to SSDs if you are not already using them, otherwise you will already get the best.
Below is something for us to imagine in the long run that slow means writing to disk compared to caches. If access to the cache takes 10 minutes, this means that it takes 2 years to read from disk. All hits are shown in the image below. Obviously, when your code is executed, the bottleneck will be writing to disk. The best you can do to keep your discs consistent.

source share