My suggestion was to cache sound on each device and then sync playback.
NTP can give you amazingly good clock synchronization.
Then, maybe, you repeat the clock synchronization so often and restart playback from the common point in the file after taking into account the drift in the clock frequency. How often do you need it, it depends on how many hours it takes on devices.
An interesting research project.
Nick palmer
source share