The data transfer process begins to read Bigquery export results in parallel with the export job.
What is the semantics of a parallel read operation? Should I expect a performance difference between a parallel read and a read after executing a BQ export job (in my case ~ 10 TB)? Can I force the Dataflow service to wait for the BQ export to complete and only then start processing the data?
source
share