TL; DR: The difference between spark "actions" and "transformations": https://spark.apache.org/docs/2.2.0/rdd-programming-guide.html#rdd-operations
Please note that all that you specified using the asynchronous option is βactionsβ , which means that they will immediately start processing the data and try to return synchronously. This can take a lot of time if there is a lot of data, so it's nice to have an asynchronous option.
At the same time, the operations that you specified without the asynchronous option are spark "transformations that are evaluated lazily, which means that it instantly creates a plan for the job, but will not actually process any data until you apply the" action "later to return the results.
Meanwhile, do you have special code or a problem that you are trying to solve with this?
combinatorist
source share