Try the below option -
val numExecutors = spark.conf.get("spark.executor.instances").toInt
val numExecutorsCores = spark.conf.get("spark.executor.cores").toInt
val numShufflePartitions = (numExecutors * numExecutorsCores)
spark.conf.set("spark.sql.shuffle.partitions", numShufflePartitions)
, , .
, -
spark.conf.set("spark.executor.memoryOverhead", "3G")
- Dataframe didvie hdfs hdfs spark.sql.shuffle.partitions.