Shuffle in Spark | Session-10 | Apache Spark Series from A-Z

Hi Friends Apache spark is a distributed computing framework, that basically means the data that is being processed is Distributed among the nodes, but when the data is to be computed the distributed data many a times need to be Shuffled across the different partitions of Distributed data. In this video I have explained about Spark Shuffle, and why it is important and inevitable park of Apache spark.