This session explains how spark internally executes a job internally through provided spark shells or standalone program. It covers key concepts of foundational topics of spark which are highlighted below-
1) Driver Manager
2) Spark Executors
3) Spark Context
4) Resilient Distributed Dataset - RDD
This explains step by step instruction how code gets shipped and gets executed and generate the output whenever you submit your spark job on cluster. This session is highly recommended before anyone gets a deeper dive into detailed Spark sessions.
Visit our website for more tutorials-
www.limeguru.com
Download
0 formats
No download links available.
How Spark Executes A Program | Introduction To Driver Manager, Executor, Spark Context & RDD | NatokHD