Back to Browse

Understanding Apache Spark

2.8K views
Nov 29, 2021
10:24

In this Apache Spark tutorial, learn how Apache Spark works as part of a Transformer pipeline and how the Transformer Engine works under the hood and enables us to transform data fast and efficiently. Apache Spark is a cluster-computing engine focused on data operations. The Transformer Engine is an execution engine that runs data processing pipelines on Apache Spark, benefitting large datasets and giving the ability to access parallel processing. This video explains the entire schema in-depth. This video is part of a comprehensive course that covers the fundamentals of the StreamSets DataOps Platform. To enroll in this free course, follow this link: https://academy.streamsets.com/courses/dataops-platform-fundamentals/?utm_source=youtube&utm_medium=social&utm_campaign=dataops-platform Learn more about StreamSets: https://streamsets.com/products/dataops-platform/?utm_source=youtube&utm_medium=social&utm_campaign=dataops-platform Try StreamSets now: https://streamsets.com/try-dataops/?utm_source=youtube&utm_medium=social&utm_campaign=dataops-platform

Download

1 formats

Video Formats

360pmp413.4 MB

Right-click 'Download' and select 'Save Link As' if the file opens in a new tab.

Understanding Apache Spark | NatokHD