Back to Browse

Manipulating Geospatial Data at Massive Scale

1.8K views
Sep 1, 2021
19:24

John Deere ingests petabytes of precision agriculture data every year from its customers’ farms across the globe. In order to scale our data science efforts globally, our data scientists need to perform geospatial analysis on our data lake in an efficient and scalable manner. In this talk, we will describe some of the methods our data engineering team developed for efficient geospatial queries including: – Leveraging Quadtree spatial indexing to partition our Delta Lake tables – Extending the Spark Catalyst Optimizer to perform efficient geospatial joins in our data lake Connect with us: Website: https://databricks.com Facebook: https://www.facebook.com/databricksinc Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc/ Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. https://databricks.com/databricks-named-leader-by-gartner

Download

0 formats

No download links available.

Manipulating Geospatial Data at Massive Scale | NatokHD