menu
arrow_back

Distributed Computation of NDVI from Landsat Images Using Cloud Dataflow

45m access · 45m completion
Connection Details

7 Credits

info_outline
This lab costs 7 Credits to run. You can purchase credits or a subscription under My Account.

00:45:00

Distributed Computation of NDVI from Landsat Images using Cloud Dataflow

GSP011

Overview

In this lab you process Landsat data in a distributed manner using Apache Beam and Cloud Dataflow. This lab is part of a series of labs on processing scientific data.

Before starting this lab, we highly recommend you read this blog post on what this pipeline does, what methods it uses, and what the results look like.

What you learn

In this lab, you:

  • Examine Apache Beam code to carry out Landsat processing
  • Submit Beam pipeline to Dataflow runner
  • View job details

Consider using Apache Beam on Cloud Dataflow to scale out compute-intensive jobs that meet these conditions:

  1. Your data is not tabular and you can not use SQL to do the analysis. (If it is tabular, use BigQuery).
  2. Large portions of the job are embarrassingly parallel -- in other words, you can process different subsets of the data on different machines.
  3. Your logic involves custom functions, iterations, etc...
  4. The distribution of the work varies across your data subsets.

Join Qwiklabs to Read the Rest of this Lab...and More!

  • Get temporary access to the Google Cloud Console.
  • Nearly 100 labs from beginner to advanced levels.
  • Bite-sized so you can learn at your own pace.
Join to Start This Lab
home
Home
school
Catalog
menu
More
More