menu
arrow_back

Distributed Image Processing in Cloud Dataproc

Zugangsdauer: 60 Minuten · Abschlussdauer: 60 Minuten
Connection Details

7 Credits

info_outline
This lab costs 7 Credits to run. You can purchase credits or a subscription under My Account.

01:00:00

Distributed Image Processing in Cloud Dataproc

GSP010

Google Cloud Self-Paced Labs

Overview

In this hands-on lab, you will learn how to use Apache Spark on Cloud Dataproc to distribute a computationally intensive image processing task onto a cluster of machines. This lab is part of a series of labs on processing scientific data.

What you'll learn

  • How to create a managed Cloud Dataproc cluster with Apache Spark pre-installed.

  • How to build and run jobs that use external packages that aren't already installed on your cluster.

  • How to shut down your cluster.

Prerequisites

This is an advanced level lab. Familiarity with Cloud Dataproc and Apache Spark is recommended, but not required. If you're looking to get up to speed in these services, be sure to check out the following labs:

Once you're ready, scroll down to learn more about the services that you'll be using in this lab.

Join

  • Temporary Access
  • Catalog
  • Bite Sized
Join To Start
Score

—/30

Create a development machine in Compute Engine

Schritt durchführen

/ 5

Install Software in the development machine

Schritt durchführen

/ 5

Create a GCS bucket

Schritt durchführen

/ 5

Download some sample images into your bucket

Schritt durchführen

/ 5

Create a Cloud Dataproc cluster

Schritt durchführen

/ 5

Submit your job to Cloud Dataproc

Schritt durchführen

/ 5

home
Startseite
school
Katalog
menu
Mehr
Mehr