menu
arrow_back

Dataproc: Qwik Start - Command Line

30m access · 30m completion
Student Resources
  • Dataproc: Qwik Start - Qwiklabs Preview
  • Run Spark and Hadoop Faster with Cloud Dataproc
Connection Details

1 Credit

info_outline
This lab costs 1 Credit to run. You can purchase credits or a subscription under My Account.

00:30:00

Dataproc: Qwik Start - Command Line

GSP104

Google Cloud Self-Paced Labs

Overview

Cloud Dataproc is a fast, easy-to-use, fully-managed cloud service for running Apache Spark and Apache Hadoop clusters in a simpler, more cost-efficient way. Operations that used to take hours or days take seconds or minutes instead. Create Cloud Dataproc clusters quickly and resize them at any time, so you don't have to worry about your data pipelines outgrowing your clusters.

This lab shows you how to use gcloud on the Google Cloud Platform to create a Google Cloud Dataproc cluster, run a simple Apache Spark job in the cluster, then modify the number of workers in the cluster.

Join Qwiklabs to read the rest of this lab...and more!

  • Get temporary access to the Google Cloud Console.
  • Over 200 labs from beginner to advanced levels.
  • Bite-sized so you can learn at your own pace.
Join to Start This Lab
Score

—/10

Create a Dataproc cluster

Run Step

/ 5

Submit a job

Run Step

/ 5

home
Home
school
Catalog
menu
More
More