Analyzing Natality Data Using Datalab and BigQuery
In this lab you analyze a large (137 million rows) natality dataset using Google BigQuery and Cloud Datalab.
If you are not yet familiar with Datalab, here is a graphical cheat sheet for the main Datalab functionality:
What you learn
In this lab, you:
- Launch Cloud Datalab
- Invoke a BigQuery query
- Create charts in Datalab
- Export data for machine learning
This lab illustrates how you can carry out data exploration of large datasets, but continue to use familiar tools like Pandas and Juypter. The trick is to do the first part of your aggregation in BigQuery, get back a Pandas DataFrame, then work with the smaller Pandas DataFrame locally. Datalab provides a managed Jupyter experience, so you don't need to run notebook servers yourself.
Join Qwiklabs to read the rest of this lab...and more!
- Get temporary access to the Google Cloud Console.
- Over 200 labs from beginner to advanced levels.
- Bite-sized so you can learn at your own pace.
Launch Cloud Datalab
Create a notebook