• Course overview
  • Course details
  • Prerequisites

Course overview

About this course

In this course, you learn about the internals of BigQuery and best practices for designing, optimizing, and administering your data warehouse. The course covers BigQuery architecture and how to design optimal storage and schemas for data ingestion and changes. Next, you learn techniques to improve read performance, optimize queries, manage workloads, and use logging and monitoring tools. You also learn various methods to secure data, automate workloads, and build machine learning models with BigQuery ML.

Audience

Data analysts, data scientists, data engineers, and developers who perform work on a scale that requires advanced BigQuery internals knowledge to optimize performance.

Show More Show Less

Course details

Module 1: BigQuery Architecture Fundamentals
• Introduction
• BigQuery Core Infrastructure
• BigQuery Storage
• BigQuery Query Processing
• BigQuery Data Shuffling

Module 2: Storage and Schema Optimizations
• BigQuery Storage
• Partitioning and Clustering
• Nested and Repeated Fields
• ARRAY and STRUCT syntax
• Best Practices

Module 3: Ingesting Data
• Data Ingestion Options
• Batch Ingestion
• Streaming Ingestion
• Legacy Streaming API
• BigQuery Storage Write API
• Query Materialization
• Query External Data Sources
• Data Transfer Service

Module 4: Changing Data
• Managing Change in Data Warehouses
• Handling Slowly Changing Dimensions (SCD)
• DML statements
• DML Best Practices and Common Issues

Module 5: Improving Read Performance
• BigQuery’s Cache
• Materialized Views
• BI Engine
• High Throughput Reads
• BigQuery Storage Read API

Module 6: Optimizing and Troubleshooting Queries
• Simple Query Execution
• SELECTs and Aggregation
• JOINs and Skewed JOINs
• Filtering and Ordering
• Best Practices for Functions

Module 7: Workload Management and Pricing
• BigQuery Slots
• Pricing Models and Estimates
• Slot Reservations
• Controlling Costs

Module 8: Logging and Monitoring
• Cloud Monitoring
• BigQuery Admin Panel
• Cloud Audit Logs
• INFORMATION_SCHEMA
• Query Path and Common Errors

Module 9: Security in BigQuery
• Secure Resources with IAM
• Authorized Views
• Secure Data with Classification
• Encryption
• Data Discovery and Governance

Module 10: Automating Workloads
• Scheduling Queries
• Scripting
• Stored Procedures
• Integration with Big Data Products

Module 11: Machine Learning in BigQuery 
• Introduction to BigQuery ML
• How to Make Predictions with BigQuery ML
• How to Build and Deploy a Recommendation System with BigQuery ML
• How to Build and Deploy a Demand Forecasting Solution with BigQuery ML
• Time-Series Models with BigQuery ML
• BigQuery ML Explainability

Show More Show Less

Prerequisites

  •  Knowledge of Big Data and Machine Learning Fundamentals

Our Technology Partners

Spectrum Networks is the Authorised Learning Partner for some of the leaders in IT technology for Digital Transformation