Data Engineering with Google BigQuery & Google Cloud
Building data pipeline with BigQuery, Data Studio, and other Google Cloud. Learn basic BigQuery sql & techniques.
Description
A data warehouse is a repository of historical data that is queried to answer questions, gain insight from data, and make business decisions. BigQuery is Google’s product for data warehouse. It is designed to store and query terabytes, even petabytes of data without we need to setup and manage any infrastructure. It is not a transactional database for day-to-day operation.
BigQuery supports standard SQL, so if you ever develop with relational database like Oracle, PostgreSQL, MySQL, Microsoft SQL Server, etc, it is easy to familiarize yourself with BigQuery. There are a few BigQuery functions to support modern-day requirements, and learning about them will make your job easier.
There is no infrastructure required. We don’t need to worry about the size of storage, number of processors, or memory allocation for processing query. BigQuery scales automatically to run query, and then release the resource when it is done. We don't even charged for memory or processor allocation.
Google provides sample database for practice and trial.
This course has several topics:
an introduction, where we will see what this course about
what is data warehouse and in which part BigQuery take roles
how we can create a simple data pipeline, including data input, data cleansing, and data visualization
tools and methods that can be used for data engineering, particularly on data ingestion from various sources to BigQuery
data visualization using Google Sheets & Data Studio
This course is for people with basic technical knowledge on SQL.
This course is not basic SQL course, so we will not learn the meaning of basic sql such as SELECT, FROM, WHERE, GROUP BY, ORDER BY
See the preview video Technology in This Course for SQL keyword that we will not discuss on detail
However, we will still learn some of modern SQL syntax that can be used in BigQuery
In this course we will also learn how to fetch data from several sources, so this is a good course if you are an engineer that responsible for creating data pipeline.
What You Will Learn!
- Google BigQuery Introduction
- Building data pipeline using Google BigQuery from various sources (file, database, API, message broker)
- Various techniques & tools for loading data into Google BigQuery
- Overview of data cleansing & preparation
- Overview of data visualization using Google Sheets & Data Studio
Who Should Attend!
- Software engineers, data analyst, everyone who interested on data engineering
- Technical architect that want to know how Google platform can fit into overall data & application architecture