GCP Data Engineer

A Professional Data Engineer enables data-driven decision making by collecting, transforming, and publishing data. A Data Engineer should be able to design, build, operationalize, secure, and monitor data processing systems with a particular emphasis on security and compliance; scalability and efficiency; reliability and fidelity; and flexibility and portability. A Data Engineer should also be able to leverage, deploy, and continuously train pre-existing machine learning models.

img
request

Can’t find a batch you were looking for?

 

A Professional Data Engineer enables data-driven decision making by collecting, transforming, and publishing data. A Data Engineer should be able to design, build, operationalize, secure, and monitor data processing systems with a particular emphasis on security and compliance; scalability and efficiency; reliability and fidelity; and flexibility and portability. A Data Engineer should also be able to leverage, deploy, and continuously train pre-existing machine learning models.

Course content

 

Data Engineering Concepts
  • Overview
  • Types of Data
  • Batch vs Streaming Processing of Data
Introduction
  • What is GCP? Why GCP?
  • GCP Vs Azure Vs AWS
  • GCP Fundamental – Setting Up Free trial GCP account
  • GCP Data Engineering exam
  • https://cloud.google.com/certification/data-engineer
  • https://cloud.google.com/certification/guides/data-engineer
GCP Fundamental
  • GCP Regions and Zones
  • How to create free tier account
  • GCP Services
  • GCP region & zones
  • GCP Cloud Shell, Service Account
  • IAM Basics & Identity
  • Roles and Permission
  • Assign roles to identity
  • IAM – Service Account
  • Compute engine (IAAS)
  • Deploy application on App Engine (PAAS) o Google Kubernetes engine- GKE(CAAS)
  • Cloud Run
  • Cloud Functions
Pricing
  • Overview
  • Price List
  • Price Calculator
Google Cloud Storage
  • Overview on Storage option
  • Cloud Storage In GCP
  • Cloud Storage Interaction
  • Working with Cloud Storage (Create Bucket and CLI)
  • Secure data with encrypt – pending
  • Object versioning
  • Cloud storage access control
  • Signed URL temporary access
  • Bucket retention policy
  • Bucket pricing
  • Google Cloud Storage
  • Data Transfer Services
  • Cloud Block Storage
  • Database concepts
Cloud Block Storage & File store
  • Block storage: Persistent disk – Local SSD
  • File store
  • Which to use when?
Data Transfer Services
  • Data Migration services
  • Hands on data transfer service
Database Concepts
  • OLTP VS OLAP
  • Vertical vs Horizontal Scaling
  • RPO & RTO
  • Availability & Durability
  • Manage database on GCP & Overview
GCP Database Product
  • Google Cloud SQL
  • Semi structure data storage – No SQL
  • Cloud FileStore
  • Cloud Datastore
  • Cloud Memorystore
  • Cloud Bigtable
Cloud SQL
  • Introduction to Cloud SQL
  • Provision cloud SQL instance
  • Create cloud & execute SQL DB
  • Databases migrate to Cloud SQL
  • Cloud SQL failover
  • Bulk load in Cloud SQL
Cloud Spanner
  • Introduction to cloud spanner
  • Hands on cloud spanner
  • Delete spanner instance
  • Which when to use ?

 

Introduction to NoSQL database  

 

Cloud FileStore / DataStore
  • Datastore introduction
  • FireStore native vs datastore mode
  • Lab in datastore
Cloud Memorystore
  • Cloud memorystore redis
  • Hands-on in Memory store
Cloud Bigtable
  • Introduction to Bigtable
  • Bigtable model 7 architecture
  • Columnar database & performance
  • Labs in Bigtable

 

Data Processing and Exploration product

 

Google BigQuery
  • BigQuery basics
  • BigQuery pricing
  • Partitioning and clustering
  • BigQuery monitoring and logging
  • Hand-son – csv, Json, upload data from local
Google Cloud PubSub
  • Pub-sub basics
  • Pub-sub concepts
  • Hands on cloud pub-sub (console)
  • Hands on cloud pub-sub (gcloud)
  • Hands on pub-sub with python and schema
  • Stream data through cloud pub-sub to BigQuery
Cloud Dataflow
  • Introduction to cloud dataflow
  • Apache beam
  • Running word count python dataflow job
  • Cloud dataflow – submit job with predefined template
  • Cloud dataflow notebook instances
  • Cloud dataflow submit custom job
Bigdata Ecosystem  
  • Hadoop & HDFS
  • MapReduce
  • Apache Pig
  • Apache Spark – Pyspark
  • Apache Kafka
Cloud DataProc
  • Introduction to DataProc
  • DataProc basics
  • Advanced DataProc
  • Create DataProc cluster
  • Running Pyspark job on DataProc
  • Provision DataProc cluster with command line
Data loss Prevention API
  • Introduction to DLP API
  • Templates, info_types and match
  • Lab on DLP API
Cloud Data Fusion
  • Introduction to Data fusion
  • Introduction to data fusion – create instance
  • Lab on data fusion – deploy pipeline
  • Lab on data fusion – pipeline on DataProc cluster
Cloud Composer – Apache Airflow
  • Introduction to composer
  • Cloud composer architecture
  • Lab on composer instance creation
  • Advanced cloud composers
Data Preparation with DataPrep
  • DataPrep overview
  • Lab on DataPrep
Data Catalog
  • Introduction to DataLog
  • Lab on DataLog – tag template
Google DataStudio
  • Introduction to DataStudio
  • Hands on DataStudio

 

Data Analysis and ML

 

Machine learning basics – Theory
  • Introduction to machine learning
  • Different types of ml system
  • Workflow in machine learning
  • GCP solution for machine learning
ML-APIs Prebuilt Model (Custom Machine Learning)
  • Introduction to prebuilt model
  • Labs on different custom ML models.
Auto Machine Learning (AutoML)
  • Introduction to AutoML
  • Labs on AutoML
Custom Machine Learning
  • Introduction to Custom Machine learning
  • Labs on custom machine learning
BigQuery ML
  • Introduction to BigQuery ML
  • Lab on BigQuery ML
Data Security and Industry regulation
  • IAM best practices
  • Data Security
  • Data Privacy
  • Industry Regulation
Sample Test Walkthrough
  • Two Tests

 

To see the full course content Download now

Course Prerequisites

 
  • If you’re already a data scientist, a data engineer, data analyst, machine learning engineer or looking for a career change into the world of data, the Google Cloud Professional Data Engineer Certification is for you.
  • Being able to use cloud technologies is becoming a requirement for any kind of data focused role.
  • Learners should be familiar with the fundamentals of cloud computing and relevant practical experience

Who can attend

 
  • Cloud Engineer who want to get certified in Google cloud Data Engineer
  • Anyone looking to use Google cloud for Data Pipeline in Organization
  • Data Engineer who want to learn various GCP products for Data Engineering
  • Anyone who want to learn about various Storage & Database Product for Storing Data
  • Anyone who want deploy ML Model/ Data Pipeline on Google Cloud

Number of Hours: 40hrs

Certification

GCP Professional Cloud Collaboration Engineer (GCP PDE )

Key features

  • One to One Training
  • Online Training
  • Fastrack & Normal Track
  • Resume Modification
  • Mock Interviews
  • Video Tutorials
  • Materials
  • Real Time Projects
  • Virtual Live Experience
  • Preparing for Certification

FAQs

DASVM Technologies offers 300+ IT training courses with 10+ years of Experienced Expert level Trainers.

  • One to One Training
  • Online Training
  • Fastrack & Normal Track
  • Resume Modification
  • Mock Interviews
  • Video Tutorials
  • Materials
  • Real Time Projects
  • Materials
  • Preparing for Certification

Call now: +91-99003 49889 and know the exciting offers available for you!

We working and coordinating with the companies exclusively to get placed. We have a placement cell focussing on training and placements in Bangalore. Our placement cell help more than 600+ students per year.

Learn from experts active in their field, not out-of-touch trainers. Leading practitioners who bring current best practices and case studies to sessions that fit into your work schedule. We have a pool of experts and trainers are composed with highly skilled and experienced in supporting you in specific tasks and provide professional support. 24x7 Learning support from mentors and a community of like-minded peers to resolve any conceptual doubts. Our trainers has contributed in the growth of our clients as well as professionals.

All of our highly qualified trainers are industry experts with at least 10-12 years of relevant teaching experience. Each of them has gone through a rigorous selection process which includes profile screening, technical evaluation, and a training demo before they are certified to train for us. We also ensure that only those trainers with a high alumni rating continue to train for us.

No worries. DASVM technologies assure that no one misses single lectures topics. We will reschedule the classes as per your convenience within the stipulated course duration with all such possibilities. If required you can even attend that topic with any other batches.

DASVM Technologies provides many suitable modes of training to the students like:

  • Classroom training
  • One to One training
  • Fast track training
  • Live Instructor LED Online training
  • Customized training

Yes, the access to the course material will be available for lifetime once you have enrolled into the course.

You will receive DASVM Technologies recognized course completion certification & we will help you to crack global certification with our training.

Yes, DASVM Technologies provides corporate trainings with Course Customization, Learning Analytics, Cloud Labs, Certifications, Real time Projects with 24x7 Support.

Yes, DASVM Technologies provides group discounts for its training programs. Depending on the group size, we offer discounts as per the terms and conditions.

We accept all major kinds of payment options. Cash, Card (Master, Visa, and Maestro, etc), Wallets, Net Banking, Cheques and etc.

DASVM Technologies has a no refund policy. Fees once paid will not be refunded. If the candidate is not able to attend a training batch, he/she is to reschedule for a future batch. Due Date for Balance should be cleared as per date given. If in case trainer got cancelled or unavailable to provide training DASVM will arrange training sessions with other backup trainer.

Your access to the Support Team is for lifetime and will be available 24/7. The team will help you in resolving queries, during and after the course.

Please Contact our course advisor +91-99003 49889. Or you can share your queries through info@dasvmtechnologies.com

like our courses