Google Cloud Professional ML Engineer Guide

Q: When should I use BigQuery ML vs. Vertex AI for model training?

Use BigQuery ML when your data already lives in BigQuery and you need to quickly prototype models without moving data, or for SQL-based feature engineering. Use Vertex AI when you need custom model architectures, complex preprocessing, GPU training, or production-grade MLOps pipelines. BigQuery ML is great for analyst-friendly ML; Vertex AI is for ML engineering teams.

Q: What is the difference between Vertex AI AutoML and pre-trained APIs?

AutoML trains a custom model on your specific labeled training data for your specific task. Pre-trained APIs (Cloud Vision API, Natural Language API) are already-trained models you call for general tasks. Use pre-trained APIs for general tasks; use AutoML when you need custom classification on your specific domain data.

What does the Google Cloud Professional ML Engineer exam cover?

The Google Cloud Professional ML Engineer exam covers framing ML problems, architecting ML solutions, designing data preparation and processing pipelines, developing ML models, automating and orchestrating ML pipelines, monitoring and optimizing ML solutions, and responsible AI practices on GCP using Vertex AI, BigQuery ML, and related services.

The Google Cloud Professional Machine Learning Engineer (PMLE) certification validates expertise in designing and implementing ML pipelines on GCP. ML engineers with this certification can translate business problems into ML solutions, build scalable training and inference pipelines, and operationalize models in production.

The PMLE exam covers both theoretical ML knowledge and practical GCP ML service expertise, making it one of the more challenging GCP professional certifications.

Exam Overview

Detail	Information
Certification	Professional Machine Learning Engineer
Provider	Google Cloud
Number of Questions	60
Time Limit	2 hours
Passing Score	Not published
Cost	$200 USD
Prerequisites	Data engineering or ML background recommended
Validity	2 years

The exam covers six domains:

Framing ML problems (10%)
Architecting ML solutions (18%)
Designing data preparation and processing (23%)
Developing ML models (22%)
Automating and orchestrating ML pipelines (17%)
Monitoring ML solutions (10%)

"The PMLE exam tests both ML engineering and GCP service knowledge. Pure data scientists without GCP experience struggle with the platform sections; pure cloud engineers without ML knowledge struggle with the modeling sections. The most successful candidates have both — understanding when to use AutoML vs. custom training, how to structure training pipelines with Vertex AI Pipelines, and how to monitor models for drift in production." -- Google Cloud PMLE certified engineer community

Framing ML Problems

Problem Types and Approach Selection

Business Problem	ML Problem Type	Approach
Email spam detection	Binary classification	Logistic regression, neural network
Customer churn prediction	Binary classification	Gradient boosting, neural network
Product recommendation	Ranking/recommendation	Collaborative filtering, neural network
Image content moderation	Multi-class classification	CNN, Vision API (pre-trained)
Time-series demand forecasting	Regression/forecasting	ARIMA, LSTM, Temporal Fusion Transformer
Document similarity	Embedding/clustering	Word2Vec, BERT, k-means

When NOT to use ML:

When rules-based logic handles the problem perfectly
When insufficient labeled training data exists (< 1000 examples for complex problems)
When the cost/complexity of ML exceeds business value
When the decision needs to be fully explainable and auditable

Feature Engineering

Numerical features:

Normalization: Scale to [0, 1] range: (x - min) / (max - min)
Standardization: Z-score scaling: (x - mean) / std_dev
Log transform: Reduce skewness in power-law distributed data
Bucketization: Convert continuous to categorical (age → age groups)

Categorical features:

One-hot encoding: Create binary column per category (low cardinality)
Embedding: Dense vector representation (high cardinality, neural networks)
Target encoding: Replace category with target variable mean (risk of data leakage)

Common feature engineering mistakes:

Data leakage: Using future information to predict current events
Not scaling features for distance-based algorithms (k-means, SVM)
Using test data statistics for training normalization

Vertex AI Platform

Vertex AI Components

Vertex AI Platform
├── Vertex AI Datasets (managed data)
├── Feature Store (feature registry and serving)
├── Workbench (managed Jupyter notebooks)
├── AutoML (no-code model training)
├── Custom Training (container-based training jobs)
├── Experiments (training run tracking)
├── Model Registry (versioned model storage)
├── Model Evaluation (performance metrics)
├── Vertex AI Pipelines (ML workflow orchestration)
├── Endpoints (online prediction serving)
└── Batch Prediction (offline inference)

Training Options Comparison

Option	Expertise	Data Size	Customization
AutoML Tabular	Minimal	1K-100M rows	Low
AutoML Vision/NLP	Minimal	100+ images/docs	Low
Pre-trained APIs	None	N/A (no training)	None
Custom Training (scikit-learn)	Moderate	Any	High
Custom Training (TensorFlow/PyTorch)	High	Any	Maximum

Custom Training Job

# custom_train.py
import os
import pandas as pd
from sklearn.ensemble import GradientBoostingClassifier
from sklearn.model_selection import train_test_split
import joblib
from google.cloud import storage

# Load training data from GCS
df = pd.read_csv(os.environ['AIP_TRAINING_DATA_URI'])
X = df.drop('label', axis=1)
y = df['label']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)

# Train model
model = GradientBoostingClassifier(n_estimators=100, learning_rate=0.1)
model.fit(X_train, y_train)

# Save model to GCS
model_dir = os.environ['AIP_MODEL_DIR']
joblib.dump(model, 'model.joblib')

# Upload to GCS
client = storage.Client()
bucket_name = model_dir.replace('gs://', '').split('/')[0]
blob_path = '/'.join(model_dir.replace('gs://', '').split('/')[1:]) + '/model.joblib'
bucket = client.bucket(bucket_name)
bucket.blob(blob_path).upload_from_filename('model.joblib')

ML Pipelines with Vertex AI Pipelines

Vertex AI Pipelines orchestrate multi-step ML workflows using Kubeflow Pipelines SDK:

from kfp import dsl
from kfp.v2 import compiler
from google.cloud import aiplatform

@dsl.component(base_image='python:3.9', packages_to_install=['pandas', 'scikit-learn'])
def preprocess(input_path: str, output_path: dsl.Output[dsl.Dataset]):
    import pandas as pd
    df = pd.read_csv(input_path)
    # Feature engineering
    df['feature_log'] = df['feature'].apply(lambda x: max(0, x) ** 0.5)
    df.to_csv(output_path.path, index=False)

@dsl.component(base_image='python:3.9', packages_to_install=['sklearn', 'joblib'])
def train(dataset: dsl.Input[dsl.Dataset], model: dsl.Output[dsl.Model]):
    import pandas as pd
    from sklearn.ensemble import GradientBoostingClassifier
    import joblib
    
    df = pd.read_csv(dataset.path)
    X = df.drop('label', axis=1)
    y = df['label']
    
    clf = GradientBoostingClassifier()
    clf.fit(X, y)
    joblib.dump(clf, model.path + '/model.joblib')

@dsl.pipeline(name='ml-pipeline', description='Training pipeline')
def pipeline(input_path: str):
    preprocess_task = preprocess(input_path=input_path)
    train_task = train(dataset=preprocess_task.outputs['output_path'])

compiler.Compiler().compile(pipeline_func=pipeline, package_path='pipeline.json')

Model Monitoring

Detecting Training-Serving Skew and Drift

Issue	Description	Detection Method
Training-serving skew	Feature distributions differ between training and production	Compare distributions via statistical tests
Data drift	Input feature distribution shifts over time	Monitor feature statistics (mean, std, quantiles)
Concept drift	Relationship between features and label changes	Monitor prediction accuracy against ground truth
Label drift	Distribution of true labels changes	Monitor ground truth labels when available

Vertex AI Model Monitoring:

from google.cloud import aiplatform

# Enable monitoring on an existing endpoint
endpoint = aiplatform.Endpoint('projects/PROJECT/locations/us-central1/endpoints/ENDPOINT_ID')

monitoring_job = aiplatform.ModelDeploymentMonitoringJob.create(
    display_name='my-monitoring-job',
    endpoint=endpoint.resource_name,
    feature_thresholds={"age": 0.3, "income": 0.3},  # Alert if distribution shifts > 30%
    analysis_instance_schema_uri='gs://my-bucket/schema.yaml',
    predict_instance_schema_uri='gs://my-bucket/schema.yaml',
    monitoring_interval=1  # hours
)

Frequently Asked Questions

When should I use BigQuery ML vs. Vertex AI for model training? Use BigQuery ML when your data already lives in BigQuery and you need to quickly prototype models without moving data, or for models that benefit from SQL-based feature engineering. BigQuery ML supports logistic regression, k-means, neural networks, and even imports TensorFlow models. Use Vertex AI when you need custom model architectures, complex preprocessing, GPU training, or production-grade MLOps pipelines. BigQuery ML is great for analyst-friendly ML; Vertex AI is for ML engineering teams.

What is the difference between Vertex AI AutoML and pre-trained APIs? AutoML trains a custom model on your specific data for your specific task — you provide labeled training data, Google trains a model. Pre-trained APIs (Cloud Vision API, Natural Language API, Speech-to-Text) are already-trained models you call with API requests for general tasks. Use pre-trained APIs when your task is general (detect objects in photos, translate text). Use AutoML when you need custom classification (your specific product categories, your company's specific entities).

How is responsible AI tested on the PMLE exam? Responsible AI questions test your knowledge of bias detection and mitigation, model explainability (Vertex Explainable AI with feature attributions and SHAP values), fairness metrics (demographic parity, equalized odds), data privacy practices (differential privacy, federated learning concepts), and governance practices (model cards, data cards). These questions account for approximately 10% of the exam and require understanding both the concepts and how GCP tools support responsible AI practices.

References

Google Cloud. (2025). Professional Machine Learning Engineer Certification. https://cloud.google.com/certifications/machine-learning-engineer
Google Cloud. (2025). Vertex AI Documentation. https://cloud.google.com/vertex-ai/docs
Lakshmanan, V., Robinson, S., & Munn, M. (2021). Machine Learning Design Patterns. O'Reilly Media.
Aurélien, G. (2022). Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition. O'Reilly Media.
Google Cloud. (2025). Responsible AI Practices. https://ai.google/responsibility/responsible-ai-practices/
Kubeflow. (2025). Kubeflow Pipelines Documentation. https://www.kubeflow.org/docs/components/pipelines/

Frequently Asked Questions

What does the Google Cloud Professional ML Engineer exam cover?

When should I use BigQuery ML vs. Vertex AI for model training?

Use BigQuery ML when your data already lives in BigQuery and you need to quickly prototype models without moving data, or for SQL-based feature engineering. Use Vertex AI when you need custom model architectures, complex preprocessing, GPU training, or production-grade MLOps pipelines. BigQuery ML is great for analyst-friendly ML; Vertex AI is for ML engineering teams.

What is the difference between Vertex AI AutoML and pre-trained APIs?

AutoML trains a custom model on your specific labeled training data for your specific task. Pre-trained APIs (Cloud Vision API, Natural Language API) are already-trained models you call for general tasks. Use pre-trained APIs for general tasks; use AutoML when you need custom classification on your specific domain data.

Google Cloud Professional ML Engineer Guide

What does the Google Cloud Professional ML Engineer exam cover?

Exam Overview

Framing ML Problems

Problem Types and Approach Selection

Feature Engineering

Vertex AI Platform

Vertex AI Components

Training Options Comparison

Custom Training Job

ML Pipelines with Vertex AI Pipelines

Model Monitoring

Detecting Training-Serving Skew and Drift

Frequently Asked Questions

References

Tags

Frequently Asked Questions

Share this article

Continue Reading

Google Cloud Professional Cloud Architect Guide

Google Cloud Professional Cloud Security Engineer Guide

Google Cloud Digital Leader Study Guide

Google Cloud Associate Cloud Engineer Study Guide

Google Cloud Professional Data Engineer Guide

Google Cloud Professional Cloud Network Engineer Guide

What does the Google Cloud Professional ML Engineer exam cover?

Exam Overview

Framing ML Problems

Problem Types and Approach Selection

Feature Engineering

Vertex AI Platform

Vertex AI Components

Training Options Comparison

Custom Training Job

ML Pipelines with Vertex AI Pipelines

Model Monitoring

Detecting Training-Serving Skew and Drift

Frequently Asked Questions

References

Tags

Frequently Asked Questions

Share this article

Continue Reading

Google Cloud Professional Cloud Architect Guide

Google Cloud Professional Cloud Security Engineer Guide

Google Cloud Digital Leader Study Guide

Google Cloud Associate Cloud Engineer Study Guide

Google Cloud Professional Data Engineer Guide

Google Cloud Professional Cloud Network Engineer Guide

We Value Your Privacy

Cookie Preferences

Essential Cookies

Analytics & Performance Cookies

Advertising & Marketing Cookies