Foreword
Preface
1. Introduction
The ML Lifecycle
Data Collection and Analysis
ML Training Pipelines
Build and Validate Applications
Quality and Performance Evaluation
Defining and Measuring SLOs
Launch
Monitoring and Feedback Loops
Lessons from the Loop
2. Data Management Principles
Data as Liability
The Data Sensitivity of ML Pipelines
Phases of Data
Creation
Ingestion
Processing
Storage
Management
Analysis and Visualization
Data Reliability
Durability
Consistency
Version Control
Performance
Availability
Data Integrity
Security
Privacy
Policy and Compliance
Conclusion
3. Basic Introduction to Models
What Is a Model?
A Basic Model Creation Work_flow
Model Architecture Versus Model Definition Versus Trained Model
Where Are the Vulnerabilities?
Training Data
Labels
Training Methods
Infrastructure and Pipelines
Platforms
Feature Generation
Upgrades and Fixes
A Set of Useful Questions to Ask About Any Model
An Example ML System
Yarn Product Click-Prediction Model
Features
Labels for Features