Preface
1. Machine Learning for Computer Vision
Machine Learning
Deep Learning Use Cases
Summary
2. ML Models for Vision
A Dataset for Machine Perception
5-Flowers Dataset
Reading Image Data
Visualizing Image Data
Reading the Dataset File
A Linear Model Using Keras
Keras Model
Training the Model
A Neural Network Using Keras
Neural Networks
Deep Neural Networks
Summary
Glossary
3. Image Vision
Pretrained Embeddings
Pretrained Model
Transfer Learning
Fine-Tuning
Convolutional Networks
Convolutional Filters
Stacking Convolutional Layers
Pooling Layers
AlexNet
The Quest for Depth
Filter Factorization
lxl Convolutions
VGG19
Global Average Pooling
Modular Architectures
Inception
SqueezeNet
ResNet and Skip Connections
DenseNet
Depth-Separable Convolutions
Xception
Neural Architecture Search Designs
NASNet
The MobileNet Family
Beyond Convolution: The Transformer Architecture
Choosing a Model
Performance Comparison
Ensembling
Recommended Strategy
Summary
4. Object Detection and Image Segmentation
Object Detection
YOLO
RetinaNet
Segmentation
Mask R-CNN and Instance Segmentation
U-Net and Semantic Segmentation
Summary
5. Creating Visi0n Datasets
Collecting Images
Photographs
6. Preprocessing
7. Training Pipeline
8. Model Quality and Continuous Evaluation
9. Model Predictions
10. Trends in Production ML
11. Advanced Vision Problems
12. Image and Text Generation
Afterword
Index