乌代?卡马特(Dr. Uday Kamath) is the chief data scientist at BAE Systems Applied Intelligence. He speizes in scalable machine learning and has spent 20 years in the domain of AML, fraud detection in finan crime, cyber security, and bioinformatics, to name a few. Dr. Kamath is responsible for key products in areas focusing on the behavioral, so networking and big data machine learning aspects of analytics at BAE AI. He received his PhD at George Mason University, under the able guidance of Dr. Kenneth De Jong, where his dissertation research focused on machine learning for big data and automated sequence mining.
目录
Preface
Chapter 1:Machine Learning Review
Machine learning-history and definition
What is not machine learning?
Machine learning-concepts and terminology
Machine learning-types and subtypes
Datasets used in machine learning
Machine learning applications
Practical issues in machine learning
Machine learning-roles and process
Roles
Process
Machine learning-tools and datasets
Datasets
Summary
Chapter 2:Practical Approach to Real-World Supervised Learning
Formal description and notation
Data quality analysis
Descriptive data analysis
Basic label analysis
Basic feature analysis
Visualization analysis
Univariate feature analysis
Multivariate feature analysis
Data transformation and preprocessing
Feature construction
Handling missing values
Outliers
Discretization
Data sampling
Is sampling needed?
Undersampling and oversampling
Training, validation, and test set
Feature relevance analysis and dimensionality reduction
Feature search techniques
Feature evaluation techniques
Filter approach
Wrapper approach
Embedded approach
Model building
Linear models
Linear Regression
Naive Bayes
Logistic Regression
Non-linear models
Decision Trees
K-Nearest Neighbors(KNN)
Support vector machines(SVM)
Ensemble learning and meta learners
Bootstrap aggregating or bagging
Boosting
……