supervised learning,unsupervised learning and reinforcement learning
Supervised learning: classification task and regression
Unsupervised learning: finding subgroups with clustering and dimension reduction(commonly used approach in feature preprocessing to remove noise from data,and compress the data onto a smaller dimensional subspace while retaining most of the relevant information)
transforming the features in the range [0, 1] or a standard normal distribution with zero mean and unit variance
Dimensionality reduction
divide the dataset into a separate training and test set. We use the training set to train and optimize our machine learning model, while we keep the test set until the very end to evaluate the final model.
Training and selecting a predictive model
we first have to decide upon a metric to measure performance.
compare at least a handful of different algorithms in order to train and select the best performing model.
different cross-validation techniques
use of hyperparameter optimization techniques(Intuitively, we can think of those hyperparameters as parameters that are not learned from the data but represent the knobs of a model that we can turn to improve its performance.