Classification overview
A common use case for machine learning is classifying new data by using a model trained on similar labeled data. For example, you might want to predict whether an email is spam, or whether a customer product review is positive, negative, or neutral.
You can use any of the following models in combination with the
ML.PREDICT function
to perform classification:
- Logistic regression models:
use
logistic regression
by setting the
MODEL_TYPEoption toLOGISTIC_REG. - Boosted tree models:
use a
gradient boosted decision tree
by setting the
MODEL_TYPEoption toBOOSTED_TREE_CLASSIFIER. - Random forest models:
use a
random forest
by setting the
MODEL_TYPEoption toRANDOM_FOREST_CLASSIFIER. - Deep neural network (DNN) models:
use a
neural network
by setting the
MODEL_TYPEoption toDNN_CLASSIFIER. - Wide & Deep models:
use
wide & deep learning
by setting the
MODEL_TYPEoption toDNN_LINEAR_COMBINED_CLASSIFIER. - AutoML models:
use an
AutoML classification model
by setting the
MODEL_TYPEoption toAUTOML_CLASSIFIER.
Recommended knowledge
By using the default settings in the CREATE MODEL statements and the
ML.PREDICT function, you can create and use a classification model even
without much ML knowledge. However, having basic knowledge about
ML development helps you optimize both your data and your model to
deliver better results. We recommend using the following resources to develop
familiarity with ML techniques and processes: