.. _sec_tabularmultilabel:

Predicting Multiple Columns in a Table (Multi-Label Prediction)
===============================================================


In multi-label prediction, we wish to predict multiple columns of a
table (i.e. labels) based on the values in the remaining columns. Here
we present a simple strategy to do this with AutoGluon, which simply
maintains a separate
`TabularPredictor <../../api/autogluon.predictor.html#autogluon.tabular.TabularPredictor.fit>`__
object for each column being predicted. Correlations between labels can
be accounted for in predictions by imposing an order on the labels and
allowing the ``TabularPredictor`` for each label to condition on the
predicted values for labels that appeared earlier in the order.

MultilabelPredictor Class
~~~~~~~~~~~~~~~~~~~~~~~~~

We start by defining a custom ``MultilabelPredictor`` class to manage a
collection of ``TabularPredictor`` objects, one for each label. You can
use the ``MultilabelPredictor`` similarly to an individual
``TabularPredictor``, except it operates on multiple labels rather than
one.

.. code:: python

    from autogluon.tabular import TabularDataset, TabularPredictor
    from autogluon.core.utils.utils import setup_outputdir
    from autogluon.core.utils.loaders import load_pkl
    from autogluon.core.utils.savers import save_pkl
    import os.path
    
    class MultilabelPredictor():
        """ Tabular Predictor for predicting multiple columns in table.
            Creates multiple TabularPredictor objects which you can also use individually.
            You can access the TabularPredictor for a particular label via: `multilabel_predictor.get_predictor(label_i)`
    
            Parameters
            ----------
            labels : List[str]
                The ith element of this list is the column (i.e. `label`) predicted by the ith TabularPredictor stored in this object.
            path : str
                Path to directory where models and intermediate outputs should be saved.
                If unspecified, a time-stamped folder called "AutogluonModels/ag-[TIMESTAMP]" will be created in the working directory to store all models.
                Note: To call `fit()` twice and save all results of each fit, you must specify different `path` locations or don't specify `path` at all.
                Otherwise files from first `fit()` will be overwritten by second `fit()`.
                Caution: when predicting many labels, this directory may grow large as it needs to store many TabularPredictors.
            problem_types : List[str]
                The ith element is the `problem_type` for the ith TabularPredictor stored in this object.
            eval_metrics : List[str]
                The ith element is the `eval_metric` for the ith TabularPredictor stored in this object.
            consider_labels_correlation : bool
                Whether the predictions of multiple labels should account for label correlations or predict each label independently of the others.
                If True, the ordering of `labels` may affect resulting accuracy as each label is predicted conditional on the previous labels appearing earlier in this list (i.e. in an auto-regressive fashion).
                Set to False if during inference you may want to individually use just the ith TabularPredictor without predicting all the other labels.
            kwargs :
                Arguments passed into the initialization of each TabularPredictor.
    
        """
    
        multi_predictor_file = 'multilabel_predictor.pkl'
    
        def __init__(self, labels, path, problem_types=None, eval_metrics=None, consider_labels_correlation=True, **kwargs):
            if len(labels) < 2:
                raise ValueError("MultilabelPredictor is only intended for predicting MULTIPLE labels (columns), use TabularPredictor for predicting one label (column).")
            self.path = setup_outputdir(path, warn_if_exist=False)
            self.labels = labels
            self.consider_labels_correlation = consider_labels_correlation
            self.predictors = {}  # key = label, value = TabularPredictor or str path to the TabularPredictor for this label
            if eval_metrics is None:
                self.eval_metrics = {}
            else:
                self.eval_metrics = {labels[i] : eval_metrics[i] for i in range(len(labels))}
            problem_type = None
            eval_metric = None
            for i in range(len(labels)):
                label = labels[i]
                path_i = self.path + "Predictor_" + label
                if problem_types is not None:
                    problem_type = problem_types[i]
                if eval_metrics is not None:
                    eval_metric = self.eval_metrics[i]
                self.predictors[label] = TabularPredictor(label=label, problem_type=problem_type, eval_metric=eval_metric, path=path_i, **kwargs)
    
        def fit(self, train_data, tuning_data=None, **kwargs):
            """ Fits a separate TabularPredictor to predict each of the labels.
    
                Parameters
                ----------
                train_data, tuning_data : str or autogluon.tabular.TabularDataset or pd.DataFrame
                    See documentation for `TabularPredictor.fit()`.
                kwargs :
                    Arguments passed into the `fit()` call for each TabularPredictor.
            """
            if isinstance(train_data, str):
                train_data = TabularDataset(train_data)
            if tuning_data is not None and isinstance(tuning_data, str):
                tuning_data = TabularDataset(tuning_data)
            train_data_og = train_data.copy()
            if tuning_data is not None:
                tuning_data_og = tuning_data.copy()
            save_metrics = len(self.eval_metrics) == 0
            for i in range(len(self.labels)):
                label = self.labels[i]
                predictor = self.get_predictor(label)
                if not self.consider_labels_correlation:
                    labels_to_drop = [l for l in self.labels if l!=label]
                else:
                    labels_to_drop = [labels[j] for j in range(i+1,len(self.labels))]
                train_data = train_data_og.drop(labels_to_drop, axis=1)
                if tuning_data is not None:
                    tuning_data = tuning_data_og.drop(labels_to_drop, axis=1)
                print(f"Fitting TabularPredictor for label: {label} ...")
                predictor.fit(train_data=train_data, tuning_data=tuning_data, **kwargs)
                self.predictors[label] = predictor.path
                if save_metrics:
                    self.eval_metrics[label] = predictor.eval_metric
            self.save()
    
        def predict(self, data, **kwargs):
            """ Returns DataFrame with label columns containing predictions for each label.
    
                Parameters
                ----------
                data : str or autogluon.tabular.TabularDataset or pd.DataFrame
                    Data to make predictions for. If label columns are present in this data, they will be ignored. See documentation for `TabularPredictor.predict()`.
                kwargs :
                    Arguments passed into the predict() call for each TabularPredictor.
            """
            return self._predict(data, as_proba=False, **kwargs)
    
        def predict_proba(self, data, **kwargs):
            """ Returns dict where each key is a label and the corresponding value is the `predict_proba()` output for just that label.
    
                Parameters
                ----------
                data : str or autogluon.tabular.TabularDataset or pd.DataFrame
                    Data to make predictions for. See documentation for `TabularPredictor.predict()` and `TabularPredictor.predict_proba()`.
                kwargs :
                    Arguments passed into the `predict_proba()` call for each TabularPredictor (also passed into a `predict()` call).
            """
            return self._predict(data, as_proba=True, **kwargs)
    
        def evaluate(self, data, **kwargs):
            """ Returns dict where each key is a label and the corresponding value is the `evaluate()` output for just that label.
    
                Parameters
                ----------
                data : str or autogluon.tabular.TabularDataset or pd.DataFrame
                    Data to evalate predictions of all labels for, must contain all labels as columns. See documentation for `TabularPredictor.evaluate()`.
                kwargs :
                    Arguments passed into the `evaluate()` call for each TabularPredictor (also passed into the `predict()` call).
            """
            data = self._get_data(data)
            eval_dict = {}
            for label in self.labels:
                print(f"Evaluating TabularPredictor for label: {label} ...")
                predictor = self.get_predictor(label)
                eval_dict[label] = predictor.evaluate(data, **kwargs)
                if self.consider_labels_correlation:
                    data[label] = predictor.predict(data, **kwargs)
            return eval_dict
    
        def save(self):
            """ Save MultilabelPredictor to disk. """
            for label in self.labels:
                if not isinstance(self.predictors[label], str):
                    self.predictors[label] = self.predictors[label].path
            save_pkl.save(path=self.path+self.multi_predictor_file, object=self)
            print(f"MultilabelPredictor saved to disk. Load with: MultilabelPredictor.load('{self.path}')")
    
        @classmethod
        def load(cls, path):
            """ Load MultilabelPredictor from disk `path` previously specified when creating this MultilabelPredictor. """
            path = os.path.expanduser(path)
            if path[-1] != os.path.sep:
                path = path + os.path.sep
            return load_pkl.load(path=path+cls.multi_predictor_file)
    
        def get_predictor(self, label):
            """ Returns TabularPredictor which is used to predict this label. """
            predictor = self.predictors[label]
            if isinstance(predictor, str):
                return TabularPredictor.load(path=predictor)
            return predictor
    
        def _get_data(self, data):
            if isinstance(data, str):
                return TabularDataset(data)
            return data.copy()
    
        def _predict(self, data, as_proba=False, **kwargs):
            data = self._get_data(data)
            if as_proba:
                predproba_dict = {}
            for label in self.labels:
                print(f"Predicting with TabularPredictor for label: {label} ...")
                predictor = self.get_predictor(label)
                if as_proba:
                    predproba_dict[label] = predictor.predict_proba(data, as_multiclass=True, **kwargs)
                data[label] = predictor.predict(data, **kwargs)
            if not as_proba:
                return data[self.labels]
            else:
                return predproba_dict

Training
~~~~~~~~

Let's now apply our multi-label predictor to predict multiple columns in
a data table. We first train models to predict each of the labels.

.. code:: python

    train_data = TabularDataset('https://autogluon.s3.amazonaws.com/datasets/Inc/train.csv')
    subsample_size = 500  # subsample subset of data for faster demo, try setting this to much larger values
    train_data = train_data.sample(n=subsample_size, random_state=0)
    train_data.head()


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }
    
        .dataframe tbody tr th {
            vertical-align: top;
        }
    
        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>age</th>
          <th>workclass</th>
          <th>fnlwgt</th>
          <th>education</th>
          <th>education-num</th>
          <th>marital-status</th>
          <th>occupation</th>
          <th>relationship</th>
          <th>race</th>
          <th>sex</th>
          <th>capital-gain</th>
          <th>capital-loss</th>
          <th>hours-per-week</th>
          <th>native-country</th>
          <th>class</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>6118</th>
          <td>51</td>
          <td>Private</td>
          <td>39264</td>
          <td>Some-college</td>
          <td>10</td>
          <td>Married-civ-spouse</td>
          <td>Exec-managerial</td>
          <td>Wife</td>
          <td>White</td>
          <td>Female</td>
          <td>0</td>
          <td>0</td>
          <td>40</td>
          <td>United-States</td>
          <td>&gt;50K</td>
        </tr>
        <tr>
          <th>23204</th>
          <td>58</td>
          <td>Private</td>
          <td>51662</td>
          <td>10th</td>
          <td>6</td>
          <td>Married-civ-spouse</td>
          <td>Other-service</td>
          <td>Wife</td>
          <td>White</td>
          <td>Female</td>
          <td>0</td>
          <td>0</td>
          <td>8</td>
          <td>United-States</td>
          <td>&lt;=50K</td>
        </tr>
        <tr>
          <th>29590</th>
          <td>40</td>
          <td>Private</td>
          <td>326310</td>
          <td>Some-college</td>
          <td>10</td>
          <td>Married-civ-spouse</td>
          <td>Craft-repair</td>
          <td>Husband</td>
          <td>White</td>
          <td>Male</td>
          <td>0</td>
          <td>0</td>
          <td>44</td>
          <td>United-States</td>
          <td>&lt;=50K</td>
        </tr>
        <tr>
          <th>18116</th>
          <td>37</td>
          <td>Private</td>
          <td>222450</td>
          <td>HS-grad</td>
          <td>9</td>
          <td>Never-married</td>
          <td>Sales</td>
          <td>Not-in-family</td>
          <td>White</td>
          <td>Male</td>
          <td>0</td>
          <td>2339</td>
          <td>40</td>
          <td>El-Salvador</td>
          <td>&lt;=50K</td>
        </tr>
        <tr>
          <th>33964</th>
          <td>62</td>
          <td>Private</td>
          <td>109190</td>
          <td>Bachelors</td>
          <td>13</td>
          <td>Married-civ-spouse</td>
          <td>Exec-managerial</td>
          <td>Husband</td>
          <td>White</td>
          <td>Male</td>
          <td>15024</td>
          <td>0</td>
          <td>40</td>
          <td>United-States</td>
          <td>&gt;50K</td>
        </tr>
      </tbody>
    </table>
    </div>


.. code:: python

    labels = ['education-num','education','class']  # which columns to predict based on the others
    problem_types = ['regression','multiclass','binary']  # type of each prediction problem
    save_path = 'agModels-predictEducationClass'  # specifies folder to store trained models
    
    time_limit = 5  # how many seconds to train the TabularPredictor for each label, set much larger in your applications!

.. code:: python

    multi_predictor = MultilabelPredictor(labels=labels, problem_types=problem_types, path=save_path)
    multi_predictor.fit(train_data, time_limit=time_limit)


.. parsed-literal::
    :class: output

    Beginning AutoGluon training ... Time limit = 5s
    AutoGluon will save models to "agModels-predictEducationClass/Predictor_education-num/"
    AutoGluon Version:  0.1.0b20210301
    Train Data Rows:    500
    Train Data Columns: 12
    Preprocessing data ...
    Using Feature Generators to preprocess the data ...
    Fitting AutoMLPipelineFeatureGenerator...
    NumExpr defaulting to 8 threads.
    	Available Memory:                    13765.46 MB
    	Train Data (Original)  Memory Usage: 0.26 MB (0.0% of available memory)
    	Inferring data type of each feature based on column values. Set feature_metadata_in to manually specify special dtypes of the features.
    	Stage 1 Generators:
    		Fitting AsTypeFeatureGenerator...
    	Stage 2 Generators:
    		Fitting FillNaFeatureGenerator...
    	Stage 3 Generators:
    		Fitting IdentityFeatureGenerator...
    		Fitting CategoryFeatureGenerator...
    			Fitting CategoryMemoryMinimizeFeatureGenerator...
    	Stage 4 Generators:
    		Fitting DropUniqueFeatureGenerator...
    	Types of features in original data (raw dtype, special dtypes):
    		('int', [])    : 5 | ['age', 'fnlwgt', 'capital-gain', 'capital-loss', 'hours-per-week']
    		('object', []) : 7 | ['workclass', 'marital-status', 'occupation', 'relationship', 'race', ...]
    	Types of features in processed data (raw dtype, special dtypes):
    		('category', []) : 7 | ['workclass', 'marital-status', 'occupation', 'relationship', 'race', ...]
    		('int', [])      : 5 | ['age', 'fnlwgt', 'capital-gain', 'capital-loss', 'hours-per-week']
    	0.1s = Fit runtime
    	12 features in original data used to generate 12 features in processed data.
    	Train Data (Processed) Memory Usage: 0.02 MB (0.0% of available memory)
    Data preprocessing and feature engineering runtime = 0.07s ...
    AutoGluon will gauge predictive performance using evaluation metric: 'root_mean_squared_error'
    	To change this, specify the eval_metric argument of fit()
    Automatically generating train/validation split with holdout_frac=0.2, Train Rows: 400, Val Rows: 100
    Fitting model: RandomForestMSE ... Training model for up to 4.93s of the 4.93s of remaining time.


.. parsed-literal::
    :class: output

    Fitting TabularPredictor for label: education-num ...


.. parsed-literal::
    :class: output

    	-2.2493	 = Validation root_mean_squared_error score
    	0.5s	 = Training runtime
    	0.11s	 = Validation runtime
    Fitting model: ExtraTreesMSE ... Training model for up to 4.3s of the 4.3s of remaining time.
    	-2.4398	 = Validation root_mean_squared_error score
    	0.4s	 = Training runtime
    	0.11s	 = Validation runtime
    Fitting model: KNeighborsUnif ... Training model for up to 3.77s of the 3.77s of remaining time.
    	-2.703	 = Validation root_mean_squared_error score
    	0.0s	 = Training runtime
    	0.1s	 = Validation runtime
    Fitting model: KNeighborsDist ... Training model for up to 3.67s of the 3.67s of remaining time.
    	-2.7447	 = Validation root_mean_squared_error score
    	0.0s	 = Training runtime
    	0.1s	 = Validation runtime
    Fitting model: LightGBM ... Training model for up to 3.56s of the 3.56s of remaining time.
    	-2.3176	 = Validation root_mean_squared_error score
    	0.2s	 = Training runtime
    	0.01s	 = Validation runtime
    Fitting model: LightGBMXT ... Training model for up to 3.35s of the 3.35s of remaining time.
    	-2.2917	 = Validation root_mean_squared_error score
    	0.17s	 = Training runtime
    	0.01s	 = Validation runtime
    Fitting model: CatBoost ... Training model for up to 3.16s of the 3.15s of remaining time.
    	-2.1916	 = Validation root_mean_squared_error score
    	0.58s	 = Training runtime
    	0.01s	 = Validation runtime
    Fitting model: XGBoost ... Training model for up to 2.57s of the 2.57s of remaining time.
    	-2.1739	 = Validation root_mean_squared_error score
    	0.17s	 = Training runtime
    	0.01s	 = Validation runtime
    Fitting model: NeuralNetMXNet ... Training model for up to 2.37s of the 2.37s of remaining time.
    	Ran out of time, stopping training early. (Stopping on epoch 24)
    	-2.6378	 = Validation root_mean_squared_error score
    	2.51s	 = Training runtime
    	0.02s	 = Validation runtime
    Fitting model: WeightedEnsemble_L2 ... Training model for up to 4.93s of the -0.71s of remaining time.
    	-2.1337	 = Validation root_mean_squared_error score
    	0.39s	 = Training runtime
    	0.0s	 = Validation runtime
    AutoGluon training complete, total runtime = 6.11s ...
    TabularPredictor saved. To load, use: predictor = TabularPredictor.load("agModels-predictEducationClass/Predictor_education-num/")
    Beginning AutoGluon training ... Time limit = 5s
    AutoGluon will save models to "agModels-predictEducationClass/Predictor_education/"
    AutoGluon Version:  0.1.0b20210301
    Train Data Rows:    500
    Train Data Columns: 13
    Preprocessing data ...
    Warning: Some classes in the training set have fewer than 10 examples. AutoGluon will only keep 11 out of 15 classes for training and will not try to predict the rare classes. To keep more classes, increase the number of datapoints from these rare classes in the training data or reduce label_count_threshold.
    Fraction of data from classes with at least 10 examples that will be kept for training models: 0.976
    Train Data Class Count: 11
    Using Feature Generators to preprocess the data ...
    Fitting AutoMLPipelineFeatureGenerator...
    	Available Memory:                    13624.91 MB
    	Train Data (Original)  Memory Usage: 0.25 MB (0.0% of available memory)
    	Inferring data type of each feature based on column values. Set feature_metadata_in to manually specify special dtypes of the features.
    	Stage 1 Generators:
    		Fitting AsTypeFeatureGenerator...
    	Stage 2 Generators:
    		Fitting FillNaFeatureGenerator...
    	Stage 3 Generators:
    		Fitting IdentityFeatureGenerator...
    		Fitting CategoryFeatureGenerator...
    			Fitting CategoryMemoryMinimizeFeatureGenerator...
    	Stage 4 Generators:
    		Fitting DropUniqueFeatureGenerator...
    	Types of features in original data (raw dtype, special dtypes):
    		('int', [])    : 6 | ['age', 'fnlwgt', 'education-num', 'capital-gain', 'capital-loss', ...]
    		('object', []) : 7 | ['workclass', 'marital-status', 'occupation', 'relationship', 'race', ...]
    	Types of features in processed data (raw dtype, special dtypes):
    		('category', []) : 7 | ['workclass', 'marital-status', 'occupation', 'relationship', 'race', ...]
    		('int', [])      : 6 | ['age', 'fnlwgt', 'education-num', 'capital-gain', 'capital-loss', ...]
    	0.1s = Fit runtime
    	13 features in original data used to generate 13 features in processed data.
    	Train Data (Processed) Memory Usage: 0.03 MB (0.0% of available memory)
    Data preprocessing and feature engineering runtime = 0.07s ...
    AutoGluon will gauge predictive performance using evaluation metric: 'accuracy'
    	To change this, specify the eval_metric argument of fit()
    Automatically generating train/validation split with holdout_frac=0.2, Train Rows: 390, Val Rows: 98
    Fitting model: NeuralNetMXNet ... Training model for up to 4.93s of the 4.93s of remaining time.


.. parsed-literal::
    :class: output

    Fitting TabularPredictor for label: education ...


.. parsed-literal::
    :class: output

    	Ran out of time, stopping training early. (Stopping on epoch 77)
    	0.8061	 = Validation accuracy score
    	5.0s	 = Training runtime
    	0.02s	 = Validation runtime
    Fitting model: WeightedEnsemble_L2 ... Training model for up to 4.93s of the -0.14s of remaining time.
    	0.8061	 = Validation accuracy score
    	0.0s	 = Training runtime
    	0.0s	 = Validation runtime
    AutoGluon training complete, total runtime = 5.15s ...
    TabularPredictor saved. To load, use: predictor = TabularPredictor.load("agModels-predictEducationClass/Predictor_education/")
    Beginning AutoGluon training ... Time limit = 5s
    AutoGluon will save models to "agModels-predictEducationClass/Predictor_class/"
    AutoGluon Version:  0.1.0b20210301
    Train Data Rows:    500
    Train Data Columns: 14
    Preprocessing data ...
    Selected class <--> label mapping:  class 1 =  >50K, class 0 =  <=50K
    	Note: For your binary classification, AutoGluon arbitrarily selected which label-value represents positive ( >50K) vs negative ( <=50K) class.
    	To explicitly set the positive_class, either rename classes to 1 and 0, or specify positive_class in Predictor init.
    Using Feature Generators to preprocess the data ...
    Fitting AutoMLPipelineFeatureGenerator...
    	Available Memory:                    13617.31 MB
    	Train Data (Original)  Memory Usage: 0.29 MB (0.0% of available memory)
    	Inferring data type of each feature based on column values. Set feature_metadata_in to manually specify special dtypes of the features.
    	Stage 1 Generators:
    		Fitting AsTypeFeatureGenerator...
    	Stage 2 Generators:
    		Fitting FillNaFeatureGenerator...
    	Stage 3 Generators:
    		Fitting IdentityFeatureGenerator...
    		Fitting CategoryFeatureGenerator...
    			Fitting CategoryMemoryMinimizeFeatureGenerator...
    	Stage 4 Generators:
    		Fitting DropUniqueFeatureGenerator...
    	Types of features in original data (raw dtype, special dtypes):
    		('int', [])    : 6 | ['age', 'fnlwgt', 'education-num', 'capital-gain', 'capital-loss', ...]
    		('object', []) : 8 | ['workclass', 'education', 'marital-status', 'occupation', 'relationship', ...]
    	Types of features in processed data (raw dtype, special dtypes):
    		('category', []) : 8 | ['workclass', 'education', 'marital-status', 'occupation', 'relationship', ...]
    		('int', [])      : 6 | ['age', 'fnlwgt', 'education-num', 'capital-gain', 'capital-loss', ...]
    	0.1s = Fit runtime
    	14 features in original data used to generate 14 features in processed data.
    	Train Data (Processed) Memory Usage: 0.03 MB (0.0% of available memory)
    Data preprocessing and feature engineering runtime = 0.07s ...
    AutoGluon will gauge predictive performance using evaluation metric: 'accuracy'
    	To change this, specify the eval_metric argument of fit()
    Automatically generating train/validation split with holdout_frac=0.2, Train Rows: 400, Val Rows: 100
    Fitting model: RandomForestGini ... Training model for up to 4.93s of the 4.93s of remaining time.


.. parsed-literal::
    :class: output

    Fitting TabularPredictor for label: class ...


.. parsed-literal::
    :class: output

    	0.84	 = Validation accuracy score
    	0.51s	 = Training runtime
    	0.11s	 = Validation runtime
    Fitting model: RandomForestEntr ... Training model for up to 4.3s of the 4.3s of remaining time.
    	0.83	 = Validation accuracy score
    	0.51s	 = Training runtime
    	0.11s	 = Validation runtime
    Fitting model: ExtraTreesGini ... Training model for up to 3.67s of the 3.67s of remaining time.
    	0.83	 = Validation accuracy score
    	0.41s	 = Training runtime
    	0.11s	 = Validation runtime
    Fitting model: ExtraTreesEntr ... Training model for up to 3.14s of the 3.14s of remaining time.
    	0.84	 = Validation accuracy score
    	0.41s	 = Training runtime
    	0.11s	 = Validation runtime
    Fitting model: KNeighborsUnif ... Training model for up to 2.61s of the 2.61s of remaining time.
    	0.73	 = Validation accuracy score
    	0.0s	 = Training runtime
    	0.1s	 = Validation runtime
    Fitting model: KNeighborsDist ... Training model for up to 2.5s of the 2.5s of remaining time.
    	0.65	 = Validation accuracy score
    	0.0s	 = Training runtime
    	0.1s	 = Validation runtime
    Fitting model: LightGBM ... Training model for up to 2.4s of the 2.4s of remaining time.
    	0.85	 = Validation accuracy score
    	0.16s	 = Training runtime
    	0.01s	 = Validation runtime
    Fitting model: LightGBMXT ... Training model for up to 2.22s of the 2.22s of remaining time.
    	0.83	 = Validation accuracy score
    	0.12s	 = Training runtime
    	0.01s	 = Validation runtime
    Fitting model: CatBoost ... Training model for up to 2.09s of the 2.09s of remaining time.
    	0.86	 = Validation accuracy score
    	0.85s	 = Training runtime
    	0.01s	 = Validation runtime
    Fitting model: XGBoost ... Training model for up to 1.22s of the 1.22s of remaining time.
    	0.85	 = Validation accuracy score
    	0.14s	 = Training runtime
    	0.01s	 = Validation runtime
    Fitting model: NeuralNetMXNet ... Training model for up to 1.07s of the 1.07s of remaining time.
    	Ran out of time, stopping training early. (Stopping on epoch 12)
    	0.75	 = Validation accuracy score
    	1.08s	 = Training runtime
    	0.02s	 = Validation runtime
    Fitting model: WeightedEnsemble_L2 ... Training model for up to 4.93s of the -0.81s of remaining time.
    	0.86	 = Validation accuracy score
    	0.3s	 = Training runtime
    	0.0s	 = Validation runtime
    AutoGluon training complete, total runtime = 6.12s ...
    TabularPredictor saved. To load, use: predictor = TabularPredictor.load("agModels-predictEducationClass/Predictor_class/")


.. parsed-literal::
    :class: output

    MultilabelPredictor saved to disk. Load with: MultilabelPredictor.load('agModels-predictEducationClass/')


Inference and Evaluation
~~~~~~~~~~~~~~~~~~~~~~~~

After training, you can easily use the ``MultilabelPredictor`` to
predict all labels in new data:

.. code:: python

    test_data = TabularDataset('https://autogluon.s3.amazonaws.com/datasets/Inc/test.csv')
    test_data = test_data.sample(n=subsample_size, random_state=0)
    test_data_nolab = test_data.drop(columns=labels)  # unnecessary, just to demonstrate we're not cheating here
    test_data_nolab.head()


.. parsed-literal::
    :class: output

    Loaded data from: https://autogluon.s3.amazonaws.com/datasets/Inc/test.csv | Columns = 15 / 15 | Rows = 9769 -> 9769


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }
    
        .dataframe tbody tr th {
            vertical-align: top;
        }
    
        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>age</th>
          <th>workclass</th>
          <th>fnlwgt</th>
          <th>marital-status</th>
          <th>occupation</th>
          <th>relationship</th>
          <th>race</th>
          <th>sex</th>
          <th>capital-gain</th>
          <th>capital-loss</th>
          <th>hours-per-week</th>
          <th>native-country</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>5454</th>
          <td>41</td>
          <td>Self-emp-not-inc</td>
          <td>408498</td>
          <td>Married-civ-spouse</td>
          <td>Exec-managerial</td>
          <td>Husband</td>
          <td>White</td>
          <td>Male</td>
          <td>0</td>
          <td>0</td>
          <td>50</td>
          <td>United-States</td>
        </tr>
        <tr>
          <th>6111</th>
          <td>39</td>
          <td>Private</td>
          <td>746786</td>
          <td>Married-civ-spouse</td>
          <td>Prof-specialty</td>
          <td>Husband</td>
          <td>White</td>
          <td>Male</td>
          <td>0</td>
          <td>0</td>
          <td>55</td>
          <td>United-States</td>
        </tr>
        <tr>
          <th>5282</th>
          <td>50</td>
          <td>Private</td>
          <td>62593</td>
          <td>Married-civ-spouse</td>
          <td>Farming-fishing</td>
          <td>Husband</td>
          <td>Asian-Pac-Islander</td>
          <td>Male</td>
          <td>0</td>
          <td>0</td>
          <td>40</td>
          <td>United-States</td>
        </tr>
        <tr>
          <th>3046</th>
          <td>31</td>
          <td>Private</td>
          <td>248178</td>
          <td>Married-civ-spouse</td>
          <td>Other-service</td>
          <td>Husband</td>
          <td>Black</td>
          <td>Male</td>
          <td>0</td>
          <td>0</td>
          <td>35</td>
          <td>United-States</td>
        </tr>
        <tr>
          <th>2162</th>
          <td>43</td>
          <td>State-gov</td>
          <td>52849</td>
          <td>Married-civ-spouse</td>
          <td>Prof-specialty</td>
          <td>Husband</td>
          <td>White</td>
          <td>Male</td>
          <td>0</td>
          <td>0</td>
          <td>40</td>
          <td>United-States</td>
        </tr>
      </tbody>
    </table>
    </div>


.. code:: python

    multi_predictor = MultilabelPredictor.load(save_path)  # unnecessary, just demonstrates how to load previously-trained multilabel predictor from file
    
    predictions = multi_predictor.predict(test_data_nolab)
    print("Predictions:  \n", predictions)


.. parsed-literal::
    :class: output

    Predicting with TabularPredictor for label: education-num ...
    Predicting with TabularPredictor for label: education ...
    Predicting with TabularPredictor for label: class ...
    Predictions:  
           education-num      education   class
    5454      10.634719   Some-college    >50K
    6111      12.948993   Some-college    >50K
    5282       9.537755        HS-grad    >50K
    3046       9.539511        HS-grad   <=50K
    2162      12.567348      Bachelors    >50K
    ...             ...            ...     ...
    6965       9.681354        HS-grad   <=50K
    4762       9.063696        HS-grad   <=50K
    234       10.432962   Some-college   <=50K
    6291      10.400469   Some-college   <=50K
    9575      10.106630   Some-college   <=50K
    
    [500 rows x 3 columns]


We can also easily evaluate the performance of our predictions if our
new data contain the ground truth labels:

.. code:: python

    evaluations = multi_predictor.evaluate(test_data)
    print(evaluations)
    print("Evaluated using metrics:", multi_predictor.eval_metrics)


.. parsed-literal::
    :class: output

    Evaluating TabularPredictor for label: education-num ...
    Predictive performance on given data: root_mean_squared_error = 2.179581924768954
    Evaluating TabularPredictor for label: education ...
    Predictive performance on given data: accuracy = 0.33
    Evaluating TabularPredictor for label: class ...
    Predictive performance on given data: accuracy = 0.816
    {'education-num': 2.179581924768954, 'education': 0.33, 'class': 0.816}
    Evaluated using metrics: {'education-num': root_mean_squared_error, 'education': accuracy, 'class': accuracy}


Accessing the TabularPredictor for One Label
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

We can also directly work with the ``TabularPredictor`` for any one of
the labels as follows. However we recommend you set
``consider_labels_correlation=False`` before training if you later plan
to use an individual ``TabularPredictor`` to predict just one label
rather than all of the labels predicted by the ``MultilabelPredictor``.

.. code:: python

    predictor_class = multi_predictor.get_predictor('class')
    predictor_class.leaderboard(silent=True)


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }
    
        .dataframe tbody tr th {
            vertical-align: top;
        }
    
        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>model</th>
          <th>score_val</th>
          <th>pred_time_val</th>
          <th>fit_time</th>
          <th>pred_time_val_marginal</th>
          <th>fit_time_marginal</th>
          <th>stack_level</th>
          <th>can_infer</th>
          <th>fit_order</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>0</th>
          <td>CatBoost</td>
          <td>0.86</td>
          <td>0.009471</td>
          <td>0.850355</td>
          <td>0.009471</td>
          <td>0.850355</td>
          <td>1</td>
          <td>True</td>
          <td>9</td>
        </tr>
        <tr>
          <th>1</th>
          <td>WeightedEnsemble_L2</td>
          <td>0.86</td>
          <td>0.010045</td>
          <td>1.150084</td>
          <td>0.000574</td>
          <td>0.299728</td>
          <td>2</td>
          <td>True</td>
          <td>12</td>
        </tr>
        <tr>
          <th>2</th>
          <td>XGBoost</td>
          <td>0.85</td>
          <td>0.005486</td>
          <td>0.136692</td>
          <td>0.005486</td>
          <td>0.136692</td>
          <td>1</td>
          <td>True</td>
          <td>10</td>
        </tr>
        <tr>
          <th>3</th>
          <td>LightGBM</td>
          <td>0.85</td>
          <td>0.009789</td>
          <td>0.164842</td>
          <td>0.009789</td>
          <td>0.164842</td>
          <td>1</td>
          <td>True</td>
          <td>7</td>
        </tr>
        <tr>
          <th>4</th>
          <td>ExtraTreesEntr</td>
          <td>0.84</td>
          <td>0.107529</td>
          <td>0.407588</td>
          <td>0.107529</td>
          <td>0.407588</td>
          <td>1</td>
          <td>True</td>
          <td>4</td>
        </tr>
        <tr>
          <th>5</th>
          <td>RandomForestGini</td>
          <td>0.84</td>
          <td>0.107544</td>
          <td>0.506645</td>
          <td>0.107544</td>
          <td>0.506645</td>
          <td>1</td>
          <td>True</td>
          <td>1</td>
        </tr>
        <tr>
          <th>6</th>
          <td>LightGBMXT</td>
          <td>0.83</td>
          <td>0.009568</td>
          <td>0.118615</td>
          <td>0.009568</td>
          <td>0.118615</td>
          <td>1</td>
          <td>True</td>
          <td>8</td>
        </tr>
        <tr>
          <th>7</th>
          <td>RandomForestEntr</td>
          <td>0.83</td>
          <td>0.107483</td>
          <td>0.507051</td>
          <td>0.107483</td>
          <td>0.507051</td>
          <td>1</td>
          <td>True</td>
          <td>2</td>
        </tr>
        <tr>
          <th>8</th>
          <td>ExtraTreesGini</td>
          <td>0.83</td>
          <td>0.107528</td>
          <td>0.405294</td>
          <td>0.107528</td>
          <td>0.405294</td>
          <td>1</td>
          <td>True</td>
          <td>3</td>
        </tr>
        <tr>
          <th>9</th>
          <td>NeuralNetMXNet</td>
          <td>0.75</td>
          <td>0.023019</td>
          <td>1.076305</td>
          <td>0.023019</td>
          <td>1.076305</td>
          <td>1</td>
          <td>True</td>
          <td>11</td>
        </tr>
        <tr>
          <th>10</th>
          <td>KNeighborsUnif</td>
          <td>0.73</td>
          <td>0.102981</td>
          <td>0.002154</td>
          <td>0.102981</td>
          <td>0.002154</td>
          <td>1</td>
          <td>True</td>
          <td>5</td>
        </tr>
        <tr>
          <th>11</th>
          <td>KNeighborsDist</td>
          <td>0.65</td>
          <td>0.102837</td>
          <td>0.001876</td>
          <td>0.102837</td>
          <td>0.001876</td>
          <td>1</td>
          <td>True</td>
          <td>6</td>
        </tr>
      </tbody>
    </table>
    </div>


Tips
~~~~

In order to obtain the best predictions, you should generally add the
following arguments to ``MultilabelPredictor.fit()``:

1) Specify ``eval_metrics`` to the metrics you will use to evaluate
   predictions for each label

2) Specify ``presets='best_quality'`` to tell AutoGluon you care about
   predictive performance more than latency/memory usage, which will
   utilize stack ensembling when predicting each label.

If you find that too much memory/disk is being used, try calling
``MultilabelPredictor.fit()`` with additional arguments discussed under
`"If you encounter memory issues" in the In Depth
Tutorial <tabular-indepth.html#if-you-encounter-memory-issues>`__ or
`"If you encounter disk space
issues" <tabular-indepth.html#if-you-encounter-disk-space-issues>`__.

If you find inference too slow, you can try the strategies discussed
under `"Accelerating Inference" in the In Depth
Tutorial <tabular-indepth.html#accelerating-inference>`__. In
particular, simply try specifying the following preset in
``MultilabelPredictor.fit()``:
``presets = ['good_quality_faster_inference_only_refit', 'optimize_for_deployment']``