.. _sec_tabularprediction_text_multimodal:

Multimodal Data Tables: Combining BERT/Transformers and Classical Tabular Models
================================================================================


**Tip**: If your data contains images, consider also checking out
:ref:`sec_tabularprediction_multimodal` which handles images in
addition to text and tabular features.

Here we introduce how to use AutoGluon Tabular to deal with multimodal
tabular data that contains text, numeric, and categorical columns. In
AutoGluon, **raw text data** is considered as a first-class citizen of
data tables. AutoGluon Tabular can help you train and combine a diverse
set of models including classical tabular models like
LightGBM/RF/CatBoost as well as our pretrained NLP model based
multimodal network that is introduced in Section
":ref:`sec_textprediction_architecture`" of
:ref:`sec_textprediction_multimodal` (used by AutoGluon's
``TextPredictor``).

.. code:: python

    %matplotlib inline
    import matplotlib.pyplot as plt
    import numpy as np
    import pandas as pd
    import pprint
    import random
    from autogluon.tabular import TabularPredictor
    import mxnet as mx
    
    np.random.seed(123)
    random.seed(123)
    mx.random.seed(123)

Product Sentiment Analysis Dataset
----------------------------------

We consider the product sentiment analysis dataset from a `MachineHack
hackathon <https://www.machinehack.com/hackathons/product_sentiment_classification_weekend_hackathon_19/leaderboard>`__.
The goal is to predict a user's sentiment towards a product given their
review (raw text) and a categorical feature indicating the product's
type (e.g., Tablet, Mobile, etc.). We have already split the original
dataset to be 90% for training and 10% for development/testing (if
submitting your models to the hackathon, we recommend training them on
100% of the dataset).

.. code:: python

    !mkdir -p product_sentiment_machine_hack
    !wget https://autogluon-text-data.s3.amazonaws.com/multimodal_text/machine_hack_product_sentiment/train.csv -O product_sentiment_machine_hack/train.csv
    !wget https://autogluon-text-data.s3.amazonaws.com/multimodal_text/machine_hack_product_sentiment/dev.csv -O product_sentiment_machine_hack/dev.csv
    !wget https://autogluon-text-data.s3.amazonaws.com/multimodal_text/machine_hack_product_sentiment/test.csv -O product_sentiment_machine_hack/test.csv


.. parsed-literal::
    :class: output

    --2021-08-27 22:20:43--  https://autogluon-text-data.s3.amazonaws.com/multimodal_text/machine_hack_product_sentiment/train.csv
    Resolving autogluon-text-data.s3.amazonaws.com (autogluon-text-data.s3.amazonaws.com)... 52.217.138.121
    Connecting to autogluon-text-data.s3.amazonaws.com (autogluon-text-data.s3.amazonaws.com)|52.217.138.121|:443... connected.
    HTTP request sent, awaiting response... 200 OK
    Length: 689486 (673K) [text/csv]
    Saving to: ‘product_sentiment_machine_hack/train.csv’
    
    product_sentiment_m 100%[===================>] 673.33K  1.82MB/s    in 0.4s    
    
    2021-08-27 22:20:44 (1.82 MB/s) - ‘product_sentiment_machine_hack/train.csv’ saved [689486/689486]
    
    --2021-08-27 22:20:44--  https://autogluon-text-data.s3.amazonaws.com/multimodal_text/machine_hack_product_sentiment/dev.csv
    Resolving autogluon-text-data.s3.amazonaws.com (autogluon-text-data.s3.amazonaws.com)... 52.217.138.121
    Connecting to autogluon-text-data.s3.amazonaws.com (autogluon-text-data.s3.amazonaws.com)|52.217.138.121|:443... connected.
    HTTP request sent, awaiting response... 200 OK
    Length: 75517 (74K) [text/csv]
    Saving to: ‘product_sentiment_machine_hack/dev.csv’
    
    product_sentiment_m 100%[===================>]  73.75K  --.-KB/s    in 0.1s    
    
    2021-08-27 22:20:45 (568 KB/s) - ‘product_sentiment_machine_hack/dev.csv’ saved [75517/75517]
    
    --2021-08-27 22:20:45--  https://autogluon-text-data.s3.amazonaws.com/multimodal_text/machine_hack_product_sentiment/test.csv
    Resolving autogluon-text-data.s3.amazonaws.com (autogluon-text-data.s3.amazonaws.com)... 52.217.138.121
    Connecting to autogluon-text-data.s3.amazonaws.com (autogluon-text-data.s3.amazonaws.com)|52.217.138.121|:443... connected.
    HTTP request sent, awaiting response... 200 OK
    Length: 312194 (305K) [text/csv]
    Saving to: ‘product_sentiment_machine_hack/test.csv’
    
    product_sentiment_m 100%[===================>] 304.88K  1.06MB/s    in 0.3s    
    
    2021-08-27 22:20:46 (1.06 MB/s) - ‘product_sentiment_machine_hack/test.csv’ saved [312194/312194]
    

.. code:: python

    subsample_size = 2000  # for quick demo, try setting to larger values
    feature_columns = ['Product_Description', 'Product_Type']
    label = 'Sentiment'
    
    train_df = pd.read_csv('product_sentiment_machine_hack/train.csv', index_col=0).sample(2000, random_state=123)
    dev_df = pd.read_csv('product_sentiment_machine_hack/dev.csv', index_col=0)
    test_df = pd.read_csv('product_sentiment_machine_hack/test.csv', index_col=0)
    
    train_df = train_df[feature_columns + [label]]
    dev_df = dev_df[feature_columns + [label]]
    test_df = test_df[feature_columns]
    print('Number of training samples:', len(train_df))
    print('Number of dev samples:', len(dev_df))
    print('Number of test samples:', len(test_df))


.. parsed-literal::
    :class: output

    Number of training samples: 2000
    Number of dev samples: 637
    Number of test samples: 2728


There are two features in the dataset: the users' review of the product
and the product's type, and four possible classes to predict.

.. code:: python

    train_df.head()


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }
    
        .dataframe tbody tr th {
            vertical-align: top;
        }
    
        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>Product_Description</th>
          <th>Product_Type</th>
          <th>Sentiment</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>4532</th>
          <td>they took away the lego pit but replaced it wi...</td>
          <td>0</td>
          <td>1</td>
        </tr>
        <tr>
          <th>1831</th>
          <td>#Apple to Open Pop-Up Shop at #SXSW [REPORT]: ...</td>
          <td>9</td>
          <td>2</td>
        </tr>
        <tr>
          <th>3536</th>
          <td>RT @mention False Alarm: Google Circles Not Co...</td>
          <td>5</td>
          <td>1</td>
        </tr>
        <tr>
          <th>5157</th>
          <td>Will Google reveal a new social network called...</td>
          <td>9</td>
          <td>2</td>
        </tr>
        <tr>
          <th>4643</th>
          <td>Niceness RT @mention Less than 2 hours until w...</td>
          <td>6</td>
          <td>3</td>
        </tr>
      </tbody>
    </table>
    </div>


.. code:: python

    dev_df.head()


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }
    
        .dataframe tbody tr th {
            vertical-align: top;
        }
    
        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>Product_Description</th>
          <th>Product_Type</th>
          <th>Sentiment</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>3170</th>
          <td>Do it. RT @mention Come party w/ Google tonigh...</td>
          <td>3</td>
          <td>3</td>
        </tr>
        <tr>
          <th>6301</th>
          <td>Line for iPads at #SXSW. Doesn't look too bad!...</td>
          <td>6</td>
          <td>3</td>
        </tr>
        <tr>
          <th>5643</th>
          <td>First up: iPad Design Headaches (2 Tablets, Ca...</td>
          <td>6</td>
          <td>2</td>
        </tr>
        <tr>
          <th>1953</th>
          <td>#SXSW: Mint Talks Mobile App Development Chall...</td>
          <td>9</td>
          <td>2</td>
        </tr>
        <tr>
          <th>2658</th>
          <td>ÛÏ@mention Apple store downtown Austin open t...</td>
          <td>9</td>
          <td>2</td>
        </tr>
      </tbody>
    </table>
    </div>


.. code:: python

    test_df.head()


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }
    
        .dataframe tbody tr th {
            vertical-align: top;
        }
    
        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>Product_Description</th>
          <th>Product_Type</th>
        </tr>
        <tr>
          <th>Text_ID</th>
          <th></th>
          <th></th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>5786</th>
          <td>RT @mention Going to #SXSW? The new iPhone gui...</td>
          <td>7</td>
        </tr>
        <tr>
          <th>5363</th>
          <td>RT @mention 95% of iPhone and Droid apps have ...</td>
          <td>9</td>
        </tr>
        <tr>
          <th>6716</th>
          <td>RT @mention Thank you to @mention for letting ...</td>
          <td>9</td>
        </tr>
        <tr>
          <th>4339</th>
          <td>#Thanks @mention we're lovin' the @mention app...</td>
          <td>7</td>
        </tr>
        <tr>
          <th>66</th>
          <td>At #sxsw? @mention / @mention wanna buy you a ...</td>
          <td>9</td>
        </tr>
      </tbody>
    </table>
    </div>


AutoGluon Tabular with Multimodal Support
-----------------------------------------

To utilize the ``TextPredictor`` model inside of ``TabularPredictor``,
we must specify the ``hyperparameters = 'multimodal'`` in AutoGluon
Tabular. Internally, this will train multiple tabular models as well as
the TextPredictor model, and then combine them via either a weighted
ensemble or stack ensemble, as explained in `AutoGluon Tabular
Paper <https://arxiv.org/pdf/2003.06505.pdf>`__. If you do not specify
``hyperparameters = 'multimodal'``, then AutoGluon Tabular will simply
featurize text fields using N-grams and train only tabular models (which
may work better if your text is mostly uncommon strings/vocabulary).

.. code:: python

    from autogluon.tabular import TabularPredictor
    predictor = TabularPredictor(label='Sentiment', path='ag_tabular_product_sentiment_multimodal')
    predictor.fit(train_df, hyperparameters='multimodal')


.. parsed-literal::
    :class: output

    Beginning AutoGluon training ...
    AutoGluon will save models to "ag_tabular_product_sentiment_multimodal/"
    AutoGluon Version:  0.3.0b20210827
    Train Data Rows:    2000
    Train Data Columns: 2
    Preprocessing data ...
    AutoGluon infers your prediction problem is: 'multiclass' (because dtype of label-column == int, but few unique label-values observed).
    	4 unique label values:  [1, 2, 3, 0]
    	If 'multiclass' is not the correct problem_type, please manually specify the problem_type argument in fit() (You may specify problem_type as one of: ['binary', 'multiclass', 'regression'])
    Train Data Class Count: 4
    Using Feature Generators to preprocess the data ...
    Fitting AutoMLPipelineFeatureGenerator...
    	Available Memory:                    22099.61 MB
    	Train Data (Original)  Memory Usage: 0.34 MB (0.0% of available memory)
    	Inferring data type of each feature based on column values. Set feature_metadata_in to manually specify special dtypes of the features.
    	Stage 1 Generators:
    		Fitting AsTypeFeatureGenerator...
    	Stage 2 Generators:
    		Fitting FillNaFeatureGenerator...
    	Stage 3 Generators:
    		Fitting IdentityFeatureGenerator...
    		Fitting IdentityFeatureGenerator...
    			Fitting RenameFeatureGenerator...
    		Fitting CategoryFeatureGenerator...
    			Fitting CategoryMemoryMinimizeFeatureGenerator...
    		Fitting TextSpecialFeatureGenerator...
    			Fitting BinnedFeatureGenerator...
    			Fitting DropDuplicatesFeatureGenerator...
    		Fitting TextNgramFeatureGenerator...
    			Fitting CountVectorizer for text features: ['Product_Description']
    			CountVectorizer fit with vocabulary size = 230
    	Stage 4 Generators:
    		Fitting DropUniqueFeatureGenerator...
    	Types of features in original data (raw dtype, special dtypes):
    		('int', [])          : 1 | ['Product_Type']
    		('object', ['text']) : 1 | ['Product_Description']
    	Types of features in processed data (raw dtype, special dtypes):
    		('category', ['text_as_category'])  :   1 | ['Product_Description']
    		('int', [])                         :   1 | ['Product_Type']
    		('int', ['binned', 'text_special']) :  30 | ['Product_Description.char_count', 'Product_Description.word_count', 'Product_Description.capital_ratio', 'Product_Description.lower_ratio', 'Product_Description.digit_ratio', ...]
    		('int', ['text_ngram'])             : 231 | ['__nlp__.about', '__nlp__.all', '__nlp__.amp', '__nlp__.an', '__nlp__.an ipad', ...]
    		('object', ['text'])                :   1 | ['Product_Description_raw_text']
    	0.5s = Fit runtime
    	2 features in original data used to generate 264 features in processed data.
    	Train Data (Processed) Memory Usage: 1.34 MB (0.0% of available memory)
    Data preprocessing and feature engineering runtime = 0.54s ...
    AutoGluon will gauge predictive performance using evaluation metric: 'accuracy'
    	To change this, specify the eval_metric argument of fit()
    Automatically generating train/validation split with holdout_frac=0.2, Train Rows: 1600, Val Rows: 400
    Fitting 8 L1 models ...
    Fitting model: LightGBM ...
    	0.8875	 = Validation score   (accuracy)
    	0.83s	 = Training   runtime
    	0.01s	 = Validation runtime
    Fitting model: LightGBMXT ...
    	0.8625	 = Validation score   (accuracy)
    	0.65s	 = Training   runtime
    	0.01s	 = Validation runtime
    Fitting model: CatBoost ...
    	0.8825	 = Validation score   (accuracy)
    	0.66s	 = Training   runtime
    	0.02s	 = Validation runtime
    Fitting model: XGBoost ...
    	0.8925	 = Validation score   (accuracy)
    	0.84s	 = Training   runtime
    	0.01s	 = Validation runtime
    Fitting model: NeuralNetMXNet ...
    	0.88	 = Validation score   (accuracy)
    	2.15s	 = Training   runtime
    	0.02s	 = Validation runtime
    Fitting model: LightGBMLarge ...
    	0.8775	 = Validation score   (accuracy)
    	2.02s	 = Training   runtime
    	0.01s	 = Validation runtime
    Fitting model: TextPredictor ...


.. parsed-literal::
    :class: output

    All Logs will be saved to /var/lib/jenkins/workspace/workspace/autogluon-tutorial-tabular-v3/docs/_build/eval/tutorials/tabular_prediction/ag_tabular_product_sentiment_multimodal/models/TextPredictor/task0/training.log


.. parsed-literal::
    :class: output

    [Iter 3/130, Epoch 0] valid accuracy=7.0500e-01, log_loss=8.0036e-01, accuracy=7.0500e-01, time spent=1.184s, total time spent=0.11min. Find new best=True, Find new top-3=True
    [Iter 6/130, Epoch 0] valid accuracy=8.8000e-01, log_loss=5.6644e-01, accuracy=8.8000e-01, time spent=1.179s, total time spent=0.21min. Find new best=True, Find new top-3=True
    [Iter 9/130, Epoch 0] valid accuracy=8.8250e-01, log_loss=4.5013e-01, accuracy=8.8250e-01, time spent=1.182s, total time spent=0.31min. Find new best=True, Find new top-3=True
    [Iter 12/130, Epoch 0] valid accuracy=8.8750e-01, log_loss=4.5823e-01, accuracy=8.8750e-01, time spent=1.178s, total time spent=0.44min. Find new best=True, Find new top-3=True
    [Iter 15/130, Epoch 1] valid accuracy=8.8750e-01, log_loss=4.3925e-01, accuracy=8.8750e-01, time spent=1.191s, total time spent=0.55min. Find new best=True, Find new top-3=True
    [Iter 18/130, Epoch 1] valid accuracy=8.8250e-01, log_loss=4.2215e-01, accuracy=8.8250e-01, time spent=1.186s, total time spent=0.64min. Find new best=False, Find new top-3=True
    [Iter 21/130, Epoch 1] valid accuracy=8.1750e-01, log_loss=4.9420e-01, accuracy=8.1750e-01, time spent=1.195s, total time spent=0.72min. Find new best=False, Find new top-3=False
    [Iter 24/130, Epoch 1] valid accuracy=8.8250e-01, log_loss=5.0839e-01, accuracy=8.8250e-01, time spent=1.197s, total time spent=0.81min. Find new best=False, Find new top-3=True
    [Iter 27/130, Epoch 2] valid accuracy=8.8250e-01, log_loss=4.3373e-01, accuracy=8.8250e-01, time spent=1.204s, total time spent=0.89min. Find new best=False, Find new top-3=True
    [Iter 30/130, Epoch 2] valid accuracy=8.8750e-01, log_loss=4.5229e-01, accuracy=8.8750e-01, time spent=1.203s, total time spent=1.00min. Find new best=True, Find new top-3=True
    [Iter 33/130, Epoch 2] valid accuracy=8.8500e-01, log_loss=4.1333e-01, accuracy=8.8500e-01, time spent=1.203s, total time spent=1.08min. Find new best=False, Find new top-3=False
    [Iter 36/130, Epoch 2] valid accuracy=8.9250e-01, log_loss=3.9142e-01, accuracy=8.9250e-01, time spent=1.207s, total time spent=1.18min. Find new best=True, Find new top-3=True
    [Iter 39/130, Epoch 2] valid accuracy=8.9750e-01, log_loss=3.8456e-01, accuracy=8.9750e-01, time spent=1.204s, total time spent=1.29min. Find new best=True, Find new top-3=True
    [Iter 42/130, Epoch 3] valid accuracy=8.9750e-01, log_loss=3.7758e-01, accuracy=8.9750e-01, time spent=1.212s, total time spent=1.39min. Find new best=True, Find new top-3=True
    [Iter 45/130, Epoch 3] valid accuracy=9.0500e-01, log_loss=3.8503e-01, accuracy=9.0500e-01, time spent=1.213s, total time spent=1.50min. Find new best=True, Find new top-3=True
    [Iter 48/130, Epoch 3] valid accuracy=9.0000e-01, log_loss=3.8804e-01, accuracy=9.0000e-01, time spent=1.211s, total time spent=1.59min. Find new best=False, Find new top-3=True
    [Iter 51/130, Epoch 3] valid accuracy=9.0000e-01, log_loss=3.7967e-01, accuracy=9.0000e-01, time spent=1.212s, total time spent=1.68min. Find new best=False, Find new top-3=True
    [Iter 54/130, Epoch 4] valid accuracy=9.0250e-01, log_loss=3.8610e-01, accuracy=9.0250e-01, time spent=1.211s, total time spent=1.77min. Find new best=False, Find new top-3=True
    [Iter 57/130, Epoch 4] valid accuracy=9.0250e-01, log_loss=3.7233e-01, accuracy=9.0250e-01, time spent=1.219s, total time spent=1.86min. Find new best=False, Find new top-3=True
    [Iter 60/130, Epoch 4] valid accuracy=9.0250e-01, log_loss=3.9014e-01, accuracy=9.0250e-01, time spent=1.215s, total time spent=1.95min. Find new best=False, Find new top-3=True
    [Iter 63/130, Epoch 4] valid accuracy=9.0000e-01, log_loss=3.6710e-01, accuracy=9.0000e-01, time spent=1.215s, total time spent=2.03min. Find new best=False, Find new top-3=False
    [Iter 66/130, Epoch 5] valid accuracy=9.0250e-01, log_loss=3.8815e-01, accuracy=9.0250e-01, time spent=1.221s, total time spent=2.12min. Find new best=False, Find new top-3=True
    [Iter 69/130, Epoch 5] valid accuracy=9.0500e-01, log_loss=4.2492e-01, accuracy=9.0500e-01, time spent=1.215s, total time spent=2.22min. Find new best=True, Find new top-3=True
    [Iter 72/130, Epoch 5] valid accuracy=8.9500e-01, log_loss=3.7422e-01, accuracy=8.9500e-01, time spent=1.232s, total time spent=2.30min. Find new best=False, Find new top-3=False
    [Iter 75/130, Epoch 5] valid accuracy=9.0250e-01, log_loss=3.8409e-01, accuracy=9.0250e-01, time spent=1.223s, total time spent=2.39min. Find new best=False, Find new top-3=True
    [Iter 78/130, Epoch 5] valid accuracy=9.0000e-01, log_loss=4.0509e-01, accuracy=9.0000e-01, time spent=1.234s, total time spent=2.47min. Find new best=False, Find new top-3=False
    [Iter 81/130, Epoch 6] valid accuracy=8.9750e-01, log_loss=3.8553e-01, accuracy=8.9750e-01, time spent=1.236s, total time spent=2.55min. Find new best=False, Find new top-3=False
    [Iter 84/130, Epoch 6] valid accuracy=8.9750e-01, log_loss=4.1444e-01, accuracy=8.9750e-01, time spent=1.236s, total time spent=2.63min. Find new best=False, Find new top-3=False
    [Iter 87/130, Epoch 6] valid accuracy=8.9250e-01, log_loss=4.0910e-01, accuracy=8.9250e-01, time spent=1.244s, total time spent=2.71min. Find new best=False, Find new top-3=False
    [Iter 90/130, Epoch 6] valid accuracy=9.0250e-01, log_loss=4.2499e-01, accuracy=9.0250e-01, time spent=1.258s, total time spent=2.80min. Find new best=False, Find new top-3=True
    [Iter 93/130, Epoch 7] valid accuracy=9.0500e-01, log_loss=4.3600e-01, accuracy=9.0500e-01, time spent=1.254s, total time spent=2.91min. Find new best=True, Find new top-3=True
    [Iter 96/130, Epoch 7] valid accuracy=8.9500e-01, log_loss=4.2524e-01, accuracy=8.9500e-01, time spent=1.250s, total time spent=2.99min. Find new best=False, Find new top-3=False
    [Iter 99/130, Epoch 7] valid accuracy=8.9750e-01, log_loss=4.3084e-01, accuracy=8.9750e-01, time spent=1.257s, total time spent=3.07min. Find new best=False, Find new top-3=False
    [Iter 102/130, Epoch 7] valid accuracy=9.0500e-01, log_loss=4.4503e-01, accuracy=9.0500e-01, time spent=1.255s, total time spent=3.18min. Find new best=True, Find new top-3=True
    [Iter 105/130, Epoch 8] valid accuracy=9.0000e-01, log_loss=4.3139e-01, accuracy=9.0000e-01, time spent=1.268s, total time spent=3.26min. Find new best=False, Find new top-3=False
    [Iter 108/130, Epoch 8] valid accuracy=8.9500e-01, log_loss=4.3005e-01, accuracy=8.9500e-01, time spent=1.272s, total time spent=3.34min. Find new best=False, Find new top-3=False
    [Iter 111/130, Epoch 8] valid accuracy=9.0000e-01, log_loss=4.4181e-01, accuracy=9.0000e-01, time spent=1.262s, total time spent=3.42min. Find new best=False, Find new top-3=False
    [Iter 114/130, Epoch 8] valid accuracy=9.0500e-01, log_loss=4.4967e-01, accuracy=9.0500e-01, time spent=1.251s, total time spent=3.53min. Find new best=True, Find new top-3=True
    [Iter 117/130, Epoch 8] valid accuracy=9.0250e-01, log_loss=4.5091e-01, accuracy=9.0250e-01, time spent=1.242s, total time spent=3.61min. Find new best=False, Find new top-3=False
    [Iter 120/130, Epoch 9] valid accuracy=9.0000e-01, log_loss=4.4570e-01, accuracy=9.0000e-01, time spent=1.238s, total time spent=3.69min. Find new best=False, Find new top-3=False
    [Iter 123/130, Epoch 9] valid accuracy=9.0000e-01, log_loss=4.4126e-01, accuracy=9.0000e-01, time spent=1.230s, total time spent=3.77min. Find new best=False, Find new top-3=False
    [Iter 126/130, Epoch 9] valid accuracy=8.9500e-01, log_loss=4.3735e-01, accuracy=8.9500e-01, time spent=1.220s, total time spent=3.85min. Find new best=False, Find new top-3=False
    [Iter 129/130, Epoch 9] valid accuracy=8.9500e-01, log_loss=4.3720e-01, accuracy=8.9500e-01, time spent=1.219s, total time spent=3.93min. Find new best=False, Find new top-3=False
    [Iter 130/130, Epoch 9] valid accuracy=8.9500e-01, log_loss=4.3720e-01, accuracy=8.9500e-01, time spent=1.212s, total time spent=3.97min. Find new best=False, Find new top-3=False
    	0.9025	 = Validation score   (accuracy)
    	248.36s	 = Training   runtime
    	1.66s	 = Validation runtime
    Fitting model: ImagePredictor ...
    	No valid features to train ImagePredictor... Skipping this model.
    Fitting model: WeightedEnsemble_L2 ...
    	0.9075	 = Validation score   (accuracy)
    	0.18s	 = Training   runtime
    	0.0s	 = Validation runtime
    AutoGluon training complete, total runtime = 261.73s ...
    TabularPredictor saved. To load, use: predictor = TabularPredictor.load("ag_tabular_product_sentiment_multimodal/")


.. parsed-literal::
    :class: output

    <autogluon.tabular.predictor.predictor.TabularPredictor at 0x7f24bc4c1650>


.. code:: python

    predictor.leaderboard(dev_df)


.. parsed-literal::
    :class: output

                     model  score_test  score_val  pred_time_test  pred_time_val    fit_time  pred_time_test_marginal  pred_time_val_marginal  fit_time_marginal  stack_level  can_infer  fit_order
    0        TextPredictor    0.901099     0.9025        3.280783       1.664501  248.355721                 3.280783                1.664501         248.355721            1       True          7
    1  WeightedEnsemble_L2    0.897959     0.9075        3.346303       1.700953  252.700886                 0.011055                0.000417           0.178008            2       True          8
    2       NeuralNetMXNet    0.888540     0.8800        0.033470       0.023258    2.148541                 0.033470                0.023258           2.148541            1       True          5
    3        LightGBMLarge    0.886970     0.8775        0.016229       0.007511    2.021697                 0.016229                0.007511           2.021697            1       True          6
    4             LightGBM    0.885400     0.8875        0.010350       0.006860    0.830556                 0.010350                0.006860           0.830556            1       True          1
    5             CatBoost    0.885400     0.8825        0.017251       0.015054    0.661368                 0.017251                0.015054           0.661368            1       True          3
    6              XGBoost    0.882261     0.8925        0.109931       0.006701    0.838306                 0.109931                0.006701           0.838306            1       True          4
    7           LightGBMXT    0.869702     0.8625        0.010634       0.006610    0.653536                 0.010634                0.006610           0.653536            1       True          2


.. raw:: html

    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }
    
        .dataframe tbody tr th {
            vertical-align: top;
        }
    
        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>model</th>
          <th>score_test</th>
          <th>score_val</th>
          <th>pred_time_test</th>
          <th>pred_time_val</th>
          <th>fit_time</th>
          <th>pred_time_test_marginal</th>
          <th>pred_time_val_marginal</th>
          <th>fit_time_marginal</th>
          <th>stack_level</th>
          <th>can_infer</th>
          <th>fit_order</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>0</th>
          <td>TextPredictor</td>
          <td>0.901099</td>
          <td>0.9025</td>
          <td>3.280783</td>
          <td>1.664501</td>
          <td>248.355721</td>
          <td>3.280783</td>
          <td>1.664501</td>
          <td>248.355721</td>
          <td>1</td>
          <td>True</td>
          <td>7</td>
        </tr>
        <tr>
          <th>1</th>
          <td>WeightedEnsemble_L2</td>
          <td>0.897959</td>
          <td>0.9075</td>
          <td>3.346303</td>
          <td>1.700953</td>
          <td>252.700886</td>
          <td>0.011055</td>
          <td>0.000417</td>
          <td>0.178008</td>
          <td>2</td>
          <td>True</td>
          <td>8</td>
        </tr>
        <tr>
          <th>2</th>
          <td>NeuralNetMXNet</td>
          <td>0.888540</td>
          <td>0.8800</td>
          <td>0.033470</td>
          <td>0.023258</td>
          <td>2.148541</td>
          <td>0.033470</td>
          <td>0.023258</td>
          <td>2.148541</td>
          <td>1</td>
          <td>True</td>
          <td>5</td>
        </tr>
        <tr>
          <th>3</th>
          <td>LightGBMLarge</td>
          <td>0.886970</td>
          <td>0.8775</td>
          <td>0.016229</td>
          <td>0.007511</td>
          <td>2.021697</td>
          <td>0.016229</td>
          <td>0.007511</td>
          <td>2.021697</td>
          <td>1</td>
          <td>True</td>
          <td>6</td>
        </tr>
        <tr>
          <th>4</th>
          <td>LightGBM</td>
          <td>0.885400</td>
          <td>0.8875</td>
          <td>0.010350</td>
          <td>0.006860</td>
          <td>0.830556</td>
          <td>0.010350</td>
          <td>0.006860</td>
          <td>0.830556</td>
          <td>1</td>
          <td>True</td>
          <td>1</td>
        </tr>
        <tr>
          <th>5</th>
          <td>CatBoost</td>
          <td>0.885400</td>
          <td>0.8825</td>
          <td>0.017251</td>
          <td>0.015054</td>
          <td>0.661368</td>
          <td>0.017251</td>
          <td>0.015054</td>
          <td>0.661368</td>
          <td>1</td>
          <td>True</td>
          <td>3</td>
        </tr>
        <tr>
          <th>6</th>
          <td>XGBoost</td>
          <td>0.882261</td>
          <td>0.8925</td>
          <td>0.109931</td>
          <td>0.006701</td>
          <td>0.838306</td>
          <td>0.109931</td>
          <td>0.006701</td>
          <td>0.838306</td>
          <td>1</td>
          <td>True</td>
          <td>4</td>
        </tr>
        <tr>
          <th>7</th>
          <td>LightGBMXT</td>
          <td>0.869702</td>
          <td>0.8625</td>
          <td>0.010634</td>
          <td>0.006610</td>
          <td>0.653536</td>
          <td>0.010634</td>
          <td>0.006610</td>
          <td>0.653536</td>
          <td>1</td>
          <td>True</td>
          <td>2</td>
        </tr>
      </tbody>
    </table>
    </div>


Improve the Performance with Stack Ensemble
-------------------------------------------

You can improve predictive performance by using stack ensembling. One
way to turn it on is as follows:

::

    predictor.fit(train_df, hyperparameters='multimodal', num_bag_folds=5, num_stack_levels=1)

or using:

::

    predictor.fit(train_df, hyperparameters='multimodal', presets='best_quality')

which will automatically select values for ``num_stack_levels`` (how
many stacking layers) and ``num_bag_folds`` (how many folds to split
data into during bagging). Stack ensembling can take much longer, so we
won't run with this configuration here. You may explore more examples in
https://github.com/awslabs/autogluon/tree/master/examples/text\_prediction,
which demonstrate how you can achieve top performance in competitions
with a stack ensembling based solution.