AutoGluon Predictors¶

Example (Predictor for tabular data):

Import TabularDataset and TabularPredictor:

>>> from autogluon.tabular import TabularDataset, TabularPredictor

Load a tabular dataset:

>>> train_data = TabularDataset("https://autogluon.s3.amazonaws.com/datasets/Inc/train.csv")

Fit classification models predicting the “class” column:

>>> predictor = TabularPredictor(label="class").fit(train_data)

Load test data:

>>> test_data = TabularDataset("https://autogluon.s3.amazonaws.com/datasets/Inc/test.csv")

Evaluate predictions on test data:

>>> leaderboard = predictor.leaderboard(test_data)

Predictors¶

Predictors built into AutoGluon such that a single call to fit() can produce high-quality trained models for tabular, image, or text data. For other applications, you can still use AutoGluon to tune the hyperparameters of your own custom models and training scripts.

TabularPredictor

AutoGluon TabularPredictor predicts values in a column of a tabular dataset (classification or regression).

`ImagePredictor`	AutoGluon Predictor for predicting image category based on their whole contents
`ObjectDetector`	AutoGluon Predictor for detecting objects in images

TextPredictor

AutoGluon TextPredictor predicts values in a column of a tabular dataset that contains text fields (classification or regression).

TabularPredictor¶

class autogluon.tabular.TabularPredictor(label, problem_type=None, eval_metric=None, path=None, verbosity=2, sample_weight=None, weight_evaluation=False, groups=None, **kwargs)[source]¶

AutoGluon TabularPredictor predicts values in a column of a tabular dataset (classification or regression).

Parameters

labelstr

Name of the column that contains the target variable to predict.

problem_typestr, default = None

Type of prediction problem, i.e. is this a binary/multiclass classification or regression problem (options: ‘binary’, ‘multiclass’, ‘regression’, ‘quantile’). If problem_type = None, the prediction problem type is inferred based on the label-values in provided dataset.

eval_metricfunction or str, default = None

Metric by which predictions will be ultimately evaluated on test data. AutoGluon tunes factors such as hyperparameters, early-stopping, ensemble-weights, etc. in order to improve this metric on validation data.

If eval_metric = None, it is automatically chosen based on problem_type. Defaults to ‘accuracy’ for binary and multiclass classification, ‘root_mean_squared_error’ for regression, and ‘pinball_loss’ for quantile.

Otherwise, options for classification:: [‘accuracy’, ‘balanced_accuracy’, ‘f1’, ‘f1_macro’, ‘f1_micro’, ‘f1_weighted’, ‘roc_auc’, ‘roc_auc_ovo_macro’, ‘average_precision’, ‘precision’, ‘precision_macro’, ‘precision_micro’, ‘precision_weighted’, ‘recall’, ‘recall_macro’, ‘recall_micro’, ‘recall_weighted’, ‘log_loss’, ‘pac_score’]
Options for regression:: [‘root_mean_squared_error’, ‘mean_squared_error’, ‘mean_absolute_error’, ‘median_absolute_error’, ‘r2’]

For more information on these options, see sklearn.metrics: https://scikit-learn.org/stable/modules/classes.html#sklearn-metrics-metrics

You can also pass your own evaluation function here as long as it follows formatting of the functions defined in folder autogluon.core.metrics.

pathstr, default = None

Path to directory where models and intermediate outputs should be saved. If unspecified, a time-stamped folder called “AutogluonModels/ag-[TIMESTAMP]” will be created in the working directory to store all models. Note: To call fit() twice and save all results of each fit, you must specify different path locations or don’t specify path at all. Otherwise files from first fit() will be overwritten by second fit().

verbosityint, default = 2

Verbosity levels range from 0 to 4 and control how much information is printed. Higher levels correspond to more detailed print statements (you can set verbosity = 0 to suppress warnings). If using logging, you can alternatively control amount of information printed via logger.setLevel(L), where L ranges from 0 to 50 (Note: higher values of L correspond to fewer print statements, opposite of verbosity levels).

sample_weightstr, default = None

If specified, this column-name indicates which column of the data should be treated as sample weights. This column will NOT be considered as a predictive feature. Sample weights should be non-negative (and cannot be nan), with larger values indicating which rows are more important than others. If you want your usage of sample weights to match results obtained outside of this Predictor, then ensure sample weights for your training (or tuning) data sum to the number of rows in the training (or tuning) data. You may also specify two special strings: ‘auto_weight’ (automatically choose a weighting strategy based on the data) or ‘balance_weight’ (equally weight classes in classification, no effect in regression). If specifying your own sample_weight column, make sure its name does not match these special strings.

weight_evaluationbool, default = False

Only considered when sample_weight column is not None. Determines whether sample weights should be taken into account when computing evaluation metrics on validation/test data. If True, then weighted metrics will be reported based on the sample weights provided in the specified sample_weight (in which case sample_weight column must also be present in test data). In this case, the ‘best’ model used by default for prediction will also be decided based on a weighted version of evaluation metric. Note: we do not recommend specifying weight_evaluation when sample_weight is ‘auto_weight’ or ‘balance_weight’, instead specify appropriate eval_metric.

groupsstr, default = None

[Experimental] If specified, AutoGluon will use the column named the value of groups in train_data during .fit as the data splitting indices for the purposes of bagging. This column will not be used as a feature during model training. This parameter is ignored if bagging is not enabled. To instead specify a custom validation set with bagging disabled, specify tuning_data in .fit. The data will be split via sklearn.model_selection.LeaveOneGroupOut. Use this option to control the exact split indices AutoGluon uses. It is not recommended to use this option unless it is required for very specific situations. Bugs may arise from edge cases if the provided groups are not valid to properly train models, such as if not all classes are present during training in multiclass classification. It is up to the user to sanitize their groups.

As an example, if you want your data folds to preserve adjacent rows in the table without shuffling, then for 3 fold bagging with 6 rows of data, the groups column values should be [0, 0, 1, 1, 2, 2].

**kwargs :

learner_typeAbstractLearner, default = DefaultLearner

A class which inherits from AbstractLearner. This dictates the inner logic of predictor. If you don’t know what this is, keep it as the default.

learner_kwargsdict, default = None

Kwargs to send to the learner. Options include:

positive_classstr or int, default = None: Used to determine the positive class in binary classification. This is used for certain metrics such as ‘f1’ which produce different scores depending on which class is considered the positive class. If not set, will be inferred as the second element of the existing unique classes after sorting them.

If classes are [0, 1], then 1 will be selected as the positive class. If classes are [‘def’, ‘abc’], then ‘def’ will be selected as the positive class. If classes are [True, False], then True will be selected as the positive class.
ignored_columnslist, default = None: Banned subset of column names that predictor may not use as predictive features (e.g. unique identifier to a row or user-ID). These columns are ignored during fit().
label_count_thresholdint, default = 10: For multi-class classification problems, this is the minimum number of times a label must appear in dataset in order to be considered an output class. AutoGluon will ignore any classes whose labels do not appear at least this many times in the dataset (i.e. will never predict them).
cache_databool, default = True: When enabled, the training and validation data are saved to disk for future reuse. Enables advanced functionality in predictor such as fit_extra() and feature importance calculation on the original data.
trainer_typeAbstractTrainer, default = AutoTrainer: A class inheriting from AbstractTrainer that controls training/ensembling of many models. If you don’t know what this is, keep it as the default.

Attributes

pathstr

Path to directory where all models used by this Predictor are stored.

problem_typestr

What type of prediction problem this Predictor has been trained for.

eval_metricfunction or str

What metric is used to evaluate predictive performance.

labelstr

Name of table column that contains data from the variable to predict (often referred to as: labels, response variable, target variable, dependent variable, Y, etc).

feature_metadataautogluon.core.features.feature_metadata.FeatureMetadata

Inferred data type of each predictive variable after preprocessing transformation (i.e. column of training data table used to predict label). Contains both raw dtype and special dtype information. Each feature has exactly 1 raw dtype (such as ‘int’, ‘float’, ‘category’) and zero to many special dtypes (such as ‘datetime_as_int’, ‘text’, ‘text_ngram’). Special dtypes are AutoGluon specific feature types that are used to identify features with meaning beyond what the raw dtype can convey.

feature_metadata.type_map_raw: Dictionary of feature name -> raw dtype mappings. feature_metadata.type_group_map_special: Dictionary of lists of special feature names, grouped by special feature dtype.

positive_classstr or int

Returns the positive class name in binary classification.

class_labelslist

For multiclass problems, this list contains the class labels in sorted order of predict_proba() output. For binary problems, this list contains the class labels in sorted order of predict_proba(as_multiclass=True) output.

class_labels[0] corresponds to internal label = 0 (negative class), class_labels[1] corresponds to internal label = 1 (positive class). This is relevant for certain metrics such as F1 where True and False labels impact the metric score differently.

For other problem types, will equal None. For example if pred = predict_proba(x, as_multiclass=True), then ith index of pred provides predicted probability that x belongs to class given by class_labels[i].

class_labels_internallist

For multiclass problems, this list contains the internal class labels in sorted order of internal predict_proba() output. For binary problems, this list contains the internal class labels in sorted order of internal predict_proba(as_multiclass=True) output.

The value will always be class_labels_internal=[0, 1] for binary problems, with 0 as the negative class, and 1 as the positive class.

For other problem types, will equal None.

class_labels_internal_mapdict

For binary and multiclass classification problems, this dictionary contains the mapping of the original labels to the internal labels. For example, in binary classification, label values of ‘True’ and ‘False’ will be mapped to the internal representation 1 and 0.

Therefore, class_labels_internal_map would equal {‘True’: 1, ‘False’: 0}

For other problem types, will equal None. For multiclass, it is possible for not all of the label values to have a mapping.

This indicates that the internal models will never predict those missing labels, and training rows associated with the missing labels were dropped.

Methods

`Dataset`	alias of `autogluon.core.dataset.TabularDataset`
`delete_models`([models_to_keep, …])	Deletes models from predictor. This can be helpful to minimize memory usage and disk usage, particularly for model deployment. This will remove all references to the models in predictor. For example, removed models will not appear in predictor.leaderboard(). WARNING: If delete_from_disk=True, this will DELETE ALL FILES in the deleted model directories, regardless if they were created by AutoGluon or not. DO NOT STORE FILES INSIDE OF THE MODEL DIRECTORY THAT ARE UNRELATED TO AUTOGLUON.
`distill`([train_data, tuning_data, …])	Distill AutoGluon’s most accurate ensemble-predictor into single models which are simpler/faster and require less memory/compute.
`evaluate`(data[, model, silent, …])	Report the predictive performance evaluated over a given dataset.
`evaluate_predictions`(y_true, y_pred[, …])	Evaluate the provided prediction probabilities against ground truth labels.
`feature_importance`([data, model, features, …])	Calculates feature importance scores for the given model via permutation importance.
`features`([feature_stage])	Returns a list of feature names dependent on the value of feature_stage.
`fit`(train_data[, tuning_data, time_limit, …])	Fit models to predict a column of a data table (label) based on the other columns (features).
`fit_extra`(hyperparameters[, time_limit, …])	Fits additional models after the original `TabularPredictor.fit()` call.
`fit_summary`([verbosity, show_plot])	Output summary of information about models produced during fit().
`fit_weighted_ensemble`([base_models, …])	Fits new weighted ensemble models to combine predictions of previously-trained models.
`get_model_best`()	Returns the string model name of the best model by validation score.
`get_model_full_dict`()	Returns a dictionary of original model name -> refit full model name.
`get_model_names`([stack_name, level, …])	Returns the list of model names trained in this predictor object.
`get_model_names_persisted`()	Returns the list of model names which are persisted in memory.
`get_oof_pred`([model, transformed, …])	Note: This is advanced functionality not intended for normal usage.
`get_oof_pred_proba`([model, transformed, …])	Note: This is advanced functionality not intended for normal usage.
`info`()	[EXPERIMENTAL] Returns a dictionary of predictor metadata. Warning: This functionality is currently in preview mode. The metadata information returned may change in structure in future versions without warning. The definitions of various metadata values are not yet documented. The output of this function should not be used for programmatic decisions. Contains information such as row count, column count, model training time, validation scores, hyperparameters, and much more.
`leaderboard`([data, extra_info, …])	Output summary of information about models produced during fit() as a `pd.DataFrame`. Includes information on test and validation scores for all models, model training times, inference times, and stack levels. Output DataFrame columns include: ‘model’: The name of the model.
`load`(path[, verbosity])	Load a TabularPredictor object previously produced by fit() from file and returns this object.
`load_data_internal`([data, return_X, return_y])	Loads the internal data representation used during model training. Individual AutoGluon models like the neural network may apply additional feature transformations that are not reflected in this method. This method only applies universal transforms employed by all AutoGluon models. Warning, the internal representation may: Have different features compared to the original data. Have different row counts compared to the original data. Have indices which do not align with the original data. Have label values which differ from those in the original data. Internal data representations should NOT be combined with the original data, in most cases this is not possible.
`persist_models`([models, with_ancestors, …])	Persist models in memory for reduced inference latency.
`plot_ensemble_model`([prune_unused_nodes])	Output the visualized stack ensemble architecture of a model trained by fit().
`predict`(data[, model, as_pandas])	Use trained models to produce predictions of label column values for new data.
`predict_proba`(data[, model, as_pandas, …])	Use trained models to produce predicted class probabilities rather than class-labels (if task is classification).
`refit_full`([model])	Retrain model on all of the data (training + validation). For bagged models: Optimizes a model’s inference time by collapsing bagged ensembles into a single model fit on all of the training data. This process will typically result in a slight accuracy reduction and a large inference speedup. The inference speedup will generally be between 10-200x faster than the original bagged ensemble model. The inference speedup factor is equivalent to (k * n), where k is the number of folds (num_bag_folds) and n is the number of finished repeats (num_bag_sets) in the bagged ensemble. The runtime is generally 10% or less of the original fit runtime. The runtime can be roughly estimated as 1 / (k * n) of the original fit runtime, with k and n defined above. For non-bagged models: Optimizes a model’s accuracy by retraining on 100% of the data without using a validation set. Will typically result in a slight accuracy increase and no change to inference time. The runtime will be approximately equal to the original fit runtime. This process does not alter the original models, but instead adds additional models. If stacker models are refit by this process, they will use the refit_full versions of the ancestor models during inference. Models produced by this process will not have validation scores, as they use all of the data for training. Therefore, it is up to the user to determine if the models are of sufficient quality by including test data in predictor.leaderboard(test_data). If the user does not have additional test data, they should reference the original model’s score for an estimate of the performance of the refit_full model. Warning: Be aware that utilizing refit_full models without separately verifying on test data means that the model is untested, and has no guarantee of being consistent with the original model. cache_data must have been set to True during the original training to enable this functionality.
`save`()	Save this Predictor to file in directory specified by this Predictor’s path.
`save_space`([remove_data, remove_fit_stack, …])	Reduces the memory and disk size of predictor by deleting auxiliary model files that aren’t needed for prediction on new data.
`transform_features`([data, model, …])	Transforms data features through the AutoGluon feature generator. This is useful to gain an understanding of how AutoGluon interprets the data features. The output of this function can be used to train further models, even outside of AutoGluon. This can be useful for training your own models on the same data representation as AutoGluon. Individual AutoGluon models like the neural network may apply additional feature transformations that are not reflected in this method. This method only applies universal transforms employed by all AutoGluon models. When data=None, `base_models=[{best_model}], and bagging was enabled during fit(): This returns the out-of-fold predictions of the best model, which can be used as training input to a custom user stacker model.
`transform_labels`(labels[, inverse, proba])	Transforms data labels to the internal label representation.
`unpersist_models`([models])	Unpersist models in memory for reduced memory usage.

Dataset¶: alias of autogluon.core.dataset.TabularDataset

delete_models(models_to_keep=None, models_to_delete=None, allow_delete_cascade=False, delete_from_disk=True, dry_run=True)[source]¶

Deletes models from predictor. This can be helpful to minimize memory usage and disk usage, particularly for model deployment. This will remove all references to the models in predictor.

For example, removed models will not appear in predictor.leaderboard().

WARNING: If delete_from_disk=True, this will DELETE ALL FILES in the deleted model directories, regardless if they were created by AutoGluon or not.: DO NOT STORE FILES INSIDE OF THE MODEL DIRECTORY THAT ARE UNRELATED TO AUTOGLUON.

Parameters

models_to_keepstr or list, default = None

Name of model or models to not delete. All models that are not specified and are also not required as a dependency of any model in models_to_keep will be deleted. Specify models_to_keep=’best’ to keep only the best model and its model dependencies. models_to_delete must be None if models_to_keep is set. To see the list of possible model names, use: predictor.get_model_names() or predictor.leaderboard().

models_to_deletestr or list, default = None

Name of model or models to delete. All models that are not specified but depend on a model in models_to_delete will also be deleted. models_to_keep must be None if models_to_delete is set.

allow_delete_cascadebool, default = False

If False, if unspecified dependent models of models in models_to_delete exist an exception will be raised instead of deletion occurring.: An example of a dependent model is m1 if m2 is a stacker model and takes predictions from m1 as inputs. In this case, m1 would be a dependent model of m2.

If True, all dependent models of models in models_to_delete will be deleted. Has no effect if models_to_delete=None.

delete_from_diskbool, default = True

If True, deletes the models from disk if they were persisted. WARNING: This deletes the entire directory for the deleted models, and ALL FILES located there.

It is highly recommended to first run with dry_run=True to understand which directories will be deleted.

dry_runbool, default = True

If True, then deletions don’t occur, and logging statements are printed describing what would have occurred. Set dry_run=False to perform the deletions.

distill(train_data=None, tuning_data=None, augmentation_data=None, time_limit=None, hyperparameters=None, holdout_frac=None, teacher_preds='soft', augment_method='spunge', augment_args={'max_size': 100000, 'size_factor': 5}, models_name_suffix=None, verbosity=None)[source]¶

Distill AutoGluon’s most accurate ensemble-predictor into single models which are simpler/faster and require less memory/compute. Distillation can produce a model that is more accurate than the same model fit directly on the original training data. After calling distill(), there will be more models available in this Predictor, which can be evaluated using predictor.leaderboard(test_data) and deployed with: predictor.predict(test_data, model=MODEL_NAME). This will raise an exception if cache_data=False was previously set in fit().

NOTE: Until catboost v0.24 is released, distill() with CatBoost students in multiclass classification requires you to first install catboost-dev: pip install catboost-dev

Parameters

train_datastr or TabularDataset or pd.DataFrame, default = None

Same as train_data argument of fit(). If None, the same training data will be loaded from fit() call used to produce this Predictor.

tuning_datastr or TabularDataset or pd.DataFrame, default = None

Same as tuning_data argument of fit(). If tuning_data = None and train_data = None: the same training/validation splits will be loaded from fit() call used to produce this Predictor, unless bagging/stacking was previously used in which case a new training/validation split is performed.

augmentation_dataTabularDataset or pd.DataFrame, default = None

An optional extra dataset of unlabeled rows that can be used for augmenting the dataset used to fit student models during distillation (ignored if None).

time_limitint, default = None

Approximately how long (in seconds) the distillation process should run for. If None, no time-constraint will be enforced allowing the distilled models to fully train.

hyperparametersdict or str, default = None

Specifies which models to use as students and what hyperparameter-values to use for them. Same as hyperparameters argument of fit(). If = None, then student models will use the same hyperparameters from fit() used to produce this Predictor. Note: distillation is currently only supported for [‘GBM’,’NN’,’RF’,’CAT’] student models, other models and their hyperparameters are ignored here.

holdout_fracfloat

Same as holdout_frac argument of TabularPredictor.fit().

teacher_predsstr, default = ‘soft’

What form of teacher predictions to distill from (teacher refers to the most accurate AutoGluon ensemble-predictor). If None, we only train with original labels (no data augmentation). If ‘hard’, labels are hard teacher predictions given by: teacher.predict() If ‘soft’, labels are soft teacher predictions given by: teacher.predict_proba() Note: ‘hard’ and ‘soft’ are equivalent for regression problems. If augment_method is not None, teacher predictions are only used to label augmented data (training data keeps original labels). To apply label-smoothing: teacher_preds=’onehot’ will use original training data labels converted to one-hot vectors for multiclass problems (no data augmentation).

augment_methodstr, default=’spunge’

Specifies method to use for generating augmented data for distilling student models. Options include:

None : no data augmentation performed. ‘munge’ : The MUNGE algorithm (https://www.cs.cornell.edu/~caruana/compression.kdd06.pdf). ‘spunge’ : A simpler, more efficient variant of the MUNGE algorithm.

augment_argsdict, default = {‘size_factor’:5, ‘max_size’: int(1e5)}

Contains the following kwargs that control the chosen augment_method (these are ignored if augment_method=None):: ‘num_augmented_samples’: int, number of augmented datapoints used during distillation. Overrides ‘size_factor’, ‘max_size’ if specified. ‘max_size’: float, the maximum number of augmented datapoints to add (ignored if ‘num_augmented_samples’ specified). ‘size_factor’: float, if n = training data sample-size, we add int(n * size_factor) augmented datapoints, up to ‘max_size’. Larger values in augment_args will slow down the runtime of distill(), and may produce worse results if provided time_limit are too small. You can also pass in kwargs for the spunge_augment, munge_augment functions in autogluon.tabular.augmentation.distill_utils.

models_name_suffixstr, default = None

Optional suffix that can be appended at the end of all distilled student models’ names. Note: all distilled models will contain ‘_DSTL’ substring in their name by default.

verbosityint, default = None

Controls amount of printed output during distillation (4 = highest, 0 = lowest). Same as verbosity parameter of TabularPredictor. If None, the same verbosity used in previous fit is employed again.

Returns

List of names (str) corresponding to the distilled models.

Examples

>>> from autogluon.tabular import TabularDataset, TabularPredictor
>>> train_data = TabularDataset('train.csv')
>>> predictor = TabularPredictor(label='class').fit(train_data, auto_stack=True)
>>> distilled_model_names = predictor.distill()
>>> test_data = TabularDataset('test.csv')
>>> ldr = predictor.leaderboard(test_data)
>>> model_to_deploy = distilled_model_names[0]
>>> predictor.predict(test_data, model=model_to_deploy)

evaluate(data, model=None, silent=False, auxiliary_metrics=True, detailed_report=False) → dict[source]¶

Report the predictive performance evaluated over a given dataset. This is basically a shortcut for: pred_proba = predict_proba(data); evaluate_predictions(data[label], pred_proba).

Parameters

datastr or TabularDataset or pd.DataFrame: This dataset must also contain the label with the same column-name as previously specified. If str is passed, data will be loaded using the str value as the file path.
modelstr (optional): The name of the model to get prediction probabilities from. Defaults to None, which uses the highest scoring model on the validation set. Valid models are listed in this predictor by calling predictor.get_model_names().
silentbool, default = False: If False, performance results are printed.
auxiliary_metrics: bool, default = True: Should we compute other (problem_type specific) metrics in addition to the default metric?
detailed_reportbool, default = False: Should we computed more detailed versions of the auxiliary_metrics? (requires auxiliary_metrics = True)

Returns

Returns dict where keys = metrics, values = performance along each metric. To get the eval_metric score, do output[predictor.eval_metric.name]
NOTE: Metrics scores always show in higher is better form.
This means that metrics such as log_loss and root_mean_squared_error will have their signs FLIPPED, and values will be negative.

evaluate_predictions(y_true, y_pred, silent=False, auxiliary_metrics=True, detailed_report=False) → dict[source]¶

Evaluate the provided prediction probabilities against ground truth labels. Evaluation is based on the eval_metric previously specified to fit(), or default metrics if none was specified.

Parameters

y_truenp.array or pd.Series: The ordered collection of ground-truth labels.
y_predpd.Series or pd.DataFrame: The ordered collection of prediction probabilities or predictions. Obtainable via the output of predictor.predict_proba. Caution: For certain types of eval_metric (such as ‘roc_auc’), y_pred must be predicted-probabilities rather than predicted labels.
silentbool, default = False: If False, performance results are printed.
auxiliary_metrics: bool, default = True: Should we compute other (problem_type specific) metrics in addition to the default metric?
detailed_reportbool, default = False: Should we computed more detailed versions of the auxiliary_metrics? (requires auxiliary_metrics = True)

Returns

Returns dict where keys = metrics, values = performance along each metric.
NOTE: Metrics scores always show in higher is better form.
This means that metrics such as log_loss and root_mean_squared_error will have their signs FLIPPED, and values will be negative.

feature_importance(data=None, model=None, features=None, feature_stage='original', subsample_size=1000, time_limit=None, num_shuffle_sets=None, include_confidence_band=True, silent=False)[source]¶

Calculates feature importance scores for the given model via permutation importance. Refer to https://explained.ai/rf-importance/ for an explanation of permutation importance. A feature’s importance score represents the performance drop that results when the model makes predictions on a perturbed copy of the data where this feature’s values have been randomly shuffled across rows. A feature score of 0.01 would indicate that the predictive performance dropped by 0.01 when the feature was randomly shuffled. The higher the score a feature has, the more important it is to the model’s performance. If a feature has a negative score, this means that the feature is likely harmful to the final model, and a model trained with the feature removed would be expected to achieve a better predictive performance. Note that calculating feature importance can be a very computationally expensive process, particularly if the model uses hundreds or thousands of features. In many cases, this can take longer than the original model training. To estimate how long feature_importance(model, data, features) will take, it is roughly the time taken by predict_proba(data, model) multiplied by the number of features.

Note: For highly accurate importance and p_value estimates, it is recommend to set subsample_size to at least 5,000 if possible and num_shuffle_sets to at least 10.

Parameters

datastr or TabularDataset or pd.DataFrame (optional)

This data must also contain the label-column with the same column-name as specified during fit(). If specified, then the data is used to calculate the feature importance scores. If str is passed, data will be loaded using the str value as the file path. If not specified, the original data used during fit() will be used if cache_data=True. Otherwise, an exception will be raised. Do not pass the training data through this argument, as the feature importance scores calculated will be biased due to overfitting.

More accurate feature importances will be obtained from new data that was held-out during fit().

modelstr, default = None

Model to get feature importances for, if None the best model is chosen. Valid models are listed in this predictor by calling predictor.get_model_names()

featureslist, default = None

List of str feature names that feature importances are calculated for and returned, specify None to get all feature importances. If you only want to compute feature importances for some of the features, you can pass their names in as a list of str. Valid feature names change depending on the feature_stage.

To get the list of feature names for feature_stage=’original’, call predictor.feature_metadata_in.get_features(). To get the list of feature names for feature_stage=’transformed’, call list(predictor.transform_features().columns). To get the list of feature names for feature_stage=`transformed_model, call list(predictor.transform_features(model={model_name}).columns).

[Advanced] Can also contain tuples as elements of (feature_name, feature_list) form.: feature_name can be any string so long as it is unique with all other feature names / features in the list. feature_list can be any list of valid features in the data. This will compute importance of the combination of features in feature_list, naming the set of features in the returned DataFrame feature_name. This importance will differ from adding the individual importances of each feature in feature_list, and will be more accurate to the overall group importance. Example: [‘featA’, ‘featB’, ‘featC’, (‘featBC’, [‘featB’, ‘featC’])] In this example, the importance of ‘featBC’ will be calculated by jointly permuting ‘featB’ and ‘featC’ together as if they were a single two-dimensional feature.

feature_stagestr, default = ‘original’

What stage of feature-processing should importances be computed for. Options:

‘original’:
Compute importances of the original features. Warning: data must be specified with this option, otherwise an exception will be raised.

‘transformed’:
Compute importances of the post-internal-transformation features (after automated feature engineering). These features may be missing some original features, or add new features entirely. An example of new features would be ngram features generated from a text column. Warning: For bagged models, feature importance calculation is not yet supported with this option when data=None. Doing so will raise an exception.

‘transformed_model’:
Compute importances of the post-model-transformation features. These features are the internal features used by the requested model. They may differ greatly from the original features. If the model is a stack ensemble, this will include stack ensemble features such as the prediction probability features of the stack ensemble’s base (ancestor) models.

subsample_sizeint, default = 1000

The number of rows to sample from data when computing feature importance. If subsample_size=None or data contains fewer than subsample_size rows, all rows will be used during computation. Larger values increase the accuracy of the feature importance scores. Runtime linearly scales with subsample_size.

time_limitfloat, default = None

Time in seconds to limit the calculation of feature importance. If None, feature importance will calculate without early stopping. A minimum of 1 full shuffle set will always be evaluated. If a shuffle set evaluation takes longer than time_limit, the method will take the length of a shuffle set evaluation to return regardless of the time_limit.

num_shuffle_setsint, default = None

The number of different permutation shuffles of the data that are evaluated. Larger values will increase the quality of the importance evaluation. It is generally recommended to increase subsample_size before increasing num_shuffle_sets. Defaults to 3 if time_limit is None or 10 if time_limit is specified. Runtime linearly scales with num_shuffle_sets.

include_confidence_band: bool, default = True

If True, will include output columns ‘p99_high’ and ‘p99_low’ which indicates that the true feature importance will be between ‘p99_high’ and ‘p99_low’ 99% of the time (99% confidence interval). Increasing subsample_size and num_shuffle_sets will tighten the band.

silentbool, default = False

Whether to suppress logging output.

Returns

pd.DataFrame of feature importance scores with 6 columns:

index: The feature name. ‘importance’: The estimated feature importance score. ‘stddev’: The standard deviation of the feature importance score. If NaN, then not enough num_shuffle_sets were used to calculate a variance. ‘p_value’: P-value for a statistical t-test of the null hypothesis: importance = 0, vs the (one-sided) alternative: importance > 0.

Features with low p-value appear confidently useful to the predictor, while the other features may be useless to the predictor (or even harmful to include in its training data). A p-value of 0.01 indicates that there is a 1% chance that the feature is useless or harmful, and a 99% chance that the feature is useful. A p-value of 0.99 indicates that there is a 99% chance that the feature is useless or harmful, and a 1% chance that the feature is useful.

‘n’: The number of shuffles performed to estimate importance score (corresponds to sample-size used to determine confidence interval for true score). ‘p99_high’: Upper end of 99% confidence interval for true feature importance score. ‘p99_low’: Lower end of 99% confidence interval for true feature importance score.

features(feature_stage: str = 'original')[source]¶

Returns a list of feature names dependent on the value of feature_stage.

Parameters

feature_stagestr, default = ‘original’: If ‘original’, returns the list of features specified in the original training data. This feature set is required in input data when making predictions. If ‘transformed’, returns the list of features after pre-processing by the feature generator.

Returns

Returns a list of feature names

fit(train_data, tuning_data=None, time_limit=None, presets=None, hyperparameters=None, feature_metadata='infer', **kwargs)[source]¶

Fit models to predict a column of a data table (label) based on the other columns (features).

Parameters

train_datastr or TabularDataset or pd.DataFrame

Table of the training data, which is similar to a pandas DataFrame. If str is passed, train_data will be loaded using the str value as the file path.

tuning_datastr or TabularDataset or pd.DataFrame, default = None

Another dataset containing validation data reserved for tuning processes such as early stopping and hyperparameter tuning. This dataset should be in the same format as train_data. If str is passed, tuning_data will be loaded using the str value as the file path. Note: final model returned may be fit on tuning_data as well as train_data. Do not provide your evaluation test data here! In particular, when num_bag_folds > 0 or num_stack_levels > 0, models will be trained on both tuning_data and train_data. If tuning_data = None, fit() will automatically hold out some random validation examples from train_data.

time_limitint, default = None

Approximately how long fit() should run for (wallclock time in seconds). If not specified, fit() will run until all models have completed training, but will not repeatedly bag models unless num_bag_sets is specified.

presetslist or str or dict, default = [‘medium_quality_faster_train’]

List of preset configurations for various arguments in fit(). Can significantly impact predictive accuracy, memory-footprint, and inference latency of trained models, and various other properties of the returned predictor. It is recommended to specify presets and avoid specifying most other fit() arguments or model hyperparameters prior to becoming familiar with AutoGluon. As an example, to get the most accurate overall predictor (regardless of its efficiency), set presets=’best_quality’. To get good quality with minimal disk usage, set presets=[‘good_quality_faster_inference_only_refit’, ‘optimize_for_deployment’] Any user-specified arguments in fit() will override the values used by presets. If specifying a list of presets, later presets will override earlier presets if they alter the same argument. For precise definitions of the provided presets, see file: autogluon/tabular/configs/presets_configs.py. Users can specify custom presets by passing in a dictionary of argument values as an element to the list.

Available Presets: [‘best_quality’, ‘high_quality_fast_inference_only_refit’, ‘good_quality_faster_inference_only_refit’, ‘medium_quality_faster_train’, ‘optimize_for_deployment’, ‘ignore_text’] It is recommended to only use one quality based preset in a given call to fit() as they alter many of the same arguments and are not compatible with each-other.

In-depth Preset Info:

best_quality={‘auto_stack’: True}

Best predictive accuracy with little consideration to inference time or disk usage. Achieve even better results by specifying a large time_limit value. Recommended for applications that benefit from the best possible model accuracy.

high_quality_fast_inference_only_refit={‘auto_stack’: True, ‘refit_full’: True, ‘set_best_to_refit_full’: True, ‘_save_bag_folds’: False}

High predictive accuracy with fast inference. ~10x-200x faster inference and ~10x-200x lower disk usage than best_quality. Recommended for applications that require reasonable inference speed and/or model size.

good_quality_faster_inference_only_refit={‘auto_stack’: True, ‘refit_full’: True, ‘set_best_to_refit_full’: True, ‘_save_bag_folds’: False, ‘hyperparameters’: ‘light’}

Good predictive accuracy with very fast inference. ~4x faster inference and ~4x lower disk usage than high_quality_fast_inference_only_refit. Recommended for applications that require fast inference speed.

medium_quality_faster_train={‘auto_stack’: False}

Medium predictive accuracy with very fast inference and very fast training time. ~20x faster training than good_quality_faster_inference_only_refit. This is the default preset in AutoGluon, but should generally only be used for quick prototyping, as good_quality_faster_inference_only_refit results in significantly better predictive accuracy and faster inference time.

optimize_for_deployment={‘keep_only_best’: True, ‘save_space’: True}

Optimizes result immediately for deployment by deleting unused models and removing training artifacts. Often can reduce disk usage by ~2-4x with no negatives to model accuracy or inference speed. This will disable numerous advanced functionality, but has no impact on inference. This will make certain functionality less informative, such as predictor.leaderboard() and predictor.fit_summary().

Because unused models will be deleted under this preset, methods like predictor.leaderboard() and predictor.fit_summary() will no longer show the full set of models that were trained during fit().

Recommended for applications where the inner details of AutoGluon’s training is not important and there is no intention of manually choosing between the final models. This preset pairs well with the other presets such as good_quality_faster_inference_only_refit to make a very compact final model. Identical to calling predictor.delete_models(models_to_keep=’best’, dry_run=False) and predictor.save_space() directly after fit().

ignore_text={‘_feature_generator_kwargs’: {‘enable_text_ngram_features’: False, ‘enable_text_special_features’: False, ‘enable_raw_text_features’: False}}

Disables automated feature generation when text features are detected. This is useful to determine how beneficial text features are to the end result, as well as to ensure features are not mistaken for text when they are not. Ignored if feature_generator was also specified.

hyperparametersstr or dict, default = ‘default’

Determines the hyperparameters used by the models. If str is passed, will use a preset hyperparameter configuration.

Valid str options: [‘default’, ‘light’, ‘very_light’, ‘toy’, ‘multimodal’]
‘default’: Default AutoGluon hyperparameters intended to maximize accuracy without significant regard to inference time or disk usage. ‘light’: Results in smaller models. Generally will make inference speed much faster and disk usage much lower, but with worse accuracy. ‘very_light’: Results in much smaller models. Behaves similarly to ‘light’, but in many cases with over 10x less disk usage and a further reduction in accuracy. ‘toy’: Results in extremely small models. Only use this when prototyping, as the model quality will be severely reduced. ‘multimodal’: [EXPERIMENTAL] Trains a multimodal transformer model alongside tabular models. Requires that some text columns appear in the data, a GPU, and CUDA-enabled MXNet.

When combined with ‘best_quality’ presets option, this can achieve extremely strong results in multimodal data tables that contain columns with text in addition to numeric/categorical columns.

Reference autogluon/tabular/configs/hyperparameter_configs.py for information on the hyperparameters associated with each preset.

Keys are strings that indicate which model types to train.

Stable model options include:: ‘GBM’ (LightGBM) ‘CAT’ (CatBoost) ‘XGB’ (XGBoost) ‘RF’ (random forest) ‘XT’ (extremely randomized trees) ‘KNN’ (k-nearest neighbors) ‘LR’ (linear regression) ‘NN’ (neural network with MXNet backend) ‘FASTAI’ (neural network with FastAI backend)
Experimental model options include:: ‘FASTTEXT’ (FastText) ‘AG_TEXT_NN’ (Multimodal Text+Tabular model, GPU is required) ‘TRANSF’ (Tabular Transformer, GPU is recommended)

If a certain key is missing from hyperparameters, then fit() will not train any models of that type. Omitting a model key from hyperparameters is equivalent to including this model key in excluded_model_types. For example, set hyperparameters = { ‘NN’:{…} } if say you only want to train neural networks and no other types of models.

Values = dict of hyperparameter settings for each model type, or list of dicts.

Each hyperparameter can either be a single fixed value or a search space containing many possible values. Unspecified hyperparameters will be set to default values (or default search spaces if hyperparameter_tune = True). Caution: Any provided search spaces will be overridden by fixed defaults if hyperparameter_tune = False. To train multiple models of a given type, set the value to a list of hyperparameter dictionaries.

For example, hyperparameters = {‘RF’: [{‘criterion’: ‘gini’}, {‘criterion’: ‘entropy’}]} will result in 2 random forest models being trained with separate hyperparameters.

Advanced functionality: Custom models

hyperparameters can also take special string values instead of a dictionary of model parameters which maps to a pre-configured model configuration (currently supported options = [‘GBMLarge’]).: These additional models will be trained using custom pre-specified hyperparameter settings that are known to work well.

Advanced functionality: Custom stack levels

By default, AutoGluon re-uses the same models and model hyperparameters at each level during stack ensembling. To customize this behaviour, create a hyperparameters dictionary separately for each stack level, and then add them as values to a new dictionary, with keys equal to the stack level.

Example: hyperparameters = {1: {‘RF’: rf_params1}, 2: {‘CAT’: [cat_params1, cat_params2], ‘NN’: {}}} This will result in a stack ensemble that has one custom random forest in level 1 followed by two CatBoost models with custom hyperparameters and a default neural network in level 2, for a total of 4 models.

If a level is not specified in hyperparameters, it will default to using the highest specified level to train models. This can also be explicitly controlled by adding a ‘default’ key.

Default:

hyperparameters = {

‘NN’: {}, ‘GBM’: [

{‘extra_trees’: True, ‘ag_args’: {‘name_suffix’: ‘XT’}}, {}, ‘GBMLarge’,

], ‘CAT’: {}, ‘XGB’: {}, ‘FASTAI’: {}, ‘RF’: [

{‘criterion’: ‘gini’, ‘ag_args’: {‘name_suffix’: ‘Gini’, ‘problem_types’: [‘binary’, ‘multiclass’]}}, {‘criterion’: ‘entropy’, ‘ag_args’: {‘name_suffix’: ‘Entr’, ‘problem_types’: [‘binary’, ‘multiclass’]}}, {‘criterion’: ‘mse’, ‘ag_args’: {‘name_suffix’: ‘MSE’, ‘problem_types’: [‘regression’]}},

], ‘XT’: [

{‘criterion’: ‘gini’, ‘ag_args’: {‘name_suffix’: ‘Gini’, ‘problem_types’: [‘binary’, ‘multiclass’]}}, {‘criterion’: ‘entropy’, ‘ag_args’: {‘name_suffix’: ‘Entr’, ‘problem_types’: [‘binary’, ‘multiclass’]}}, {‘criterion’: ‘mse’, ‘ag_args’: {‘name_suffix’: ‘MSE’, ‘problem_types’: [‘regression’]}},

], ‘KNN’: [

{‘weights’: ‘uniform’, ‘ag_args’: {‘name_suffix’: ‘Unif’}}, {‘weights’: ‘distance’, ‘ag_args’: {‘name_suffix’: ‘Dist’}},

],

}

Details regarding the hyperparameters you can specify for each model are provided in the following files:

NN: autogluon.tabular.models.tabular_nn.hyperparameters.parameters

Note: certain hyperparameter settings may cause these neural networks to train much slower.

GBM: autogluon.tabular.models.lgb.hyperparameters.parameters

See also the lightGBM docs: https://lightgbm.readthedocs.io/en/latest/Parameters.html

CAT: autogluon.tabular.models.catboost.hyperparameters.parameters

See also the CatBoost docs: https://catboost.ai/docs/concepts/parameter-tuning.html

XGB: autogluon.tabular.models.xgboost.hyperparameters.parameters

See also the XGBoost docs: https://xgboost.readthedocs.io/en/latest/parameter.html

FASTAI: autogluon.tabular.models.fastainn.hyperparameters.parameters

See also the FastAI docs: https://docs.fast.ai/tabular.models.html

RF: See sklearn documentation: https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html

Note: Hyperparameter tuning is disabled for this model.

XT: See sklearn documentation: https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.ExtraTreesClassifier.html

Note: Hyperparameter tuning is disabled for this model.

KNN: See sklearn documentation: https://scikit-learn.org/stable/modules/generated/sklearn.neighbors.KNeighborsClassifier.html

Note: Hyperparameter tuning is disabled for this model.

LR: autogluon.tabular.models.lr.hyperparameters.parameters

Note: Hyperparameter tuning is disabled for this model. Note: ‘penalty’ parameter can be used for regression to specify regularization method: ‘L1’ and ‘L2’ values are supported.

Advanced functionality: Custom AutoGluon model arguments

These arguments are optional and can be specified in any model’s hyperparameters.

Example: hyperparameters = {‘RF’: {…, ‘ag_args’: {‘name_suffix’: ‘CustomModelSuffix’, ‘disable_in_hpo’: True}}

ag_args: Dictionary of customization options related to meta properties of the model such as its name, the order it is trained, the problem types it is valid for, and the type of HPO it utilizes.

Valid keys:: name: (str) The name of the model. This overrides AutoGluon’s naming logic and all other name arguments if present. name_main: (str) The main name of the model. Example: ‘RandomForest’. name_prefix: (str) Add a custom prefix to the model name. Unused by default. name_suffix: (str) Add a custom suffix to the model name. Unused by default. priority: (int) Determines the order in which the model is trained. Larger values result in the model being trained earlier. Default values range from 100 (KNN) to 0 (custom), dictated by model type. If you want this model to be trained first, set priority = 999. problem_types: (list) List of valid problem types for the model. problem_types=[‘binary’] will result in the model only being trained if problem_type is ‘binary’. disable_in_hpo: (bool) If True, the model will only be trained if hyperparameter_tune_kwargs=None. valid_stacker: (bool) If False, the model will not be trained as a level 2 or higher stacker model. valid_base: (bool) If False, the model will not be trained as a level 1 (base) model. hyperparameter_tune_kwargs: (dict) Refer to TabularPredictor.fit() hyperparameter_tune_kwargs argument. If specified here, will override global HPO settings for this model.

Reference the default hyperparameters for example usage of these options.

ag_args_fit: Dictionary of model fit customization options related to how and with what constraints the model is trained. These parameters affect stacker fold models, but not stacker models themselves.

Clarification: time_limit is the internal time in seconds given to a particular model to train, which is dictated in part by the time_limit argument given during predictor.fit() but is not the same. Valid keys:

stopping_metric: (str or autogluon.core.metrics.Scorer, default=None) The metric to use for early stopping of the model. If None, model will decide. max_memory_usage_ratio: (float, default=1.0) The ratio of memory usage relative to the default to allow before early stopping or killing the model. Values greater than 1.0 will be increasingly prone to out-of-memory errors. max_time_limit_ratio: (float, default=1.0) The ratio of the provided time_limit to use during model fit(). If time_limit=10 and max_time_limit_ratio=0.3, time_limit would be changed to 3. Does not alter max_time_limit or min_time_limit values. max_time_limit: (float, default=None) Maximum amount of time to allow this model to train for (in sec). If the provided time_limit is greater than this value, it will be replaced by max_time_limit. min_time_limit: (float, default=0) Allow this model to train for at least this long (in sec), regardless of the time limit it would otherwise be granted.

If min_time_limit >= max_time_limit, time_limit will be set to min_time_limit. If min_time_limit=None, time_limit will be set to None and the model will have no training time restriction.

num_cpus(int or str, default=’auto’)
How many CPUs to use during model fit. If ‘auto’, model will decide.

num_gpus(int or str, default=’auto’)
How many GPUs to use during model fit. If ‘auto’, model will decide. Some models can use GPUs but don’t by default due to differences in model quality. Set to 0 to disable usage of GPUs.

ag_args_ensemble: Dictionary of hyperparameters shared by all models that control how they are ensembled, if bag mode is enabled.

Valid keys:: use_orig_features: (bool) Whether a stack model will use the original features along with the stack features to train (akin to skip-connections). If the model has no stack features (no base models), this value is ignored and the stack model will use the original features. max_base_models: (int, default=25) Maximum number of base models whose predictions form the features input to this stacker model. If more than max_base_models base models are available, only the top max_base_models models with highest validation score are used. max_base_models_per_type: (int, default=5) Similar to max_base_models. If more than max_base_models_per_type of any particular model type are available, only the top max_base_models_per_type of that type are used. This occurs before the max_base_models filter. save_bag_folds: (bool, default=True)

If True, bagged models will save their fold models (the models from each individual fold of bagging). This is required to use bagged models for prediction. If False, bagged models will not save their fold models. This means that bagged models will not be valid models during inference.

This should only be set to False when planning to call predictor.refit_full() or when refit_full is set and set_best_to_refit_full=True. Particularly useful if disk usage is a concern. By not saving the fold models, bagged models will use only very small amounts of disk space during training. In many training runs, this will reduce peak disk usage by >10x.

feature_metadataautogluon.tabular.FeatureMetadata or str, default = ‘infer’

The feature metadata used in various inner logic in feature preprocessing. If ‘infer’, will automatically construct a FeatureMetadata object based on the properties of train_data. In this case, train_data is input into autogluon.tabular.FeatureMetadata.from_df() to infer feature_metadata. If ‘infer’ incorrectly assumes the dtypes of features, consider explicitly specifying feature_metadata.

**kwargs :

auto_stackbool, default = False

Whether AutoGluon should automatically utilize bagging and multi-layer stack ensembling to boost predictive accuracy. Set this = True if you are willing to tolerate longer training times in order to maximize predictive accuracy! Automatically sets num_bag_folds and num_stack_levels arguments based on dataset properties. Note: Setting num_bag_folds and num_stack_levels arguments will override auto_stack. Note: This can increase training time (and inference time) by up to 20x, but can greatly improve predictive performance.

num_bag_foldsint, default = None

Number of folds used for bagging of models. When num_bag_folds = k, training time is roughly increased by a factor of k (set = 0 to disable bagging). Disabled by default (0), but we recommend values between 5-10 to maximize predictive performance. Increasing num_bag_folds will result in models with lower bias but that are more prone to overfitting. num_bag_folds = 1 is an invalid value, and will raise a ValueError. Values > 10 may produce diminishing returns, and can even harm overall results due to overfitting. To further improve predictions, avoid increasing num_bag_folds much beyond 10 and instead increase num_bag_sets.

num_bag_setsint, default = None

Number of repeats of kfold bagging to perform (values must be >= 1). Total number of models trained during bagging = num_bag_folds * num_bag_sets. Defaults to 1 if time_limit is not specified, otherwise 20 (always disabled if num_bag_folds is not specified). Values greater than 1 will result in superior predictive performance, especially on smaller problems and with stacking enabled (reduces overall variance).

num_stack_levelsint, default = None

Number of stacking levels to use in stack ensemble. Roughly increases model training time by factor of num_stack_levels+1 (set = 0 to disable stack ensembling). Disabled by default (0), but we recommend values between 1-3 to maximize predictive performance. To prevent overfitting, num_bag_folds >= 2 must also be set or else a ValueError will be raised.

holdout_fracfloat, default = None

Fraction of train_data to holdout as tuning data for optimizing hyperparameters (ignored unless tuning_data = None, ignored if num_bag_folds != 0 unless use_bag_holdout == True). Default value (if None) is selected based on the number of rows in the training data. Default values range from 0.2 at 2,500 rows to 0.01 at 250,000 rows. Default value is doubled if hyperparameter_tune_kwargs is set, up to a maximum of 0.2. Disabled if num_bag_folds >= 2 unless use_bag_holdout == True.

use_bag_holdoutbool, default = False

If True, a holdout_frac portion of the data is held-out from model bagging. This held-out data is only used to score models and determine weighted ensemble weights. Enable this if there is a large gap between score_val and score_test in stack models. Note: If tuning_data was specified, tuning_data is used as the holdout data. Disabled if not bagging.

hyperparameter_tune_kwargsstr or dict, default = None

Hyperparameter tuning strategy and kwargs (for example, how many HPO trials to run). If None, then hyperparameter tuning will not be performed. Valid preset values:

‘auto’: Uses the ‘bayesopt’ preset. ‘random’: Performs HPO via random search using local scheduler. ‘bayesopt’: Performs HPO via bayesian optimization using local scheduler.

For valid dictionary keys, refer to autogluon.core.scheduler.FIFOScheduler documentation.: The ‘searcher’ key is required when providing a dict.

ag_argsdict, default = None

Keyword arguments to pass to all models (i.e. common hyperparameters shared by all AutoGluon models). See the ag_args argument from “Advanced functionality: Custom AutoGluon model arguments” in the hyperparameters argument documentation for valid values. Identical to specifying ag_args parameter for all models in hyperparameters. If a key in ag_args is already specified for a model in hyperparameters, it will not be altered through this argument.

ag_args_fitdict, default = None

Keyword arguments to pass to all models. See the ag_args_fit argument from “Advanced functionality: Custom AutoGluon model arguments” in the hyperparameters argument documentation for valid values. Identical to specifying ag_args_fit parameter for all models in hyperparameters. If a key in ag_args_fit is already specified for a model in hyperparameters, it will not be altered through this argument.

ag_args_ensembledict, default = None

Keyword arguments to pass to all models. See the ag_args_ensemble argument from “Advanced functionality: Custom AutoGluon model arguments” in the hyperparameters argument documentation for valid values. Identical to specifying ag_args_ensemble parameter for all models in hyperparameters. If a key in ag_args_ensemble is already specified for a model in hyperparameters, it will not be altered through this argument.

excluded_model_typeslist, default = None

Banned subset of model types to avoid training during fit(), even if present in hyperparameters. Reference hyperparameters documentation for what models correspond to each value. Useful when a particular model type such as ‘KNN’ or ‘custom’ is not desired but altering the hyperparameters dictionary is difficult or time-consuming.

Example: To exclude both ‘KNN’ and ‘custom’ models, specify excluded_model_types=[‘KNN’, ‘custom’].

refit_fullbool or str, default = False

Whether to retrain all models on all of the data (training + validation) after the normal training procedure. This is equivalent to calling predictor.refit_full(model=refit_full) after fit. If refit_full=True, it will be treated as refit_full=’all’. If refit_full=False, refitting will not occur. Valid str values:

all: refits all models. best: refits only the best model (and its ancestors if it is a stacker model). {model_name}: refits only the specified model (and its ancestors if it is a stacker model).

For bagged models:

Reduces a model’s inference time by collapsing bagged ensembles into a single model fit on all of the training data. This process will typically result in a slight accuracy reduction and a large inference speedup. The inference speedup will generally be between 10-200x faster than the original bagged ensemble model.

The inference speedup factor is equivalent to (k * n), where k is the number of folds (num_bag_folds) and n is the number of finished repeats (num_bag_sets) in the bagged ensemble.

The runtime is generally 10% or less of the original fit runtime.: The runtime can be roughly estimated as 1 / (k * n) of the original fit runtime, with k and n defined above.

For non-bagged models:

Optimizes a model’s accuracy by retraining on 100% of the data without using a validation set. Will typically result in a slight accuracy increase and no change to inference time. The runtime will be approximately equal to the original fit runtime.

This process does not alter the original models, but instead adds additional models. If stacker models are refit by this process, they will use the refit_full versions of the ancestor models during inference. Models produced by this process will not have validation scores, as they use all of the data for training.

Therefore, it is up to the user to determine if the models are of sufficient quality by including test data in predictor.leaderboard(test_data). If the user does not have additional test data, they should reference the original model’s score for an estimate of the performance of the refit_full model.

Warning: Be aware that utilizing refit_full models without separately verifying on test data means that the model is untested, and has no guarantee of being consistent with the original model.

The time taken by this process is not enforced by time_limit.

set_best_to_refit_fullbool, default = False

If True, will change the default model that Predictor uses for prediction when model is not specified to the refit_full version of the model that exhibited the highest validation score. Only valid if refit_full is set.

keep_only_bestbool, default = False

If True, only the best model and its ancestor models are saved in the outputted predictor. All other models are deleted.: If you only care about deploying the most accurate predictor with the smallest file-size and no longer need any of the other trained models or functionality beyond prediction on new data, then set: keep_only_best=True, save_space=True. This is equivalent to calling predictor.delete_models(models_to_keep=’best’, dry_run=False) directly after fit().
If used with refit_full and set_best_to_refit_full, the best model will be the refit_full model, and the original bagged best model will be deleted.: refit_full will be automatically set to ‘best’ in this case to avoid training models which will be later deleted.

save_spacebool, default = False

If True, reduces the memory and disk size of predictor by deleting auxiliary model files that aren’t needed for prediction on new data.: This is equivalent to calling predictor.save_space() directly after fit().

This has NO impact on inference accuracy. It is recommended if the only goal is to use the trained model for prediction. Certain advanced functionality may no longer be available if save_space=True. Refer to predictor.save_space() documentation for more details.

feature_generatorautogluon.features.generators.AbstractFeatureGenerator, default = autogluon.features.generators.AutoMLPipelineFeatureGenerator

The feature generator used by AutoGluon to process the input data to the form sent to the models. This often includes automated feature generation and data cleaning. It is generally recommended to keep the default feature generator unless handling an advanced use-case. To control aspects of the default feature generation process, you can pass in an AutoMLPipelineFeatureGenerator object constructed using some of these kwargs:

enable_numeric_featuresbool, default True
Whether to keep features of ‘int’ and ‘float’ raw types. These features are passed without alteration to the models. Appends IdentityFeatureGenerator(infer_features_in_args=dict(valid_raw_types=[‘int’, ‘float’]))) to the generator group.

enable_categorical_featuresbool, default True
Whether to keep features of ‘object’ and ‘category’ raw types. These features are processed into memory optimized ‘category’ features. Appends CategoryFeatureGenerator() to the generator group.

enable_datetime_featuresbool, default True
Whether to keep features of ‘datetime’ raw type and ‘object’ features identified as ‘datetime_as_object’ features. These features will be converted to ‘int’ features representing milliseconds since epoch. Appends DatetimeFeatureGenerator() to the generator group.

enable_text_special_featuresbool, default True
Whether to use ‘object’ features identified as ‘text’ features to generate ‘text_special’ features such as word count, capital letter ratio, and symbol counts. Appends TextSpecialFeatureGenerator() to the generator group.

enable_text_ngram_featuresbool, default True
Whether to use ‘object’ features identified as ‘text’ features to generate ‘text_ngram’ features. Appends TextNgramFeatureGenerator(vectorizer=vectorizer) to the generator group.

enable_raw_text_featuresbool, default False
Whether to keep the raw text features. Appends IdentityFeatureGenerator(infer_features_in_args=dict(required_special_types=[‘text’])) to the generator group.

vectorizerCountVectorizer, default CountVectorizer(min_df=30, ngram_range=(1, 3), max_features=10000, dtype=np.uint8)
sklearn CountVectorizer object to use in TextNgramFeatureGenerator. Only used if enable_text_ngram_features=True.

unlabeled_datapd.DataFrame, default = None

[Experimental Parameter] Collection of data without labels that we can use to pretrain on. This is the same schema as train_data, except without the labels. Currently, unlabeled_data is only used for pretraining a TabTransformer model. If you do not specify ‘TRANSF’ with unlabeled_data, then no pretraining will occur and unlabeled_data will be ignored! After the pretraining step, we will finetune using the TabTransformer model as well. If TabTransformer is ensembled with other models, like in typical AutoGluon fashion, then the output of this “pretrain/finetune” will be ensembled with other models, which will not used the unlabeled_data. The “pretrain/finetune flow” is also known as semi-supervised learning. The typical use case for unlabeled_data is to add signal to your model where you may not have sufficient training data. e.g. 500 hand-labeled samples (perhaps a hard human task), whole data set (unlabeled) is thousands/millions. However, this isn’t the only use case. Given enough unlabeled data(millions of rows), you may see improvements to any amount of labeled data.

verbosityint

If specified, overrides the existing predictor.verbosity value.

Returns

TabularPredictor object. Returns self.

Examples

>>> from autogluon.tabular import TabularDataset, TabularPredictor
>>> train_data = TabularDataset('https://autogluon.s3.amazonaws.com/datasets/Inc/train.csv')
>>> label = 'class'
>>> predictor = TabularPredictor(label=label).fit(train_data)
>>> test_data = TabularDataset('https://autogluon.s3.amazonaws.com/datasets/Inc/test.csv')
>>> leaderboard = predictor.leaderboard(test_data)
>>> y_test = test_data[label]
>>> test_data = test_data.drop(columns=[label])
>>> y_pred = predictor.predict(test_data)
>>> perf = predictor.evaluate_predictions(y_true=y_test, y_pred=y_pred)

To maximize predictive performance, use the following:

>>> eval_metric = 'roc_auc'  # set this to the metric you ultimately care about
>>> time_limit = 3600  # set as long as you are willing to wait (in sec)
>>> predictor = TabularPredictor(label=label, eval_metric=eval_metric).fit(train_data, presets=['best_quality'], time_limit=time_limit)

fit_extra(hyperparameters, time_limit=None, base_model_names=None, **kwargs)[source]¶

Fits additional models after the original TabularPredictor.fit() call. The original train_data and tuning_data will be used to train the models.

Parameters

hyperparametersstr or dict: Refer to argument documentation in TabularPredictor.fit(). If base_model_names is specified and hyperparameters is using the level-based key notation, the key of the level which directly uses the base models should be 1. The level in the hyperparameters dictionary is relative, not absolute.
time_limitint, default = None: Refer to argument documentation in TabularPredictor.fit().
base_model_nameslist, default = None: The names of the models to use as base models for this fit call. Base models will provide their out-of-fold predictions as additional features to the models in hyperparameters. If specified, all models trained will be stack ensembles. If None, models will be trained as if they were specified in TabularPredictor.fit(), without depending on existing models. Only valid if bagging is enabled.
**kwargs :: Refer to kwargs documentation in TabularPredictor.fit(). Note that the following kwargs are not available in fit_extra as they cannot be changed from their values set in fit():

[holdout_frac, num_bag_folds, auto_stack, feature_generator, unlabeled_data]

fit_summary(verbosity=3, show_plot=False)[source]¶

Output summary of information about models produced during fit(). May create various generated summary plots and store them in folder: predictor.path.

Parameters

verbosityint, default = 3: Controls how detailed of a summary to output. Set <= 0 for no output printing, 1 to print just high-level summary, 2 to print summary and create plots, >= 3 to print all information produced during fit().
show_plotbool, default = False: If True, shows the model summary plot in browser when verbosity > 1.

Returns

Dict containing various detailed information. We do not recommend directly printing this dict as it may be very large.

fit_weighted_ensemble(base_models: list = None, name_suffix='Best', expand_pareto_frontier=False, time_limit=None)[source]¶

Fits new weighted ensemble models to combine predictions of previously-trained models. cache_data must have been set to True during the original training to enable this functionality.

Parameters

base_modelslist, default = None: List of model names the weighted ensemble can consider as candidates. If None, all previously trained models are considered except for weighted ensemble models. As an example, to train a weighted ensemble that can only have weights assigned to the models ‘model_a’ and ‘model_b’, set base_models=[‘model_a’, ‘model_b’]
name_suffixstr, default = ‘Best’: Name suffix to add to the name of the newly fitted ensemble model.
expand_pareto_frontierbool, default = False: If True, will train N-1 weighted ensemble models instead of 1, where N=len(base_models). The final model trained when True is equivalent to the model trained when False. These weighted ensemble models will attempt to expand the pareto frontier. This will create many different weighted ensembles which have different accuracy/memory/inference-speed trade-offs. This is particularly useful when inference speed is an important consideration.
time_limitint, default = None: Time in seconds each weighted ensemble model is allowed to train for. If expand_pareto_frontier=True, the time_limit value is applied to each model. If None, the ensemble models train without time restriction.

Returns

List of newly trained weighted ensemble model names.
If an exception is encountered while training an ensemble model, that model’s name will be absent from the list.

get_model_best()[source]¶

Returns the string model name of the best model by validation score. This is typically the same model used during inference when predictor.predict is called without specifying a model.

Returns

String model name of the best model

get_model_full_dict()[source]¶

Returns a dictionary of original model name -> refit full model name. Empty unless refit_full=True was set during fit or predictor.refit_full() was called. This can be useful when determining the best model based off of predictor.leaderboard(), then getting the _FULL version of the model by passing its name as the key to this dictionary.

Returns

Dictionary of original model name -> refit full model name.

get_model_names(stack_name=None, level=None, can_infer: bool = None, models: list = None) → list[source]¶: Returns the list of model names trained in this predictor object.

get_model_names_persisted() → list[source]¶: Returns the list of model names which are persisted in memory.

get_oof_pred(model: str = None, transformed=False, train_data=None, internal_oof=False) → pandas.core.series.Series[source]¶

Note: This is advanced functionality not intended for normal usage.

Returns the out-of-fold (OOF) predictions for every row in the training data.

For more information, refer to get_oof_pred_proba() documentation.

Parameters

modelstr (optional): Refer to get_oof_pred_proba() documentation.
transformedbool, default = False: Refer to get_oof_pred_proba() documentation.
train_datapd.DataFrame, default = None: Refer to get_oof_pred_proba() documentation.
internal_oofbool, default = False: Refer to get_oof_pred_proba() documentation.

Returns

pd.Series object of the out-of-fold training predictions of the model.

get_oof_pred_proba(model: str = None, transformed=False, as_multiclass=True, train_data=None, internal_oof=False) → Union[pandas.core.frame.DataFrame, pandas.core.series.Series][source]¶

Note: This is advanced functionality not intended for normal usage.

Returns the out-of-fold (OOF) predicted class probabilities for every row in the training data. OOF prediction probabilities may provide unbiased estimates of generalization accuracy (reflecting how predictions will behave on new data) Predictions for each row are only made using models that were fit to a subset of data where this row was held-out.

Warning: This method will raise an exception if called on a model that is not a bagged ensemble. Only bagged models (such a stacker models) can produce OOF predictions.: This also means that refit_full models and distilled models will raise an exception.
Warning: If intending to join the output of this method with the original training data, be aware that a rare edge-case issue exists:: Multiclass problems with rare classes combined with the use of the ‘log_loss’ eval_metric may have forced AutoGluon to duplicate rows in the training data to satisfy minimum class counts in the data. If this has occurred, then the indices and row counts of the returned pd.Series in this method may not align with the training data. In this case, consider fetching the processed training data using predictor.load_data_internal() instead of using the original training data. A more benign version of this issue occurs when ‘log_loss’ wasn’t specified as the eval_metric but rare classes were dropped by AutoGluon. In this case, not all of the original training data rows will have an OOF prediction. It is recommended to either drop these rows during the join or to get direct predictions on the missing rows via TabularPredictor.predict_proba().

Parameters

modelstr (optional)

The name of the model to get out-of-fold predictions from. Defaults to None, which uses the highest scoring model on the validation set. Valid models are listed in this predictor by calling predictor.get_model_names()

transformedbool, default = False

Whether the output values should be of the original label representation (False) or the internal label representation (True). The internal representation for binary and multiclass classification are integers numbering the k possible classes from 0 to k-1, while the original representation is identical to the label classes provided during fit. Generally, most users will want the original representation and keep transformed=False.

as_multiclassbool, default = True

Whether to return binary classification probabilities as if they were for multiclass classification.: Output will contain two columns, and if transformed=False, the column names will correspond to the binary class labels. The columns will be the same order as predictor.class_labels.

If False, output will contain only 1 column for the positive class (get positive_class name via predictor.positive_class). Only impacts output for binary classification problems.

train_datapd.DataFrame, default = None

Specify the original train_data to ensure that any training rows that were originally dropped internally are properly handled. If None, then output will not contain all rows if training rows were dropped internally during fit.

internal_oofbool, default = False

[Advanced Option] Return the internal OOF preds rather than the externally facing OOF preds. Internal OOF preds may have more/fewer rows than was provided in train_data, and are incompatible with external data. If you don’t know what this does, keep it as False.

Returns

pd.Series or pd.DataFrame object of the out-of-fold training prediction probabilities of the model.

info()[source]¶

[EXPERIMENTAL] Returns a dictionary of predictor metadata. Warning: This functionality is currently in preview mode.

The metadata information returned may change in structure in future versions without warning. The definitions of various metadata values are not yet documented. The output of this function should not be used for programmatic decisions.

Contains information such as row count, column count, model training time, validation scores, hyperparameters, and much more.

Returns

Dictionary of predictor metadata.

leaderboard(data=None, extra_info=False, extra_metrics=None, only_pareto_frontier=False, silent=False)[source]¶

Output summary of information about models produced during fit() as a pd.DataFrame. Includes information on test and validation scores for all models, model training times, inference times, and stack levels. Output DataFrame columns include:

‘model’: The name of the model.

‘score_val’: The validation score of the model on the ‘eval_metric’.
NOTE: Metrics scores always show in higher is better form. This means that metrics such as log_loss and root_mean_squared_error will have their signs FLIPPED, and values will be negative. This is necessary to avoid the user needing to know the metric to understand if higher is better when looking at leaderboard.

‘pred_time_val’: The inference time required to compute predictions on the validation data end-to-end.
Equivalent to the sum of all ‘pred_time_val_marginal’ values for the model and all of its base models.

‘fit_time’: The fit time required to train the model end-to-end (Including base models if the model is a stack ensemble).
Equivalent to the sum of all ‘fit_time_marginal’ values for the model and all of its base models.

‘pred_time_val_marginal’: The inference time required to compute predictions on the validation data (Ignoring inference times for base models).
Note that this ignores the time required to load the model into memory when bagging is disabled.

‘fit_time_marginal’: The fit time required to train the model (Ignoring base models). ‘stack_level’: The stack level of the model.

A model with stack level N can take any set of models with stack level less than N as input, with stack level 1 models having no model inputs.

‘can_infer’: If model is able to perform inference on new data. If False, then the model either was not saved, was deleted, or an ancestor of the model cannot infer.
can_infer is often False when save_bag_folds=False was specified in initial fit().

‘fit_order’: The order in which models were fit. The first model fit has fit_order=1, and the Nth model fit has fit_order=N. The order corresponds to the first child model fit in the case of bagged ensembles.

Parameters

datastr or TabularDataset or pd.DataFrame (optional)

This Dataset must also contain the label-column with the same column-name as specified during fit(). If specified, then the leaderboard returned will contain additional columns ‘score_test’, ‘pred_time_test’, and ‘pred_time_test_marginal’.

‘score_test’: The score of the model on the ‘eval_metric’ for the data provided.
NOTE: Metrics scores always show in higher is better form. This means that metrics such as log_loss and root_mean_squared_error will have their signs FLIPPED, and values will be negative. This is necessary to avoid the user needing to know the metric to understand if higher is better when looking at leaderboard.

‘pred_time_test’: The true end-to-end wall-clock inference time of the model for the data provided.
Equivalent to the sum of all ‘pred_time_test_marginal’ values for the model and all of its base models.

‘pred_time_test_marginal’: The inference time of the model for the data provided, minus the inference time for the model’s base models, if it has any.
Note that this ignores the time required to load the model into memory when bagging is disabled.

If str is passed, data will be loaded using the str value as the file path.

extra_infobool, default = False

If True, will return extra columns with advanced info. This requires additional computation as advanced info data is calculated on demand. Additional output columns when extra_info=True include:

‘num_features’: Number of input features used by the model.
Some models may ignore certain features in the preprocessed data.

‘num_models’: Number of models that actually make up this “model” object.
For non-bagged models, this is 1. For bagged models, this is equal to the number of child models (models trained on bagged folds) the bagged ensemble contains.

‘num_models_w_ancestors’: Equivalent to the sum of ‘num_models’ values for the model and its’ ancestors (see below). ‘memory_size’: The amount of memory in bytes the model requires when persisted in memory. This is not equivalent to the amount of memory the model may use during inference.

For bagged models, this is the sum of the ‘memory_size’ of all child models.

‘memory_size_w_ancestors’: Equivalent to the sum of ‘memory_size’ values for the model and its’ ancestors.
This is the amount of memory required to avoid loading any models in-between inference calls to get predictions from this model. For online-inference, this is critical. It is important that the machine performing online inference has memory more than twice this value to avoid loading models for every call to inference by persisting models in memory.

‘memory_size_min’: The amount of memory in bytes the model minimally requires to perform inference.
For non-bagged models, this is equivalent to ‘memory_size’. For bagged models, this is equivalent to the largest child model’s ‘memory_size_min’. To minimize memory usage, child models can be loaded and un-persisted one by one to infer. This is the default behavior if a bagged model was not already persisted in memory prior to inference.

‘memory_size_min_w_ancestors’: Equivalent to the max of the ‘memory_size_min’ values for the model and its’ ancestors.
This is the minimum required memory to infer with the model by only loading one model at a time, as each of its ancestors will also have to be loaded into memory. For offline-inference where latency is not a concern, this should be used to determine the required memory for a machine if ‘memory_size_w_ancestors’ is too large.

‘num_ancestors’: Number of ancestor models for the given model.

‘num_descendants’: Number of descendant models for the given model.

‘model_type’: The type of the given model.
If the model is an ensemble type, ‘child_model_type’ will indicate the inner model type. A stack ensemble of bagged LightGBM models would have ‘StackerEnsembleModel’ as its model type.

‘child_model_type’: The child model type. None if the model is not an ensemble. A stack ensemble of bagged LightGBM models would have ‘LGBModel’ as its child type.
child models are models which are used as a group to generate a given bagged ensemble model’s predictions. These are the models trained on each fold of a bagged ensemble. For 10-fold bagging, the bagged ensemble model would have 10 child models. For 10-fold bagging with 3 repeats, the bagged ensemble model would have 30 child models. Note that child models are distinct from ancestors and descendants.

‘hyperparameters’: The hyperparameter values specified for the model.
All hyperparameters that do not appear in this dict remained at their default values.

‘hyperparameters_fit’: The hyperparameters set by the model during fit.
This overrides the ‘hyperparameters’ value for a particular key if present in ‘hyperparameters_fit’ to determine the fit model’s final hyperparameters. This is most commonly set for hyperparameters that indicate model training iterations or epochs, as early stopping can find a different value from what ‘hyperparameters’ indicated. In these cases, the provided hyperparameter in ‘hyperparameters’ is used as a maximum for the model, but the model is still able to early stop at a smaller value during training to achieve a better validation score or to satisfy time constraints. For example, if a NN model was given epochs=500 as a hyperparameter, but found during training that epochs=60 resulted in optimal validation score, it would use epoch=60 and hyperparameters_fit={‘epoch’: 60} would be set.

‘ag_args_fit’: Special AutoGluon arguments that influence model fit.
See the documentation of the hyperparameters argument in TabularPredictor.fit() for more information.

‘features’: List of feature names used by the model.

‘child_hyperparameters’: Equivalent to ‘hyperparameters’, but for the model’s children.

‘child_hyperparameters_fit’: Equivalent to ‘hyperparameters_fit’, but for the model’s children.

‘child_ag_args_fit’: Equivalent to ‘ag_args_fit’, but for the model’s children.

‘ancestors’: The model’s ancestors. Ancestor models are the models which are required to make predictions during the construction of the model’s input features.
If A is an ancestor of B, then B is a descendant of A. If a model’s ancestor is deleted, the model is no longer able to infer on new data, and its ‘can_infer’ value will be False. A model can only have ancestor models whose ‘stack_level’ are lower than itself. ‘stack_level’=1 models have no ancestors.

‘descendants’: The model’s descendants. Descendant models are the models which require this model to make predictions during the construction of their input features.
If A is a descendant of B, then B is an ancestor of A. If this model is deleted, then all descendant models will no longer be able to infer on new data, and their ‘can_infer’ values will be False. A model can only have descendant models whose ‘stack_level’ are higher than itself.

extra_metricslist, default = None

A list of metrics to calculate scores for and include in the output DataFrame. Only valid when data is specified. The scores refer to the scores on data (same data as used to calculate the score_test column). This list can contain any values which would also be valid for eval_metric in predictor init. For example, extra_metrics=[‘accuracy’, ‘roc_auc’, ‘log_loss’] would be valid in binary classification. This example would return 3 additional columns in the output DataFrame, whose column names match the names of the metrics. Passing extra_metrics=[predictor.eval_metric] would return an extra column in the name of the eval metric that has identical values to score_test. This also works with custom metrics. If passing an object instead of a string, the column name will be equal to the .name attribute of the object. NOTE: Metrics scores always show in higher is better form. This means that metrics such as log_loss and root_mean_squared_error will have their signs FLIPPED, and values will be negative. This is necessary to avoid the user needing to know the metric to understand if higher is better when looking at leaderboard.

only_pareto_frontierbool, default = False

If True, only return model information of models in the Pareto frontier of the accuracy/latency trade-off (models which achieve the highest score within their end-to-end inference time). At minimum this will include the model with the highest score and the model with the lowest inference time. This is useful when deciding which model to use during inference if inference time is a consideration. Models filtered out by this process would never be optimal choices for a user that only cares about model inference time and score.

silentbool, default = False

Should leaderboard DataFrame be printed?

Returns

pd.DataFrame of model performance summary information.

classmethod load(path: str, verbosity: int = None)[source]¶

Load a TabularPredictor object previously produced by fit() from file and returns this object. It is highly recommended the predictor be loaded with the exact AutoGluon version it was fit with.

Parameters

pathstr: The path to directory in which this Predictor was previously saved.
verbosityint, default = None: Sets the verbosity level of this Predictor after it is loaded. Valid values range from 0 (least verbose) to 4 (most verbose). If None, logging verbosity is not changed from existing values. Specify larger values to see more information printed when using Predictor during inference, smaller values to see less information. Refer to TabularPredictor init for more information.

load_data_internal(data='train', return_X=True, return_y=True)[source]¶

Loads the internal data representation used during model training. Individual AutoGluon models like the neural network may apply additional feature transformations that are not reflected in this method. This method only applies universal transforms employed by all AutoGluon models. Warning, the internal representation may:

Have different features compared to the original data. Have different row counts compared to the original data. Have indices which do not align with the original data. Have label values which differ from those in the original data.

Internal data representations should NOT be combined with the original data, in most cases this is not possible.

Parameters

datastr, default = ‘train’

The data to load. Valid values are:

‘train’:
Load the training data used during model training. This is a transformed and augmented version of the train_data passed in fit().

‘val’:
Load the validation data used during model training. This is a transformed and augmented version of the tuning_data passed in fit(). If tuning_data=None was set in fit(), then tuning_data is an automatically generated validation set created by splitting train_data. Warning: Will raise an exception if called by a bagged predictor, as bagged predictors have no validation data.

return_Xbool, default = True

Whether to return the internal data features If set to False, then the first element in the returned tuple will be None.

return_ybool, default = True

Whether to return the internal data labels If set to False, then the second element in the returned tuple will be None.

Returns

Tuple of (pd.DataFrame, pd.Series) corresponding to the internal data features and internal data labels, respectively.

persist_models(models='best', with_ancestors=True, max_memory=0.1) → list[source]¶

Persist models in memory for reduced inference latency. This is particularly important if the models are being used for online-inference where low latency is critical. If models are not persisted in memory, they are loaded from disk every time they are asked to make predictions.

Parameters

modelslist of str or str, default = ‘best’: Model names of models to persist. If ‘best’ then the model with the highest validation score is persisted (this is the model used for prediction by default). If ‘all’ then all models are persisted. Valid models are listed in this predictor by calling predictor.get_model_names().
with_ancestorsbool, default = True: If True, all ancestor models of the provided models will also be persisted. If False, stacker models will not have the models they depend on persisted unless those models were specified in models. This will slow down inference as the ancestor models will still need to be loaded from disk for each predict call. Only relevant for stacker models.
max_memoryfloat, default = 0.1: Proportion of total available memory to allow for the persisted models to use. If the models’ summed memory usage requires a larger proportion of memory than max_memory, they are not persisted. In this case, the output will be an empty list. If None, then models are persisted regardless of estimated memory usage. This can cause out-of-memory errors.

Returns

List of persisted model names.

plot_ensemble_model(prune_unused_nodes=True) → str[source]¶

Output the visualized stack ensemble architecture of a model trained by fit(). The plot is stored to a file, ensemble_model.png in folder predictor.path

This function requires graphviz and pygraphviz to be installed because this visualization depends on those package. Unless this function will raise ImportError without being able to generate the visual of the ensemble model.

To install the required package, run the below commands (for Ubuntu linux):

$ sudo apt-get install graphviz $ pip install graphviz

For other platforms, refer to https://graphviz.org/ for Graphviz install, and https://pygraphviz.github.io/documentation.html for PyGraphviz.

Returns

The file name with the full path to the saved graphic

property positive_class¶

Returns the positive class name in binary classification. Useful for computing metrics such as F1 which require a positive and negative class. In binary classification, TabularPredictor.predict_proba(as_multiclass=False) returns the estimated probability that each row belongs to the positive class. Will print a warning and return None if called when predictor.problem_type != ‘binary’.

Returns

The positive class name in binary classification or None if the problem is not binary classification.

predict(data, model=None, as_pandas=True)[source]¶

Use trained models to produce predictions of label column values for new data.

Parameters

datastr or TabularDataset or pd.DataFrame: The data to make predictions for. Should contain same column names as training Dataset and follow same format (may contain extra columns that won’t be used by Predictor, including the label-column itself). If str is passed, data will be loaded using the str value as the file path.
modelstr (optional): The name of the model to get predictions from. Defaults to None, which uses the highest scoring model on the validation set. Valid models are listed in this predictor by calling predictor.get_model_names()
as_pandasbool, default = True: Whether to return the output as a pd.Series (True) or np.ndarray (False).

Returns

Array of predictions, one corresponding to each row in given dataset. Either np.ndarray or pd.Series depending on as_pandas argument.

predict_proba(data, model=None, as_pandas=True, as_multiclass=True)[source]¶

Use trained models to produce predicted class probabilities rather than class-labels (if task is classification). If predictor.problem_type is regression, this functions identically to predict, returning the same output.

Parameters

datastr or TabularDataset or pd.DataFrame

The data to make predictions for. Should contain same column names as training dataset and follow same format (may contain extra columns that won’t be used by Predictor, including the label-column itself). If str is passed, data will be loaded using the str value as the file path.

modelstr (optional)

The name of the model to get prediction probabilities from. Defaults to None, which uses the highest scoring model on the validation set. Valid models are listed in this predictor by calling predictor.get_model_names().

as_pandasbool, default = True

Whether to return the output as a pandas object (True) or numpy array (False). Pandas object is a DataFrame if this is a multiclass problem or as_multiclass=True, otherwise it is a Series. If the output is a DataFrame, the column order will be equivalent to predictor.class_labels.

as_multiclassbool, default = True

Whether to return binary classification probabilities as if they were for multiclass classification.: Output will contain two columns, and if as_pandas=True, the column names will correspond to the binary class labels. The columns will be the same order as predictor.class_labels.

If False, output will contain only 1 column for the positive class (get positive_class name via predictor.positive_class). Only impacts output for binary classification problems.

Returns

Array of predicted class-probabilities, corresponding to each row in the given data.
May be a np.ndarray or pd.DataFrame / pd.Series depending on as_pandas and as_multiclass arguments and the type of prediction problem.
For binary classification problems, the output contains for each datapoint the predicted probabilities of the negative and positive classes, unless you specify as_multiclass=False.

refit_full(model='all')[source]¶

Retrain model on all of the data (training + validation). For bagged models:

Optimizes a model’s inference time by collapsing bagged ensembles into a single model fit on all of the training data. This process will typically result in a slight accuracy reduction and a large inference speedup. The inference speedup will generally be between 10-200x faster than the original bagged ensemble model.

The inference speedup factor is equivalent to (k * n), where k is the number of folds (num_bag_folds) and n is the number of finished repeats (num_bag_sets) in the bagged ensemble.

The runtime is generally 10% or less of the original fit runtime.
The runtime can be roughly estimated as 1 / (k * n) of the original fit runtime, with k and n defined above.

For non-bagged models:: Optimizes a model’s accuracy by retraining on 100% of the data without using a validation set. Will typically result in a slight accuracy increase and no change to inference time. The runtime will be approximately equal to the original fit runtime.

This process does not alter the original models, but instead adds additional models. If stacker models are refit by this process, they will use the refit_full versions of the ancestor models during inference. Models produced by this process will not have validation scores, as they use all of the data for training.

Therefore, it is up to the user to determine if the models are of sufficient quality by including test data in predictor.leaderboard(test_data). If the user does not have additional test data, they should reference the original model’s score for an estimate of the performance of the refit_full model.

Warning: Be aware that utilizing refit_full models without separately verifying on test data means that the model is untested, and has no guarantee of being consistent with the original model.

cache_data must have been set to True during the original training to enable this functionality.

Parameters

modelstr, default = ‘all’

Model name of model to refit.: If ‘all’ then all models are refitted. If ‘best’ then the model with the highest validation score is refit.

All ancestor models will also be refit in the case that the selected model is a weighted or stacker ensemble. Valid models are listed in this predictor by calling predictor.get_model_names().

Returns

Dictionary of original model names -> refit_full model names.

save()[source]¶: Save this Predictor to file in directory specified by this Predictor’s path. Note that TabularPredictor.fit() already saves the predictor object automatically (we do not recommend modifying the Predictor object yourself as it tracks many trained models).

save_space(remove_data=True, remove_fit_stack=True, requires_save=True, reduce_children=False)[source]¶

Reduces the memory and disk size of predictor by deleting auxiliary model files that aren’t needed for prediction on new data. This function has NO impact on inference accuracy. It is recommended to invoke this method if the only goal is to use the trained model for prediction. However, certain advanced functionality may no longer be available after save_space() has been called.

Parameters

remove_databool, default = True

Whether to remove cached files of the original training and validation data. Only reduces disk usage, it has no impact on memory usage. This is especially useful when the original data was large. This is equivalent to setting cache_data=False during the original fit().

Will disable all advanced functionality that requires cache_data=True.

remove_fit_stackbool, default = True

Whether to remove information required to fit new stacking models and continue fitting bagged models with new folds. Only reduces disk usage, it has no impact on memory usage. This includes:

out-of-fold (OOF) predictions

This is useful for multiclass problems with many classes, as OOF predictions can become very large on disk. (1 GB per model in extreme cases) This disables predictor.refit_full() for stacker models.

requires_savebool, default = True

Whether to remove information that requires the model to be saved again to disk. Typically this only includes flag variables that don’t have significant impact on memory or disk usage, but should technically be updated due to the removal of more important information.

An example is the is_data_saved boolean variable in trainer, which should be updated to False if remove_data=True was set.

reduce_childrenbool, default = False

Whether to apply the reduction rules to bagged ensemble children models. These are the models trained for each fold of the bagged ensemble. This should generally be kept as False since the most important memory and disk reduction techniques are automatically applied to these models during the original fit() call.

transform_features(data=None, model=None, base_models=None, return_original_features=True)[source]¶

Transforms data features through the AutoGluon feature generator. This is useful to gain an understanding of how AutoGluon interprets the data features. The output of this function can be used to train further models, even outside of AutoGluon. This can be useful for training your own models on the same data representation as AutoGluon. Individual AutoGluon models like the neural network may apply additional feature transformations that are not reflected in this method. This method only applies universal transforms employed by all AutoGluon models. When data=None, `base_models=[{best_model}], and bagging was enabled during fit():

This returns the out-of-fold predictions of the best model, which can be used as training input to a custom user stacker model.

Parameters

datastr or TabularDataset or pd.DataFrame (optional)

The data to apply feature transformation to. This data does not require the label column. If str is passed, data will be loaded using the str value as the file path. If not specified, the original data used during fit() will be used if fit() was previously called with cache_data=True. Otherwise, an exception will be raised.

For non-bagged mode predictors:
The data used when not specified is the validation set. This can either be an automatically generated validation set or the user-defined tuning_data if passed during fit(). If all parameters are unspecified, then the output is equivalent to predictor.load_data_internal(data=’val’, return_X=True, return_y=False)[0]. To get the label values of the output, call predictor.load_data_internal(data=’val’, return_X=False, return_y=True)[1]. If the original training set is desired, it can be passed in through data.

Warning: Do not pass the original training set if model or base_models are set. This will result in overfit feature transformation.

For bagged mode predictors:
The data used when not specified is the full training set. If all parameters are unspecified, then the output is equivalent to predictor.load_data_internal(data=’train’, return_X=True, return_y=False)[0]. To get the label values of the output, call predictor.load_data_internal(data=’train’, return_X=False, return_y=True)[1]. base_model features generated in this instance will be from out-of-fold predictions. Note that the training set may differ from the training set originally passed during fit(), as AutoGluon may choose to drop or duplicate rows during training. Warning: Do not pass the original training set through data if model or base_models are set. This will result in overfit feature transformation. Instead set data=None.

modelstr, default = None

Model to generate input features for. The output data will be equivalent to the input data that would be sent into model.predict_proba(data).

Note: This only applies to cases where data is not the training data.

If None, then only return generically preprocessed features prior to any model fitting. Valid models are listed in this predictor by calling predictor.get_model_names(). Specifying a refit_full model will cause an exception if data=None. base_models=None is a requirement when specifying model.

base_modelslist, default = None

List of model names to use as base_models for a hypothetical stacker model when generating input features. If None, then only return generically preprocessed features prior to any model fitting. Valid models are listed in this predictor by calling predictor.get_model_names(). If a stacker model S exists with base_models=M, then setting base_models=M is equivalent to setting model=S. model=None is a requirement when specifying base_models.

return_original_featuresbool, default = True

Whether to return the original features. If False, only returns the additional output columns from specifying model or base_models.

This is useful to set to False if the intent is to use the output as input to further stacker models without the original features.

Returns

pd.DataFrame of the provided data after feature transformation has been applied.
This output does not include the label column, and will remove it if present in the supplied data.
If a transformed label column is desired, use predictor.transform_labels.

Examples

>>> from autogluon.tabular import TabularPredictor
>>> predictor = TabularPredictor(label='class').fit('train.csv', label='class', auto_stack=True)  # predictor is in bagged mode.
>>> model = 'WeightedEnsemble_L2'
>>> train_data_transformed = predictor.transform_features(model=model)  # Internal training DataFrame used as input to `model.fit()` for each model trained in predictor.fit()`
>>> test_data_transformed = predictor.transform_features('test.csv', model=model)  # Internal test DataFrame used as input to `model.predict_proba()` during `predictor.predict_proba(test_data, model=model)`

transform_labels(labels, inverse=False, proba=False)[source]¶

Transforms data labels to the internal label representation. This can be useful for training your own models on the same data label representation as AutoGluon. Regression problems do not differ between original and internal representation, and thus this method will return the provided labels. Warning: When inverse=False, it is possible for the output to contain NaN label values in multiclass problems if the provided label was dropped during training.

Parameters

labelsnp.ndarray or pd.Series

Labels to transform. If proba=False, an example input would be the output of predictor.predict(test_data). If proba=True, an example input would be the output of predictor.predict_proba(test_data, as_multiclass=False).

inverseboolean, default = False

When True, the input labels are treated as being in the internal representation and the original representation is outputted.

probaboolean, default = False

When True, the input labels are treated as probabilities and the output will be the internal representation of probabilities.: In this case, it is expected that labels be a pd.DataFrame or np.ndarray. If the problem_type is multiclass:

The input column order must be equal to predictor.class_labels. The output column order will be equal to predictor.class_labels_internal. if inverse=True, the same logic applies, but with input and output columns interchanged.
When False, the input labels are treated as actual labels and the output will be the internal representation of the labels.: In this case, it is expected that labels be a pd.Series or np.ndarray.

Returns

pd.Series of labels if proba=False or pd.DataFrame of label probabilities if proba=True.

unpersist_models(models='all') → list[source]¶

Unpersist models in memory for reduced memory usage. If models are not persisted in memory, they are loaded from disk every time they are asked to make predictions. Note: Another way to reset the predictor and unpersist models is to reload the predictor from disk via predictor = TabularPredictor.load(predictor.path).

Parameters

modelslist of str or str, default = ‘all’: Model names of models to unpersist. If ‘all’ then all models are unpersisted. Valid models are listed in this predictor by calling predictor.get_model_names_persisted().

Returns

List of unpersisted model names.

ImagePredictor¶

class autogluon.vision.ImagePredictor(label='label', problem_type=None, eval_metric=None, path=None, verbosity=2)[source]¶

AutoGluon Predictor for predicting image category based on their whole contents

Parameters

labelstr, default = ‘label’

Name of the column that contains the target variable to predict.

problem_typestr, default = None

Type of prediction problem. Options: (‘multiclass’). If problem_type = None, the prediction problem type is inferred: based on the provided dataset. Currently only multiclass(or single class vs. background) classification is supported.

eval_metricstr, default = None

Metric by which to evaluate the data with. Options: (‘accuracy’). Currently only supports accuracy for multiclass classification.

pathstr, default = None

The directory for saving logs or intermediate data. If unspecified, will create a sub-directory under current working directory.

verbosityint, default = 2

Verbosity levels range from 0 to 4 and control how much information is printed. Higher levels correspond to more detailed print statements (you can set verbosity = 0 to suppress warnings). If using logging, you can alternatively control amount of information printed via logger.setLevel(L), where L ranges from 0 to 50 (Note: higher values of L correspond to fewer print statements, opposite of verbosity levels)

Attributes

path

Methods

`Dataset`	alias of `gluoncv.auto.data.dataset.ImageClassificationDataset`
`evaluate`(data)	Evaluate model performance on validation data.
`fit`(train_data[, tuning_data, time_limit, …])	Automatic fit process for image prediction.
`fit_summary`()	Return summary of last fit process.
`list_models`()	Get the list of supported model names in model zoo that can be used for image classification.
`load`(path[, verbosity])	Load previously saved predictor.
`predict`(data[, as_pandas])	Predict images as a whole, return labels(class category).
`predict_feature`(data[, as_pandas])	Predict images visual feature representations, return the features as numpy (1xD) vector.
`predict_proba`(data[, as_pandas])	Predict images as a whole, return the probabilities of each category rather than class-labels.
`save`([path])	Dump predictor to disk.

Dataset¶: alias of gluoncv.auto.data.dataset.ImageClassificationDataset

evaluate(data)[source]¶

Evaluate model performance on validation data.

Parameters

datapd.DataFrame or iterator: The validation data.

fit(train_data, tuning_data=None, time_limit='auto', presets=None, hyperparameters=None, **kwargs)[source]¶

Automatic fit process for image prediction.

Parameters

train_datapd.DataFrame or str

Training data, can be a dataframe like image dataset. For dataframe like datasets, image and label columns are required. image: raw image paths. label: categorical integer id, starting from 0. For more details of how to construct a dataset for image predictor, check out: http://preview.d2l.ai/d8/main/image_classification/getting_started.html. If a string is provided, will search for d8 built-in datasets.

tuning_datapd.DataFrame or str, default = None

Another dataset containing validation data reserved for model selection and hyperparameter-tuning, can be a dataframe like image dataset. If a string is provided, will search for k8 datasets. If None, the validation dataset will be randomly split from train_data according to holdout_frac.

time_limitint, default = ‘auto’ (defaults to 2 hours if no presets detected)

Time limit in seconds, if None, will run until all tuning and training finished. If time_limit is hit during fit, the HPO process will interrupt and return the current best configuration.

presetslist or str or dict, default = [‘medium_quality_faster_train’]

List of preset configurations for various arguments in fit(). Can significantly impact predictive accuracy, memory-footprint, and inference latency of trained models, and various other properties of the returned predictor. It is recommended to specify presets and avoid specifying most other fit() arguments or model hyperparameters prior to becoming familiar with AutoGluon. As an example, to get the most accurate overall predictor (regardless of its efficiency), set presets=’best_quality’. To get good quality with faster inference speed, set presets=’good_quality_faster_inference’ Any user-specified arguments in fit() will override the values used by presets. If specifying a list of presets, later presets will override earlier presets if they alter the same argument. For precise definitions of the provided presets, see file: autogluon/vision/configs/presets_configs.py. Users can specify custom presets by passing in a dictionary of argument values as an element to the list. Available Presets: [‘best_quality’, ‘high_quality_fast_inference’, ‘good_quality_faster_inference’, ‘medium_quality_faster_train’] It is recommended to only use one quality based preset in a given call to fit() as they alter many of the same arguments and are not compatible with each-other.

Note that depending on your specific hardware limitation(# gpu, size of gpu memory…) your mileage may vary a lot, you may choose lower quality presets if necessary, and try to reduce batch_size if OOM(“RuntimeError: CUDA error: out of memory”) happens frequently during the fit.

In-depth Preset Info:

# Best predictive accuracy with little consideration to inference time or model size. Achieve even better results by specifying a large time_limit value. # Recommended for applications that benefit from the best possible model accuracy. best_quality={

‘hyperparameters’: {
‘model’: Categorical(‘coat_lite_small’, ‘twins_pcpvt_base’, ‘swin_base_patch4_window7_224’), ‘lr’: Real(1e-5, 1e-2, log=True), ‘batch_size’: Categorical(8, 16, 32, 64, 128), ‘epochs’: 200, ‘early_stop_patience’: 50 },

‘hyperparameter_tune_kwargs’: {
‘num_trials’: 1024, ‘searcher’: ‘random’,

}, ‘time_limit’: 12*3600,

},

# Good predictive accuracy with fast inference. # Recommended for applications that require reasonable inference speed and/or model size. good_quality_fast_inference={

‘hyperparameters’: {
‘model’: Categorical(‘resnet50d’, ‘efficientnet_b1’, ‘mobilenetv3_large_100’), ‘lr’: Real(1e-4, 1e-2, log=True), ‘batch_size’: Categorical(8, 16, 32, 64, 128), ‘epochs’: 150, ‘early_stop_patience’: 20 },

‘hyperparameter_tune_kwargs’: {
‘num_trials’: 512, ‘searcher’: ‘random’,

}, ‘time_limit’: 8*3600,

},

# Medium predictive accuracy with very fast inference and very fast training time. medium_quality_faster_train={

‘hyperparameters’: {
‘model’: ‘resnet50d’, ‘lr’: 0.01, ‘batch_size’: 64, ‘epochs’: 50, ‘early_stop_patience’: 5 },

‘time_limit’: 1*3600,

},

# Medium predictive accuracy with very fast inference. # Comparing with medium_quality_faster_train it uses faster model but explores more hyperparameters. medium_quality_faster_inference={

‘hyperparameters’: {
‘model’: Categorical(‘resnet18’, ‘mobilenetv3_small_100’, ‘resnet18_v1b’), ‘lr’: Categorical(0.01, 0.005, 0.001), ‘batch_size’: Categorical(64, 128), ‘epochs’: Categorical(50, 100), ‘early_stop_patience’: 10 },

‘hyperparameter_tune_kwargs’: {
‘num_trials’: 32, ‘searcher’: ‘random’,

}, ‘time_limit’: 2*3600,

},

hyperparametersdict, default = None

Extra hyperparameters for specific models. Accepted args includes(not limited to): epochs : int, default value based on network

The epochs for model training.

netmx.gluon.Block: The custom network. If defined, the model name in config will be ignored so your custom network will be used for training rather than pulling it from model zoo.
optimizermx.Optimizer: The custom optimizer object. If defined, the optimizer will be ignored in config but this object will be used in training instead.
batch_sizeint: Mini batch size
lrfloat: Trainer learning rate for optimization process.
early_stop_patienceint, default=10: Number of epochs with no improvement after which train is early stopped. Use None to disable.
early_stop_min_deltafloat, default=1e-4: The small delta value to ignore when evaluating the metric. A large delta helps stablize the early stopping strategy against tiny fluctuation, e.g. 0.5->0.49->0.48->0.499->0.500001 is still considered as a good timing for early stopping.
early_stop_baselinefloat, default=None: The minimum(baseline) value to trigger early stopping. For example, with early_stop_baseline=0.5, early stopping won’t be triggered if the metric is less than 0.5 even if plateau is detected. Use None to disable.
early_stop_max_valuefloat, default=None: The max value for metric, early stop training instantly once the max value is achieved. Use None to disable.

You can get the list of accepted hyperparameters in config.yaml saved by this predictor.

**kwargs :

holdout_fracfloat, default = 0.1

The random split ratio for tuning_data if tuning_data==None.

random_stateint, default = None

The random_state(seed) for shuffling data, only used if tuning_data==None. Note that the random_state only affect the splitting process, not model training. If not specified(None), will leave the original random sampling intact.

nthreads_per_trialint, default = (# cpu cores)

Number of CPU threads for each trial, if None, will detect the # cores on current instance.

ngpus_per_trialint, default = (# gpus)

Number of GPUs to use for each trial, if None, will detect the # gpus on current instance.

hyperparameter_tune_kwargs: dict, default = None

num_trialsint, default = 1: The limit of HPO trials that can be performed within time_limit. The HPO process will be terminated when num_trials trials have finished or wall clock time_limit is reached, whichever comes first.
searcherstr, default = ‘random’: Searcher strategy for HPO, ‘random’ by default. Options include: ‘random’ (random search), ‘bayesopt’ (Gaussian process Bayesian optimization), ‘grid’ (grid search).
max_rewardfloat, default = None: The reward threashold for stopping criteria. If max_reward is reached during HPO, the scheduler will terminate earlier to reduce time cost.
scheduler_optionsdict, default = None: Extra options for HPO scheduler, please refer to autogluon.core.Searcher for details.

fit_summary()[source]¶

Return summary of last fit process.

Returns

dict: The summary of last fit process. Major keys are (‘train_acc’, ‘val_acc’, ‘total_time’,…)

classmethod list_models()[source]¶

Get the list of supported model names in model zoo that can be used for image classification.

Returns

tuple of str: A tuple of supported model names in str.

classmethod load(path, verbosity=2)[source]¶

Load previously saved predictor.

Parameters

pathstr: The file name for saved pickle file. If path is a directory, will try to load the file image_predictor.ag in this directory.
verbosityint, default = 2: Verbosity levels range from 0 to 4 and control how much information is printed. Higher levels correspond to more detailed print statements (you can set verbosity = 0 to suppress warnings). If using logging, you can alternatively control amount of information printed via logger.setLevel(L), where L ranges from 0 to 50 (Note: higher values of L correspond to fewer print statements, opposite of verbosity levels)

predict(data, as_pandas=True)[source]¶

Predict images as a whole, return labels(class category).

Parameters

datastr, pd.DataFrame or ndarray: The input, can be str(filepath), pd.DataFrame with ‘image’ column, or raw ndarray input.
as_pandasbool, default = True: Whether to return the output as a pandas object (True) or list of numpy array(s) (False). Pandas object is a DataFrame.

Returns

pd.DataFrame: The returned dataframe will contain labels. If more than one image in input, the returned dataframe will contain images column, and all results are concatenated.

predict_feature(data, as_pandas=True)[source]¶

Predict images visual feature representations, return the features as numpy (1xD) vector.

Parameters

datastr, pd.DataFrame or ndarray: The input, can be str(filepath), pd.DataFrame with ‘image’ column, or raw ndarray input.
as_pandasbool, default = True: Whether to return the output as a pandas object (True) or list of numpy array(s) (False). Pandas object is a DataFrame.

Returns

pd.DataFrame: The returned dataframe will contain image features. If more than one image in input, the returned dataframe will contain images column, and all results are concatenated.

predict_proba(data, as_pandas=True)[source]¶

Predict images as a whole, return the probabilities of each category rather than class-labels.

Parameters

datastr, pd.DataFrame or ndarray: The input, can be str(filepath), pd.DataFrame with ‘image’ column, or raw ndarray input.
as_pandasbool, default = True: Whether to return the output as a pandas object (True) or list of numpy array(s) (False). Pandas object is a DataFrame.

Returns

pd.DataFrame: The returned dataframe will contain probs of each category. If more than one image in input, the returned dataframe will contain images column, and all results are concatenated.

save(path=None)[source]¶

Dump predictor to disk.

Parameters

pathstr, default = None: The path of saved copy. If not specified(None), will automatically save to self.path directory with filename image_predictor.ag

ObjectDetector¶

class autogluon.vision.ObjectDetector(path=None, verbosity=2)[source]¶

AutoGluon Predictor for detecting objects in images

Parameters

pathstr, default = None: The directory for saving logs or intermediate data. If unspecified, will create a sub-directory under current working directory.
verbosityint, default = 2: Verbosity levels range from 0 to 4 and control how much information is printed. Higher levels correspond to more detailed print statements (you can set verbosity = 0 to suppress warnings). If using logging, you can alternatively control amount of information printed via logger.setLevel(L), where L ranges from 0 to 50 (Note: higher values of L correspond to fewer print statements, opposite of verbosity levels)

Attributes

path

Methods

`Dataset`	alias of `gluoncv.auto.data.dataset.ObjectDetectionDataset`
`evaluate`(data)	Evaluate model performance on validation data.
`fit`(train_data[, tuning_data, time_limit, …])	Automatic fit process for object detection.
`fit_summary`()	Return summary of last fit process.
`load`(path[, verbosity])	Load previously saved predictor.
`predict`(data[, as_pandas])	Predict objects in image, return the confidences, bounding boxes of each predicted object.
`save`([path])	Dump predictor to disk.

Dataset¶: alias of gluoncv.auto.data.dataset.ObjectDetectionDataset

evaluate(data)[source]¶

Evaluate model performance on validation data.

Parameters

datapd.DataFrame or iterator: The validation data.

fit(train_data, tuning_data=None, time_limit='auto', presets=None, hyperparameters=None, **kwargs)[source]¶

Automatic fit process for object detection. Tip: if you observe very slow training speed only happening at the first epoch and your overall time budget is not large, you may disable CUDNN_AUTOTUNE by setting the environment variable export MXNET_CUDNN_AUTOTUNE_DEFAULT=0 before running your python script or insert import os; os.environ[‘MXNET_CUDNN_AUTOTUNE_DEFAULT’] = ‘0’ before any code block. The tuning is beneficial in terms of training speed in the long run, but may cost your noticeble overhead at the begining of each trial.

Parameters

train_datapd.DataFrame or str

Training data, can be a dataframe like image dataset. For more details of how to construct a object detection dataset, please checkout: http://preview.d2l.ai/d8/main/object_detection/getting_started.html. If a string is provided, will search for d8 datasets.

tuning_datapd.DataFrame or str, default = None

Holdout tuning data for validation, reserved for model selection and hyperparameter-tuning, can be a dataframe like image dataset. If a string is provided, will search for k8 datasets. If None, the validation dataset will be randomly split from train_data according to holdout_frac.

time_limitint, default = ‘auto’(defaults to 2 hours if no presets detected)

Time limit in seconds, if None, will run until all tuning and training finished. If time_limit is hit during fit, the HPO process will interrupt and return the current best configuration.

presetslist or str or dict, default = [‘medium_quality_faster_train’]

List of preset configurations for various arguments in fit(). Can significantly impact predictive accuracy, memory-footprint, and inference latency of trained models, and various other properties of the returned predictor. It is recommended to specify presets and avoid specifying most other fit() arguments or model hyperparameters prior to becoming familiar with AutoGluon. As an example, to get the most accurate overall predictor (regardless of its efficiency), set presets=’best_quality’. To get good quality with faster inference speed, set presets=’good_quality_faster_inference’ Any user-specified arguments in fit() will override the values used by presets. If specifying a list of presets, later presets will override earlier presets if they alter the same argument. For precise definitions of the provided presets, see file: autogluon/vision/configs/presets_configs.py. Users can specify custom presets by passing in a dictionary of argument values as an element to the list. Available Presets: [‘best_quality’, ‘high_quality_fast_inference’, ‘good_quality_faster_inference’, ‘medium_quality_faster_train’] It is recommended to only use one quality based preset in a given call to fit() as they alter many of the same arguments and are not compatible with each-other.

Note that depending on your specific hardware limitation(# gpu, size of gpu memory…) your mileage may vary a lot, you may choose lower quality presets if necessary, and try to reduce batch_size if OOM(“RuntimeError: CUDA error: out of memory”) happens frequently during the fit.

In-depth Preset Info:

# Best predictive accuracy with little consideration to inference time or model size. Achieve even better results by specifying a large time_limit value. # Recommended for applications that benefit from the best possible model accuracy. best_quality={

‘hyperparameters’: {
‘transfer’: ‘faster_rcnn_fpn_resnet101_v1d_coco’, ‘lr’: Real(1e-5, 1e-3, log=True), ‘batch_size’: Categorical(4, 8), ‘epochs’: 30, ‘early_stop_patience’: 50 },

‘hyperparameter_tune_kwargs’: {
‘num_trials’: 128, ‘searcher’: ‘random’,

}, ‘time_limit’: 24*3600,

},

# Good predictive accuracy with fast inference. # Recommended for applications that require reasonable inference speed and/or model size. good_quality_fast_inference={

‘hyperparameters’: {

‘transfer’: Categorical(‘ssd_512_resnet50_v1_coco’,
‘yolo3_darknet53_coco’, ‘center_net_resnet50_v1b_coco’),

‘lr’: Real(1e-4, 1e-2, log=True), ‘batch_size’: Categorical(8, 16, 32, 64), ‘epochs’: 50, ‘early_stop_patience’: 20 },

‘hyperparameter_tune_kwargs’: {
‘num_trials’: 512, ‘searcher’: ‘random’,

}, ‘time_limit’: 12*3600,

},

# Medium predictive accuracy with very fast inference and very fast training time. # This is the default preset in AutoGluon, but should generally only be used for quick prototyping. medium_quality_faster_train={

‘hyperparameters’: {
‘transfer’: ‘ssd_512_resnet50_v1_coco’, ‘lr’: 0.01, ‘batch_size’: Categorical(8, 16), ‘epochs’: 30, ‘early_stop_patience’: 5 },

‘hyperparameter_tune_kwargs’: {
‘num_trials’: 16, ‘searcher’: ‘random’,

}, ‘time_limit’: 2*3600,

},

# Medium predictive accuracy with very fast inference. # Comparing with medium_quality_faster_train it uses faster model but explores more hyperparameters. medium_quality_faster_inference={

‘hyperparameters’: {
‘transfer’: Categorical(‘center_net_resnet18_v1b_coco’, ‘yolo3_mobilenet1.0_coco’), ‘lr’: Categorical(0.01, 0.005, 0.001), ‘batch_size’: Categorical(32, 64, 128), ‘epochs’: Categorical(30, 50), ‘early_stop_patience’: 10 },

‘hyperparameter_tune_kwargs’: {
‘num_trials’: 32, ‘searcher’: ‘random’,

}, ‘time_limit’: 4*3600,

},

hyperparametersdict, default = None

Extra hyperparameters for specific models. Accepted args includes(not limited to): epochs : int, default value based on network

The epochs for model training.

batch_sizeint: Mini batch size
lrfloat: Trainer learning rate for optimization process.
early_stop_patienceint, default=10: Number of epochs with no improvement after which train is early stopped. Use None to disable.
early_stop_min_deltafloat, default=1e-4: The small delta value to ignore when evaluating the metric. A large delta helps stablize the early stopping strategy against tiny fluctuation, e.g. 0.5->0.49->0.48->0.499->0.500001 is still considered as a good timing for early stopping.
early_stop_baselinefloat, default=None: The minimum(baseline) value to trigger early stopping. For example, with early_stop_baseline=0.5, early stopping won’t be triggered if the metric is less than 0.5 even if plateau is detected. Use None to disable.
early_stop_max_valuefloat, default=None: The max value for metric, early stop training instantly once the max value is achieved. Use None to disable.

You can get the list of accepted hyperparameters in config.yaml saved by this predictor.

**kwargs :

holdout_fracfloat, default = 0.1

The random split ratio for tuning_data if tuning_data==None.

random_stateint, default = None

The random_state(seed) for shuffling data, only used if tuning_data==None. Note that the random_state only affect the splitting process, not model training. If not specified(None), will leave the original random sampling intact.

nthreads_per_trialint, default = (# cpu cores)

Number of CPU threads for each trial, if None, will detect the # cores on current instance.

ngpus_per_trialint, default = (# gpus)

Number of GPUs to use for each trial, if None, will detect the # gpus on current instance.

hyperparameter_tune_kwargs: dict, default = None

num_trialsint, default = 1: The limit of HPO trials that can be performed within time_limit. The HPO process will be terminated when num_trials trials have finished or wall clock time_limit is reached, whichever comes first.
searcherstr, default = ‘random’: Searcher strategy for HPO, ‘random’ by default. Options include: ‘random’ (random search), ‘bayesopt’ (Gaussian process Bayesian optimization), ‘grid’ (grid search).
max_rewardfloat, default = None: The reward threashold for stopping criteria. If max_reward is reached during HPO, the scheduler will terminate earlier to reduce time cost.
scheduler_optionsdict, default = None: Extra options for HPO scheduler, please refer to autogluon.core.Searcher for details.

fit_summary()[source]¶

Return summary of last fit process.

Returns

dict: The summary of last fit process. Major keys are (‘train_map’, ‘val_map’, ‘total_time’,…)

classmethod load(path, verbosity=2)[source]¶

Load previously saved predictor.

Parameters

pathstr: The file name for saved pickle file. If path is a directory, will try to load the file object_detector.ag in this directory.
verbosityint, default = 2: Verbosity levels range from 0 to 4 and control how much information is printed. Higher levels correspond to more detailed print statements (you can set verbosity = 0 to suppress warnings). If using logging, you can alternatively control amount of information printed via logger.setLevel(L), where L ranges from 0 to 50 (Note: higher values of L correspond to fewer print statements, opposite of verbosity levels)

predict(data, as_pandas=True)[source]¶

Predict objects in image, return the confidences, bounding boxes of each predicted object.

Parameters

datastr, pd.DataFrame or ndarray: The input data, can be str(filepath), pd.DataFrame with ‘image’ column, or raw ndarray input.
as_pandasbool, default = True: Whether to return the output as a pandas object (True) or list of numpy array(s) (False). Pandas object is a DataFrame.

Returns

pd.DataFrame: The returned dataframe will contain (pred_score, pred_bbox, pred_id). If more than one image in input, the returned dataframe will contain images column, and all results are concatenated.

save(path=None)[source]¶

Dump predictor to disk.

Parameters

pathstr: The file name of saved copy. If not specified(None), will automatically save to self.path directory with filename object_detector.ag

TextPredictor¶

class autogluon.text.TextPredictor(label, problem_type=None, eval_metric=None, path=None, verbosity=3, warn_if_exist=True)[source]¶

AutoGluon TextPredictor predicts values in a column of a tabular dataset that contains text fields (classification or regression). TabularPredictor can also do this but it uses an ensemble of many types of models and may featurize text. TextPredictor instead directly fits individual Transformer neural network models directly to the raw text (which are also capable of handling additional numeric/categorical columns). We generally recommend TabularPredictor if your table contains numeric/categorical columns and TextPredictor if your table contains only text columns, but you may easily try both. In fact, TabularPredictor.fit(…, hyperparameters=’multimodal’) will train a TextPredictor along with many tabular models and ensemble them together.

Parameters

labelstr

Name of the column that contains the target variable to predict.

problem_typestr, default = None

Type of prediction problem, i.e. is this a binary/multiclass classification or regression problem (options: ‘binary’, ‘multiclass’, ‘regression’). If problem_type = None, the prediction problem type is inferred based on the label-values in provided dataset.

eval_metricfunction or str, default = None

Metric by which predictions will be ultimately evaluated on test data. AutoGluon tunes factors such as hyperparameters, early-stopping, etc. in order to improve this metric on validation data.

If eval_metric = None, it is automatically chosen based on problem_type. Defaults to ‘accuracy’ for binary and multiclass classification, ‘root_mean_squared_error’ for regression.

Otherwise, options for classification:

[‘accuracy’, ‘balanced_accuracy’, ‘f1’, ‘f1_macro’, ‘f1_micro’, ‘f1_weighted’, ‘roc_auc’, ‘roc_auc_ovo_macro’, ‘average_precision’, ‘precision’,

‘precision_macro’, ‘precision_micro’,

‘precision_weighted’, ‘recall’, ‘recall_macro’, ‘recall_micro’,: ‘recall_weighted’, ‘log_loss’, ‘pac_score’]

Options for regression:

[‘root_mean_squared_error’, ‘mean_squared_error’, ‘mean_absolute_error’,: ‘median_absolute_error’, ‘r2’, ‘spearmanr’, ‘pearsonr’]

For more information on these options, see sklearn.metrics: https://scikit-learn.org/stable/modules/classes.html#sklearn-metrics-metrics You can also pass your own evaluation function here as long as it follows formatting of the functions defined in folder autogluon.core.metrics.

pathstr, default = None

Path to directory where models and intermediate outputs should be saved. If unspecified, a time-stamped folder called “AutogluonTextModel/ag-[TIMESTAMP]” will be created in the working directory to store all models. Note: To call fit() twice and save all results of each fit, you must specify different path locations or don’t specify path at all. Otherwise files from first fit() will be overwritten by second fit().

verbosityint, default = 3

Verbosity levels range from 0 to 4 and control how much information is printed. Higher levels correspond to more detailed print statements (you can set verbosity = 0 to suppress warnings). If using logging, you can alternatively control amount of information printed via logger.setLevel(L), where L ranges from 0 to 50 (Note: higher values of L correspond to fewer print statements, opposite of verbosity levels)

warn_if_existbool, default = True

Whether to raise warning if the specified path already exists.

Attributes

backend
class_labels: The original name of the class labels.
class_labels_internal: The internal integer labels.
class_labels_internal_map: The map that projects label names to the internal ids.
label
path
positive_class: Name of the class label that will be mapped to 1.
problem_type
results

Methods

`evaluate`(data[, metrics])	Report the predictive performance evaluated over a given dataset.
`extract_embedding`(data[, as_pandas])	Extract intermediate feature representations of a row from the trained neural network.
`fit`(train_data[, tuning_data, time_limit, …])	Fit Transformer models to predict label column of a data table based on the other columns (which may contain text or numeric/categorical features).
`load`(path[, verbosity])	Load a TextPredictor object previously produced by fit() from file and returns this object.
`predict`(data[, as_pandas])	Use trained model to produce predictions of label column values for new data.
`predict_proba`(data[, as_pandas, as_multiclass])	Use trained model to produce predicted class probabilities rather than class-labels (if task is classification).
`save`(path)	Save this Predictor to file in directory specified by path.

set_verbosity

property class_labels¶

The original name of the class labels.

For example, the tabular data may contain classes equal to “entailment”, “contradiction”, “neutral”. Internally, these will be converted to 0, 1, 2, …

This function returns the original names of these raw labels.

Returns

ret: List that contain the class names. It will be None if the predictor is not solving a classification problem.

property class_labels_internal¶

The internal integer labels.

For example, if the possible labels are [“entailment”, “contradiction”, “neutral”], the internal labels can be [0, 1, 2]

Returns

ret: List that contains the internal integer labels. It will be None if the predictor is not solving a classification problem.

property class_labels_internal_map¶

The map that projects label names to the internal ids. For example, if the internal labels are [“entailment”, “contradiction”, “neutral”] and the internal ids are [0, 1, 2], the label mapping will be {“entailment”: 0, “contradiction”: 1, “neutral”: 2}

Returns

ret: The label mapping dictionary. It will be None if the predictor is not solving a classification problem.

evaluate(data, metrics=None)[source]¶

Report the predictive performance evaluated over a given dataset.

Parameters

datastr or TabularDataset or pandas.DataFrame: This dataset must also contain the label with the same column-name as previously specified. If str is passed, data will be loaded using the str value as the file path.
metricsstr or List[str], default = None: Name of metric or a list of multiple metric names to report (options are the same as for eval_metric). If None, we only return the score for the stored eval_metric.

Returns

metrics_valuesfloat or dict: The metrics computed on the data. This is a single value if there is only one metric and is a dictionary of {metric_name –> value} if there are multiple metrics.

extract_embedding(data, as_pandas=False)[source]¶

Extract intermediate feature representations of a row from the trained neural network.

Parameters

datastr or TabularDataset or pd.DataFrame: The data to extract embeddings for. Should contain same column names as training dataset and follow same format (except for the label column). If str is passed, data will be loaded using the str value as the file path.
as_pandasbool, default = False: Whether to return the output as a pandas DataFrame (True) or numpy array (False).

Returns

Array of embeddings, corresponding to each row in the given data.
It will have shape (#samples, D) where the embedding dimension D is determined by the neural network’s architecture.

fit(train_data, tuning_data=None, time_limit=None, presets=None, hyperparameters=None, column_types=None, num_cpus=None, num_gpus=None, num_trials=None, plot_results=None, holdout_frac=None, seed=0)[source]¶

Fit Transformer models to predict label column of a data table based on the other columns (which may contain text or numeric/categorical features).

Parameters

train_datastr or TabularDataset or pd.DataFrame

Table of the training data, which is similar to a pandas DataFrame. If str is passed, train_data will be loaded using the str value as the file path.

tuning_datastr or TabularDataset or pd.DataFrame, default = None

Another dataset containing validation data reserved for tuning processes such as early stopping and hyperparameter tuning. This dataset should be in the same format as train_data. If str is passed, tuning_data will be loaded using the str value as the file path. Note: final model returned may be fit on tuning_data as well as train_data. Do not provide your evaluation test data here! If tuning_data = None, fit() will automatically hold out some random validation examples from train_data.

time_limitint, default = None

Approximately how long fit() should run for (wallclock time in seconds). If not specified, fit() will run until the model has completed training.

presetsstr, default = None

Presets are pre-registered configurations that control training (hyperparameters and other aspects). It is recommended to specify presets and avoid specifying most other fit() arguments or model hyperparameters prior to becoming familiar with AutoGluon. Print all available presets via autogluon.text.list_presets(). Some notable presets include:

“best_quality”: produce the most accurate overall predictor (regardless of its efficiency).

“medium_quality_faster_train”: produce an accurate predictor but take efficiency into account (this is the default preset).

“lower_quality_fast_train”: produce a predict that is quick to train and make predictions with, even if its accuracy is worse.

hyperparametersdict, default = None

The hyperparameters of the fit() function, which affect the resulting accuracy of the trained predictor. Experienced AutoGluon users can use this argument to specify neural network hyperparameter values/search-spaces as well as which hyperparameter-tuning strategy should be employed. See the “Text Prediction” tutorials for examples.

column_typesdict, default = None

The type of data in each table column can be specified via a dictionary that maps the column name to its data type. For example: column_types = {“item_name”: “text”, “brand”: “text”, “product_description”: “text”, “height”: “numerical”} may be used for a table with columns: “item_name”, “brand”, “product_description”, and “height”. If None, column_types will be automatically inferred from the data. The current supported types are: - “text”: each row in this column contains text (sentence, paragraph, etc.). - “numerical”: each row in this column contains a number. - “categorical”: each row in this column belongs to one of K categories.

num_cpusint, default = None

The number of CPUs to use for each training run (i.e. one hyperparameter-tuning trial).

num_gpusint, default = None

The number of GPUs to use to use for each training run (i.e. one hyperparameter-tuning trial). We recommend at least 1 GPU for TextPredictor as its neural network models are computationally intensive.

num_trialsint, default = None

If hyperparameter-tuning is used, specifies how many HPO trials should be run (assuming time_limit has not been exceeded). By default, this is the provided number of trials in the hyperparameters or presets. If specified here, this value will overwrite the value in hyperparameters[‘tune_kwargs’][‘num_trials’].

plot_resultsbool, default = None

Whether to plot intermediate results from training. If None, will be decided based on the environment in which fit() is run.

holdout_fracfloat, default = None

Fraction of train_data to holdout as tuning data for optimizing hyperparameters (ignored unless tuning_data = None). Default value (if None) is selected based on the number of rows in the training data and whether hyperparameter-tuning is utilized.

seedint, default = 0

The random seed to use for this training run. If None, no seed will be specified and repeated runs will produce different results.

Returns

TextPredictor object. Returns self.

classmethod load(path: str, verbosity: int = None)[source]¶

Load a TextPredictor object previously produced by fit() from file and returns this object. It is highly recommended the predictor be loaded with the exact AutoGluon version it was fit with.

Parameters

pathstr: The path to directory in which this Predictor was previously saved.
verbosityint, default = None: Sets the verbosity level of this Predictor after it is loaded. Valid values range from 0 (least verbose) to 4 (most verbose). If None, logging verbosity is not changed from existing values. Specify larger values to see more information printed when using Predictor during inference, smaller values to see less information. Refer to TextPredictor init for more information.

property positive_class¶

Name of the class label that will be mapped to 1. This is only meaningful for binary classification problems.

It is useful for computing metrics such as F1 which require a positive and negative class. You may refer to https://en.wikipedia.org/wiki/F-score for more details. In binary classification, TextPredictor.predict_proba(as_multiclass=False) returns the estimated probability that each row belongs to the positive class. Will print a warning and return None if called when predictor.problem_type != ‘binary’.

Returns

The positive class name in binary classification or None if the problem is not binary classification.

predict(data, as_pandas=True)[source]¶

Use trained model to produce predictions of label column values for new data.

Parameters

datastr or TabularDataset or pd.DataFrame: The data to make predictions for. Should contain same column names as training Dataset and follow same format (except for the label column). If str is passed, data will be loaded using the str value as the file path.
as_pandasbool, default = True: Whether to return the output as a pd.Series (True) or np.ndarray (False).

Returns

Array of predictions, one corresponding to each row in given dataset.

predict_proba(data, as_pandas=True, as_multiclass=True)[source]¶

Use trained model to produce predicted class probabilities rather than class-labels (if task is classification). If predictor.problem_type is regression, this functions identically to predict, returning the same output.

Parameters

datastr or TabularDataset or pd.DataFrame: The data to make predictions for. Should contain same column names as training dataset and follow same format (except for the label column). If str is passed, data will be loaded using the str value as the file path.
as_pandasbool, default = True: Whether to return the output as a pandas DataFrame(Series) (True) or numpy array (False).
as_multiclassbool, default = True: Whether to return the probability of all labels or just return the probability of the positive class for binary classification problems.

Returns

Array of predicted class-probabilities, corresponding to each row in the given data.
When as_multiclass is True, the output will always have shape (#samples, #classes).
Otherwise, the output will have shape (#samples,)

save(path)[source]¶

Save this Predictor to file in directory specified by path. The relevant files will be saved in two parts:

PATH/text_predictor_assets.json
Contains the configuration information of this Predictor.
PATH/saved_model
Contains the model weights and other features

Note that TextPredictor.fit() already saves the predictor object automatically (we do not recommend modifying the Predictor object yourself as it tracks many trained models).

Parameters

path, str: The path to directory in which to save this Predictor.

Additional Tabular APIs¶

TabularDataset¶

class autogluon.tabular.TabularDataset(data, **kwargs)[source]¶

A dataset in tabular format (with rows = samples, columns = features/variables). This object is essentially a pandas DataFrame (with some extra attributes) and all existing pandas methods can be applied to it. For full list of methods/attributes, see pandas Dataframe documentation: https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html

Parameters

datapd.DataFrame or str: If str, path to data file (CSV or Parquet format). If you already have your data in a pd.DataFrame, you can specify it here.

Examples

>>> from autogluon.core.dataset import TabularDataset
>>> train_data = TabularDataset('https://autogluon.s3.amazonaws.com/datasets/Inc/train.csv')
>>> train_data.head(30)
>>> train_data.columns

Attributes

file_path: (str): Path to data file from which this TabularDataset was created. None if data was a pd.DataFrame.
Note: In addition to these attributes, `TabularDataset` also shares all the same attributes and methods of a pandas Dataframe.
For a detailed list, see: https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html

FeatureMetadata¶

class autogluon.core.features.feature_metadata.FeatureMetadata(type_map_raw: Dict[str, str], type_group_map_special: Dict[str, List[str]] = None)[source]¶

Feature metadata contains information about features that are not directly apparent in the raw data itself. This enables feature generators to properly process features, and allows downstream models to properly handle features during training and inference.

Parameters

type_map_rawDict[str, str]

Dictionary of feature names to raw types. The values can be anything, but it is generally recommended they be one of:

[‘int’, ‘float’, ‘object’, ‘category’, ‘datetime’]

type_group_map_specialDict[str, List[str]], optional

Dictionary of special types to lists of feature names. The keys can be anything, but it is generally recommended they be one of:

[‘binned’, ‘datetime_as_int’, ‘datetime_as_object’, ‘text’, ‘text_as_category’, ‘text_special’, ‘text_ngram’, ‘image_path’, ‘stack’]

For descriptions of each special feature-type, see: autogluon.core.features.types Feature names that appear in the value lists must also be keys in type_map_raw. Feature names are not required to have special types.

Methods

`add_special_types`(type_map_special[, inplace])	Adds special types to features.
`from_df`(df)	Construct FeatureMetadata based on the inferred feature types of an input `pd.DataFrame`.
`get_features`([valid_raw_types, …])	Returns a list of features held within the feature metadata object after being pruned through the available parameters.
`join_metadata`(metadata[, shared_raw_features])	Join two FeatureMetadata objects together, returning a new FeatureMetadata object
`join_metadatas`(metadata_list[, …])
`keep_features`(features[, inplace])	Removes all features from metadata except for those in features
`remove_features`(features[, inplace])	Removes all features from metadata that are in features
`rename_features`(rename_map[, inplace])	Rename all features from metadata that are keys in rename_map to their values.

get_feature_type_raw
get_feature_types_special
get_type_group_map_raw
get_type_map_special
print_feature_metadata_full
to_dict

add_special_types(type_map_special: Dict[str, List[str]], inplace=False)[source]¶

Adds special types to features.

Parameters

type_map_specialDict[str, List[str]]: Dictionary of feature -> list of special types to add. Features in dictionary must already exist in the FeatureMetadata object.
inplacebool, default False: If True, updates self inplace and returns self. If False, updates a copy of self and returns copy.
Returns
——-
:class:`FeatureMetadata` object.

Examples

>>> from autogluon.core.features.feature_metadata import FeatureMetadata
>>> feature_metadata = FeatureMetadata({'FeatureA': 'int', 'FeatureB': 'object'})
>>> feature_metadata = feature_metadata.add_special_types({'FeatureA': ['MySpecialType'], 'FeatureB': ['MySpecialType', 'text']})

classmethod from_df(df: pandas.core.frame.DataFrame)[source]¶

Construct FeatureMetadata based on the inferred feature types of an input pd.DataFrame.

Parameters

dfpd.DataFrame: DataFrame used to infer FeatureMetadata.

Returns

FeatureMetadata object.

get_features(valid_raw_types: list = None, valid_special_types: list = None, invalid_raw_types: list = None, invalid_special_types: list = None, required_special_types: list = None, required_raw_special_pairs: List[Tuple[str, List[str]]] = None, required_exact=False, required_at_least_one_special=False) → List[str][source]¶

Returns a list of features held within the feature metadata object after being pruned through the available parameters.

Parameters

valid_raw_typeslist, default None: If a feature’s raw type is not in this list, it is pruned. If None, then no features are pruned through this logic.
valid_special_typeslist, default None: If a feature has a special type not in this list, it is pruned. Features without special types are never pruned through this logic. If None, then no features are pruned through this logic.
invalid_raw_typeslist, default None: If a feature’s raw type is in this list, it is pruned. If None, then no features are pruned through this logic.
invalid_special_typeslist, default None: If a feature has a special type in this list, it is pruned. Features without special types are never pruned through this logic. If None, then no features are pruned through this logic.
required_special_typeslist, default None: If a feature does not have all of the special types in this list, it is pruned. Features without special types are pruned through this logic. If None, then no features are pruned through this logic.
required_raw_special_pairsList[Tuple[str, List[str]]], default None: If a feature does not satisfy the (raw_type, special_types) requirement of at least one of the elements in this list, it is pruned. Identical to getting the union of calling get_features(valid_raw_types=[raw_type], required_special_types=special_types) for every element of (raw_type, special_types) in required_raw_special_pairs If raw_type is None, then any feature will satisfy the raw type requirement. If special_types is None, then any feature will satisfy the special type requirement (including those with no special types).
required_exactbool, default False: If True, then if a feature does not have the exact same special types (with no extra special types) as required_special_types, it is pruned. This is also applied to required_raw_special_pairs if specified. Has no effect if required_special_types and required_raw_special_pairs are None.
required_at_least_one_specialbool, default False: If True, then if a feature has zero special types, it is pruned.

Returns

featureslist of feature names in feature metadata that satisfy all checks dictated by the parameters.

join_metadata(metadata, shared_raw_features='error')[source]¶: Join two FeatureMetadata objects together, returning a new FeatureMetadata object

keep_features(features: list, inplace=False)[source]¶: Removes all features from metadata except for those in features

remove_features(features: list, inplace=False)[source]¶: Removes all features from metadata that are in features

rename_features(rename_map: dict, inplace=False)[source]¶: Rename all features from metadata that are keys in rename_map to their values.