autogluon.core.TabularDataset

class autogluon.core.TabularDataset(data, **kwargs)[source]

A dataset in tabular format (with rows = samples, columns = features/variables). This object is essentially a pandas DataFrame (with some extra attributes) and all existing pandas methods can be applied to it. For full list of methods/attributes, see pandas Dataframe documentation: https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html

Parameters:

data (pd.DataFrame or str) – If str, path to data file (CSV or Parquet format). If you already have your data in a pd.DataFrame, you can specify it here.

file_path

Path to data file from which this TabularDataset was created. None if data was a pd.DataFrame.

Type:

(str)

Note
Type:

In addition to these attributes, TabularDataset also shares all the same attributes and methods of a pandas Dataframe.

For a detailed list, see
Type:

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html

Examples

>>> from autogluon.core.dataset import TabularDataset
>>> train_data = TabularDataset('https://autogluon.s3.amazonaws.com/datasets/Inc/train.csv')
>>> train_data.head(30)
>>> train_data.columns
__init__(data, **kwargs)[source]

Methods

Attributes