autogluon.core.TabularDataset¶
- class autogluon.core.TabularDataset(data, **kwargs)[source]¶
A dataset in tabular format (with rows = samples, columns = features/variables). This object is essentially a pandas DataFrame (with some extra attributes) and all existing pandas methods can be applied to it. For full list of methods/attributes, see pandas Dataframe documentation: https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html
- Parameters:
data (
pd.DataFrame
or str) – If str, path to data file (CSV or Parquet format). If you already have your data in apd.DataFrame
, you can specify it here.
- file_path¶
Path to data file from which this TabularDataset was created. None if data was a
pd.DataFrame
.- Type:
(str)
- Note¶
- Type:
In addition to these attributes, TabularDataset also shares all the same attributes and methods of a pandas Dataframe.
- For a detailed list, see
Examples
>>> from autogluon.core.dataset import TabularDataset >>> train_data = TabularDataset('https://autogluon.s3.amazonaws.com/datasets/Inc/train.csv') >>> train_data.head(30) >>> train_data.columns
Methods
Attributes