mirror of
https://github.com/deepseek-ai/smallpond
synced 2025-05-10 23:51:20 +00:00
29 lines
421 B
ReStructuredText
29 lines
421 B
ReStructuredText
.. currentmodule:: smallpond.logical.dataset
|
|
|
|
Dataset
|
|
=======
|
|
|
|
Dataset represents a collection of files.
|
|
|
|
To create a dataset:
|
|
|
|
.. code-block:: python
|
|
|
|
dataset = ParquetDataSet("path/to/dataset/*.parquet")
|
|
|
|
DataSets
|
|
--------
|
|
|
|
.. autosummary::
|
|
:toctree: ../generated
|
|
|
|
DataSet
|
|
FileSet
|
|
ParquetDataSet
|
|
CsvDataSet
|
|
JsonDataSet
|
|
ArrowTableDataSet
|
|
PandasDataSet
|
|
PartitionedDataSet
|
|
SqlQueryDataSet
|