pyarrow.dataset.FileSystemFactoryOptions#
- classpyarrow.dataset.FileSystemFactoryOptions(partition_base_dir=None,partitioning=None,exclude_invalid_files=None,listselector_ignore_prefixes=None)#
Bases:
_WeakrefableInfluences the discovery of filesystem paths.
- Parameters:
- partition_base_dir
str, optional For the purposes of applying the partitioning, paths will bestripped of the partition_base_dir. Files not matching thepartition_base_dir prefix will be skipped for partitioning discovery.The ignored files will still be part of the Dataset, but will nothave partition information.
- partitioningPartitioning/PartitioningFactory, optional
Apply the Partitioning to every discovered Fragment. See Partitioning orPartitioningFactory documentation.
- exclude_invalid_filesbool, optional (default
True) If True, invalid files will be excluded (file format specific check).This will incur IO for each files in a serial and single threadedfashion. Disabling this feature will skip the IO, but unsupportedfiles may be present in the Dataset (resulting in an error at scantime).
- selector_ignore_prefixes
list, optional When discovering from a Selector (and not from an explicit file list),ignore files and directories matching any of these prefixes.By default this is [‘.’, ‘_’].
- partition_base_dir
- __init__(*args,**kwargs)#
Methods
__init__(*args, **kwargs)Attributes
Whether to exclude invalid files.
Base directory to strip paths before applying the partitioning.
Partitioning to apply to discovered files.
PartitioningFactory to apply to discovered files and discover a Partitioning.
List of prefixes.
- exclude_invalid_files#
Whether to exclude invalid files.
- partition_base_dir#
Base directory to strip paths before applying the partitioning.
- partitioning#
Partitioning to apply to discovered files.
NOTE: setting this property will overwrite partitioning_factory.
- partitioning_factory#
PartitioningFactory to apply to discovered files anddiscover a Partitioning.
NOTE: setting this property will overwrite partitioning.
- selector_ignore_prefixes#
List of prefixes. Files matching one of those prefixes will beignored by the discovery process.

