pyarrow.orc.read_table#

pyarrow.orc.read_table(source,columns=None,filesystem=None)[source]#

Read a Table from an ORC file.

Parameters:
sourcestr,pyarrow.NativeFile, or file-like object

If a string passed, can be a single file name. For file-like objects,only read a single file. Use pyarrow.BufferReader to read a filecontained in a bytes or buffer-like object.

columnslist

If not None, only these columns will be read from the file. A columnname may be a prefix of a nested field, e.g. ‘a’ will select ‘a.b’,‘a.c’, and ‘a.d.e’. Output always follows the ordering of the file andnot thecolumns list. If empty, no columns will be read. Notethat the table will still have the correct num_rows set despite havingno columns.

filesystemFileSystem, defaultNone

If nothing passed, will be inferred based on path.Path will try to be found in the local on-disk filesystem otherwiseit will be parsed as an URI to determine the filesystem.