Feather provides binary columnar serialization for data frames.It is designed to make reading and writing data frames efficient,and to make sharing data across data analysis languages easy.read_feather() can read both the Feather Version 1 (V1), a legacy version available starting in 2016,and the Version 2 (V2), which is the Apache Arrow IPC file format.read_ipc_file() is an alias ofread_feather().
Usage
read_feather(file, col_select=NULL, as_data_frame=TRUE, mmap=TRUE)read_ipc_file(file, col_select=NULL, as_data_frame=TRUE, mmap=TRUE)Arguments
- file
A character file name or URI, connection,
rawvector, anArrow input stream, or aFileSystemwith path (SubTreeFileSystem).If a file name or URI, an ArrowInputStream will be opened andclosed when finished. If an input stream is provided, it will be leftopen.- col_select
A character vector of column names to keep, as in the"select" argument to
data.table::fread(), or atidy selection specificationof columns, as used indplyr::select().- as_data_frame
Should the function return a
tibble(default) oran ArrowTable?- mmap
Logical: whether to memory-map the file (default
TRUE)
Value
Atibble ifas_data_frame isTRUE (the default), or anArrowTable otherwise
See also
FeatherReader andRecordBatchReader for lower-level access to reading Arrow IPC data.
Examples
# We recommend the ".arrow" extension for Arrow IPC files (Feather V2).tf<-tempfile(fileext=".arrow")on.exit(unlink(tf))write_feather(mtcars,tf)df<-read_feather(tf)dim(df)#> [1] 32 11# Can select columnsdf<-read_feather(tf, col_select=starts_with("d"))