Apache Arrow#
Apache Arrow is a universal columnar format and multi-language toolbox for fastdata interchange and in-memory analytics.
The project specifies a language-independent column-oriented memory formatfor flat and hierarchical data, organized for efficient analytic operations onmodern hardware. The project houses an actively developed collection oflibraries in many languages for solving problems related to data transfer andin-memory analytical processing. This includes such topics as:
Zero-copy shared memory and RPC-based data movement
Reading and writing file formats (like CSV, Apache ORC, and Apache Parquet)
In-memory analytics and query processing
To learn how to use Arrow refer to the documentation specific to yourtarget environment.
Read about the Apache Arrow format and its related specifications andprotocols.
Find documentation on building the libraries from source, building thedocumentation, contributing and code reviews, continuous integration,benchmarking, and the release process.
Browse the documentation and source code for Apache Arrow librariesin C++, C GLib, C#, Go, Java, JavaScript, Julia, MATLAB, Python, R,Ruby, Rust, and Swift.
Explore a collection of Apache Arrow recipes in C++, Java, Python,R, and Rust.

