This article has multiple issues. Please helpimprove it or discuss these issues on thetalk page.(Learn how and when to remove these messages) (Learn how and when to remove this message)
|
| Apache ORC | |
|---|---|
| Initial release | 20 February 2013; 12 years ago (2013-02-20)[1] |
| Stable release | |
| Repository | ORC Repository |
| Operating system | Cross-platform |
| Type | Database management system |
| License | Apache License 2.0 |
| Website | orc |
Apache ORC (Optimized Row Columnar) is afree and open-sourcecolumn-oriented data storage format.[3] It is similar to the other columnar-storage file formats available in theHadoop ecosystem such asRCFile andParquet. It is used by most of the data processing frameworksApache Spark,Apache Hive,Apache Flink, andApache Hadoop.
In February 2013, the Optimized Row Columnar (ORC) file format was announced byHortonworks in collaboration withFacebook.[1]A calendar month later, theApache Parquet format was announced, developed byCloudera andTwitter.[4]
Apache ORC format is widely supported includingAmazon Web Services'Glue[5],Google Cloud Platform'sBigQuery,[6] andPandas (software).[7]
| Version | Original release date | Latest version | Release date |
|---|---|---|---|
| Unsupported: 1.0 | 2016-01-25 | 1.0.0 | 2016-01-25 |
| Unsupported: 1.1 | 2016-06-10 | 1.1.2 | 2016-07-08 |
| Unsupported: 1.2 | 2016-08-25 | 1.2.3 | 2016-12-12 |
| Unsupported: 1.3 | 2017-01-23 | 1.3.4 | 2017-10-16 |
| Unsupported: 1.4 | 2017-05-08 | 1.4.5 | 2019-12-09 |
| Unsupported: 1.5 | 2018-05-14 | 1.5.13 | 2021-09-15 |
| Unsupported: 1.6 | 2019-09-03 | 1.6.14 | 2022-04-14 |
| Unsupported: 1.7 | 2021-09-15 | 1.7.8 | 2023-01-21 |
| Supported: 1.8 | 2022-09-03 | 1.8.9 | 2025-05-06 |
| Supported: 1.9 | 2023-06-28 | 1.9.6 | 2025-05-06 |
| Supported: 2.0 | 2024-03-08 | 2.0.5 | 2025-05-06 |
| Latest version:2.1 | 2025-01-09 | 2.1.2 | 2025-05-06 |
Legend: Unsupported Supported Latest version Preview version Future version | |||
{{cite conference}}: CS1 maint: multiple names: authors list (link)