Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Apache ORC

From Wikipedia, the free encyclopedia
Column-oriented data storage format
This article has multiple issues. Please helpimprove it or discuss these issues on thetalk page.(Learn how and when to remove these messages)
The topic of this articlemay not meet Wikipedia'snotability guidelines for products and services. Please help to demonstrate the notability of the topic by citingreliable secondary sources that areindependent of the topic and provide significant coverage of it beyond a mere trivial mention. If notability cannot be shown, the article is likely to bemerged,redirected, ordeleted.
Find sources: "Apache ORC" – news ·newspapers ·books ·scholar ·JSTOR
(February 2019) (Learn how and when to remove this message)
This articlemay rely excessively on sourcestoo closely associated with the subject, potentially preventing the article from beingverifiable andneutral. Please helpimprove it by replacing them with more appropriatecitations toreliable, independent sources.(February 2019) (Learn how and when to remove this message)
(Learn how and when to remove this message)
Apache ORC
Initial release20 February 2013; 12 years ago (2013-02-20)[1]
Stable release
2.1.2 / 6 May 2025; 6 months ago (2025-05-06)[2]
RepositoryORC Repository
Operating systemCross-platform
TypeDatabase management system
LicenseApache License 2.0
Websiteorc.apache.org

Apache ORC (Optimized Row Columnar) is afree and open-sourcecolumn-oriented data storage format.[3] It is similar to the other columnar-storage file formats available in theHadoop ecosystem such asRCFile andParquet. It is used by most of the data processing frameworksApache Spark,Apache Hive,Apache Flink, andApache Hadoop.

In February 2013, the Optimized Row Columnar (ORC) file format was announced byHortonworks in collaboration withFacebook.[1]A calendar month later, theApache Parquet format was announced, developed byCloudera andTwitter.[4]

Apache ORC format is widely supported includingAmazon Web Services'Glue[5],Google Cloud Platform'sBigQuery,[6] andPandas (software).[7]

History

[edit]
VersionOriginal release dateLatest versionRelease date
Unsupported: 1.02016-01-251.0.02016-01-25
Unsupported: 1.12016-06-101.1.22016-07-08
Unsupported: 1.22016-08-251.2.32016-12-12
Unsupported: 1.32017-01-231.3.42017-10-16
Unsupported: 1.42017-05-081.4.52019-12-09
Unsupported: 1.52018-05-141.5.132021-09-15
Unsupported: 1.62019-09-031.6.142022-04-14
Unsupported: 1.72021-09-151.7.82023-01-21
Supported: 1.82022-09-031.8.92025-05-06
Supported: 1.92023-06-281.9.62025-05-06
Supported: 2.02024-03-082.0.52025-05-06
Latest version:2.12025-01-092.1.22025-05-06
Legend:
Unsupported
Supported
Latest version
Preview version
Future version

See also

[edit]

References

[edit]
  1. ^abAlan Gates (February 20, 2013)."The Stinger Initiative: Making Apache Hive 100 Times Faster".Hortonworks blog. Archived fromthe original on March 28, 2013.
  2. ^"Apache ORC - Releases". Retrieved15 May 2025.
  3. ^Yin Huai, Siyuan Ma, Rubao Lee, Owen O'Malley, and Xiaodong Zhang (2013)."Understanding Insights into the Basic Structure and Essential Issues of Table Placement Methods in Clusters ". VLDB' 39. pp. 1750–1761.CiteSeerX 10.1.1.406.4342.doi:10.14778/2556549.2556559.{{cite conference}}: CS1 maint: multiple names: authors list (link)
  4. ^Justin Kestelyn (March 13, 2013)."Introducing Parquet: Efficient Columnar Storage for Apache Hadoop".Cloudera blog. Archived fromthe original on September 19, 2016. RetrievedMay 4, 2017.
  5. ^"Using the ORC format in AWS Glue".docs.aws.amazon.com. RetrievedAugust 21, 2024.
  6. ^"Load an ORC file".cloud.google.com/bigquery/docs. RetrievedMay 15, 2025.
  7. ^"pandas.read_orc".pandas.pydata.org. RetrievedMay 15, 2025.
Top-level
projects
Commons
Incubator
Other projects
Attic
Licenses
Retrieved from "https://en.wikipedia.org/w/index.php?title=Apache_ORC&oldid=1323979742"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp