Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Tab-separated values

From Wikipedia, the free encyclopedia
Text file format
Tab-separated values
Filename extension.tsv,.tab[1]
Internet media type
text/tab-separated-values
Uniform Type Identifier (UTI)public.tab-separated-values-text[2]
UTI conformationpublic.delimited-values-text[2]
Developed byUniversity of Minnesota InternetGopher Team

Internet Assigned Numbers Authority
Initial releasec. June 1993; 31 years ago (1993-06)
Type of formatDelimiter-separated values format
Container fordatabase information organized as field separated lists
StandardIANAMIME type

Tab-separated values (TSV) is a simple,text-basedfile format for storingtabular data.[3]Records are separated bynewlines, and values within a record are separated bytab characters. The TSV format is thus adelimiter-separated values format, similar tocomma-separated values.

TSV is a simple file format that is widely supported, so it is often used indata exchange to move tabular data between different computer programs that support the format. For example, a TSV file might be used to transfer information from adatabase to aspreadsheet.

Example

[edit]

The head of theIris flower data set can be stored as a TSV using the followingplain text (note that the HTML rendering may convert tabs to spaces):

Sepal length	Sepal width	Petal length	Petal width	Species5.1	3.5	1.4	0.2	I. setosa4.9	3.0	1.4	0.2	I. setosa4.7	3.2	1.3	0.2	I. setosa4.6	3.1	1.5	0.2	I. setosa5.0	3.6	1.4	0.2	I. setosa

The TSV plain text above corresponds to the following tabular data:

Sepal lengthSepal widthPetal lengthPetal widthSpecies
5.13.51.40.2I. setosa
4.93.01.40.2I. setosa
4.73.21.30.2I. setosa
4.63.11.50.2I. setosa
5.03.61.40.2I. setosa

Character escaping

[edit]

TheIANAmedia type standard for TSV achieves simplicity by simply disallowing tabs within fields.[4]

Since the values in the TSV format cannot contain literal tabs ornewline characters, a convention is necessary for lossless conversion of text values with these characters. A common convention is to perform the followingescapes:[5][6]

escape sequencemeaning
\nline feed
\ttab
\rcarriage return
\\backslash

Another common convention is to use the CSV convention fromRFC 4180 and enclose values containing tabs or newlines in double quotes. This can lead to ambiguities.[7][8]

Line endings

[edit]

Records are typically separated by a line feed, as is typical for Unix platforms, or a carriage return and line feed, as is typical for Microsoft platforms. Some programs may expect the latter. The de-facto specification[9] specifies that records are separated by anEOL, but does not specify any specificnewline.

See also

[edit]

References

[edit]
  1. ^U of Edin. Research Data Support Team."Choose the best file formats". University of Edinburgh. § Formats we recommend. Retrieved23 May 2023.
  2. ^ab"tabSeparatedText".Apple Developer Documentation: Uniform Type Identifiers.Apple Inc. Retrieved23 May 2023.
  3. ^"How To Use Tab Separated Value (TSV) files". International Monetary Fund. Retrieved1 February 2023.
  4. ^Lindner 1993.
  5. ^Dusek, Jason (6 May 2014)."Linear TSV: simple, line-oriented, tabular data".Data Protocols - Open Knowledge Foundation (v1.0β ed.).
  6. ^Dolan, Stephen (1 November 2018)."jq Manual".jq. Retrieved23 May 2023.
  7. ^Miller, Rob (22 September 2015).Text Processing with Ruby: Extract Value from the Data That Surrounds You. Pragmatic Bookshelf. p. 94.ISBN 978-1-68050-492-7.
  8. ^Giuseppini, Gabriele; Burnett, Mark (10 February 2005).Microsoft Log Parser Toolkit: A Complete Toolkit for Microsoft's Undocumented Log Analysis Tool. Elsevier. p. 311.ISBN 978-0-08-048939-1.
  9. ^"IANA: text/tab-separated-values".

Sources

[edit]

Further reading

[edit]
Retrieved from "https://en.wikipedia.org/w/index.php?title=Tab-separated_values&oldid=1288577446"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp