Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

8-bit clean

From Wikipedia, the free encyclopedia
Computer system that correctly handles 8-bit character encodings
icon
This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "8-bit clean" – news ·newspapers ·books ·scholar ·JSTOR
(June 2008) (Learn how and when to remove this message)

Incomputer networking, a system is8-bit clean if it processes8-bitcharacter encodings without altering thehigh bit or treating anybyte as anin-band control code. This property can describe both acommunications protocol and the software and devices that implement such protocols. Although many early email systems only supported 7-bit data, the vast majority of modern email systems are 8-bit clean.

History

[edit]

Until the early 1990s, many programs and data transmission channels werecharacter-oriented and treated some characters likeend-of-text (ETX) ascontrol characters. Others assumed a stream of seven-bit characters, with values between 0 and 127; for example, theASCII standard used only seven bits per character,avoiding an eight-bit representation in order to save on data transmission costs. On computers and data links using8-bit bytes, this left the topbit of each byte free for use as aparity bit,flag bit, or metadata control bit. Seven-bit systems and data links are unable to directly handle more complex character codes which are commonplace in non-English-speaking countries with largeralphabets.

Binary files consisting of 8-bitoctets cannot be transmitted through 7-bit data channels directly. To work around this,binary-to-text encodings have been devised which use only 7-bitASCII characters. Some of these encodings areuuencoding,Ascii85,SREC,BinHex,kermit andMIME'sBase64.EBCDIC-based systems cannot handle all characters used in UUencoded data.[clarification needed (seetalk)] However, the base64 encoding does not have this problem.

SMTP and NNTP

[edit]

Historically, various media were used to transfer messages, some of which only supported 7-bit data, so an 8-bit message had high chances to begarbled during transmission in the 20th century. Some implementations ignored the formal discouraging of 8-bit data and allowed bytes with thehigh bit set to pass through. Such implementations are said to be 8-bit clean. In general, acommunications protocol is said to be 8-bit clean if it correctly passes through the high bit of each byte in the communication process.

Many early communications protocol standards, such asRFC 780,788,821,2821,5321 (forSMTP),RFC 977 (forNNTP) andRFC 1056, were designed to work over such "7-bit" communication links. They specifically require the use of ASCII "transmitted as an 8-bit byte with the high-order bit cleared to zero", and some of these[1] explicitly restrictall data to 7-bit characters.

For the first few decades of email networks (1971 to the early 1990s), most email messages wereplain text in the 7-bit US-ASCII character set.[2]

The RFC 788 definition of SMTP, like its predecessor RFC 780, limits Internet Mail to lines (1000 characters or less) of 7-bit US-ASCII characters.[3][4][5][6]

Later, the format of email messages was redefined in order to support messages that are not entirely US-ASCII text (text messages in character sets other than US-ASCII, and non-text messages, such as audio and images).[6] The header field Content-Transfer-Encoding=binary[a] requires an 8-bit clean transport.

RFC 3977[7] specifies that "NNTP operates over any reliable bi-directional 8-bit-wide data stream channel" and changes the character set for commands toUTF-8. However, RFC 5536[8] still limits the character set to ASCII, including RFC 2047[9] and RFC 2231[10] MIME encoding of non-ASCII data.

The Internet community generally adds features byextension, allowing communication in both directions between upgraded machines and not-yet-upgraded machines, rather than declaring formerly standards-compliant legacy software to be broken and requiring that all software worldwide be upgraded to the latest standard. The recommended way to take advantage of 8-bit clean links between machines is to use the ESMTP (RFC 1869)8BITMIME extension[11][12] for message bodies and the SMTPSMTPUTF8[13] extension for message headers. Despite this, somemail transfer agents, notablyExim andqmail, relay mail to servers that do not advertise 8BITMIME without performing the conversion to 7-bit MIME (typicallyquoted-printable, "Q-P conversion") required byRFC 6152. This "just-send-8" attitude does not, in fact, cause problems in practice because virtually all modern email servers are 8-bit clean.[14]

See also

[edit]

Notes

[edit]
  1. ^The header field Content-Transfer-Encoding=8BIT does not designate 8-bit clean, sinceCRLF has special significance.

References

[edit]
  1. ^RFC 780: Appendix A,RFC 788: 4.5.2.,RFC 821: Appendix B,RFC 1056: 4.
  2. ^ John Beck."Email Explained". 2011.
  3. ^Jonathan B. Postel (November 1981). "4.5.3. SIZES".SIMPLE MAIL TRANSFER PROTOCOL.doi:10.17487/RFC0788.RFC788.The maximum total length of a text line including the <CRLF> is 1000 characters (but not counting the leading dot duplicated for transparency).
  4. ^G. Vaudreuil (February 1993). "2. The Problem".Transition of Internet Mail from Just-Send-8 to 8bit-SMTP/MIME.doi:10.17487/RFC1428.RFC1428.SMTP as defined in RFC 821 limits the sending of Internet Mail to US-ASCII characters.
  5. ^ Dan Sugalski."E-mail with Attachments". "The Perl Journal". Summer 1999. "When mail was standardized way back in 1982 with RFC822, ... The only limits placed on the body were the character set (7-bit ASCII) and the maximum line length (1000 characters)."
  6. ^abN. Freed; N. Borenstein (November 1996). "Abstract".Multipurpose Internet Mail Extensions (MIME) Part One: Format of Internet Message Bodies.doi:10.17487/RFC2045.RFC2045.Multipurpose Internet Mail Extensions, orMIME, redefines the format of messages
  7. ^C. Feather (October 2006).Network News Transfer Protocol (NNTP).doi:10.17487/RFC3977.RFC3977.
  8. ^C. Lindsey; D. Kohn (November 2009). K. Murchison (ed.).Netnews Article Format.doi:10.17487/RFC5536.RFC5536.
  9. ^K. Moore (November 1996).MIME (Multipurpose Internet Mail Extensions) Part Three: Message Header Extensions for Non-ASCII Text. Network Working Group.doi:10.17487/RFC2047.RFC2047.Draft Standard. Updated byRFC 2184 and2231. ObsoletesRFC 1590,1522 and1521.
  10. ^N. Freed; K. Moore (November 1997).MIME Parameter Value and Encoded Word Extensions: Character Sets, Languages, and Continuations.doi:10.17487/RFC2231.RFC2231.
  11. ^Theodore Ts'o;Keith Moore;Mark Crispin (12 September 1994)."8-bit transmission in NNTP".IETF-SMTP mail list. Archived fromthe original on 20 March 2012. Retrieved3 April 2010.
  12. ^"comp.mail.mime FAQ, part 3 'What's ESMTP, and how does it affect MIME?'".Usenet FAQs. 8 August 1997. Archived fromthe original on 18 January 2012. Retrieved3 April 2010.
  13. ^J. Yao; W. Mao (February 2012).SMTP Extension for Internationalized Email.doi:10.17487/RFC8531.RFC8531.
  14. ^"The 8BITMIME extension".
Retrieved from "https://en.wikipedia.org/w/index.php?title=8-bit_clean&oldid=1316160849"
Category:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp