std::codecvt_mode

From cppreference.com

Compiler support
Freestanding and hosted
Language
Standard library
Standard library headers
Named requirements
Feature test macros(C++20)
Language support library
Concepts library(C++20)
Diagnostics library
Memory management library
Metaprogramming library(C++11)
General utilities library
Containers library
Iterators library
Ranges library(C++20)
Algorithms library
Strings library
Text processing library
Numerics library
Date and time library
Input/output library
Filesystem library(C++17)
Concurrency support library(C++11)
Execution control library(C++26)
Technical specifications
Symbols index
External libraries

[edit]

Text processing library

Localization library

Regular expressions library(C++11)

Formatting library(C++20)

Null-terminated sequence utilities

Byte strings

Multibyte strings

Wide strings

Primitive numeric conversions

to_chars (C++17)
to_chars_result (C++17)
from_chars (C++17)
from_chars_result (C++17)
chars_format (C++17)

Text encoding identifications

text_encoding

(C++26)

[edit]

Localization library

Locales and facets
Locales
has_facet
use_facet
locale
Facet category base classes
ctype_base
codecvt_base
messages_base
time_base
money_base
ctype facets
ctype
ctype<char>
ctype_byname
codecvt
codecvt_byname
numeric facets
num_get
num_put
numpunct
numpunct_byname
collate facets
collate
collate_byname
time facets
time_get
time_put
time_get_byname
time_put_byname

Character classification and conversion

Character classification

isspace
iscntrl
isupper

islower
isalpha
ispunct

isdigit
isxdigit
isalnum

isblank (C++11)
isprint
isgraph

Character conversions

toupper

tolower

String and stream conversions

wstring_convert

(C++11/17/26*)

wbuffer_convert

(C++11/17/26*)

Unicode conversion facets

codecvt_utf8

(C++11/17/26*)

codecvt_utf16

(C++11/17/26*)

codecvt_utf8_utf16

(C++11/17/26*)

codecvt_mode

(C++11/17/26*)

C library locales

LC_ALLLC_COLLATELC_CTYPELC_MONETARYLC_NUMERICLC_TIME

setlocale
localeconv
lconv

[edit]

Defined in header`<codecvt>`
enum codecvt_mode{ consume_header=4, generate_header=2, little_endian=1 };		(since C++11) (deprecated in C++17) (removed in C++26)

The facetsstd::codecvt_utf8,std::codecvt_utf16, andstd::codecvt_utf8_utf16 accept an optional value of typestd::codecvt_mode as a template argument, which specifies optional features of the unicode string conversion.

[edit]Constants

Defined in header`<locale>`
Enumerator	Meaning
`little_endian`	assume the input is in little-endian byte order (applies to UTF-16 input only, the default is big-endian)
`consume_header`	consume the byte order mark, if present at the start of input sequence, and (in case of UTF-16), rely on the byte order it specifies for decoding the rest of the input
`generate_header`	output the byte order mark at the start of the output sequence

The recognized byte order marks are:

`0xfe 0xff`	UTF-16 big-endian
`0xff 0xfe`	UTF-16 little-endian
`0xef 0xbb 0xbf`	UTF-8 (no effect on endianness)

Ifstd::consume_header is not selected when reading a file beginning with byte order mark, the Unicode character U+FEFF (Zero width non-breaking space) will be read as the first character of the string content.

[edit]Example

The following example demonstrates consuming the UTF-8 BOM:

Run this code

#include <codecvt>#include <cwchar>#include <fstream>#include <iostream>#include <locale>#include <string> int main(){// UTF-8 data with BOMstd::ofstream{"text.txt"}<<"\ufeffz\u6c34\U0001d10b"; // read the UTF-8 file, skipping the BOMstd::wifstream fin{"text.txt"};    fin.imbue(std::locale(fin.getloc(),                          newstd::codecvt_utf8<wchar_t,0x10ffff, std::consume_header>)); for(wchar_t c; fin.get(c);)std::cout<<std::hex<<std::showbase<<(std::wint_t)c<<'\n';}

Output:

0x7a0x6c340x1d10b

[edit]See also

codecvt	converts between character encodings, including UTF-8, UTF-16, UTF-32 (class template)[edit]
codecvt_utf8 (C++11)(deprecated in C++17)(removed in C++26)	converts between UTF-8 and UCS-2/UCS-4 (class template)[edit]
codecvt_utf16 (C++11)(deprecated in C++17)(removed in C++26)	converts between UTF-16 and UCS-2/UCS-4 (class template)[edit]
codecvt_utf8_utf16 (C++11)(deprecated in C++17)(removed in C++26)	converts between UTF-8 and UTF-16 (class template)[edit]

Retrieved from "https://en.cppreference.com/mwiki/index.php?title=cpp/locale/codecvt_mode&oldid=182039"

Movatterモバイル変換

cppreference.com

Namespaces

Variants

Views

Actions

std::codecvt_mode

[edit]Constants

[edit]Example

[edit]See also

Navigation

Toolbox