mbrtoc16

From cppreference.com

Compiler support
Language
Headers
Type support
Program utilities
Variadic function support
Error handling
Dynamic memory management
Strings library
Algorithms
Numerics
Date and time utilities
Input/output support
Localization support
Concurrency support(C11)
Technical Specifications
Symbol index

[edit]

Strings library

Null-terminated byte strings
Null-terminated multibyte strings
Null-terminated wide strings

[edit]

Null-terminated multibyte strings

Functions

Wide/multibyte conversions

mbsinit

(C95)

mbstowcsmbstowcs_s (C11)
btowc (C95)
mbrtowc (C95)
mbsrtowcsmbsrtowcs_s (C95)(C11)
mbrtoc8 (C23)
c8rtomb (C23)
mbrtoc16 (C11)
c16rtomb (C11)
c32rtomb (C11)
mbrtoc32 (C11)

mblen
mbtowc
wctombwctomb_s (C11)
wcstombswcstombs_s (C11)
wctob (C95)
wcrtombwcrtomb_s (C95)(C11)
wcsrtombswcsrtombs_s (C95)(C11)
mbrlen (C95)

Types

mbstate_t (C95)
char8_t (C23)

char16_t (C11)
char32_t (C11)

Defined in header`<uchar.h>`
size_t mbrtoc16( char16_trestrict pc16,constcharrestrict s, size_t n,mbstate_t*restrict ps);		(since C11)

Converts a single code point from its narrow multibyte character representation to its variable-length 16-bit wide character representation (typically, UTF-16).

Ifs is not a null pointer, inspects at mostn bytes of the multibyte character string, beginning with the byte pointed to bys to determine the number of bytes necessary to complete the next multibyte character (including any shift sequences, and taking into account the current multibyte conversion state*ps). If the function determines that the next multibyte character ins is complete and valid, converts it to the corresponding 16-bit wide character and stores it in*pc16 (ifpc16 is not null).

If the multibyte character in*s corresponds to a multi-char16_t sequence (e.g. a surrogate pair in UTF-16), then after the first call to this function,*ps is updated in such a way that the next call tombrtoc16 will write out the additionalchar16_t, without considering*s.

Ifs is a null pointer, the values ofn andpc16 are ignored and the call is equivalent tombrtoc16(NULL,"",1, ps).

If the wide character produced is the null character, the conversion state*ps represents the initial shift state.

If the macro__STDC_UTF_16__ is defined, the 16-bit encoding used by this function is UTF-16; otherwise, it is implementation-defined.The macro is always defined and the encoding is always UTF-16.(since C23) In any case, the multibyte character encoding used by this function is specified by the currently active C locale.

[edit]Parameters

pc16	-	pointer to the location where the resulting 16-bit wide character will be written
s	-	pointer to the multibyte character string used as input
n	-	limit on the number of bytes in s that can be examined
ps	-	pointer to the conversion state object used when interpreting the multibyte string

[edit]Return value

The first of the following that applies:

0 if the character converted froms (and stored in*pc16 if non-null) was the null character.
The number of bytes[1, n] of the multibyte character successfully converted froms.
(size_t)-3 if the nextchar16_t from a multi-char16_t character (e.g. a surrogate pair) has now been written to*pc16. No bytes are processed from the input in this case.
(size_t)-2 if the nextn bytes constitute an incomplete, but so far valid, multibyte character. Nothing is written to*pc16.
(size_t)-1 if encoding error occurs. Nothing is written to*pc16, the valueEILSEQ is stored inerrno and the value of*ps is unspecified.

[edit]Example

On MSVC you may need the/utf-8 compiler flag for UTF_8 to work properly.

Run this code

#include <locale.h>#include <stdio.h>#include <uchar.h> int main(void){setlocale(LC_ALL,"en_US.utf8");constchar in[]= u8"zß水🍌";// or "z\u00df\u6c34\U0001F34C"enum{ in_sz=sizeof in/sizeof*in}; printf("Processing %d UTF-8 code units: [", in_sz);for(int n=0; n< in_sz;++n)printf("%s%02X", n?" ":"",(unsignedchar)in[n]);puts("]");     char16_t out[in_sz];constchar* p_in= in;constchar* end= in+ in_sz;    char16_t* p_out= out;mbstate_t state={0}; for(size_t rc;(rc= mbrtoc16(p_out, p_in, end- p_in,&state));){if(rc==(size_t)-1)// invalid inputbreak;elseif(rc==(size_t)-2)// truncated inputbreak;elseif(rc==(size_t)-3)// UTF-16 high surrogate            p_out+=1;else{            p_in+= rc;            p_out+=1;};} constsize_t out_sz= p_out- out+1;printf("into %zu UTF-16 code units: [", out_sz);for(size_t x=0; x< out_sz;++x)printf("%s%04X", x?" ":"", out[x]);puts("]");}

Output:

Processing 11 UTF-8 code units: [7A C3 9F E6 B0 B4 F0 9F 8D 8C 00]into 6 UTF-16 code units: [007A 00DF 6C34 D83C DF4C 0000]

[edit]References

C23 standard (ISO/IEC 9899:2024):

7.30.1.3 The mbrtoc16 function (p: 408-409)

C11 standard (ISO/IEC 9899:2011):

7.28.1.1 The mbrtoc16 function (p: 398-399)

[edit]See also

c16rtomb (C11)	converts a UTF-16 character to narrow multibyte encoding (function)[edit]
C++ documentation formbrtoc16

Retrieved from "https://en.cppreference.com/mwiki/index.php?title=c/string/multibyte/mbrtoc16&oldid=181013"

Movatterモバイル変換

cppreference.com

Namespaces

Variants

Views

Actions

mbrtoc16

Contents

[edit]Parameters

[edit]Return value

[edit]Example

[edit]References

[edit]See also

Navigation

Toolbox