std::mblen

From cppreference.com

Compiler support
Freestanding and hosted
Language
Standard library
Standard library headers
Named requirements
Feature test macros(C++20)
Language support library
Concepts library(C++20)
Diagnostics library
Memory management library
Metaprogramming library(C++11)
General utilities library
Containers library
Iterators library
Ranges library(C++20)
Algorithms library
Strings library
Text processing library
Numerics library
Date and time library
Input/output library
Filesystem library(C++17)
Concurrency support library(C++11)
Execution control library(C++26)
Technical specifications
Symbols index
External libraries

[edit]

Text processing library

Localization library

Regular expressions library(C++11)

Formatting library(C++20)

Null-terminated sequence utilities

Byte strings

Multibyte strings

Wide strings

Primitive numeric conversions

to_chars (C++17)
to_chars_result (C++17)
from_chars (C++17)
from_chars_result (C++17)
chars_format (C++17)

Text encoding identifications

text_encoding

(C++26)

[edit]

Null-terminated multibyte strings

Functions

Wide/multibyte examination

mblen
mbrlen

mbsinit

Multibyte/wide conversions

mbtowc
mbstowcs
btowc
mbrtowc
mbsrtowcs
wctomb
wcstombs
wctob

wcrtomb
wcsrtombs
mbrtoc8 (C++20)
mbrtoc16 (C++11)
mbrtoc32 (C++11)
c8rtomb (C++20)
c16rtomb (C++11)
c32rtomb (C++11)

Types

mbstate_t

Macros

MB_LEN_MAX MB_CUR_MAX
__STDC_UTF_16__ __STDC_UTF_32__ (C++11)(C++11)

[edit]

Defined in header`<cstdlib>`
int mblen(constchar* s,std::size_t n);

Determines the size, in bytes, of the multibyte character whose first byte is pointed to bys.

Ifs is a null pointer, resets the global conversion state and determines whether shift sequences are used.

This function is equivalent to the callstd::mbtowc(nullptr, s, n), except that conversion state ofstd::mbtowc is unaffected.

[edit]Notes

Each call tomblen updates the internal global conversion state (a static object of typestd::mbstate_t, only known to this function). If the multibyte encoding uses shift states, care must be taken to avoid backtracking or multiple scans. In any case, multiple threads should not callmblen without synchronization:std::mbrlen may be used instead.

[edit]Parameters

s	-	pointer to the multibyte character
n	-	limit on the number of bytes in s that can be examined

[edit]Return value

Ifs is not a null pointer, returns the number of bytes that are contained in the multibyte character or-1 if the first bytes pointed to bys do not form a valid multibyte character or0 ifs is pointing at the null character'\0'.

Ifs is a null pointer, resets its internal conversion state to represent the initial shift state and returns0 if the current multibyte encoding is not state-dependent (does not use shift sequences) or a non-zero value if the current multibyte encoding is state-dependent (uses shift sequences).

[edit]Example

Run this code

#include <clocale>#include <cstdlib>#include <iomanip>#include <iostream>#include <stdexcept>#include <string_view> // the number of characters in a multibyte string is the sum of mblen()'s// note: the simpler approach is std::mbstowcs(nullptr, s.c_str(), s.size())std::size_t strlen_mb(conststd::string_view s){    std::mblen(nullptr,0);// reset the conversion statestd::size_t result=0;constchar* ptr= s.data();for(constchar*const end= ptr+ s.size(); ptr< end;++result){constint next= std::mblen(ptr, end- ptr);if(next==-1)throwstd::runtime_error("strlen_mb(): conversion error");        ptr+= next;}return result;} void dump_bytes(conststd::string_view str){std::cout<<std::hex<<std::uppercase<<std::setfill('0');for(unsignedchar c: str)std::cout<<std::setw(2)<<static_cast<int>(c)<<' ';std::cout<<std::dec<<'\n';} int main(){// allow mblen() to work with UTF-8 multibyte encodingstd::setlocale(LC_ALL,"en_US.utf8");// UTF-8 narrow multibyte encodingconststd::string_view str="z\u00df\u6c34\U0001f34c";// or u8"zß水🍌"std::cout<<std::quoted(str)<<" is "<< strlen_mb(str)<<" characters, but as much as "<< str.size()<<" bytes: ";    dump_bytes(str);}

Possible output:

"zß水🍌" is 4 characters, but as much as 10 bytes: 7A C3 9F E6 B0 B4 F0 9F 8D 8C

[edit]See also

mbtowc	converts the next multibyte character to wide character (function)[edit]
mbrlen	returns the number of bytes in the next multibyte character, given state (function)[edit]
C documentation formblen

Retrieved from "https://en.cppreference.com/mwiki/index.php?title=cpp/string/multibyte/mblen&oldid=153040"

Movatterモバイル変換

cppreference.com

Namespaces

Variants

Views

Actions

std::mblen

Contents

[edit]Notes

[edit]Parameters

[edit]Return value

[edit]Example

[edit]See also

Navigation

Toolbox