Namespaces
Variants
Views
Actions

Difference between revisions of "cpp/string/byte/isalpha"

From cppreference.com
< cpp‎ | string‎ | byte
m (Undo revision 116385 by 83.168.109.73 (talk) back to "or")
m (`not equal to EOF` is necessary, but not sufficient to cause UB.)
Line 11: Line 11:
 
In locales other than {{c|"C"}}, an alphabetic character is a character for which {{lc|std::isupper()}} or {{lc|std::islower()}} returns non-zero or any other character considered alphabetic by the locale. In any case, {{lc|std::iscntrl()}}, {{lc|std::isdigit()}}, {{lc|std::ispunct()}} and {{lc|std::isspace()}} will return zero for this character.
 
In locales other than {{c|"C"}}, an alphabetic character is a character for which {{lc|std::isupper()}} or {{lc|std::islower()}} returns non-zero or any other character considered alphabetic by the locale. In any case, {{lc|std::iscntrl()}}, {{lc|std::isdigit()}}, {{lc|std::ispunct()}} and {{lc|std::isspace()}} will return zero for this character.
  
The behavior is undefined if the value of {{tt|ch}} is not representable as {{c|unsigned char}} or is not equal to {{c|EOF}}.
+
The behavior is undefined if the value of {{tt|ch}} is not representable as {{c|unsigned char}} and is not equal to {{c|EOF}}.
  
 
===Parameters===
 
===Parameters===

Revision as of 11:56, 31 July 2020

Defined in header <cctype>
int isalpha( int ch );

Checks if the given character is an alphabetic character as classified by the currently installed C locale. In the default locale, the following characters are alphabetic:

  • uppercase letters ABCDEFGHIJKLMNOPQRSTUVWXYZ
  • lowercase letters abcdefghijklmnopqrstuvwxyz

In locales other than "C", an alphabetic character is a character for which std::isupper() or std::islower() returns non-zero or any other character considered alphabetic by the locale. In any case, std::iscntrl(), std::isdigit(), std::ispunct() and std::isspace() will return zero for this character.

The behavior is undefined if the value of ch is not representable as unsigned char and is not equal to EOF.

Contents

Parameters

ch - character to classify

Return value

Non-zero value if the character is an alphabetic character, zero otherwise.

Notes

Like all other functions from <cctype>, the behavior of std::isalpha is undefined if the argument's value is neither representable as unsigned char nor equal to EOF. To use these functions safely with plain chars (or signed chars), the argument should first be converted to unsigned char:

bool my_isalpha(char ch)
{
    return std::isalpha(static_cast<unsigned char>(ch));
}

Similarly, they should not be directly used with standard algorithms when the iterator's value type is char or signed char. Instead, convert the value to unsigned char first:

int count_alphas(const std::string& s)
{
    return std::count_if(s.begin(), s.end(),
                      // static_cast<int(*)(int)>(std::isalpha)         // wrong
                      // [](int c){ return std::isalpha(c); }           // wrong
                      // [](char c){ return std::isalpha(c); }          // wrong
                         [](unsigned char c){ return std::isalpha(c); } // correct
                        );
}

Example

Demonstrates the use of isalpha() with different locales (OS-specific).

#include <iostream>
#include <cctype> 
#include <clocale>
 
int main()
{        
    unsigned char c = '\xdf'; // German letter ß in ISO-8859-1
 
    std::cout << "isalpha(\'\\xdf\', default C locale) returned "
               << std::boolalpha << (bool)std::isalpha(c) << '\n';
 
    std::setlocale(LC_ALL, "de_DE.iso88591");
    std::cout << "isalpha(\'\\xdf\', ISO-8859-1 locale) returned "
              << std::boolalpha << (bool)std::isalpha(c) << '\n';
 
}

Output:

isalpha('\xdf', default C locale) returned false
isalpha('\xdf', ISO-8859-1 locale) returned true


See also

checks if a character is classified as alphabetic by a locale
(function template) [edit]
checks if a wide character is alphabetic
(function) [edit]
C documentation for isalpha
ASCII values characters

iscntrl
iswcntrl

isprint
iswprint

isspace
iswspace

isblank
iswblank

isgraph
iswgraph

ispunct
iswpunct

isalnum
iswalnum

isalpha
iswalpha

isupper
iswupper

islower
iswlower

isdigit
iswdigit

isxdigit
iswxdigit

decimal hexadecimal octal
0–8 \x0\x8 \0\10 control codes (NUL, etc.) ≠0 0 0 0 0 0 0 0 0 0 0 0
9 \x9 \11 tab (\t) ≠0 0 ≠0 ≠0 0 0 0 0 0 0 0 0
10–13 \xA\xD \12\15 whitespaces (\n, \v, \f, \r) ≠0 0 ≠0 0 0 0 0 0 0 0 0 0
14–31 \xE\x1F \16\37 control codes ≠0 0 0 0 0 0 0 0 0 0 0 0
32 \x20 \40 space 0 ≠0 ≠0 ≠0 0 0 0 0 0 0 0 0
33–47 \x21\x2F \41\57 !"#$%&'()*+,-./ 0 ≠0 0 0 ≠0 ≠0 0 0 0 0 0 0
48–57 \x30\x39 \60\71 0123456789 0 ≠0 0 0 ≠0 0 ≠0 0 0 0 ≠0 ≠0
58–64 \x3A\x40 \72\100 :;<=>?@ 0 ≠0 0 0 ≠0 ≠0 0 0 0 0 0 0
65–70 \x41\x46 \101\106 ABCDEF 0 ≠0 0 0 ≠0 0 ≠0 ≠0 ≠0 0 0 ≠0
71–90 \x47\x5A \107\132 GHIJKLMNOP
QRSTUVWXYZ
0 ≠0 0 0 ≠0 0 ≠0 ≠0 ≠0 0 0 0
91–96 \x5B\x60 \133\140 [\]^_` 0 ≠0 0 0 ≠0 ≠0 0 0 0 0 0 0
97–102 \x61\x66 \141\146 abcdef 0 ≠0 0 0 ≠0 0 ≠0 ≠0 0 ≠0 0 ≠0
103–122 \x67\x7A \147\172 ghijklmnop
qrstuvwxyz
0 ≠0 0 0 ≠0 0 ≠0 ≠0 0 ≠0 0 0
123–126 \x7B\x7E \172\176 {|}~ 0 ≠0 0 0 ≠0 ≠0 0 0 0 0 0 0
127 \x7F \177 backspace character (DEL) ≠0 0 0 0 0 0 0 0 0 0 0 0