Back to module index
Go to module by name
unicodedata
This module provides access to the Unicode Character Database which
defines character properties for all Unicode characters. The data in
this database is based on the UnicodeData.txt file version
13.0.0 which is publicly available from ftp://ftp.unicode.org/.
The module uses the same names and symbols as defined by the
UnicodeData File Format 13.0.0.
Classes
UCD
bidirectional(self, chr, /)
Returns the bidirectional class assigned to the character chr as string.
If no such value is defined, an empty string is returned.
category(self, chr, /)
Returns the general category assigned to the character chr as string.
combining(self, chr, /)
Returns the canonical combining class assigned to the character chr as integer.
Returns 0 if no combining class is defined.
decimal(...)
Converts a Unicode character into its equivalent decimal value.
Returns the decimal value assigned to the character chr as integer.
If no such value is defined, default is returned, or, if not given,
ValueError is raised.
decomposition(self, chr, /)
Returns the character decomposition mapping assigned to the character chr as string.
An empty string is returned in case no such mapping is defined.
digit(...)
Converts a Unicode character into its equivalent digit value.
Returns the digit value assigned to the character chr as integer.
If no such value is defined, default is returned, or, if not given,
ValueError is raised.
east_asian_width(self, chr, /)
Returns the east asian width assigned to the character chr as string.
is_normalized(self, form, unistr, /)
Return whether the Unicode string unistr is in the normal form 'form'.
Valid values for form are 'NFC', 'NFKC', 'NFD', and 'NFKD'.
lookup(self, name, /)
Look up character by name.
If a character with the given name is found, return the
corresponding character. If not found, KeyError is raised.
mirrored(self, chr, /)
Returns the mirrored property assigned to the character chr as integer.
Returns 1 if the character has been identified as a "mirrored"
character in bidirectional text, 0 otherwise.
name(...)
Returns the name assigned to the character chr as a string.
If no name is defined, default is returned, or, if not given,
ValueError is raised.
normalize(self, form, unistr, /)
Return the normal form 'form' for the Unicode string unistr.
Valid values for form are 'NFC', 'NFKC', 'NFD', and 'NFKD'.
numeric(...)
Converts a Unicode character into its equivalent numeric value.
Returns the numeric value assigned to the character chr as float.
If no such value is defined, default is returned, or, if not given,
ValueError is raised.
unidata_version = <member 'unidata_version' of 'unicodedata.UCD' objects>
Functions
bidirectional
bidirectional(chr, /)
Returns the bidirectional class assigned to the character chr as string.
If no such value is defined, an empty string is returned.
category
category(chr, /)
Returns the general category assigned to the character chr as string.
combining
combining(chr, /)
Returns the canonical combining class assigned to the character chr as integer.
Returns 0 if no combining class is defined.
decimal
decimal(...)
Converts a Unicode character into its equivalent decimal value.
Returns the decimal value assigned to the character chr as integer.
If no such value is defined, default is returned, or, if not given,
ValueError is raised.
decomposition
decomposition(chr, /)
Returns the character decomposition mapping assigned to the character chr as string.
An empty string is returned in case no such mapping is defined.
digit
digit(...)
Converts a Unicode character into its equivalent digit value.
Returns the digit value assigned to the character chr as integer.
If no such value is defined, default is returned, or, if not given,
ValueError is raised.
east_asian_width
east_asian_width(chr, /)
Returns the east asian width assigned to the character chr as string.
is_normalized
is_normalized(form, unistr, /)
Return whether the Unicode string unistr is in the normal form 'form'.
Valid values for form are 'NFC', 'NFKC', 'NFD', and 'NFKD'.
lookup
lookup(name, /)
Look up character by name.
If a character with the given name is found, return the
corresponding character. If not found, KeyError is raised.
mirrored
mirrored(chr, /)
Returns the mirrored property assigned to the character chr as integer.
Returns 1 if the character has been identified as a "mirrored"
character in bidirectional text, 0 otherwise.
name
name(...)
Returns the name assigned to the character chr as a string.
If no name is defined, default is returned, or, if not given,
ValueError is raised.
normalize
normalize(form, unistr, /)
Return the normal form 'form' for the Unicode string unistr.
Valid values for form are 'NFC', 'NFKC', 'NFD', and 'NFKD'.
numeric
numeric(...)
Converts a Unicode character into its equivalent numeric value.
Returns the numeric value assigned to the character chr as float.
If no such value is defined, default is returned, or, if not given,
ValueError is raised.
Other members
ucd_3_2_0 = <unicodedata.UCD object at 0x7ff35ed75210>
ucnhash_CAPI = <capsule object "unicodedata.ucnhash_CAPI" at 0x7ff35ed75cf0>
unidata_version = '13.0.0'