SIGN IN SIGN UP

gh-74902: Add Unicode Grapheme Cluster Break algorithm (GH-143076)

Add the unicodedata.iter_graphemes() function to iterate over grapheme
clusters according to rules defined in Unicode Standard Annex #29.

Add unicodedata.grapheme_cluster_break(), unicodedata.indic_conjunct_break()
and unicodedata.extended_pictographic() functions to get the properties
of the character which are related to the above algorithm.

Co-authored-by: Guillaume "Vermeille" Sanchez <guillaume.v.sanchez@gmail.com>
S
Serhiy Storchaka committed
bab1d7a561ab015dd6bb97e255fd12a8ce367edf
Parent: 0e0d51c
Committed by GitHub <noreply@github.com> on 1/14/2026, 2:37:57 PM