Unicode text processing algorithms

E564767

Unicode text processing algorithms are standardized procedures that define how Unicode text is compared, sorted, segmented, normalized, and otherwise manipulated consistently across different systems and languages.

Try in SPARQL Jump to: Surface forms Statements Referenced by

Observed surface forms (1)

Surface form Occurrences
Generic String Encoding Rules 1

Statements (50)

Predicate Object
instanceOf Unicode Standard component
text processing standard
appliesTo Unicode code points
Unicode strings
definedBy Unicode Consortium NERFINISHED
designedFor cross-platform interoperability
multilingual text
documentedIn Unicode Standard Annex #10 NERFINISHED
Unicode Standard Annex #14 NERFINISHED
Unicode Standard Annex #15 NERFINISHED
Unicode Standard Annex #29 NERFINISHED
Unicode Standard Annex #31 NERFINISHED
Unicode Standard Annex #44 NERFINISHED
Unicode Standard Annex #9 NERFINISHED
ensures locale-independent default behavior
stable normalization forms
hasAspect bidirectional text handling
case mapping
collation
grapheme cluster segmentation
identifier processing
line breaking
numeric value processing
script detection
text comparison
text normalization
text segmentation
text sorting
word breaking
hasPurpose enable language-independent text processing
ensure consistent handling of Unicode text across systems
includesAlgorithm Case Mapping Algorithms
Grapheme Cluster Boundary Rules NERFINISHED
Identifier and Pattern Syntax Rules NERFINISHED
Line Breaking Algorithm NERFINISHED
Sentence Boundary Rules NERFINISHED
Unicode Bidirectional Algorithm NERFINISHED
Unicode Collation Algorithm NERFINISHED
Unicode Normalization Algorithm NERFINISHED
Word Boundary Rules
partOf Unicode Standard NERFINISHED
requires Unicode Character Database NERFINISHED
standardizedIn Unicode Standard Annexes NERFINISHED
Unicode Technical Reports NERFINISHED
Unicode Technical Standards NERFINISHED
supports locale-specific tailoring
usedBy databases
operating systems
programming languages
text rendering engines

Referenced by (2)

Full triples — surface form annotated when it differs from this entity's canonical label.

Mark Davis helpedStandardize Unicode text processing algorithms
GSER fullName Unicode text processing algorithms
this entity surface form: Generic String Encoding Rules