Extensions and enhancements to java.text to support unicode transforms, UnicodeSet, surrogate char utilities, UCA collation, normalization, break iteration (rule and dictionary based), enhanced number format, international string searching, and arabic shaping.
- Unicode Transforms (Transliteration) convert between different representations of unicode text.
- UnicodeSet provides set operations on unicode characters and strings, and are representable as compact expressions.
- Surrogate character utilities (UTF16) provides indexing and substring operations on text containing surrogates.
- UCA collation implements the current Unicode Collation Algorithm.  Data for many locale-specific collations is provided.
- Normalization supports standard Unicode normalization forms as well as FCD.
- Break iteration supports character, word, line, sentences, and title case breaks.
- Enhanced number format to 'spell out' numbers, padding and rounding control.
- Internationally sensitive string searching, uses collation.
- Arabic shaping converts between shaped and unshaped arabic characters and digits.