Unicode Utilities Change Log
- Version 2.27
- This release updates the character data to Unicode 5.1 and fixes a bug in the -V option of
uniname as well as couple of other minor bugs.
- Version 2.26
- This release adds unifuzz, a utility that generates test input
for programs expecting Unicode. Unifuzz can generate a random string
of characters, tokens of various potentially problematic characters and sequences,
very long lines, strings with embedded nulls, and ill-formed UTF-8. unirev
is renamed unireverse.
- Version 2.25
- Adds to unidesc the option -r which causes it to list the ranges
detected after reading all input rather than listing them as they are encountered,
and adds to uniname the option -B which causes it to ignore characters
within the Basic Multilingual Plane.
- Version 2.24
- Adds the utility Unirev, a filter which reverses UTF-8 strings.
- Version 2.23
- uniname and unidesc now provide information about the
unofficial ranges within the
Private Use Area registered with the ConScript Unicode Registry.
- Version 2.22
- Adds to uniname the command line option -A, proposed and implemented
by H. Merijn Brand, which causes ASCII whitespace to be skipped.
- Version 2.21
- unidesc and unihist now print detailed error information
on encountering ill-formed UTF-8.
- Version 2.20
- Added to uniname the -a option, which causes it to ignore ASCII characters.
Also fixes bugs introduced into unihist and unidesc caused by bad version of UTF-8
- Version 2.19
- The data has been updated to Unicode version 5.0.
- Version 2.18
- A bug was fixed that incorrectly treated a read interrupted in the middle of
a UTF-8 sequence as a truncated sequence.
- Version 2.17
- The data has been updated to Unicode version 4.1.
- Version 2.16
- Added -l option to print line number.
Also added the shell script utf8lookup, which invokes uniname to
provide an easy way to look up
the character name corresponding to a codepoint from the command line.
In addition to uniname it requires the utility Ascii2binary.
- Version 2.15.1
- Added ifdefs to unihist.c to allow compilation on systems lacking locale.h or
- Version 2.15
- Options were added to unidesc that list the Unicode ranges in numerical order and
alphabetical order respectively.
- Version 2.14
- The code has been thoroughly cleaned up, so that gcc -Wall -pedantic produces
no warnings, and GNU autoconfiguration set up.
- Version 2.13
- uniname can now be asked merely to validate its input as UTF-8.
If the -V flag is given, uniname reads its input and
does nothing unless it encounters invalid UTF-8. If so, it reports its location,
explains why it is invalid, and exits.
- Version 2.12
- A couple of minor errors in the endpoints of Unicode ranges were corrected.
(The errors were in unirange.c and affect uniname and unidesc.)
- Version 2.11
- Added unihist to package.
- Version 2.10
uniname now explains why ill-formed UTF-8 is ill-formed when it encounters it and
(optionally) prints column headers at the top of every screenfull.