Character set issues that pertain to both the MARC-8 and Unicode environments are dealt with here.  In this section, references to Unicode imply its UTF-8 encoding form, the only one approved for use in MARC 21.  References to a particular Unicode character will be expressed in the conventional way, as a hexadecimal number identifying the code point, not as a representation of the individual octets in the UTF-8 transformation.  See Part 3 for a discussion of UTF-8.

To return, select:

Part 1:  General Character Set Issues

Character Sets and Encoding Options