Sablotron Guide and Reference

Encodings .Implementation Notes

Summary

Sablotron supports a number of input/output encodings and charsets; either internally or through the iconv library.

Description

Sablotron handles encoding conversions with the help of iconv library. As a standard part of glibc, iconv is automatically available on many Unix-based systems (including Linux). It's available for most other platforms where Sablotron runs (Windows, Solaris, MacOS X, FreeBSD, etc). See .Dependencies for more details on iconv.

With iconv installed on your system, you can use any encoding it supports (that is, almost any encoding whatsoever) for both the input and the output documents.

If iconv is not available, encodings may still be supported internally by Sablotron. At present, the list is of such encodings is rather short: UTF-8, UTF-16, ASCII, iso-8859-1, iso-8859-2 and windows-1250 on input, UTF-8 only on output.

Lastly, a user has the option to implement a custom encoding conversion handler, which will be asked to perform any unsupported conversion. See .Handlers for details.

The default input and output encoding is in all cases UTF-8.

See Also

Groups [ .Implementation Notes ]
Entries [ .Dependencies | .Handlers ]

© 2002-2003 Ginger Alliance
revision 03-06-30
This page was generated by APIDOC