[Top] [Contents] [Index] [ ? ]

The recode reference manual

This recoding library converts files between various coded character sets and surface encodings. When this cannot be achieved exactly, it may get rid of the offending characters or fall back on approximations. The library recognises or produces more than 300 different character sets and is able to convert files between almost any pair. Most RFC 1345 character sets, and all libiconv character sets, are supported. The recode program is a handy front-end to the library.

The current recode release is 3.6.

1. Quick Tutorial  
2. Terminology and purpose  
3. How to use this program  
4. A recoding library  
5. The universal charset  
6. The iconv library  
7. Tabular sources (RFC 1345)  
8. ASCII and some derivatives  
9. Some IBM or Microsoft charsets  
10. Charsets for CDC machines  
11. Other micro-computer charsets  
12. Various other charsets  
13. All about surfaces  
14. Internal aspects  
Concept Index  
Option Index  
Library Index  
Charset and Surface Index  
-- The Detailed Node Listing ---
Terminology and purpose
2.1 Overview of charsets  
2.2 Overview of surfaces  
2.3 Contributions and bug reports  
How to use this program
3.1 Synopsis of recode call  
3.2 The request parameter  
3.3 Asking for various lists  
3.4 Controlling how files are recoded  
3.5 Reversibility issues  
3.6 Selecting sequencing methods  
3.7 Using mixed charset input  
3.8 Using recode within Emacs  
3.9 Debugging considerations  
A recoding library
4.1 Outer level functions  
4.2 Request level functions  
4.3 Task level functions  
4.4 Charset level functions  
4.5 Handling errors  
The universal charset
5.1 Universal Character Set, 2 bytes  
5.2 Universal Character Set, 4 bytes  
5.3 Universal Transformation Format, 7 bits  
5.4 Universal Transformation Format, 8 bits  
5.5 Universal Transformation Format, 16 bits  
5.6 Frequency count of characters  
5.7 Fully interpreted UCS dump  
ASCII and some derivatives
8.1 Usual ASCII  
8.2 ASCII extended by Latin Alphabets  
8.3 ASCII 7-bits, BS to overstrike  
8.4 ASCII without diacritics nor underline  
Some IBM or Microsoft charsets
9.1 EBCDIC code  EBCDIC codes
9.2 IBM's PC code  
9.3 Unisys' Icon code  
Charsets for CDC machines
10.1 Control Data's Display Code  
10.2 ASCII 6/12 from NOS  
10.3 ASCII "bang bang"  
Other micro-computer charsets
11.1 Apple's Macintosh code  
11.2 Atari ST code  
Various other charsets
12.1 World Wide Web representations  
12.2 LaTeX macro calls  
12.3 GNU project documentation files  
12.4 Vietnamese charsets  
12.5 African charsets  
12.6 Cyrillic and other charsets  
12.7 Easy French conventions  
12.8 Mule as a multiplexed charset  
All about surfaces
13.1 Permuting groups of bytes  
13.2 Representation for end of lines  
13.3 MIME contents encodings  
13.4 Interpreted character dumps  
13.5 Artificial data for testing  
Internal aspects
14.1 Overall organisation  
14.2 Adding new charsets  
14.3 Adding new surfaces  
14.4 Comments on the library design  



This document was generated by root on June, 20 2004 using texi2html