3 058b0118 2005-01-03 devnull tcs \- translate character sets
4 058b0118 2005-01-03 devnull .SH SYNOPSIS
18 058b0118 2005-01-03 devnull .I file ...
20 058b0118 2005-01-03 devnull .SH DESCRIPTION
22 058b0118 2005-01-03 devnull interprets the named
23 058b0118 2005-01-03 devnull .I file(s)
24 058b0118 2005-01-03 devnull (standard input default) as a stream of characters from the
26 058b0118 2005-01-03 devnull character set or format, converts them to runes,
27 058b0118 2005-01-03 devnull and then converts them into a stream of characters from the
29 058b0118 2005-01-03 devnull character set or format on the standard output.
30 058b0118 2005-01-03 devnull The default value for
35 058b0118 2005-01-03 devnull .BR utf ,
38 058b0118 2005-01-03 devnull encoding described in
39 058b0118 2005-01-03 devnull .IR utf (7).
42 058b0118 2005-01-03 devnull option lists the character sets known to
43 058b0118 2005-01-03 devnull .IR tcs .
44 058b0118 2005-01-03 devnull Processing continues in the face of conversion errors (the
46 058b0118 2005-01-03 devnull option prevents reporting of these errors).
49 058b0118 2005-01-03 devnull option forces the output to contain only correctly converted characters;
50 058b0118 2005-01-03 devnull otherwise,
52 058b0118 2005-01-03 devnull characters will be substituted for
54 058b0118 2005-01-03 devnull encoding errors and
55 058b0118 2005-01-03 devnull .B 0xFFFD
56 058b0118 2005-01-03 devnull characters will substituted for unknown characters.
60 058b0118 2005-01-03 devnull option generates various diagnostic and summary information on standard error,
61 058b0118 2005-01-03 devnull or makes the
63 058b0118 2005-01-03 devnull output more verbose.
66 058b0118 2005-01-03 devnull recognizes an ever changing list of character sets.
67 058b0118 2005-01-03 devnull In particular, it supports a variety of Russian and Japanese encodings.
68 058b0118 2005-01-03 devnull Some of the supported encodings are
69 058b0118 2005-01-03 devnull .TF jis-kanji
72 058b0118 2005-01-03 devnull The Plan 9
74 058b0118 2005-01-03 devnull encoding, known by ISO as UTF-8
77 058b0118 2005-01-03 devnull The deprecated original
79 058b0118 2005-01-03 devnull encoding from ISO 10646
82 058b0118 2005-01-03 devnull 7-bit ASCII
84 058b0118 2005-01-03 devnull .B 8859-1
85 058b0118 2005-01-03 devnull Latin-1 (Central European)
87 058b0118 2005-01-03 devnull .B 8859-2
88 058b0118 2005-01-03 devnull Latin-2 (Czech .. Slovak)
90 058b0118 2005-01-03 devnull .B 8859-3
91 058b0118 2005-01-03 devnull Latin-3 (Dutch .. Turkish)
93 058b0118 2005-01-03 devnull .B 8859-4
94 058b0118 2005-01-03 devnull Latin-4 (Scandinavian)
96 058b0118 2005-01-03 devnull .B 8859-5
97 058b0118 2005-01-03 devnull Part 5 (Cyrillic)
99 058b0118 2005-01-03 devnull .B 8859-6
100 058b0118 2005-01-03 devnull Part 6 (Arabic)
102 058b0118 2005-01-03 devnull .B 8859-7
103 058b0118 2005-01-03 devnull Part 7 (Greek)
105 058b0118 2005-01-03 devnull .B 8859-8
106 058b0118 2005-01-03 devnull Part 8 (Hebrew)
108 058b0118 2005-01-03 devnull .B 8859-9
109 058b0118 2005-01-03 devnull Latin-5 (Finnish .. Portuguese)
112 058b0118 2005-01-03 devnull KOI-8 (GOST 19769-74)
114 058b0118 2005-01-03 devnull .B jis-kanji
115 058b0118 2005-01-03 devnull ISO 2022-JP
118 058b0118 2005-01-03 devnull EUC-JX: JIS 0208
120 058b0118 2005-01-03 devnull .B ms-kanji
121 058b0118 2005-01-03 devnull Microsoft, or Shift-JIS
124 058b0118 2005-01-03 devnull (from only) guesses between ISO 2022-JP, EUC or Shift-Jis
127 058b0118 2005-01-03 devnull Chinese national standard (GB2312-80)
130 058b0118 2005-01-03 devnull Big 5 (HKU version)
132 058b0118 2005-01-03 devnull .B unicode
133 058b0118 2005-01-03 devnull Unicode Standard 1.0
136 058b0118 2005-01-03 devnull Thai character set plus
137 058b0118 2005-01-03 devnull .SM ASCII
138 058b0118 2005-01-03 devnull (TIS 620-1986)
140 058b0118 2005-01-03 devnull .B msdos
141 058b0118 2005-01-03 devnull IBM PC: CP 437
143 058b0118 2005-01-03 devnull .B atari
144 058b0118 2005-01-03 devnull Atari-ST character set
145 058b0118 2005-01-03 devnull .SH EXAMPLES
147 058b0118 2005-01-03 devnull .B tcs -f 8859-1
148 058b0118 2005-01-03 devnull Convert 8859-1 (Latin-1) characters into
152 058b0118 2005-01-03 devnull .B tcs -s -f jis
153 058b0118 2005-01-03 devnull Convert characters encoded in one of several shift JIS encodings into
156 058b0118 2005-01-03 devnull Unknown Kanji will be converted into
157 058b0118 2005-01-03 devnull .B 0xFFFD
158 058b0118 2005-01-03 devnull characters.
160 058b0118 2005-01-03 devnull .B tcs -lv
161 058b0118 2005-01-03 devnull Print an up to date list of the supported character sets.
162 058b0118 2005-01-03 devnull .SH SOURCE
163 c3674de4 2005-01-11 devnull .B \*9/src/cmd/tcs
164 058b0118 2005-01-03 devnull .SH SEE ALSO
165 058b0118 2005-01-03 devnull .IR ascii (1),
166 058b0118 2005-01-03 devnull .IR rune (3),
167 058b0118 2005-01-03 devnull .IR utf (7).