Blame


1 058b0118 2005-01-03 devnull .TH TCS 1
2 058b0118 2005-01-03 devnull .SH NAME
3 058b0118 2005-01-03 devnull tcs \- translate character sets
4 058b0118 2005-01-03 devnull .SH SYNOPSIS
5 058b0118 2005-01-03 devnull .B tcs
6 058b0118 2005-01-03 devnull [
7 058b0118 2005-01-03 devnull .B -slcv
8 058b0118 2005-01-03 devnull ]
9 058b0118 2005-01-03 devnull [
10 058b0118 2005-01-03 devnull .B -f
11 058b0118 2005-01-03 devnull .I ics
12 058b0118 2005-01-03 devnull ]
13 058b0118 2005-01-03 devnull [
14 058b0118 2005-01-03 devnull .B -t
15 058b0118 2005-01-03 devnull .I ocs
16 058b0118 2005-01-03 devnull ]
17 058b0118 2005-01-03 devnull [
18 058b0118 2005-01-03 devnull .I file ...
19 058b0118 2005-01-03 devnull ]
20 058b0118 2005-01-03 devnull .SH DESCRIPTION
21 058b0118 2005-01-03 devnull .I Tcs
22 058b0118 2005-01-03 devnull interprets the named
23 058b0118 2005-01-03 devnull .I file(s)
24 058b0118 2005-01-03 devnull (standard input default) as a stream of characters from the
25 058b0118 2005-01-03 devnull .I ics
26 058b0118 2005-01-03 devnull character set or format, converts them to runes,
27 058b0118 2005-01-03 devnull and then converts them into a stream of characters from the
28 058b0118 2005-01-03 devnull .I ocs
29 058b0118 2005-01-03 devnull character set or format on the standard output.
30 058b0118 2005-01-03 devnull The default value for
31 058b0118 2005-01-03 devnull .I ics
32 058b0118 2005-01-03 devnull and
33 058b0118 2005-01-03 devnull .I ocs
34 058b0118 2005-01-03 devnull is
35 058b0118 2005-01-03 devnull .BR utf ,
36 058b0118 2005-01-03 devnull the
37 058b0118 2005-01-03 devnull .SM UTF
38 058b0118 2005-01-03 devnull encoding described in
39 d32deab1 2020-08-16 rsc .MR utf (7) .
40 058b0118 2005-01-03 devnull The
41 058b0118 2005-01-03 devnull .B -l
42 058b0118 2005-01-03 devnull option lists the character sets known to
43 058b0118 2005-01-03 devnull .IR tcs .
44 058b0118 2005-01-03 devnull Processing continues in the face of conversion errors (the
45 058b0118 2005-01-03 devnull .B -s
46 058b0118 2005-01-03 devnull option prevents reporting of these errors).
47 058b0118 2005-01-03 devnull The
48 058b0118 2005-01-03 devnull .B -c
49 058b0118 2005-01-03 devnull option forces the output to contain only correctly converted characters;
50 058b0118 2005-01-03 devnull otherwise,
51 058b0118 2005-01-03 devnull .B 0x80
52 058b0118 2005-01-03 devnull characters will be substituted for
53 058b0118 2005-01-03 devnull .SM UTF
54 058b0118 2005-01-03 devnull encoding errors and
55 058b0118 2005-01-03 devnull .B 0xFFFD
56 058b0118 2005-01-03 devnull characters will substituted for unknown characters.
57 058b0118 2005-01-03 devnull .PP
58 058b0118 2005-01-03 devnull The
59 058b0118 2005-01-03 devnull .B -v
60 058b0118 2005-01-03 devnull option generates various diagnostic and summary information on standard error,
61 058b0118 2005-01-03 devnull or makes the
62 058b0118 2005-01-03 devnull .B -l
63 058b0118 2005-01-03 devnull output more verbose.
64 058b0118 2005-01-03 devnull .PP
65 058b0118 2005-01-03 devnull .I Tcs
66 058b0118 2005-01-03 devnull recognizes an ever changing list of character sets.
67 058b0118 2005-01-03 devnull In particular, it supports a variety of Russian and Japanese encodings.
68 058b0118 2005-01-03 devnull Some of the supported encodings are
69 058b0118 2005-01-03 devnull .TF jis-kanji
70 058b0118 2005-01-03 devnull .TP
71 058b0118 2005-01-03 devnull .B utf
72 058b0118 2005-01-03 devnull The Plan 9
73 058b0118 2005-01-03 devnull .SM UTF
74 058b0118 2005-01-03 devnull encoding, known by ISO as UTF-8
75 058b0118 2005-01-03 devnull .TP
76 058b0118 2005-01-03 devnull .B utf1
77 058b0118 2005-01-03 devnull The deprecated original
78 058b0118 2005-01-03 devnull .SM UTF
79 058b0118 2005-01-03 devnull encoding from ISO 10646
80 058b0118 2005-01-03 devnull .TP
81 058b0118 2005-01-03 devnull .B ascii
82 058b0118 2005-01-03 devnull 7-bit ASCII
83 058b0118 2005-01-03 devnull .TP
84 058b0118 2005-01-03 devnull .B 8859-1
85 058b0118 2005-01-03 devnull Latin-1 (Central European)
86 058b0118 2005-01-03 devnull .TP
87 058b0118 2005-01-03 devnull .B 8859-2
88 058b0118 2005-01-03 devnull Latin-2 (Czech .. Slovak)
89 058b0118 2005-01-03 devnull .TP
90 058b0118 2005-01-03 devnull .B 8859-3
91 058b0118 2005-01-03 devnull Latin-3 (Dutch .. Turkish)
92 058b0118 2005-01-03 devnull .TP
93 058b0118 2005-01-03 devnull .B 8859-4
94 058b0118 2005-01-03 devnull Latin-4 (Scandinavian)
95 058b0118 2005-01-03 devnull .TP
96 058b0118 2005-01-03 devnull .B 8859-5
97 058b0118 2005-01-03 devnull Part 5 (Cyrillic)
98 058b0118 2005-01-03 devnull .TP
99 058b0118 2005-01-03 devnull .B 8859-6
100 058b0118 2005-01-03 devnull Part 6 (Arabic)
101 058b0118 2005-01-03 devnull .TP
102 058b0118 2005-01-03 devnull .B 8859-7
103 058b0118 2005-01-03 devnull Part 7 (Greek)
104 058b0118 2005-01-03 devnull .TP
105 058b0118 2005-01-03 devnull .B 8859-8
106 058b0118 2005-01-03 devnull Part 8 (Hebrew)
107 058b0118 2005-01-03 devnull .TP
108 058b0118 2005-01-03 devnull .B 8859-9
109 058b0118 2005-01-03 devnull Latin-5 (Finnish .. Portuguese)
110 058b0118 2005-01-03 devnull .TP
111 058b0118 2005-01-03 devnull .B koi8
112 058b0118 2005-01-03 devnull KOI-8 (GOST 19769-74)
113 058b0118 2005-01-03 devnull .TP
114 058b0118 2005-01-03 devnull .B jis-kanji
115 058b0118 2005-01-03 devnull ISO 2022-JP
116 058b0118 2005-01-03 devnull .TP
117 058b0118 2005-01-03 devnull .B ujis
118 058b0118 2005-01-03 devnull EUC-JX: JIS 0208
119 058b0118 2005-01-03 devnull .TP
120 058b0118 2005-01-03 devnull .B ms-kanji
121 058b0118 2005-01-03 devnull Microsoft, or Shift-JIS
122 058b0118 2005-01-03 devnull .TP
123 058b0118 2005-01-03 devnull .B jis
124 058b0118 2005-01-03 devnull (from only) guesses between ISO 2022-JP, EUC or Shift-Jis
125 058b0118 2005-01-03 devnull .TP
126 058b0118 2005-01-03 devnull .B gb
127 058b0118 2005-01-03 devnull Chinese national standard (GB2312-80)
128 058b0118 2005-01-03 devnull .TP
129 058b0118 2005-01-03 devnull .B big5
130 058b0118 2005-01-03 devnull Big 5 (HKU version)
131 058b0118 2005-01-03 devnull .TP
132 058b0118 2005-01-03 devnull .B unicode
133 058b0118 2005-01-03 devnull Unicode Standard 1.0
134 058b0118 2005-01-03 devnull .TP
135 058b0118 2005-01-03 devnull .B tis
136 058b0118 2005-01-03 devnull Thai character set plus
137 058b0118 2005-01-03 devnull .SM ASCII
138 058b0118 2005-01-03 devnull (TIS 620-1986)
139 058b0118 2005-01-03 devnull .TP
140 058b0118 2005-01-03 devnull .B msdos
141 058b0118 2005-01-03 devnull IBM PC: CP 437
142 058b0118 2005-01-03 devnull .TP
143 058b0118 2005-01-03 devnull .B atari
144 058b0118 2005-01-03 devnull Atari-ST character set
145 058b0118 2005-01-03 devnull .SH EXAMPLES
146 058b0118 2005-01-03 devnull .TP
147 058b0118 2005-01-03 devnull .B tcs -f 8859-1
148 058b0118 2005-01-03 devnull Convert 8859-1 (Latin-1) characters into
149 058b0118 2005-01-03 devnull .SM UTF
150 058b0118 2005-01-03 devnull format.
151 058b0118 2005-01-03 devnull .TP
152 058b0118 2005-01-03 devnull .B tcs -s -f jis
153 058b0118 2005-01-03 devnull Convert characters encoded in one of several shift JIS encodings into
154 058b0118 2005-01-03 devnull .SM UTF
155 058b0118 2005-01-03 devnull format.
156 058b0118 2005-01-03 devnull Unknown Kanji will be converted into
157 058b0118 2005-01-03 devnull .B 0xFFFD
158 058b0118 2005-01-03 devnull characters.
159 058b0118 2005-01-03 devnull .TP
160 058b0118 2005-01-03 devnull .B tcs -lv
161 058b0118 2005-01-03 devnull Print an up to date list of the supported character sets.
162 058b0118 2005-01-03 devnull .SH SOURCE
163 c3674de4 2005-01-11 devnull .B \*9/src/cmd/tcs
164 058b0118 2005-01-03 devnull .SH SEE ALSO
165 058b0118 2005-01-03 devnull .IR ascii (1),
166 058b0118 2005-01-03 devnull .IR rune (3),
167 d32deab1 2020-08-16 rsc .MR utf (7) .