Blob
1 .TH TCS 12 .SH NAME3 tcs \- translate character sets4 .SH SYNOPSIS5 .B tcs6 [7 .B -slcv8 ]9 [10 .B -f11 .I ics12 ]13 [14 .B -t15 .I ocs16 ]17 [18 .I file ...19 ]20 .SH DESCRIPTION21 .I Tcs22 interprets the named23 .I file(s)24 (standard input default) as a stream of characters from the25 .I ics26 character set or format, converts them to runes,27 and then converts them into a stream of characters from the28 .I ocs29 character set or format on the standard output.30 The default value for31 .I ics32 and33 .I ocs34 is35 .BR utf ,36 the37 .SM UTF38 encoding described in39 .IR utf (7).40 The41 .B -l42 option lists the character sets known to43 .IR tcs .44 Processing continues in the face of conversion errors (the45 .B -s46 option prevents reporting of these errors).47 The48 .B -c49 option forces the output to contain only correctly converted characters;50 otherwise,51 .B 0x8052 characters will be substituted for53 .SM UTF54 encoding errors and55 .B 0xFFFD56 characters will substituted for unknown characters.57 .PP58 The59 .B -v60 option generates various diagnostic and summary information on standard error,61 or makes the62 .B -l63 output more verbose.64 .PP65 .I Tcs66 recognizes an ever changing list of character sets.67 In particular, it supports a variety of Russian and Japanese encodings.68 Some of the supported encodings are69 .TF jis-kanji70 .TP71 .B utf72 The Plan 973 .SM UTF74 encoding, known by ISO as UTF-875 .TP76 .B utf177 The deprecated original78 .SM UTF79 encoding from ISO 1064680 .TP81 .B ascii82 7-bit ASCII83 .TP84 .B 8859-185 Latin-1 (Central European)86 .TP87 .B 8859-288 Latin-2 (Czech .. Slovak)89 .TP90 .B 8859-391 Latin-3 (Dutch .. Turkish)92 .TP93 .B 8859-494 Latin-4 (Scandinavian)95 .TP96 .B 8859-597 Part 5 (Cyrillic)98 .TP99 .B 8859-6100 Part 6 (Arabic)101 .TP102 .B 8859-7103 Part 7 (Greek)104 .TP105 .B 8859-8106 Part 8 (Hebrew)107 .TP108 .B 8859-9109 Latin-5 (Finnish .. Portuguese)110 .TP111 .B koi8112 KOI-8 (GOST 19769-74)113 .TP114 .B jis-kanji115 ISO 2022-JP116 .TP117 .B ujis118 EUC-JX: JIS 0208119 .TP120 .B ms-kanji121 Microsoft, or Shift-JIS122 .TP123 .B jis124 (from only) guesses between ISO 2022-JP, EUC or Shift-Jis125 .TP126 .B gb127 Chinese national standard (GB2312-80)128 .TP129 .B big5130 Big 5 (HKU version)131 .TP132 .B unicode133 Unicode Standard 1.0134 .TP135 .B tis136 Thai character set plus137 .SM ASCII138 (TIS 620-1986)139 .TP140 .B msdos141 IBM PC: CP 437142 .TP143 .B atari144 Atari-ST character set145 .SH EXAMPLES146 .TP147 .B tcs -f 8859-1148 Convert 8859-1 (Latin-1) characters into149 .SM UTF150 format.151 .TP152 .B tcs -s -f jis153 Convert characters encoded in one of several shift JIS encodings into154 .SM UTF155 format.156 Unknown Kanji will be converted into157 .B 0xFFFD158 characters.159 .TP160 .B tcs -lv161 Print an up to date list of the supported character sets.162 .SH SOURCE163 .B \*9/src/cmd/tcs164 .SH SEE ALSO165 .IR ascii (1),166 .IR rune (3),167 .IR utf (7).