Blame


1 a31db67d 2004-04-21 devnull .tr -\(hy
2 a31db67d 2004-04-21 devnull .TL
3 a31db67d 2004-04-21 devnull Hello World
4 a31db67d 2004-04-21 devnull .br
5 a31db67d 2004-04-21 devnull or
6 a31db67d 2004-04-21 devnull .br
7 a31db67d 2004-04-21 devnull Καλημέρα κόσμε
8 a31db67d 2004-04-21 devnull .br
9 a31db67d 2004-04-21 devnull or
10 a31db67d 2004-04-21 devnull .br
11 a31db67d 2004-04-21 devnull こんにちは 世界
12 a31db67d 2004-04-21 devnull .AU
13 a31db67d 2004-04-21 devnull Rob Pike
14 a31db67d 2004-04-21 devnull Ken Thompson
15 a31db67d 2004-04-21 devnull .AI
16 a31db67d 2004-04-21 devnull .MH
17 a31db67d 2004-04-21 devnull .AB
18 a31db67d 2004-04-21 devnull Plan 9 from Bell Labs has recently been converted from ASCII
19 a31db67d 2004-04-21 devnull to an ASCII-compatible variant of Unicode, a 16-bit character set.
20 a31db67d 2004-04-21 devnull In this paper we explain the reasons for the change,
21 a31db67d 2004-04-21 devnull describe the character set and representation we chose,
22 a31db67d 2004-04-21 devnull and present the programming models and software changes
23 a31db67d 2004-04-21 devnull that support the new text format.
24 a31db67d 2004-04-21 devnull Although we stopped short of full internationalization\(emfor
25 a31db67d 2004-04-21 devnull example, system error messages are in Unixese, not Japanese\(emwe
26 a31db67d 2004-04-21 devnull believe Plan 9 is the first system to treat the representation
27 a31db67d 2004-04-21 devnull of all major languages on a uniform, equal footing throughout all its
28 a31db67d 2004-04-21 devnull software.
29 a31db67d 2004-04-21 devnull .AE
30 a31db67d 2004-04-21 devnull .SH
31 a31db67d 2004-04-21 devnull Introduction
32 a31db67d 2004-04-21 devnull .PP
33 a31db67d 2004-04-21 devnull The world is multilingual but most computer systems
34 a31db67d 2004-04-21 devnull are based on English and ASCII or worse.
35 a31db67d 2004-04-21 devnull The pending release of Plan 9 [Pike90], a new distributed operating
36 a31db67d 2004-04-21 devnull system from Bell Laboratories, seemed a good occasion
37 a31db67d 2004-04-21 devnull to correct this chauvinism.
38 a31db67d 2004-04-21 devnull It is easier to make such deep changes when building new systems than
39 a31db67d 2004-04-21 devnull by retrofitting old ones.
40 a31db67d 2004-04-21 devnull .PP
41 a31db67d 2004-04-21 devnull The ANSI C standard [ANSIC] contains some guidance on the matter of
42 a31db67d 2004-04-21 devnull `wide' and `multi-byte' characters but falls far short of
43 a31db67d 2004-04-21 devnull solving the myriad associated problems.
44 a31db67d 2004-04-21 devnull We could find no literature on how to convert a
45 a31db67d 2004-04-21 devnull .I system
46 a31db67d 2004-04-21 devnull to larger character sets, although some individual
47 a31db67d 2004-04-21 devnull .I programs
48 a31db67d 2004-04-21 devnull have been converted.
49 a31db67d 2004-04-21 devnull This paper reports what we discovered as we
50 a31db67d 2004-04-21 devnull explored the problem of representing multilingual