From mboxrd@z Thu Jan 1 00:00:00 1970 Date: Tue, 21 Jun 2011 12:56:26 +0200 From: tlaronde@polynum.com To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net> Message-ID: <20110621105626.GA536@polynum.com> References: <20110616121700.GA9131@polynum.com> <9556bc097d90b774c37c16af5a7c20eb@brasstown.quanstro.net> <20110619163458.GA424@polynum.com> <3c7e401c771bdd0d9bd8950ceb60eb9e@ladd.quanstro.net> <20110620111845.GA540@polynum.com> <76aac2169637c7af09dcd0b368aa0c7a@ladd.quanstro.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <76aac2169637c7af09dcd0b368aa0c7a@ladd.quanstro.net> User-Agent: Mutt/1.4.2.3i Subject: Re: [9fans] [RFC] fonts and unicode/utf [TeX] Topicbox-Message-UUID: f3263450-ead6-11e9-9d60-3106f5b1d025 On Mon, Jun 20, 2011 at 05:53:25PM -0400, erik quanstrom wrote: > > i'm not sure what the hard part is. just front the normal input > function with one that calls chartorune and rejects anything above > codepoint 255. that can't be more than 10 lines of code. > > that way there is no possibility of latin1 nonsense breaking previously- > functional .tex files, and you don't have to change any assumptions > in the code. (it might be better later on to operate directly on utf-8 > rather than some sort of wide character format like a rune, but that > can't break existing .tex files.) Yes, "casting" to byte can do and this is almost trivial since the input is buffered and handled via libweb (in kerTeX). But this will disallow use of TeX for non ASCII, non latin1... It seems to me better to document, and let user convert his files via tcs(1) to feed TeX. Alternative solution would be to introduce some TEX_ENCODING env variable to let input/output in TeX doing the conversion. But on Plan9 this seems to me simply ugly... to reintroduce by the window what was thrown out by the door... To be noted that at the moment I do not change _anything_ in the TeX code. The "latin1" is just the "encoding" of the fontes derived from the PS core ones (the same can be made with Computer Modern via virtual fonts to allow to the use directly of accented letters). -- Thierry Laronde http://www.kergis.com/ Key fingerprint = 0FF7 E906 FBAF FE95 FD89 250D 52B1 AE95 6006 F40C