From mboxrd@z Thu Jan  1 00:00:00 1970
Message-Id: <4E085DBF.94AB.00CC.0@wlu.ca>
Date: Mon, 27 Jun 2011 10:38:55 -0400
From: "Karljurgen Feuerherm" <kfeuerherm@wlu.ca>
To: <9fans@9fans.net>
References: <20110621105626.GA536@polynum.com>
	<iu357j$o3k$1@dough.gmane.org> <20110625065017.GA638@polynum.com>
	<522e1e2a38aa18c291305563d362abfe@ladd.quanstro.net>
	<20110625150327.GA425@polynum.com> <iu52m9$a54$1@dough.gmane.org>
	<20110625171134.GA3661@polynum.com>
	<BANLkTikoagmZ41qpH8Zqf5xw_btH1iP7Vg@mail.gmail.com>
	<20110626075745.GA395@polynum.com>
	<BANLkTi=WQCj2vL0j=G4FW08FDy_KrYpDMQ@mail.gmail.com>
	<20110627114856.GA7099@polynum.com>
	<9308c52f360f6274e0730399741278ce@ladd.quanstro.net>
In-Reply-To: <9308c52f360f6274e0730399741278ce@ladd.quanstro.net>
Mime-Version: 1.0
Content-Type: multipart/alternative; boundary="=__Part547B1FEF.0__="
Subject: Re: [9fans] [RFC] fonts and unicode/utf [TeX]
Topicbox-Message-UUID: f6e2b24e-ead6-11e9-9d60-3106f5b1d025

This is a MIME message. If you are reading this text, you may want to
consider changing to a mail reader or gateway that understands how to
properly handle MIME multipart messages.

--=__Part547B1FEF.0__=
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Thanks for bringing up Sumerian (better: Sumero-Akkadian Cuneiform). I
was thinking along exactly those lines. For me at least, solutions that
satisfy =27the majority=27 are no solutions at all. And obviously, I=27m =
not
alone.=20

(Though it could well be that I missed the intent of Thierry=27s comment
and am barking up the wrong tree.)=20

K

>>> erik quanstrom <quanstro=40quanstro.net> 06/27/11 8:36 AM >>>
> But I don=27t want to have the obligation to =22know=22 65536 signs to
> express what I want to express. I=27m sorry, but I think that the
> main majority (remember that for latin1/latin2 accented letters
> are just variants so need less =22user memory=22 than plain different
> characters) can do with (less than) 256 signs blocks, and switch
> fonts when =22speaking=22 about special things (the switch can be
> automatic by the way). As far as TeX is concerned, all the control
> codepoints (positions) are useless in the fonts. There is still
> availbale room even if for the latin1 encoded tfm built for (next)
> kerTeX from PostScript core.

there are currently 0x10ffff+1 codepoints (1114112), not 65536,
but only 23669 + the large chinese blocks are currently defined.

but anyway, i think you are missing the point.  every one of those
codepoints is used, or was used in human written communication.
the fact that you or i probablly don=27t know them all is beside the
point entirely.

there are 600000 words in the oxford english dictionary.  i don=27t
know them all.  let=27s suppose i had the power to eliminate all
the ones that i don=27t know.  wouldn=27t that be a horrible idea?
then i would not be able to learn any new words.  odious.

so with unicode.  if you strip out all the languages you don=27t know
by restricting yourself to the latin1 codepoints =5B0, 256), then you
can=27t easily add, say, greek or sumerian codepoints should you or
anyone else need them.

since, as you can see, there is a 1:1 identity mapping between latin1
and unicode codepoints =5B0, 256), i don=27t see why one wouldn=27t
give oneself the option to increase this subset to cover more ground.
i use alphas, arrows, math symbols, etc. quite often in code.  and
even more often when i used to use tex.  it=27s really quite a drag to
read =5Calpha instead of =E2=80=9C=CE=B1.=E2=80=9D

> Does a whole Unicode =22Times-Roman=22 font makes sense? Ideograms in
> =22Times-Roman=22?

i get confused on terms.  i think the right term is typeface.
extended fonts collections of a given typeface covering very
wide sections of unicode do exist and are sold by the major
font vendors.

i don=27t think that it=27s too hard to imagine that one can make
most symbols look compatable enough.  in fact, i=27m using a font
with =7E32000 glyphs on my plan 9 terminal right now.

and there=27s no penalty for having that many glyphs.  it just
means that my font file as a couple hundred subfonts.  these
are only open if needed.  typically only 3 subfonts are open
at any one time.

- erik



--=__Part547B1FEF.0__=
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Content-Description: HTML

<html>
  <head>
    <style type=3D"text/css">
      <!--
        body { margin-bottom: 1px; margin-right: 4px; font-variant: =
normal; margin-left: 4px; margin-top: 4px; line-height: normal }
        p { margin-bottom: 0; margin-top: 0 }
      -->
    </style>
   =20
  </head>
  <body style=3D"margin-bottom: 1px; margin-right: 4px; margin-left: 4px; =
margin-top: 4px">
    <p style=3D"margin-bottom: 0; margin-top: 0">
      <font face=3D"Lucida Grande" size=3D"3">Thanks for bringing up =
Sumerian &#40;better: Sumero-Akkadian Cuneiform&#41;. I was thinking along =
exactly those lines. For me at least&#44; solutions that satisfy &#39;the =
majority&#39; are no solutions at all. And obviously&#44; I&#39;m not =
alone.</font>    </p>
<br>     =20
    <p style=3D"margin-bottom: 0; margin-top: 0">
      <font face=3D"Lucida Grande" size=3D"3">&#40;Though it could well be =
that I missed the intent of Thierry&#39;s comment and am barking up the =
wrong tree.&#41;</font>    </p>
<br>     =20
    <p style=3D"margin-bottom: 0; margin-top: 0">
      <font face=3D"Lucida Grande" size=3D"3">K</font><br><br>&gt;&gt;&gt; =
erik quanstrom &lt;quanstro@quanstro.net&gt; 06/27/11 8:36 AM &gt;&gt;&gt;<=
br>&gt; But I don&#39;t want to have the obligation to &quot;know&quot; =
65536 signs to<br>&gt; express what I want to express. I&#39;m sorry&#44; =
but I think that the<br>&gt; main majority &#40;remember that for =
latin1/latin2 accented letters<br>&gt; are just variants so need less =
&quot;user memory&quot; than plain different<br>&gt; characters&#41; can =
do with &#40;less than&#41; 256 signs blocks&#44; and switch<br>&gt; fonts =
when &quot;speaking&quot; about special things &#40;the switch can =
be<br>&gt; automatic by the way&#41;. As far as TeX is concerned&#44; all =
the control<br>&gt; codepoints &#40;positions&#41; are useless in the =
fonts. There is still<br>&gt; availbale room even if for the latin1 =
encoded tfm built for &#40;next&#41;<br>&gt; kerTeX from PostScript =
core.<br><br>there are currently 0x10ffff&#43;1 codepoints &#40;1114112&#41=
;&#44; not 65536&#44;<br>but only 23669 &#43; the large chinese blocks are =
currently defined.<br><br>but anyway&#44; i think you are missing the =
point.&#160;&nbsp;every one of those<br>codepoints is used&#44; or was =
used in human written communication.<br>the fact that you or i probablly =
don&#39;t know them all is beside the<br>point entirely.<br><br>there are =
600000 words in the oxford english dictionary.&#160;&nbsp;i don&#39;t<br>kn=
ow them all.&#160;&nbsp;let&#39;s suppose i had the power to eliminate =
all<br>the ones that i don&#39;t know.&#160;&nbsp;wouldn&#39;t that be a =
horrible idea&#63;<br>then i would not be able to learn any new words.&#160=
;&nbsp;odious.<br><br>so with unicode.&#160;&nbsp;if you strip out all the =
languages you don&#39;t know<br>by restricting yourself to the latin1 =
codepoints &#91;0&#44; 256&#41;&#44; then you<br>can&#39;t easily add&#44; =
say&#44; greek or sumerian codepoints should you or<br>anyone else need =
them.<br><br>since&#44; as you can see&#44; there is a 1:1 identity =
mapping between latin1<br>and unicode codepoints &#91;0&#44; 256&#41;&#44; =
i don&#39;t see why one wouldn&#39;t<br>give oneself the option to =
increase this subset to cover more ground.<br>i use alphas&#44; arrows&#44;=
 math symbols&#44; etc. quite often in code.&#160;&nbsp;and<br>even more =
often when i used to use tex.&#160;&nbsp;it&#39;s really quite a drag =
to<br>read &#92;alpha instead of&#160;&#8220;&#945;.&#8221;<br><br>&gt; =
Does a whole Unicode &quot;Times-Roman&quot; font makes sense&#63; =
Ideograms in<br>&gt; &quot;Times-Roman&quot;&#63;<br><br>i get confused on =
terms.&#160;&nbsp;i think the right term is typeface.<br>extended fonts =
collections of a given typeface covering very<br>wide sections of unicode =
do exist and are sold by the major<br>font vendors.<br><br>i don&#39;t =
think that it&#39;s too hard to imagine that one can make<br>most symbols =
look compatable enough.&#160;&nbsp;in fact&#44; i&#39;m using a font<br>wit=
h&nbsp;&#126;32000 glyphs on my plan 9 terminal right now.<br><br>and =
there&#39;s no penalty for having that many glyphs.&#160;&nbsp;it =
just<br>means that my font file as a couple hundred subfonts.&#160;&nbsp;th=
ese<br>are only open if needed.&#160;&nbsp;typically only 3 subfonts are =
open<br>at any one time.<br><br>- erik<br><br>
    </p>
  </body>
</html>

--=__Part547B1FEF.0__=--