The Unix Heritage Society mailing list
 help / color / mirror / Atom feed
* [TUHS] Earliest Non-English UNIX?
@ 2026-01-27 23:46 segaloco via TUHS
  2026-01-28  0:11 ` [TUHS] " Alan Coopersmith via TUHS
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: segaloco via TUHS @ 2026-01-27 23:46 UTC (permalink / raw)
  To: The Eunuchs Hysterical Society

I recently got a Japanese copy of the SVR3 Programmer's Reference Manual as a reference on learning technical translation between Japanese and English.  It got me wondering on the subject of Japanese UNIX again, and I recall some prior discussion about codepages and eventual UTF-8 use for this and other regional porting efforts, but UNIX use in non-English settings certainly predates UTF-8.  From what I know of the Japanese situation, at least AT&T, Sony, and Sun were prominent players in the Japanese UNIX market early on.

This has me curious more generally, are there good records on how UNIX became cosmopolitan, what non-English (and especially non-Latin) languages it was first translated into, and how involved, if at all, Bell Labs and UCB (as opposed to post divestiture ATTIS/USL) were in early efforts to localize UNIX outside of the anglophone world.

Possibly related, the typo(I) page from V3 makes a quip about disabling the "English" mode to instead target other languages such as Urdu.  Urdu is written in an Arabic-derived script rather than Latin.  Was this just a "hey this is possible" quip or was that to suggest there was experience already by V3 of playing with non-Latin scripts in UNIX?

- Matt G.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [TUHS] Re: Earliest Non-English UNIX?
  2026-01-27 23:46 [TUHS] Earliest Non-English UNIX? segaloco via TUHS
@ 2026-01-28  0:11 ` Alan Coopersmith via TUHS
  2026-01-28 10:19 ` Edouard Klein via TUHS
  2026-01-28 15:07 ` Martin via TUHS
  2 siblings, 0 replies; 4+ messages in thread
From: Alan Coopersmith via TUHS @ 2026-01-28  0:11 UTC (permalink / raw)
  To: tuhs

On 1/27/26 15:46, segaloco via TUHS wrote:
> but UNIX use in non-English settings certainly predates UTF-8

For SunOS/Solaris, before Unicode, it was primarily ISO-8859-* for the
European languages, and Extended Unix Code (EUC) for the CJK set.

See https://en.wikipedia.org/wiki/Extended_Unix_Code and the EUC section
of _Creating Worldwide Software: Solaris International Developer's Guide_
(available at https://archive.org/details/creatingworldwid0000tuth )
for details on EUC.

While we've deprecated those locales now in favor of the UTF-8 locales
in Solaris 11.0 and later, they're still around, and the character sets
supported in things like iconv to allow translating documents, and we
know there are still some customers using the EUC locales today.

-- 
         -Alan Coopersmith-                 alan.coopersmith@oracle.com
          Oracle Solaris Engineering - https://blogs.oracle.com/solaris

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [TUHS] Re: Earliest Non-English UNIX?
  2026-01-27 23:46 [TUHS] Earliest Non-English UNIX? segaloco via TUHS
  2026-01-28  0:11 ` [TUHS] " Alan Coopersmith via TUHS
@ 2026-01-28 10:19 ` Edouard Klein via TUHS
  2026-01-28 15:07 ` Martin via TUHS
  2 siblings, 0 replies; 4+ messages in thread
From: Edouard Klein via TUHS @ 2026-01-28 10:19 UTC (permalink / raw)
  To: segaloco via TUHS

For reference the earlier thread was

https://www.tuhs.org/pipermail/tuhs/2023-March/028243.html
"[TUHS] Bell Foreign-Language UNIX Efforts"

in which I posted a history of France's Bull efforts to internationalize
UNIX in French, and also in Russian (!) for siberian gas fields:

https://www.tuhs.org/pipermail/tuhs/2023-March/028248.html

But that was quite a bit later (late 80s, early 90s) than what Matt is
asking here.

The whole thread is worth a read. Looking forward to reading this one as
well to see if more comes up.

Cheers,

Edouard.

segaloco via TUHS <tuhs@tuhs.org> writes:

> I recently got a Japanese copy of the SVR3 Programmer's Reference Manual as a
> reference on learning technical translation between Japanese and English. It got
> me wondering on the subject of Japanese UNIX again, and I recall some prior
> discussion about codepages and eventual UTF-8 use for this and other regional
> porting efforts, but UNIX use in non-English settings certainly predates UTF-8.
> From what I know of the Japanese situation, at least AT&T, Sony, and Sun were
> prominent players in the Japanese UNIX market early on.
>
> This has me curious more generally, are there good records on how UNIX became
> cosmopolitan, what non-English (and especially non-Latin) languages it was first
> translated into, and how involved, if at all, Bell Labs and UCB (as opposed to
> post divestiture ATTIS/USL) were in early efforts to localize UNIX outside of
> the anglophone world.
>
> Possibly related, the typo(I) page from V3 makes a quip about disabling the
> "English" mode to instead target other languages such as Urdu. Urdu is written
> in an Arabic-derived script rather than Latin. Was this just a "hey this is
> possible" quip or was that to suggest there was experience already by V3 of
> playing with non-Latin scripts in UNIX?
>
> - Matt G.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [TUHS] Re: Earliest Non-English UNIX?
  2026-01-27 23:46 [TUHS] Earliest Non-English UNIX? segaloco via TUHS
  2026-01-28  0:11 ` [TUHS] " Alan Coopersmith via TUHS
  2026-01-28 10:19 ` Edouard Klein via TUHS
@ 2026-01-28 15:07 ` Martin via TUHS
  2 siblings, 0 replies; 4+ messages in thread
From: Martin via TUHS @ 2026-01-28 15:07 UTC (permalink / raw)
  To: tuhs

Hello,
I would like to mention here not a UNIX per-se but heavily inspired by 
it, polish Operating System called CROOK written entirely in 
K-202/MERA-400 assembly language during 1970s and 1980s in Gdansk 
University of Technology: https://mera400.pl/Galeria
CROOK design was dictated by the same universal needs (multitasking, 
multiuser support, a hierarchical file system, etc) that shaped UNIX. It 
was build to work with both older peripherals (like tape readers) and 
newer devices produced for the MERA-400 by companies like Amepol. 
Development even influenced the design of the MX-16 processor upgrade 
for the MERA-400.

A major digital archaeology project successfully recovered CROOK and 
other software from original magnetic tapes, i.e:
https://hackaday.com/2017/03/03/raiders-of-the-lost-os-reclaiming-a-piece-of-polish-it-history/#more-245582 

Team led by Jakub Filipowicz ( https://github.com/jakubfi ) preserved 
this key piece of Polish IT history and actively documenting now all the 
archaeology efforts related with hardware/software restoration of MERA400:
https://www.youtube.com/@MERA400
The recovered disk images of CROOK-5 are now available for use in the 
EM400 emulator, i.e.:
https://mera400.pl/CROOK-5_-_obraz_dysku

Some basic specs of the machine:
- 16-bit asynchronous CPU, up to 1 million operations per second
- 48-bit hardware floating point
- 121 instructions
- 1Mword (2MB) total address space
- memory segmentation and paging: dedicated memory block for OS + 15 
blocks for user programs
- 32 interrupts (maskable, 10 priority levels)


Kind Regards!
Martin

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2026-01-28 15:07 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-01-27 23:46 [TUHS] Earliest Non-English UNIX? segaloco via TUHS
2026-01-28  0:11 ` [TUHS] " Alan Coopersmith via TUHS
2026-01-28 10:19 ` Edouard Klein via TUHS
2026-01-28 15:07 ` Martin via TUHS

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).