* mandoc and UTF-8 support.
@ 2011-05-11 20:42 Kristaps Dzonsons
[not found] ` <20110515.194157.759326979137963596.hrs@ec.ss.titech.ac.jp>
0 siblings, 1 reply; 2+ messages in thread
From: Kristaps Dzonsons @ 2011-05-11 20:42 UTC (permalink / raw)
To: hrs, discuss
[-- Attachment #1: Type: text/plain, Size: 877 bytes --]
Hiroki (note CC'd to the mdocml mailing list),
This regards today's conversation about mdocml and Japanese characters
at the FreeBSD summit.
Enclosed is a screenshot of a locally-modified mandoc producing
on-terminal UTF-8 glyphs for kanji (I have NO IDEA what these characters
mean, I just picked them from the Unicode reference). I hacked this in
as a demonstrandum that it's possible to have UTF-8 output without much
effort.
I used the groff \U'xxxx' input escape sequence to specify Unicode
input. Unfortunately, this doesn't seem to be officially supported by
groff.
http://lists.gnu.org/archive/html/groff/2000-04/msg00036.html
My question is this: do you know of the most reliable to feed groff
Unicode codepoints? I'm not sure when a -Tutf8 will exist for mandoc,
but the screenshot demonstrates that it's in principle possible.
Thanks,
Kristaps
[-- Attachment #2: screen.png --]
[-- Type: image/png, Size: 24546 bytes --]
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: mandoc and UTF-8 support.
[not found] ` <20110515.194157.759326979137963596.hrs@ec.ss.titech.ac.jp>
@ 2011-05-15 15:31 ` Kristaps Dzonsons
0 siblings, 0 replies; 2+ messages in thread
From: Kristaps Dzonsons @ 2011-05-15 15:31 UTC (permalink / raw)
To: Hiroki Sato; +Cc: discuss
> kr> Hiroki (note CC'd to the mdocml mailing list),
> kr>
> kr> This regards today's conversation about mdocml and Japanese characters
> kr> at the FreeBSD summit.
> kr>
> kr> Enclosed is a screenshot of a locally-modified mandoc producing
> kr> on-terminal UTF-8 glyphs for kanji (I have NO IDEA what these
> kr> characters mean, I just picked them from the Unicode reference). I
> kr> hacked this in as a demonstrandum that it's possible to have UTF-8
> kr> output without much effort.
>
> Great, it seems to work.
>
> kr> I used the groff \U'xxxx' input escape sequence to specify Unicode
> kr> input. Unfortunately, this doesn't seem to be officially supported by
> kr> groff.
> kr>
> kr> http://lists.gnu.org/archive/html/groff/2000-04/msg00036.html
> kr>
> kr> My question is this: do you know of the most reliable to feed groff
> kr> Unicode codepoints? I'm not sure when a -Tutf8 will exist for mandoc,
> kr> but the screenshot demonstrates that it's in principle possible.
>
> I tried UTF-8 characters directly only but it was a long time ago so
> I don't remember the details. I will check the stock version of
> groff again and give mandoc a try, then contact a manual page
> maintainer in FreeBSD project about how he feels migration from groff
> to mandoc, and get back to you.
>
> And, what do I do to subscribe this mailing-list? I could not find
> information on in at bsd.lv page.
Hiroki,
Send mail to discuss+subscribe@mdocml.bsd.lv to subscribe.
Regarding groff, their underlying Unicode input method is detailed here:
http://mdocml.bsd.lv/archives/tech/0368.html
groff also has support for arbitrary encodings by piping the output of
preconv (a groff preprocessor translating multi-byte characters, like
UTF8, into the groff escapes):
http://manpages.ubuntu.com/manpages/maverick/man1/preconv.1.html
This part I'll consider later.
I'd like to have wide-character support by the next version of mandoc;
however, I'll post versions here for testing when they're available.
Thanks again,
Kristaps
--
To unsubscribe send an email to discuss+unsubscribe@mdocml.bsd.lv
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2011-05-15 15:31 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-05-11 20:42 mandoc and UTF-8 support Kristaps Dzonsons
[not found] ` <20110515.194157.759326979137963596.hrs@ec.ss.titech.ac.jp>
2011-05-15 15:31 ` Kristaps Dzonsons
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).