discuss@mandoc.bsd.lv
 help / color / mirror / Atom feed
* mandoc and UTF-8 support.
@ 2011-05-11 20:42 Kristaps Dzonsons
       [not found] ` <20110515.194157.759326979137963596.hrs@ec.ss.titech.ac.jp>
  0 siblings, 1 reply; 2+ messages in thread
From: Kristaps Dzonsons @ 2011-05-11 20:42 UTC (permalink / raw)
  To: hrs, discuss

[-- Attachment #1: Type: text/plain, Size: 877 bytes --]

Hiroki (note CC'd to the mdocml mailing list),

This regards today's conversation about mdocml and Japanese characters 
at the FreeBSD summit.

Enclosed is a screenshot of a locally-modified mandoc producing 
on-terminal UTF-8 glyphs for kanji (I have NO IDEA what these characters 
mean, I just picked them from the Unicode reference).  I hacked this in 
as a demonstrandum that it's possible to have UTF-8 output without much 
effort.

I used the groff \U'xxxx' input escape sequence to specify Unicode 
input.  Unfortunately, this doesn't seem to be officially supported by 
groff.

   http://lists.gnu.org/archive/html/groff/2000-04/msg00036.html

My question is this: do you know of the most reliable to feed groff 
Unicode codepoints?  I'm not sure when a -Tutf8 will exist for mandoc, 
but the screenshot demonstrates that it's in principle possible.

Thanks,

Kristaps

[-- Attachment #2: screen.png --]
[-- Type: image/png, Size: 24546 bytes --]

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: mandoc and UTF-8 support.
       [not found] ` <20110515.194157.759326979137963596.hrs@ec.ss.titech.ac.jp>
@ 2011-05-15 15:31   ` Kristaps Dzonsons
  0 siblings, 0 replies; 2+ messages in thread
From: Kristaps Dzonsons @ 2011-05-15 15:31 UTC (permalink / raw)
  To: Hiroki Sato; +Cc: discuss

> kr>  Hiroki (note CC'd to the mdocml mailing list),
> kr>
> kr>  This regards today's conversation about mdocml and Japanese characters
> kr>  at the FreeBSD summit.
> kr>
> kr>  Enclosed is a screenshot of a locally-modified mandoc producing
> kr>  on-terminal UTF-8 glyphs for kanji (I have NO IDEA what these
> kr>  characters mean, I just picked them from the Unicode reference).  I
> kr>  hacked this in as a demonstrandum that it's possible to have UTF-8
> kr>  output without much effort.
>
>   Great, it seems to work.
>
> kr>  I used the groff \U'xxxx' input escape sequence to specify Unicode
> kr>  input.  Unfortunately, this doesn't seem to be officially supported by
> kr>  groff.
> kr>
> kr>    http://lists.gnu.org/archive/html/groff/2000-04/msg00036.html
> kr>
> kr>  My question is this: do you know of the most reliable to feed groff
> kr>  Unicode codepoints?  I'm not sure when a -Tutf8 will exist for mandoc,
> kr>  but the screenshot demonstrates that it's in principle possible.
>
>   I tried UTF-8 characters directly only but it was a long time ago so
>   I don't remember the details.  I will check the stock version of
>   groff again and give mandoc a try, then contact a manual page
>   maintainer in FreeBSD project about how he feels migration from groff
>   to mandoc, and get back to you.
>
>   And, what do I do to subscribe this mailing-list?  I could not find
>   information on in at bsd.lv page.

Hiroki,

Send mail to discuss+subscribe@mdocml.bsd.lv to subscribe.

Regarding groff, their underlying Unicode input method is detailed here:

  http://mdocml.bsd.lv/archives/tech/0368.html

groff also has support for arbitrary encodings by piping the output of 
preconv (a groff preprocessor translating multi-byte characters, like 
UTF8, into the groff escapes):

  http://manpages.ubuntu.com/manpages/maverick/man1/preconv.1.html

This part I'll consider later.

I'd like to have wide-character support by the next version of mandoc; 
however, I'll post versions here for testing when they're available.

Thanks again,

Kristaps
--
 To unsubscribe send an email to discuss+unsubscribe@mdocml.bsd.lv

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2011-05-15 15:31 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-05-11 20:42 mandoc and UTF-8 support Kristaps Dzonsons
     [not found] ` <20110515.194157.759326979137963596.hrs@ec.ss.titech.ac.jp>
2011-05-15 15:31   ` Kristaps Dzonsons

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).