ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
From: Mojca Miklavec <mojca.miklavec.lists@gmail.com>
To: mailing list for ConTeXt users <ntg-context@ntg.nl>
Subject: How to sort in Lua using locale?
Date: Mon, 12 Jul 2010 12:30:38 +0200	[thread overview]
Message-ID: <AANLkTimiGHyHw8EUUgaLFijRGvAtskE083-5pxDdfZBo@mail.gmail.com> (raw)

Hello,

I would like to create a document using some functionality that is
pretty close to creating an index, but the built-in index creation
code doesn't suffice. My main question is: how do I sort strings using
the built-in sorting rules for the current language?

Here is an example with a table of names:

names = {
    {first="Hans", last="Hagen"},
    {first="Hartmut", last="Henkel"},
    {first="Taco", last="Hoekwater"},
    {first="Caron", last="Haček"}
    {first="Ančka", last="Car"}
    {first="Ana", last="Car"}
    {first="Anita", last="Car"}
    {first="Matija", last="Čop"}
}

and I want to print out the following:

\section{C}
\person{Car}{Ana}
\person{Car}{Ančka}
\person{Car}{Anita}

\section{Č}
\person{Čop}{Matija}

\section{H}
\person{Haček}{Caron}
\person{Hagen}{Hans}
\person{Henkel}{Hartmut}
\person{Hoekwater}{Taco}

The following code works for English and needs a fix to respect local
sorting rules. I would be grateful for any hint before reinventing the
wheel (the hint can also be a pointer to the right line of ConTeXt
source code that does the sorting; I was a bit lost in the code).

\mainlanguage[sl]
\setuphead[section][style=\bf,before=\blank,after=,number=no]
\def\person#1#2{\hbox{{\sc #1}, #2}}

\starttext

Code that needs fixing:

\startluacode

names = {
	{first="Hans",    last="Hagen"},
	{first="Hartmut", last="Henkel"},
	{first="Taco",    last="Hoekwater"},
	{first="Caron",   last="Haček"},
	{first="Ančka",   last="Car"},
	{first="Ana",     last="Car"},
	{first="Anita",   last="Car"},
	{first="Matija",  last="Čop"},
}

-- this should sort with local sorting rules; how?
function sort_names(a,b)
	if (a.last == b.last) then
		return a.first<b.first
	else
		return a.last<b.last
	end
end

-- this is extremely ugly and needs to be fixed
-- with some UTF-8 library
function first_letter(str)
	if (string.byte(str,1)>127) then
		return string.sub(str,1,2)
	else
		return string.sub(str,1,1)
	end
end

table.sort(names, sort_names)

for k,v in pairs(names) do
	-- nested tables would also be an option; but are not required
	if (k==1) or (k>1 and (first_letter(names[k-1].last) ~=
first_letter(v.last))) then
		tex.print(string.format('\\section{%s}', first_letter(v.last)))
	end
	tex.print(string.format('\\person{%s}{%s}', v.last, v.first))
end

\stopluacode

\crlf\crlf

Desired result:

\section{C}
\person{Car}{Ana}
\person{Car}{Ančka}
\person{Car}{Anita}

\section{Č}
\person{Čop}{Matija}

\section{H}
\person{Haček}{Caron}
\person{Hagen}{Hans}
\person{Henkel}{Hartmut}
\person{Hoekwater}{Taco}

\stoptext

Thanks a lot,
   Mojca
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

             reply	other threads:[~2010-07-12 10:30 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-07-12 10:30 Mojca Miklavec [this message]
2010-07-12 12:28 ` Hans Hagen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=AANLkTimiGHyHw8EUUgaLFijRGvAtskE083-5pxDdfZBo@mail.gmail.com \
    --to=mojca.miklavec.lists@gmail.com \
    --cc=ntg-context@ntg.nl \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).