mailing list of musl libc
 help / color / mirror / code / Atom feed
From: Wermut <wermut@gmail.com>
To: "musl@lists.openwall.com" <musl@lists.openwall.com>
Subject: Re: Call for locales maintainer & contributors
Date: Sun, 27 Jul 2014 21:43:34 +0200	[thread overview]
Message-ID: <CALONj1dky0eZGj7ZGQyevmDR-N5=ZHJbT6Am5dNgKR5wZ2g5=g@mail.gmail.com> (raw)
In-Reply-To: <20140727032758.GT4038@brightrain.aerifal.cx>

[-- Attachment #1: Type: text/plain, Size: 3860 bytes --]

Hi

With the exception of the musl translation itself, I think the most
parts are doable. My problem at the moment is, that I am not a C hero
like you guys and don't know exactly how these locale files should
look like (file format, content). As a consequence fully answering
your questions is non trivial at the moment. I can do some research
and do some dirty work, but first I would need a sample locale file in
the musl format, or some documentation to get kick started. I have
worked in the past and even created locale files for glibc and cldr,
so I am at least not a complete newbie on the topic.

Unfortunately I have not enough time to act as a maintainer, but I
could periodically help out if someone stands up and take the lead.

For the translation of musl itself: Do you plan to add a *.pot file to
the musl repository?

Regards
Kevin

On Sun, Jul 27, 2014 at 5:27 AM, Rich Felker <dalias@libc.org> wrote:
> On Sat, Jul 26, 2014 at 11:27:38PM +0200, Wermut wrote:
>> Hi
>>
>> I don't like the idea of an entirely new tree of locale data written
>> from scratch. Glibc has one (with a lot of unmaintained data) and then
>> there is also the CLDR repository which aims to be the central source
>> for such data, maintained by unicode. The CLDR data is also used as a
>> basis for the Microsoft and Apple locale files and is often maintained
>> by national language experts. What I could offer is an effort to write
>> some magic code that imports the actual CLDR data and converts the
>> relevant information to the musl formatted ones. The CLDR data is
>> freely available from: http://cldr.unicode.org/index/downloads
>
> I have no objection to using data from CLDR if there's no restrictive
> license, but at first glance it looks like most of the data is outside
> the scope of the C/POSIX locale system. What we need is:

CLDR license (botom of the page): http://unicode.org/copyright.html I
my eyes this is a BSD like license. If somebody thinks the license is
not OK, please say so. Copy is attached to this mail.

>
> 1. Weekday and month names (full and abbreviated) - these should
>    almost certainly be available from CLDR or other public sources.
>
> 2. Time format strings for strftime - unless CLDR has C-oriented data
>    like that, these might not be available in a form that's easy to
>    automatically adapt. Research on this topic is welcome.
>
> 3. Regexes for yes and no responses - seems unlikely to be in CLDR,
>    but again I'd be happy for someone to prove me wrong.
>
> 4. Translations of the message strings in libc. Note that musl's
>    strings already deviate some from the legacy strings used on glibc
>    and other systems. For example the strerror strings are adjusted to
>    align more closely with the POSIX description and the actual
>    situations they arise in than the legacy strings (like "Not a
>    typewriter"). I'd like to aim to have our translated strings
>    equally modernized. And before really spending a lot of work on
>    these we should review the English strings again for possible
>    improvements and missing messages (I think some newer error codes
>    may be missing).
>
> 5. Collation rules - these almost certainly can come from Unicode/CLDR
>    but musl does not even support collation yet.
>
> 6. Monetary formatting and currency names - these almost certain can
>    come from CLDR or other public sources, but again the code to use
>    the data isn't there yet.
>
>> Contribution is not completely open, but you normally interested
>> people get access if they want to. I got mine within a week.
>>
>> This is only a suggestion open to discussion. What do you guys think about it?
>
> Overall I like it. But I think we still need a maintainer to manage
> pulling the data, maintaining string translations for messages, etc.
> Any comments on my items 1-6 above?
>
> Rich

[-- Attachment #2: unicode-license.txt --]
[-- Type: text/plain, Size: 2933 bytes --]

UNICODE, INC. LICENSE AGREEMENT - DATA FILES AND SOFTWARE

    Unicode Data Files include all data files under the directories
http://www.unicode.org/Public/, http://www.unicode.org/reports/, and
http://www.unicode.org/cldr/data/. Unicode Data Files do not include PDF
online code charts under the directory http://www.unicode.org/Public/.
Software includes any source code published in the Unicode Standard or under
the directories http://www.unicode.org/Public/,
http://www.unicode.org/reports/, and http://www.unicode.org/cldr/data/.

    NOTICE TO USER: Carefully read the following legal agreement. BY
DOWNLOADING, INSTALLING, COPYING OR OTHERWISE USING UNICODE INC.'S DATA FILES
("DATA FILES"), AND/OR SOFTWARE ("SOFTWARE"), YOU UNEQUIVOCALLY ACCEPT, AND
AGREE TO BE BOUND BY, ALL OF THE TERMS AND CONDITIONS OF THIS AGREEMENT. IF
YOU DO NOT AGREE, DO NOT DOWNLOAD, INSTALL, COPY, DISTRIBUTE OR USE THE DATA
FILES OR SOFTWARE.

    COPYRIGHT AND PERMISSION NOTICE

    Copyright © 1991-2014 Unicode, Inc. All rights reserved. Distributed under
the Terms of Use in http://www.unicode.org/copyright.html.

    Permission is hereby granted, free of charge, to any person obtaining a
copy of the Unicode data files and any associated documentation (the "Data
Files") or Unicode software and any associated documentation (the "Software")
to deal in the Data Files or Software without restriction, including without
limitation the rights to use, copy, modify, merge, publish, distribute, and/or
sell copies of the Data Files or Software, and to permit persons to whom the
Data Files or Software are furnished to do so, provided that (a) the above
copyright notice(s) and this permission notice appear with all copies of the
Data Files or Software, (b) both the above copyright notice(s) and this
permission notice appear in associated documentation, and (c) there is clear
notice in each modified Data File or in the Software as well as in the
documentation associated with the Data File(s) or Software that the data or
software has been modified.

    THE DATA FILES AND SOFTWARE ARE PROVIDED "AS IS", WITHOUT WARRANTY OF ANY
KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT OF THIRD
PARTY RIGHTS. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR HOLDERS INCLUDED IN
THIS NOTICE BE LIABLE FOR ANY CLAIM, OR ANY SPECIAL INDIRECT OR CONSEQUENTIAL
DAMAGES, OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR
PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS
ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THE
DATA FILES OR SOFTWARE.

    Except as contained in this notice, the name of a copyright holder shall
not be used in advertising or otherwise to promote the sale, use or other
dealings in these Data Files or Software without prior written authorization
of the copyright holder.

  reply	other threads:[~2014-07-27 19:43 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-07-24 19:32 Rich Felker
2014-07-26 13:27 ` Olivier Goudron
2014-07-26 21:27   ` Wermut
2014-07-27  3:27     ` Rich Felker
2014-07-27 19:43       ` Wermut [this message]
2014-07-27 20:41         ` Rich Felker
2014-07-28  3:12         ` Isaac Dunham
2014-07-28  3:28           ` Rich Felker

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CALONj1dky0eZGj7ZGQyevmDR-N5=ZHJbT6Am5dNgKR5wZ2g5=g@mail.gmail.com' \
    --to=wermut@gmail.com \
    --cc=musl@lists.openwall.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.vuxu.org/mirror/musl/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).