ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
* Problem with encoding
@ 2010-12-21  8:34 Procházka Lukáš Ing. - Pontex s. r. o.
  2010-12-21  9:29 ` Mojca Miklavec
  0 siblings, 1 reply; 16+ messages in thread
From: Procházka Lukáš Ing. - Pontex s. r. o. @ 2010-12-21  8:34 UTC (permalink / raw)
  To: ConTeXt

[-- Attachment #1: Type: text/plain, Size: 5395 bytes --]

Hello,

this example stopped working. I don't know exactly when, maybe with the newest Ctx beta. Let's have two sources:

--- t.mkiv
\mainlanguage[cz]
\enableregime[cp1250]

\starttext
   \input t2.mkiv

   AáSš
\stoptext
---

And:

--- t2.mkiv
AáSš
---

The t2.mkiv is included by the main file t.mkiv.

The strange thing is that chars with diacritics being used in the included file are successfully processed, whilst those in the main file are not.

Here's the process log:

---
MTXrun | run 1: luatex --fmt="c:/ConTeXt/tex/texmf-cache/luatex-cache/context/f53042fa2e1c106bc7e3383ec8c3a00c/formats/cont-en" --lua="c:/ConTeXt/tex/texmf-cache/luatex-cache/context/f53042fa2e1c106bc7e3383ec8c3a00c/formats/cont-en.lui" --backend=pdf "C:/Lukas/Jobs/Holesov.PDPS/SO_201/Texts/T/t.mkiv"This is LuaTeX, Version beta-0.64.0-2010111223 (rev 3956)
  \write18 enabled.
(C:/Lukas/Jobs/Holesov.PDPS/SO_201/Texts/T/t.mkiv

ConTeXt  ver: 2010.11.27 14:27 MKIV  fmt: 2010.11.29  int: english/english

system          : cont-new loaded
(c:/ConTeXt/tex/texmf-context/tex/context/base/cont-new.tex
systems         : beware: some patches loaded from cont-new.tex
(c:/ConTeXt/tex/texmf-context/tex/context/base/cont-new.mkiv))
system          : cont-fil.mkiv loaded
(c:/ConTeXt/tex/texmf-context/tex/context/base/cont-fil.mkiv
loading         : ConTeXt File Synonyms
)
system          : cont-sys.rme loaded
(c:/ConTeXt/tex/texmf-context/tex/context/user/cont-sys.rme (c:/ConTeXt/tex/texmf-context/tex/context/base/type-def.mkiv) (c:/ConTeXt/tex/texmf-context/tex/context/base/type-lua.mkiv) (c:/ConTeXt/tex/texmf-context/tex/context/base/type-siz.mkiv) (c:/ConTeXt/tex/texmf-context/tex/context/base/type-otf.mkiv))
system          : cont-err loaded
(c:/ConTeXt/tex/texmf-context/tex/context/base/cont-err.tex
systems         : no file 'cont-sys.tex', using 'cont-sys.rme' instead
)
system          : t.top loaded
(t.top
){c:/ConTeXt/tex/texmf/fonts/map/dvips/lm/lm-math.map}{c:/ConTeXt/tex/texmf/fonts/map/dvips/lm/lm-rm.map}{c:/ConTeXt/tex/texmf-context/fonts/map/pdftex/context/mkiv-base.map}
bodyfont        : 12pt rm is loaded
fonts           : preloading latin modern fonts (first stage)
language        : language en is active
publications    : loading formatting style from bxml-apa
(c:/ConTeXt/tex/texmf-context/tex/context/base/bxml-apa.mkiv)
systems         : begin file C:/Lukas/Jobs/Holesov.PDPS/SO_201/Texts/T/t.mkiv at line 4
(t2.mkiv)
! String contains an invalid utf-8 sequence.
l.7   A
        ßSÜ
backends        > using xmp file 'c:/ConTeXt/tex/texmf-context/tex/context/base/lpdf-pdx.xml'
pages           > flushing realpage 1, userpage 1, subpage 1
systems         : end file C:/Lukas/Jobs/Holesov.PDPS/SO_201/Texts/T/t.mkiv at line 8
  )<c:/ConTeXt/tex/texmf/fonts/opentype/public/lm/lmroman12-regular.otf>
mkiv lua stats : used config file          - c:/ConTeXt/tex/texmf/web2c/texmfcnf.lua
mkiv lua stats : used cache path           - c:/ConTeXt/tex/texmf-cache/luatex-cache/context/f53042fa2e1c106bc7e3383ec8c3a00c
mkiv lua stats : input load time           - 0.016 seconds
mkiv lua stats : stored bytecode data      - 257 modules, 56 tables, 313 chunks
mkiv lua stats : cleaned up reserved nodes - 33 nodes, 9 lists of 411
mkiv lua stats : node memory usage         - 19 glue_spec
mkiv lua stats : node list callback tasks  - 6 unique task lists, 5 instances (re)created, 329 calls
mkiv lua stats : used backend              - pdf (backend for directly generating pdf output)
mkiv lua stats : loaded patterns           - cs::23 en::2
mkiv lua stats : callbacks                 - direct: 481, indirect: 4508, total: 4989
mkiv lua stats : lxml preparation time     - 0.000 seconds, 0 nodes, 11 lpath calls, 0 cached calls
mkiv lua stats : result saved in file      - t.pdf
mkiv lua stats : loaded fonts              - 33 files: stmary10.afm lmmono12-regular.otf lmmono8-regular.otf lmmono9-regular.otf lmroman12-bold.otf lmroman12-regular.otf lmroman7-bold.otf lmroman7-regular.otf lmroman9-bold.otf lmroman9-regular.otf lmsans12-regular.otf lmsans8-regular.otf lmsans9-regular.otf eufb10.tfm eufb7.tfm eufm10.tfm eufm7.tfm msam10.tfm msam7.tfm msbm10.tfm msbm7.tfm lmex10.tfm lmmi12.tfm lmmi7.tfm lmmi9.tfm lmmib10.tfm lmmib7.tfm lmsy10.tfm lmsy7.tfm lmsy9.tfm rm-lmr12.tfm rm-lmr7.tf
m rm-lmr9.tfm
mkiv lua stats : fonts load time           - 0.266 seconds
mkiv lua stats : luatex banner             - this is luatex, version beta-0.64.0-2010111223 (web2c 2010) (rev 3956)
mkiv lua stats : control sequences         - 29979 of 165536
mkiv lua stats : current memory usage      - 32 MB (ctx: 32 MB)
mkiv lua stats : runtime                   - 0.813 seconds, 1 processed pages, 1 shipped pages, 1.230 pages/second

MTXrun | fatal error: return code: 1
---

So the example doesn't work with the Ctx version from 27.11.2010. But it does with one from 20.11.2010 (tested on another computer).

So is it possible to make Ctx work even when diacritics and ANSI encoding is used in the main file, as it used to be?

All files use ANSI encoding (one char = one byte); WinXP.

Best regards,

Lukas


-- 
Ing. Lukáš Procházka [mailto:LPr@pontex.cz]
Pontex s. r. o.      [mailto:pontex@pontex.cz] [http://www.pontex.cz]
Bezová 1658
147 14 Praha 4

Tel: +420 244 062 238
Fax: +420 244 461 038

[-- Attachment #2: t.log --]
[-- Type: application/octet-stream, Size: 4699 bytes --]

[-- Attachment #3: t.mkiv --]
[-- Type: application/octet-stream, Size: 95 bytes --]

[-- Attachment #4: t2.mkiv --]
[-- Type: application/octet-stream, Size: 6 bytes --]

[-- Attachment #5: Type: text/plain, Size: 486 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21  8:34 Problem with encoding Procházka Lukáš Ing. - Pontex s. r. o.
@ 2010-12-21  9:29 ` Mojca Miklavec
  2010-12-21  9:37   ` Procházka Lukáš Ing. - Pontex s. r. o.
  2010-12-21  9:43   ` Arthur Reutenauer
  0 siblings, 2 replies; 16+ messages in thread
From: Mojca Miklavec @ 2010-12-21  9:29 UTC (permalink / raw)
  To: mailing list for ConTeXt users

2010/12/21 Procházka Lukáš Ing. - Pontex s. r. o. wrote:
>
> \mainlanguage[cz]

(unrelated to your question, but ...)

It should probably be cs, not cz.

Isn't "cz" country code and "cs" language code? The "cz" is left for
backward compatibility reasons.

Mojca
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21  9:29 ` Mojca Miklavec
@ 2010-12-21  9:37   ` Procházka Lukáš Ing. - Pontex s. r. o.
  2010-12-21 10:05     ` Hans Hagen
                       ` (2 more replies)
  2010-12-21  9:43   ` Arthur Reutenauer
  1 sibling, 3 replies; 16+ messages in thread
From: Procházka Lukáš Ing. - Pontex s. r. o. @ 2010-12-21  9:37 UTC (permalink / raw)
  To: mailing list for ConTeXt users

Hello,

On Tue, 21 Dec 2010 10:29:13 +0100, Mojca Miklavec <mojca.miklavec.lists@gmail.com> wrote:

> 2010/12/21 Procházka Lukáš Ing. - Pontex s. r. o. wrote:
>>
>> \mainlanguage[cz]
>
> It should probably be cs, not cz.

I'm getting the same error message even with "cs".

> Isn't "cz" country code and "cs" language code?

BTW: What is the difference what the COUNTRY code and the LANGUAGE code affects?

Lukas

> The "cz" is left for
> backward compatibility reasons.
>
> Mojca


-- 
Ing. Lukáš Procházka [mailto:LPr@pontex.cz]
Pontex s. r. o.      [mailto:pontex@pontex.cz] [http://www.pontex.cz]
Bezová 1658
147 14 Praha 4

Tel: +420 244 062 238
Fax: +420 244 461 038

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21  9:29 ` Mojca Miklavec
  2010-12-21  9:37   ` Procházka Lukáš Ing. - Pontex s. r. o.
@ 2010-12-21  9:43   ` Arthur Reutenauer
  1 sibling, 0 replies; 16+ messages in thread
From: Arthur Reutenauer @ 2010-12-21  9:43 UTC (permalink / raw)
  To: Mailing list for ConTeXt users

> Isn't "cz" country code and "cs" language code?

  Indeed -- although it doesn't make much sense that "cs" be the
language code for a language name spelt "Czech" in English and "česky"
in Czech (I suspect it's related to the fact that "cs" was also the
country code for Czechoslovakia in its time, so that the same code was
used for the "main" language of that country, so to say).

	Arthur
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21  9:37   ` Procházka Lukáš Ing. - Pontex s. r. o.
@ 2010-12-21 10:05     ` Hans Hagen
  2010-12-21 10:11     ` Arthur Reutenauer
  2010-12-21 10:14     ` Mojca Miklavec
  2 siblings, 0 replies; 16+ messages in thread
From: Hans Hagen @ 2010-12-21 10:05 UTC (permalink / raw)
  To: mailing list for ConTeXt users

On 21-12-2010 10:37, Procházka Lukáš Ing. - Pontex s. r. o. wrote:
> Hello,
>
> On Tue, 21 Dec 2010 10:29:13 +0100, Mojca Miklavec
> <mojca.miklavec.lists@gmail.com> wrote:
>
>> 2010/12/21 Procházka Lukáš Ing. - Pontex s. r. o. wrote:
>>>
>>> \mainlanguage[cz]
>>
>> It should probably be cs, not cz.

one can always use 'czech'

> I'm getting the same error message even with "cs".

Regimes were dealt with at the file level but it has to be at the line 
level for in-document switching.

I uploaded a beta:

- m-database fix
- regimes fix
- split numbering fix

Hans

-----------------------------------------------------------------
                                           Hans Hagen | PRAGMA ADE
               Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
     tel: 038 477 53 69 | voip: 087 875 68 74 | www.pragma-ade.com
                                              | www.pragma-pod.nl
-----------------------------------------------------------------
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21  9:37   ` Procházka Lukáš Ing. - Pontex s. r. o.
  2010-12-21 10:05     ` Hans Hagen
@ 2010-12-21 10:11     ` Arthur Reutenauer
  2010-12-21 10:17       ` Procházka Lukáš Ing. - Pontex s. r. o.
  2010-12-21 11:19       ` Philipp Gesang
  2010-12-21 10:14     ` Mojca Miklavec
  2 siblings, 2 replies; 16+ messages in thread
From: Arthur Reutenauer @ 2010-12-21 10:11 UTC (permalink / raw)
  To: Mailing list for ConTeXt users

> BTW: What is the difference what the COUNTRY code and the LANGUAGE code affects?

  In your case, you should use a language code since you want to set a
language.  ConTeXt has used different sets of codes in the past, but we
now try to follow IETF recommendation "Tag for Identifying Languages"
(http://tools.ietf.org/html/bcp47), because it's the only one that can
be precise enough for our needs (at least among the tagging systems I
know of).  For example, you can distinguish between British English and
American English by appending a country code to the language code (hence
"en-gb" and "en-us", respectively).  Using a country code to identify a
language is generally a bad idea and should be discouraged, since that's
not what they're meant for, and it can lead to confusion: for example,
you could use the code "uk" to identify English as spoken in the United
Kingdom, but that's actually the language code for Ukrainian, which has
been an actual problem for ConTeXt in the post (in addition to that,
"uk" is not even the proper language code for the United Kingdom: it's
"gb", as written above; the reason why "uk" is used as a DNS top-level
domain is not clear and has lead, alas, to even more confusion).

  The authorities that decide upon language and country codes are
different committees of the ISO; the ISO standard for language codes is
ISO 639 (with different parts), and the one for country codes is ISO 3166
(again, with different parts; the two-letter codes of ISO 3166-1 are
generally rather well-known because they're the ones being used for DNS
top-level domains -- with some exceptions, see above).

	Arthur
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21  9:37   ` Procházka Lukáš Ing. - Pontex s. r. o.
  2010-12-21 10:05     ` Hans Hagen
  2010-12-21 10:11     ` Arthur Reutenauer
@ 2010-12-21 10:14     ` Mojca Miklavec
  2010-12-21 10:35       ` Arthur Reutenauer
  2 siblings, 1 reply; 16+ messages in thread
From: Mojca Miklavec @ 2010-12-21 10:14 UTC (permalink / raw)
  To: mailing list for ConTeXt users

2010/12/21 Procházka Lukáš Ing. - Pontex s. r. o. <LPr@pontex.cz>:
>
>> It should probably be cs, not cz.
>
> I'm getting the same error message even with "cs".

As I said: my comment was unrelated to your problem.

>> Isn't "cz" country code and "cs" language code?
>
> BTW: What is the difference what the COUNTRY code and the LANGUAGE code
> affects?

If you were a citizen of Austria or Switzerlad, you would not be able to use
    \mainlanguage[at] or \mainlanguage[ch]
You need to use [de] instead (which happens to be the same as country
code for Germany, but that is just "a coincidence").

Country codes usually have no meaning to ConTeXt. In the case of Czech
it is just for the sake of backward-compatibility that cz works.

Mojca
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21 10:11     ` Arthur Reutenauer
@ 2010-12-21 10:17       ` Procházka Lukáš Ing. - Pontex s. r. o.
  2010-12-21 11:19       ` Philipp Gesang
  1 sibling, 0 replies; 16+ messages in thread
From: Procházka Lukáš Ing. - Pontex s. r. o. @ 2010-12-21 10:17 UTC (permalink / raw)
  To: Mailing list for ConTeXt users

... OK, thanks all for the answers and fixes.

Lukas


On Tue, 21 Dec 2010 11:11:33 +0100, Arthur Reutenauer <arthur.reutenauer@normalesup.org> wrote:

>> BTW: What is the difference what the COUNTRY code and the LANGUAGE code affects?
>
>   In your case, you should use a language code since you want to set a
> language.  ConTeXt has used different sets of codes in the past, but we
> now try to follow IETF recommendation "Tag for Identifying Languages"
> (http://tools.ietf.org/html/bcp47), because it's the only one that can
> be precise enough for our needs (at least among the tagging systems I
> know of).  For example, you can distinguish between British English and
> American English by appending a country code to the language code (hence
> "en-gb" and "en-us", respectively).  Using a country code to identify a
> language is generally a bad idea and should be discouraged, since that's
> not what they're meant for, and it can lead to confusion: for example,
> you could use the code "uk" to identify English as spoken in the United
> Kingdom, but that's actually the language code for Ukrainian, which has
> been an actual problem for ConTeXt in the post (in addition to that,
> "uk" is not even the proper language code for the United Kingdom: it's
> "gb", as written above; the reason why "uk" is used as a DNS top-level
> domain is not clear and has lead, alas, to even more confusion).
>
>   The authorities that decide upon language and country codes are
> different committees of the ISO; the ISO standard for language codes is
> ISO 639 (with different parts), and the one for country codes is ISO 3166
> (again, with different parts; the two-letter codes of ISO 3166-1 are
> generally rather well-known because they're the ones being used for DNS
> top-level domains -- with some exceptions, see above).
>
> 	Arthur


-- 
Ing. Lukáš Procházka [mailto:LPr@pontex.cz]
Pontex s. r. o.      [mailto:pontex@pontex.cz] [http://www.pontex.cz]
Bezová 1658
147 14 Praha 4

Tel: +420 244 062 238
Fax: +420 244 461 038

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21 10:14     ` Mojca Miklavec
@ 2010-12-21 10:35       ` Arthur Reutenauer
  0 siblings, 0 replies; 16+ messages in thread
From: Arthur Reutenauer @ 2010-12-21 10:35 UTC (permalink / raw)
  To: Mailing list for ConTeXt users

> If you were a citizen of Austria or Switzerlad, you would not be able to use
>     \mainlanguage[at] or \mainlanguage[ch]

  As it happens, "at" is not the ISO 639 code of any language (and it's
doubtful it will be in the near future), but "ch" is the code for
Chamorro (http://www.ethnologue.com/show_language.asp?code=cha), so you
would indeed have problems :-)

	Arthur
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21 10:11     ` Arthur Reutenauer
  2010-12-21 10:17       ` Procházka Lukáš Ing. - Pontex s. r. o.
@ 2010-12-21 11:19       ` Philipp Gesang
  2010-12-21 11:21         ` Philipp Gesang
                           ` (3 more replies)
  1 sibling, 4 replies; 16+ messages in thread
From: Philipp Gesang @ 2010-12-21 11:19 UTC (permalink / raw)
  To: Mailing list for ConTeXt users


[-- Attachment #1.1: Type: text/plain, Size: 2740 bytes --]

On 2010-12-21 <11:11:33>, Arthur Reutenauer wrote:
> > BTW: What is the difference what the COUNTRY code and the LANGUAGE code affects?
> 
>   In your case, you should use a language code since you want to set a
> language.  ConTeXt has used different sets of codes in the past, but we
> now try to follow IETF recommendation "Tag for Identifying Languages"
> (http://tools.ietf.org/html/bcp47), because it's the only one that can
> be precise enough for our needs (at least among the tagging systems I
> know of).  For example, you can distinguish between British English and
> American English by appending a country code to the language code (hence
> "en-gb" and "en-us", respectively).  Using a country code to identify a
> language is generally a bad idea and should be discouraged, since that's
> not what they're meant for, and it can lead to confusion: for example,
> you could use the code "uk" to identify English as spoken in the United
> Kingdom, but that's actually the language code for Ukrainian, which has
> been an actual problem for ConTeXt in the post (in addition to that,
> "uk" is not even the proper language code for the United Kingdom: it's
> "gb", as written above; the reason why "uk" is used as a DNS top-level
> domain is not clear and has lead, alas, to even more confusion).

But context uses some non-standard codes also: “deo” instead of
“de-1901” for sane German orthography, and “agr” for ancient
Attic instead of “grc” or “el-polyton”. (Testfile appended.)

Philipp

PS: nice tools
    http://people.w3.org/rishida/utils/subtags/
    http://unicode.org/cldr/utility/languageid.jsp



>   The authorities that decide upon language and country codes are
> different committees of the ISO; the ISO standard for language codes is
> ISO 639 (with different parts), and the one for country codes is ISO 3166
> (again, with different parts; the two-letter codes of ISO 3166-1 are
> generally rather well-known because they're the ones being used for DNS
> top-level domains -- with some exceptions, see above).
> 
> 	Arthur
> ___________________________________________________________________________________
> If your question is of interest to others as well, please add an entry to the Wiki!
> 
> maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
> webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
> archive  : http://foundry.supelec.fr/projects/contextrev/
> wiki     : http://contextgarden.net
> ___________________________________________________________________________________

-- 
()  ascii ribbon campaign - against html e-mail
/\  www.asciiribbon.org   - against proprietary attachments

[-- Attachment #1.2: Type: application/pgp-signature, Size: 198 bytes --]

[-- Attachment #2: Type: text/plain, Size: 486 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21 11:19       ` Philipp Gesang
@ 2010-12-21 11:21         ` Philipp Gesang
  2010-12-21 11:44         ` Thomas Schmitz
                           ` (2 subsequent siblings)
  3 siblings, 0 replies; 16+ messages in thread
From: Philipp Gesang @ 2010-12-21 11:21 UTC (permalink / raw)
  To: Mailing list for ConTeXt users


[-- Attachment #1.1.1: Type: text/plain, Size: 294 bytes --]

On 2010-12-21 <12:19:15>, Philipp Gesang wrote:

> Attic instead of “grc” or “el-polyton”. (Testfile appended.)

Sorry for the mess, test file *now* appended …


-- 
()  ascii ribbon campaign - against html e-mail
/\  www.asciiribbon.org   - against proprietary attachments

[-- Attachment #1.1.2: deo.tex --]
[-- Type: text/x-tex, Size: 3215 bytes --]

% http://unicode.org/cldr/utility/languageid.jsp?a=de-1901&l=en
\setuppapersize [XY][A4] \setuppaper [nx=2,ny=2] \setuparranging [XY]

\usetypescriptfile[type-cmu] \setupbodyfont[computer-modern-unicode,10pt]

\mainlanguage[de-1901]
%\mainlanguage[deo]

\starttext

\startcolumns[n=2]

\dorecurse{4}{%
\bgroup
  %\language[deo]    % currently used language code, context only
  \language[de-1901] % BCP-47
  Seitens der Kritik habe ich auch gar nichts anderes erwartet, als daß 
  meist von „Szene“ gesprochen wird.  Die Ursache dafür ist eben in der 
  Tradition des deutschen Pathos zu ermitteln, welches den symphonischen
  Begriff im allgemeinen stempelt.  Ebendeshalb habe ich diesen Aufsatz
  über meine Sinfonie geschrieben, um den Beweis zu erbringen, daß ich das 
  Pathos gänzlich ablehne, was auch eine entsprechende Verringerung des 
  gesamten Orchesterapparats bedeutet und daß ich mich mit dem Begriff
  „Sinfonie“ gänzlich anders auseinandersetze als die übrigen Komponisten,
  welche traditiongemäß fruchtlos fortsetzen, wo tatsächlich Bruckner und 
  Mahler bereits aufhörten.  Als Komponist dieses Werkes kann ich mich
  jederzeit rechtfertigen und verantworte voll und ganz den Titel
  „Sinfonie“.

  Im übrigen möchte ich bemerken, daß die heute in Deutschland usuelle
  „Neue Sachlichkeit“ keineswegs der Wahrheit und Notwendigkeit
  entspringt, da der Deutsche nie aus seiner Sentimentalität herauskann.
  Die augenblickliche Bach- und Händelrennaissance ist nichts anderes als 
  ein Zeichen von Impotenz, welche mit dem Schlagwort „Neue Sachlichkeit“
  bemäntelt wird.  Der Fall Strawinsky ist dafür eklatant und der nächste
  fall wird die Erstarrung Hindemiths werden, falls dieser nicht
  rechtzeitig aus der Schablone herausfindet.

  {\em Erwin Schulhoff} 1928\par
\egroup

\bgroup
  %\language[agr] 
  \language[grc]
  τούτων δὲ ὑπαρχόντων αὖ πᾶσα ἀνάγκη τόνδε τὸν κόσμον εἰκόνα
  τινὸς εἶναι. μέγιστον δὴ παντὸς ἄρξασθαι κατὰ φύσιν ἀρχήν. ὧδε
  οὖν περί τε εἰκόνος καὶ περὶ τοῦ παραδείγματος αὐτῆς
  διοριστέον, ὡς ἄρα τοὺς λόγους, ὧνπέρ εἰσιν ἐξηγηταί, τούτων
  αὐτῶν καὶ συγγενεῖς ὄντας· τοῦ μὲν οὖν μονίμου καὶ βεβαίου καὶ
  μετὰ νοῦ καταφανοῦς μονίμους καὶ ἀμεταπτώτους –καθ᾽ ὅσον οἷόν
  τε καὶ ἀνελέγκτοις προσήκει λόγοις εἶναι καὶ ἀνικήτοις, τούτου
  δεῖ μηδὲν ἐλλείπειν– τοὺς δὲ τοῦ πρὸς μὲν ἐκεῖνο
  ἀπεικασθέντος, ὄντος δὲ εἰκόνος εἰκότας ἀνὰ λόγον τε ἐκείνων
  ὄντας· ὅτιπερ πρὸς γένεσιν οὐσία, τοῦτο πρὸς πίστιν ἀλήθεια.

  {\language[deo]\em Plat. Tim. 29bc}\par
\egroup
}

\stopcolumns

\stoptext

[-- Attachment #1.2: Type: application/pgp-signature, Size: 198 bytes --]

[-- Attachment #2: Type: text/plain, Size: 486 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21 11:19       ` Philipp Gesang
  2010-12-21 11:21         ` Philipp Gesang
@ 2010-12-21 11:44         ` Thomas Schmitz
  2010-12-21 12:23           ` Arthur Reutenauer
  2010-12-21 12:17         ` Arthur Reutenauer
  2010-12-21 12:37         ` Hans Hagen
  3 siblings, 1 reply; 16+ messages in thread
From: Thomas Schmitz @ 2010-12-21 11:44 UTC (permalink / raw)
  To: mailing list for ConTeXt users

On Tue, 21 Dec 2010 12:19:15 +0100
  Philipp Gesang <pgesang@ix.urz.uni-heidelberg.de> wrote:

> But context uses some non-standard codes also: “deo” 
>instead of
> “de-1901” for sane German orthography, and “agr” for 
>ancient
> Attic instead of “grc” or “el-polyton”. (Testfile 
>appended.)
> 
Yes, we should make grc a synonym of agr. el-polyton is, 
AFAIK, modern polytonic Greek, which is not the same.

Thomas
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21 11:19       ` Philipp Gesang
  2010-12-21 11:21         ` Philipp Gesang
  2010-12-21 11:44         ` Thomas Schmitz
@ 2010-12-21 12:17         ` Arthur Reutenauer
  2010-12-21 12:37         ` Hans Hagen
  3 siblings, 0 replies; 16+ messages in thread
From: Arthur Reutenauer @ 2010-12-21 12:17 UTC (permalink / raw)
  To: Mailing list for ConTeXt users

> But context uses some non-standard codes also: “deo” instead of
> “de-1901” for sane German orthography, and “agr” for ancient
> Attic instead of “grc” or “el-polyton”. (Testfile appended.)

  Yes, that's what I said: ConTeXt has used non-standard codes in the
past, now we try to follow BCP 47, while attempting to stay
backward-compatible, and providing easy-to-remember names for the user
as well.  But of course, it's always easier to write a three-line
sentence that contains one troll and one severe inaccuracy.

>     http://unicode.org/cldr/utility/languageid.jsp

  Note that Unicode Language Identifiers are not the same as BCP 47
tags, even if they're very close in their goals and shape.

	Arthur
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21 11:44         ` Thomas Schmitz
@ 2010-12-21 12:23           ` Arthur Reutenauer
  2010-12-21 12:35             ` Hans Hagen
  0 siblings, 1 reply; 16+ messages in thread
From: Arthur Reutenauer @ 2010-12-21 12:23 UTC (permalink / raw)
  To: Mailing list for ConTeXt users

> Yes, we should make grc a synonym of agr. el-polyton is, AFAIK, modern 
> polytonic Greek, which is not the same.

  Obviously so.  Amusingly, ISO states that "el" is meant for the Greek
language after the Fall of Constantinople(*), which assigns a rather long
period to Ancient Greek.

  Note that for all I know, there is no way to distinguish katharevousa
from demotic; but it would certainly be added to the IETF registry if
someone applied for it.

	Arthur

(*) Of course it doesn't say "after the Fall of Constanople" that way,
it says "from 1453 onwards" (http://www.sil.org/iso639-3/documentation.asp?id=ell)
but that's clearly what is meant.
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21 12:23           ` Arthur Reutenauer
@ 2010-12-21 12:35             ` Hans Hagen
  0 siblings, 0 replies; 16+ messages in thread
From: Hans Hagen @ 2010-12-21 12:35 UTC (permalink / raw)
  To: Mailing list for ConTeXt users

On 21-12-2010 1:23, Arthur Reutenauer wrote:

> (*) Of course it doesn't say "after the Fall of Constanople" that way,
> it says "from 1453 onwards" (http://www.sil.org/iso639-3/documentation.asp?id=ell)
> but that's clearly what is meant.

given some historic disputes on tex mailing list we should be glad that 
they avoided a reference to macedonian times

Hans

-----------------------------------------------------------------
                                           Hans Hagen | PRAGMA ADE
               Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
     tel: 038 477 53 69 | voip: 087 875 68 74 | www.pragma-ade.com
                                              | www.pragma-pod.nl
-----------------------------------------------------------------
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: Problem with encoding
  2010-12-21 11:19       ` Philipp Gesang
                           ` (2 preceding siblings ...)
  2010-12-21 12:17         ` Arthur Reutenauer
@ 2010-12-21 12:37         ` Hans Hagen
  3 siblings, 0 replies; 16+ messages in thread
From: Hans Hagen @ 2010-12-21 12:37 UTC (permalink / raw)
  To: mailing list for ConTeXt users; +Cc: Philipp Gesang

On 21-12-2010 12:19, Philipp Gesang wrote:

> But context uses some non-standard codes also: “deo” instead of

deo = de old (and no one objected at that point)

when context was dutch only we used 'du' as that's the abbreviation used 
in schools

Hans

-----------------------------------------------------------------
                                           Hans Hagen | PRAGMA ADE
               Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
     tel: 038 477 53 69 | voip: 087 875 68 74 | www.pragma-ade.com
                                              | www.pragma-pod.nl
-----------------------------------------------------------------
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 16+ messages in thread

end of thread, other threads:[~2010-12-21 12:37 UTC | newest]

Thread overview: 16+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2010-12-21  8:34 Problem with encoding Procházka Lukáš Ing. - Pontex s. r. o.
2010-12-21  9:29 ` Mojca Miklavec
2010-12-21  9:37   ` Procházka Lukáš Ing. - Pontex s. r. o.
2010-12-21 10:05     ` Hans Hagen
2010-12-21 10:11     ` Arthur Reutenauer
2010-12-21 10:17       ` Procházka Lukáš Ing. - Pontex s. r. o.
2010-12-21 11:19       ` Philipp Gesang
2010-12-21 11:21         ` Philipp Gesang
2010-12-21 11:44         ` Thomas Schmitz
2010-12-21 12:23           ` Arthur Reutenauer
2010-12-21 12:35             ` Hans Hagen
2010-12-21 12:17         ` Arthur Reutenauer
2010-12-21 12:37         ` Hans Hagen
2010-12-21 10:14     ` Mojca Miklavec
2010-12-21 10:35       ` Arthur Reutenauer
2010-12-21  9:43   ` Arthur Reutenauer

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).