ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
* Problem with encoding, XML and writetolist...
@ 2006-02-28  7:24 Martin Kolařík
  2006-02-28  9:15 ` Hans Hagen
  2006-04-03 14:25 ` Mojca Miklavec
  0 siblings, 2 replies; 27+ messages in thread
From: Martin Kolařík @ 2006-02-28  7:24 UTC (permalink / raw)


Hi all,

I have problem with chars encoding now; I have updated ConTeXt sometimes in
January; before the time I was using old (year) version. In this
prehistorical version the problem was not present.

What is the stuff? The following code generates content with titles without
spaces -- accented characters are replaced with \char XXX variant, so during
reading TUI line TeX gobbles a space after XXX as a part of <number> syntax
rule.

Can I do something? With encoding? With XML?

Thank you very much, have a nice day, Martin

Code:

\defineXMLenvironment
  [chapter]
  {\defineXMLsave[title]}
  {\expanded
     {\chapter\XMLflush{title}}}

\starttext

\startbuffer

<chapter>
  <title>Ahojá hojč kuna</title>
</chapter>

\stopbuffer

\processXMLbuffer

\completecontent[criterium=all]

\stoptext

^ permalink raw reply	[flat|nested] 27+ messages in thread
* Re: Problem with encoding, XML and writetolist...
@ 2006-04-05 11:59 Martin Kolařík
  2006-04-06  9:07 ` Hans Hagen
  0 siblings, 1 reply; 27+ messages in thread
From: Martin Kolařík @ 2006-04-05 11:59 UTC (permalink / raw)


[-- Attachment #1: Type: text/plain, Size: 4301 bytes --]

Hi Hans and Vit,

I've attached pdf and tex made with "texexec 8bit.tex" using the last beta (with cp1250 regime). The version chapterB is what I am using now, chapterA version is straightforward but not functional.

>From this my ideas from last mail came -- when I am expanding \XMLflush exactly and controlled, result is OK (the expansion stops before active characters start to expand.

IMHO, the solution on character level is something different (although it works :-); yes, I did not study Vit's problems from year before. I never had a problems with simple TeX (\chapter{čřžý}), only with XML :-), so for me it would be sufficient solve XML level. Of course, if it is stuff of other level, it cannot be solved in XML.

Thanks, Martin

-----Original Message-----
From: ntg-context-bounces@ntg.nl [mailto:ntg-context-bounces@ntg.nl]On
Behalf Of Hans Hagen
Sent: Wednesday, April 05, 2006 1:03 PM
To: mailing list for ConTeXt users
Subject: Re: [NTG-context] Problem with encoding, XML and writetolist...


Vit Zyka wrote:
> Martin Kolařík wrote:
>   
>> Hi Hans,
>>
>> I've analyzed the situation using different combinations of regimes and encodings, including switching off handling (\donothandleactivecharacter) and it seems to me, that every text with chars > 127, which is fully expanded (as in 8bit.tex), must cause difficulties, because TOC entry read back from TUO is scanned independently and thus spaces are gobbled.
>>     
>
> Hi Martin,
>
> we have discussed this problem a year ago. In that time I solve it just 
> writing '{}' in each title where was needed, like
>
>    \chapter{Dobré{} ráno}
>
> Now I have cooked an automatic solution. It is not optimal (effective) 
> but working. The only needs is to redefine
>
>    \def\numcharacter#1{\char#1{}}
>    \let\dochar\numcharacter
>
> and to load appropriate encoding file once more. Since it is banned a 
> new macro \reuseencoding was introduce:
>
> For Hans:
> It would be nice if this feature was enabled in a standard distro. 
> Perhaps you will find some more optimal solution...
>
> -------------------------------------------------------
> \unprotected
> \def\doreuseencoding#1%
>    {\letvalue{\c!file\f!encodingprefix#1}\empty
>     \makeshortfilename[\f!encodingprefix#1]%
>     \startreadingfile
>     \readsysfile\shortfilename
>       {\showmessage\m!encodings2{#1}}
>       {\showmessage\m!encodings3{#1}}%
>     \stopreadingfile}
> \def\reuseencoding[#1]%
>    {\processcommalist[#1]\doreuseencoding}
>
>
> \def\numcharacter#1{\char#1{}}
> \let\dochar\numcharacter
>
> \reuseencoding[ec]
>
> \enableregime[latin2]
>
> \starttext
> \writetolist[chapter]{1}{Dobré ráno}
> \expanded{\writetolist[chapter]{2}{Dobré odpoledne}}
>
> \placecontent
> \stoptext
>   
the {} may break liguature building (and maybe kerning) ... taco knows ...

I got a brainwave ... 

\enableregime[latin2]

\starttext

\def\numcharacter#1{\rawcharacter{#1}}%
\let\dochar\numcharacter

\writetolist[chapter]{1}{Dobré ráno}

\expanded{\writetolist[chapter]{2}{Dobré odpoledne}}

\placelist[chapter][criterium=text]

test

\typefile{test.tuo}

\stoptext

this can be sped up (a bit) with 

% \dostepwiserecurse{0}{255}{1}
%   {\setevalue{rchr:\recurselevel}{\rawcharacter\recurselevel}}

% \def\rawcharacter#1{\getvalue{rchr:#1}}

this means that all chars are round tripped to themselves in output files! 

raw characters are not active and they are therefore equivalent to \char  

it's enough then to put 

\def\numcharacter#1{\rawcharacter{#1}}%
\let\dochar\numcharacter

into cont-new.tex 

(i will discuss this in more detail with taco later) 

Hans 



-----------------------------------------------------------------
                                          Hans Hagen | PRAGMA ADE
              Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
     tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com
                                             | www.pragma-pod.nl
-----------------------------------------------------------------

_______________________________________________
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context

[-- Attachment #2: 8bit.pdf --]
[-- Type: application/pdf, Size: 27436 bytes --]

[-- Attachment #3: 8bit.tex --]
[-- Type: application/octet-stream, Size: 757 bytes --]

[-- Attachment #4: Type: text/plain, Size: 139 bytes --]

_______________________________________________
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context

^ permalink raw reply	[flat|nested] 27+ messages in thread
* Re: Problem with encoding, XML and writetolist...
@ 2006-04-02 21:20 Martin Kolařík
  0 siblings, 0 replies; 27+ messages in thread
From: Martin Kolařík @ 2006-04-02 21:20 UTC (permalink / raw)


Hi Hans,

the test file is attached,

thanks,

Martin

> -----Original Message-----
> From: ntg-context-bounces@ntg.nl [mailto:ntg-context-bounces@ntg.nl]On
> Behalf Of Hans Hagen
> Sent: Friday, March 31, 2006 7:01 PM
> To: martin.kolarik@email.cz; mailing list for ConTeXt users
> Subject: Re: [NTG-context] Problem with encoding, XML and writetolist...
>
>
> Martin Kolařík wrote:
> > Hi Hans,
> >
> > isn't there some news with the stuff?
> >
> small test file needed to refresh my memory
>
> Hans
>
> -----------------------------------------------------------------
>                                           Hans Hagen | PRAGMA ADE
>               Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
>      tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com
>                                              | www.pragma-pod.nl
> -----------------------------------------------------------------
>
> _______________________________________________
> ntg-context mailing list
> ntg-context@ntg.nl
> http://www.ntg.nl/mailman/listinfo/ntg-context
>

_______________________________________________
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context

^ permalink raw reply	[flat|nested] 27+ messages in thread
* Problem with encoding, XML and writetolist...
@ 2006-02-27 11:58 Martin Kolařík
  0 siblings, 0 replies; 27+ messages in thread
From: Martin Kolařík @ 2006-02-27 11:58 UTC (permalink / raw)


Hi Hans,

I have problem with chars encoding now; I have updated ConTeXt sometimes in
January; before the time I was using old (year) version. In this
prehistorical version the problem was not present.

What is the stuff? The following code generates content with titles without
spaces -- accented characters are replaced with \char XXX variant, so during
reading TUI line TeX gobbles a space after XXX as a part of <number> syntax
rule.

Can I do something? With encoding? With XML?

Thank you very much, have a nice day, Martin

Code:

\defineXMLenvironment
  [chapter]
  {\defineXMLsave[title]}
  {\expanded
     {\chapter\XMLflush{title}}}

\starttext

\startbuffer

<chapter>
  <title>Ahojá hojč kuna</title>
</chapter>

\stopbuffer

\processXMLbuffer

\completecontent[criterium=all]

\stoptext

^ permalink raw reply	[flat|nested] 27+ messages in thread

end of thread, other threads:[~2006-04-06  9:07 UTC | newest]

Thread overview: 27+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2006-02-28  7:24 Problem with encoding, XML and writetolist Martin Kolařík
2006-02-28  9:15 ` Hans Hagen
2006-02-28  9:33   ` Martin Kolařík
2006-03-31  8:42     ` Martin Kolařík
2006-03-31 17:01       ` Hans Hagen
2006-04-02 21:17         ` Martin Kolařík
2006-04-03  8:36           ` Hans Hagen
2006-04-03  8:57             ` Martin Kolařík
2006-04-03  9:56               ` Hans Hagen
2006-04-03 10:36                 ` Martin Kolařík
2006-04-03 12:33                   ` Mojca Miklavec
2006-04-03 12:39                     ` Martin Kolařík
2006-04-03 11:18                 ` Vit Zyka
2006-04-03 11:45                   ` Martin Kolařík
2006-04-04 20:18                 ` Martin Kolařík
2006-04-04 20:25                 ` Martin Kolařík
2006-04-05  8:17                   ` Hans Hagen
2006-04-05  9:54                   ` Vit Zyka
2006-04-05 11:03                     ` Hans Hagen
2006-04-05 12:18                       ` Vit Zyka
2006-04-05 14:09                         ` Hans Hagen
2006-04-03 14:25 ` Mojca Miklavec
2006-04-03 14:41   ` Martin Kolařík
  -- strict thread matches above, loose matches on Subject: below --
2006-04-05 11:59 Martin Kolařík
2006-04-06  9:07 ` Hans Hagen
2006-04-02 21:20 Martin Kolařík
2006-02-27 11:58 Martin Kolařík

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).