ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
* lpeg pattern in function
@ 2021-08-06 20:58 Marcus Vinicius Mesquita via ntg-context
  2021-08-07  8:37 ` Hans Hagen via ntg-context
  0 siblings, 1 reply; 3+ messages in thread
From: Marcus Vinicius Mesquita via ntg-context @ 2021-08-06 20:58 UTC (permalink / raw)
  To: mailing list for ConTeXt users; +Cc: Marcus Vinicius Mesquita


[-- Attachment #1.1: Type: text/plain, Size: 940 bytes --]

Dear list,
in the mwe below, the expected result is ok for most entries but fails when
the word contains the letters ó or ô.
We get zoolco instead of zoológico, and termtro instead of termômetro. What
am I doing wrong?

mwe:

\def\stripnumber#1%
        {\cldcontext{lpeg.match(lpeg.stripper("[¹²³⁴⁵⁶⁷⁸⁹⁰]"), [==[#1]==])}}

\starttext

\stripnumber{árbitro⁶}
\stripnumber{ébano¹}
\stripnumber{ícone⁸}
\stripnumber{zoológico⁰}
\stripnumber{eletroacústico⁹}
\stripnumber{trânsfuga⁷}
\stripnumber{farmacêutico¹}
\stripnumber{maître²}
\stripnumber{termômetro³}
\stripnumber{noûs⁴}

\stoptext
-- 
Todas as coisas fatigam o corpo, salvo a música, que não fatiga nem o corpo
nem seus membros, por ser descanso da alma, primavera do coração, distração
do aflito, entretenimento do solitário, e viático do viajante.

Kunnâsh al-Hâ'ik (Cancioneiro de al-Hâ'ik)

[-- Attachment #1.2: Type: text/html, Size: 1344 bytes --]

[-- Attachment #2: Type: text/plain, Size: 493 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://context.aanhet.net
archive  : https://bitbucket.org/phg/context-mirror/commits/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: lpeg pattern in function
  2021-08-06 20:58 lpeg pattern in function Marcus Vinicius Mesquita via ntg-context
@ 2021-08-07  8:37 ` Hans Hagen via ntg-context
  2021-08-07  9:40   ` Marcus Vinicius Mesquita via ntg-context
  0 siblings, 1 reply; 3+ messages in thread
From: Hans Hagen via ntg-context @ 2021-08-07  8:37 UTC (permalink / raw)
  To: mailing list for ConTeXt users; +Cc: Hans Hagen

On 8/6/2021 10:58 PM, Marcus Vinicius Mesquita via ntg-context wrote:
> Dear list,
> in the mwe below, the expected result is ok for most entries but fails 
> when the word contains the letters ó or ô.
> We get zoolco instead of zoológico, and termtro instead of termômetro. 
> What am I doing wrong?
> 
> mwe:
> 
> \def\stripnumber#1%
>          {\cldcontext{lpeg.match(lpeg.stripper("[¹²³⁴⁵⁶⁷⁸⁹⁰]"), 
> [==[#1]==])}}
> 
> \starttext
> 
> \stripnumber{árbitro⁶}
> \stripnumber{ébano¹}
> \stripnumber{ícone⁸}
> \stripnumber{zoológico⁰}
> \stripnumber{eletroacústico⁹}
> \stripnumber{trânsfuga⁷}
> \stripnumber{farmacêutico¹}
> \stripnumber{maître²}
> \stripnumber{termômetro³}
> \stripnumber{noûs⁴}
> 
> \stoptext


\def\stripnumber#1%
   {\cldcontext{lpeg.match(lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰")), 
[==[#1]==])}}

(US -> utf set) 	

or

\startluacode
     local s = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
     function document.StripNumber(str)
         context(lpeg.match(s, str))
     end
\stopluacode

\def\stripnumber#1{\ctxlua{document.StripNumber([==[#1]==])}}

or you can go fancy:

\startluacode
     local p_strip = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))

     interfaces.implement {
         name      = "StripNumber",
         public    = true,
         arguments = "string",
         actions   = function(str)
             context(lpeg.match(p_strip, str))
         end
     }
\stopluacode

\StripNumber{zoológico⁰}

There are more efficient variants but i guess it's good enough.

Hans

-----------------------------------------------------------------
                                           Hans Hagen | PRAGMA ADE
               Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
        tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl
-----------------------------------------------------------------
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://context.aanhet.net
archive  : https://bitbucket.org/phg/context-mirror/commits/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: lpeg pattern in function
  2021-08-07  8:37 ` Hans Hagen via ntg-context
@ 2021-08-07  9:40   ` Marcus Vinicius Mesquita via ntg-context
  0 siblings, 0 replies; 3+ messages in thread
From: Marcus Vinicius Mesquita via ntg-context @ 2021-08-07  9:40 UTC (permalink / raw)
  To: Hans Hagen; +Cc: Marcus Vinicius Mesquita, mailing list for ConTeXt users


[-- Attachment #1.1: Type: text/plain, Size: 2556 bytes --]

Thank you, Hans. Very nice indeed your solutions.

Marcus Vinicius

On Sat, Aug 7, 2021 at 5:37 AM Hans Hagen <j.hagen@xs4all.nl> wrote:

> On 8/6/2021 10:58 PM, Marcus Vinicius Mesquita via ntg-context wrote:
> > Dear list,
> > in the mwe below, the expected result is ok for most entries but fails
> > when the word contains the letters ó or ô.
> > We get zoolco instead of zoológico, and termtro instead of termômetro.
> > What am I doing wrong?
> >
> > mwe:
> >
> > \def\stripnumber#1%
> >          {\cldcontext{lpeg.match(lpeg.stripper("[¹²³⁴⁵⁶⁷⁸⁹⁰]"),
> > [==[#1]==])}}
> >
> > \starttext
> >
> > \stripnumber{árbitro⁶}
> > \stripnumber{ébano¹}
> > \stripnumber{ícone⁸}
> > \stripnumber{zoológico⁰}
> > \stripnumber{eletroacústico⁹}
> > \stripnumber{trânsfuga⁷}
> > \stripnumber{farmacêutico¹}
> > \stripnumber{maître²}
> > \stripnumber{termômetro³}
> > \stripnumber{noûs⁴}
> >
> > \stoptext
>
>
> \def\stripnumber#1%
>    {\cldcontext{lpeg.match(lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰")),
> [==[#1]==])}}
>
> (US -> utf set)
>
> or
>
> \startluacode
>      local s = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
>      function document.StripNumber(str)
>          context(lpeg.match(s, str))
>      end
> \stopluacode
>
> \def\stripnumber#1{\ctxlua{document.StripNumber([==[#1]==])}}
>
> or you can go fancy:
>
> \startluacode
>      local p_strip = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
>
>      interfaces.implement {
>          name      = "StripNumber",
>          public    = true,
>          arguments = "string",
>          actions   = function(str)
>              context(lpeg.match(p_strip, str))
>          end
>      }
> \stopluacode
>
> \StripNumber{zoológico⁰}
>
> There are more efficient variants but i guess it's good enough.
>
> Hans
>
> -----------------------------------------------------------------
>                                            Hans Hagen | PRAGMA ADE
>                Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
>         tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl
> -----------------------------------------------------------------
>


-- 
Todas as coisas fatigam o corpo, salvo a música, que não fatiga nem o corpo
nem seus membros, por ser descanso da alma, primavera do coração, distração
do aflito, entretenimento do solitário, e viático do viajante.

Kunnâsh al-Hâ'ik (Cancioneiro de al-Hâ'ik)

[-- Attachment #1.2: Type: text/html, Size: 3502 bytes --]

[-- Attachment #2: Type: text/plain, Size: 493 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://context.aanhet.net
archive  : https://bitbucket.org/phg/context-mirror/commits/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2021-08-07  9:40 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-08-06 20:58 lpeg pattern in function Marcus Vinicius Mesquita via ntg-context
2021-08-07  8:37 ` Hans Hagen via ntg-context
2021-08-07  9:40   ` Marcus Vinicius Mesquita via ntg-context

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).