* lpeg pattern in function
@ 2021-08-06 20:58 Marcus Vinicius Mesquita via ntg-context
2021-08-07 8:37 ` Hans Hagen via ntg-context
0 siblings, 1 reply; 3+ messages in thread
From: Marcus Vinicius Mesquita via ntg-context @ 2021-08-06 20:58 UTC (permalink / raw)
To: mailing list for ConTeXt users; +Cc: Marcus Vinicius Mesquita
[-- Attachment #1.1: Type: text/plain, Size: 940 bytes --]
Dear list,
in the mwe below, the expected result is ok for most entries but fails when
the word contains the letters ó or ô.
We get zoolco instead of zoológico, and termtro instead of termômetro. What
am I doing wrong?
mwe:
\def\stripnumber#1%
{\cldcontext{lpeg.match(lpeg.stripper("[¹²³⁴⁵⁶⁷⁸⁹⁰]"), [==[#1]==])}}
\starttext
\stripnumber{árbitro⁶}
\stripnumber{ébano¹}
\stripnumber{ícone⁸}
\stripnumber{zoológico⁰}
\stripnumber{eletroacústico⁹}
\stripnumber{trânsfuga⁷}
\stripnumber{farmacêutico¹}
\stripnumber{maître²}
\stripnumber{termômetro³}
\stripnumber{noûs⁴}
\stoptext
--
Todas as coisas fatigam o corpo, salvo a música, que não fatiga nem o corpo
nem seus membros, por ser descanso da alma, primavera do coração, distração
do aflito, entretenimento do solitário, e viático do viajante.
Kunnâsh al-Hâ'ik (Cancioneiro de al-Hâ'ik)
[-- Attachment #1.2: Type: text/html, Size: 1344 bytes --]
[-- Attachment #2: Type: text/plain, Size: 493 bytes --]
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: lpeg pattern in function
2021-08-06 20:58 lpeg pattern in function Marcus Vinicius Mesquita via ntg-context
@ 2021-08-07 8:37 ` Hans Hagen via ntg-context
2021-08-07 9:40 ` Marcus Vinicius Mesquita via ntg-context
0 siblings, 1 reply; 3+ messages in thread
From: Hans Hagen via ntg-context @ 2021-08-07 8:37 UTC (permalink / raw)
To: mailing list for ConTeXt users; +Cc: Hans Hagen
On 8/6/2021 10:58 PM, Marcus Vinicius Mesquita via ntg-context wrote:
> Dear list,
> in the mwe below, the expected result is ok for most entries but fails
> when the word contains the letters ó or ô.
> We get zoolco instead of zoológico, and termtro instead of termômetro.
> What am I doing wrong?
>
> mwe:
>
> \def\stripnumber#1%
> {\cldcontext{lpeg.match(lpeg.stripper("[¹²³⁴⁵⁶⁷⁸⁹⁰]"),
> [==[#1]==])}}
>
> \starttext
>
> \stripnumber{árbitro⁶}
> \stripnumber{ébano¹}
> \stripnumber{ícone⁸}
> \stripnumber{zoológico⁰}
> \stripnumber{eletroacústico⁹}
> \stripnumber{trânsfuga⁷}
> \stripnumber{farmacêutico¹}
> \stripnumber{maître²}
> \stripnumber{termômetro³}
> \stripnumber{noûs⁴}
>
> \stoptext
\def\stripnumber#1%
{\cldcontext{lpeg.match(lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰")),
[==[#1]==])}}
(US -> utf set)
or
\startluacode
local s = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
function document.StripNumber(str)
context(lpeg.match(s, str))
end
\stopluacode
\def\stripnumber#1{\ctxlua{document.StripNumber([==[#1]==])}}
or you can go fancy:
\startluacode
local p_strip = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
interfaces.implement {
name = "StripNumber",
public = true,
arguments = "string",
actions = function(str)
context(lpeg.match(p_strip, str))
end
}
\stopluacode
\StripNumber{zoológico⁰}
There are more efficient variants but i guess it's good enough.
Hans
-----------------------------------------------------------------
Hans Hagen | PRAGMA ADE
Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl
-----------------------------------------------------------------
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: lpeg pattern in function
2021-08-07 8:37 ` Hans Hagen via ntg-context
@ 2021-08-07 9:40 ` Marcus Vinicius Mesquita via ntg-context
0 siblings, 0 replies; 3+ messages in thread
From: Marcus Vinicius Mesquita via ntg-context @ 2021-08-07 9:40 UTC (permalink / raw)
To: Hans Hagen; +Cc: Marcus Vinicius Mesquita, mailing list for ConTeXt users
[-- Attachment #1.1: Type: text/plain, Size: 2556 bytes --]
Thank you, Hans. Very nice indeed your solutions.
Marcus Vinicius
On Sat, Aug 7, 2021 at 5:37 AM Hans Hagen <j.hagen@xs4all.nl> wrote:
> On 8/6/2021 10:58 PM, Marcus Vinicius Mesquita via ntg-context wrote:
> > Dear list,
> > in the mwe below, the expected result is ok for most entries but fails
> > when the word contains the letters ó or ô.
> > We get zoolco instead of zoológico, and termtro instead of termômetro.
> > What am I doing wrong?
> >
> > mwe:
> >
> > \def\stripnumber#1%
> > {\cldcontext{lpeg.match(lpeg.stripper("[¹²³⁴⁵⁶⁷⁸⁹⁰]"),
> > [==[#1]==])}}
> >
> > \starttext
> >
> > \stripnumber{árbitro⁶}
> > \stripnumber{ébano¹}
> > \stripnumber{ícone⁸}
> > \stripnumber{zoológico⁰}
> > \stripnumber{eletroacústico⁹}
> > \stripnumber{trânsfuga⁷}
> > \stripnumber{farmacêutico¹}
> > \stripnumber{maître²}
> > \stripnumber{termômetro³}
> > \stripnumber{noûs⁴}
> >
> > \stoptext
>
>
> \def\stripnumber#1%
> {\cldcontext{lpeg.match(lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰")),
> [==[#1]==])}}
>
> (US -> utf set)
>
> or
>
> \startluacode
> local s = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
> function document.StripNumber(str)
> context(lpeg.match(s, str))
> end
> \stopluacode
>
> \def\stripnumber#1{\ctxlua{document.StripNumber([==[#1]==])}}
>
> or you can go fancy:
>
> \startluacode
> local p_strip = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
>
> interfaces.implement {
> name = "StripNumber",
> public = true,
> arguments = "string",
> actions = function(str)
> context(lpeg.match(p_strip, str))
> end
> }
> \stopluacode
>
> \StripNumber{zoológico⁰}
>
> There are more efficient variants but i guess it's good enough.
>
> Hans
>
> -----------------------------------------------------------------
> Hans Hagen | PRAGMA ADE
> Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
> tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl
> -----------------------------------------------------------------
>
--
Todas as coisas fatigam o corpo, salvo a música, que não fatiga nem o corpo
nem seus membros, por ser descanso da alma, primavera do coração, distração
do aflito, entretenimento do solitário, e viático do viajante.
Kunnâsh al-Hâ'ik (Cancioneiro de al-Hâ'ik)
[-- Attachment #1.2: Type: text/html, Size: 3502 bytes --]
[-- Attachment #2: Type: text/plain, Size: 493 bytes --]
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2021-08-07 9:40 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-08-06 20:58 lpeg pattern in function Marcus Vinicius Mesquita via ntg-context
2021-08-07 8:37 ` Hans Hagen via ntg-context
2021-08-07 9:40 ` Marcus Vinicius Mesquita via ntg-context
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).