From: Hans Hagen via ntg-context <ntg-context@ntg.nl>
To: mailing list for ConTeXt users <ntg-context@ntg.nl>
Cc: Hans Hagen <j.hagen@xs4all.nl>
Subject: Re: lpeg pattern in function
Date: Sat, 7 Aug 2021 10:37:27 +0200 [thread overview]
Message-ID: <1f23a352-59e5-00d6-4610-c0982f1a4e62@xs4all.nl> (raw)
In-Reply-To: <CAK9ODgThOBeRVTS3FWKmvUHKSQc_TrPmjGRi-MJO4tY2_gLvyA@mail.gmail.com>
On 8/6/2021 10:58 PM, Marcus Vinicius Mesquita via ntg-context wrote:
> Dear list,
> in the mwe below, the expected result is ok for most entries but fails
> when the word contains the letters ó or ô.
> We get zoolco instead of zoológico, and termtro instead of termômetro.
> What am I doing wrong?
>
> mwe:
>
> \def\stripnumber#1%
> {\cldcontext{lpeg.match(lpeg.stripper("[¹²³⁴⁵⁶⁷⁸⁹⁰]"),
> [==[#1]==])}}
>
> \starttext
>
> \stripnumber{árbitro⁶}
> \stripnumber{ébano¹}
> \stripnumber{ícone⁸}
> \stripnumber{zoológico⁰}
> \stripnumber{eletroacústico⁹}
> \stripnumber{trânsfuga⁷}
> \stripnumber{farmacêutico¹}
> \stripnumber{maître²}
> \stripnumber{termômetro³}
> \stripnumber{noûs⁴}
>
> \stoptext
\def\stripnumber#1%
{\cldcontext{lpeg.match(lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰")),
[==[#1]==])}}
(US -> utf set)
or
\startluacode
local s = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
function document.StripNumber(str)
context(lpeg.match(s, str))
end
\stopluacode
\def\stripnumber#1{\ctxlua{document.StripNumber([==[#1]==])}}
or you can go fancy:
\startluacode
local p_strip = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
interfaces.implement {
name = "StripNumber",
public = true,
arguments = "string",
actions = function(str)
context(lpeg.match(p_strip, str))
end
}
\stopluacode
\StripNumber{zoológico⁰}
There are more efficient variants but i guess it's good enough.
Hans
-----------------------------------------------------------------
Hans Hagen | PRAGMA ADE
Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl
-----------------------------------------------------------------
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
next prev parent reply other threads:[~2021-08-07 8:37 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-08-06 20:58 Marcus Vinicius Mesquita via ntg-context
2021-08-07 8:37 ` Hans Hagen via ntg-context [this message]
2021-08-07 9:40 ` Marcus Vinicius Mesquita via ntg-context
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1f23a352-59e5-00d6-4610-c0982f1a4e62@xs4all.nl \
--to=ntg-context@ntg.nl \
--cc=j.hagen@xs4all.nl \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).