ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
From: Hans Hagen via ntg-context <ntg-context@ntg.nl>
To: mailing list for ConTeXt users <ntg-context@ntg.nl>
Cc: Hans Hagen <j.hagen@xs4all.nl>
Subject: Re: lpeg pattern in function
Date: Sat, 7 Aug 2021 10:37:27 +0200	[thread overview]
Message-ID: <1f23a352-59e5-00d6-4610-c0982f1a4e62@xs4all.nl> (raw)
In-Reply-To: <CAK9ODgThOBeRVTS3FWKmvUHKSQc_TrPmjGRi-MJO4tY2_gLvyA@mail.gmail.com>

On 8/6/2021 10:58 PM, Marcus Vinicius Mesquita via ntg-context wrote:
> Dear list,
> in the mwe below, the expected result is ok for most entries but fails 
> when the word contains the letters ó or ô.
> We get zoolco instead of zoológico, and termtro instead of termômetro. 
> What am I doing wrong?
> 
> mwe:
> 
> \def\stripnumber#1%
>          {\cldcontext{lpeg.match(lpeg.stripper("[¹²³⁴⁵⁶⁷⁸⁹⁰]"), 
> [==[#1]==])}}
> 
> \starttext
> 
> \stripnumber{árbitro⁶}
> \stripnumber{ébano¹}
> \stripnumber{ícone⁸}
> \stripnumber{zoológico⁰}
> \stripnumber{eletroacústico⁹}
> \stripnumber{trânsfuga⁷}
> \stripnumber{farmacêutico¹}
> \stripnumber{maître²}
> \stripnumber{termômetro³}
> \stripnumber{noûs⁴}
> 
> \stoptext


\def\stripnumber#1%
   {\cldcontext{lpeg.match(lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰")), 
[==[#1]==])}}

(US -> utf set) 	

or

\startluacode
     local s = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
     function document.StripNumber(str)
         context(lpeg.match(s, str))
     end
\stopluacode

\def\stripnumber#1{\ctxlua{document.StripNumber([==[#1]==])}}

or you can go fancy:

\startluacode
     local p_strip = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))

     interfaces.implement {
         name      = "StripNumber",
         public    = true,
         arguments = "string",
         actions   = function(str)
             context(lpeg.match(p_strip, str))
         end
     }
\stopluacode

\StripNumber{zoológico⁰}

There are more efficient variants but i guess it's good enough.

Hans

-----------------------------------------------------------------
                                           Hans Hagen | PRAGMA ADE
               Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
        tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl
-----------------------------------------------------------------
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://context.aanhet.net
archive  : https://bitbucket.org/phg/context-mirror/commits/
wiki     : http://contextgarden.net
___________________________________________________________________________________

  reply	other threads:[~2021-08-07  8:37 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-08-06 20:58 Marcus Vinicius Mesquita via ntg-context
2021-08-07  8:37 ` Hans Hagen via ntg-context [this message]
2021-08-07  9:40   ` Marcus Vinicius Mesquita via ntg-context

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1f23a352-59e5-00d6-4610-c0982f1a4e62@xs4all.nl \
    --to=ntg-context@ntg.nl \
    --cc=j.hagen@xs4all.nl \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).