ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
From: Marcus Vinicius Mesquita via ntg-context <ntg-context@ntg.nl>
To: Hans Hagen <j.hagen@xs4all.nl>
Cc: Marcus Vinicius Mesquita <marcusvinicius.mesquita@gmail.com>,
	mailing list for ConTeXt users <ntg-context@ntg.nl>
Subject: Re: lpeg pattern in function
Date: Sat, 7 Aug 2021 06:40:46 -0300	[thread overview]
Message-ID: <CAK9ODgRPjZQ3wWtnW+2+2EoGRDCJT0aXB7zuFYQT1=-W7QUuOQ@mail.gmail.com> (raw)
In-Reply-To: <1f23a352-59e5-00d6-4610-c0982f1a4e62@xs4all.nl>


[-- Attachment #1.1: Type: text/plain, Size: 2556 bytes --]

Thank you, Hans. Very nice indeed your solutions.

Marcus Vinicius

On Sat, Aug 7, 2021 at 5:37 AM Hans Hagen <j.hagen@xs4all.nl> wrote:

> On 8/6/2021 10:58 PM, Marcus Vinicius Mesquita via ntg-context wrote:
> > Dear list,
> > in the mwe below, the expected result is ok for most entries but fails
> > when the word contains the letters ó or ô.
> > We get zoolco instead of zoológico, and termtro instead of termômetro.
> > What am I doing wrong?
> >
> > mwe:
> >
> > \def\stripnumber#1%
> >          {\cldcontext{lpeg.match(lpeg.stripper("[¹²³⁴⁵⁶⁷⁸⁹⁰]"),
> > [==[#1]==])}}
> >
> > \starttext
> >
> > \stripnumber{árbitro⁶}
> > \stripnumber{ébano¹}
> > \stripnumber{ícone⁸}
> > \stripnumber{zoológico⁰}
> > \stripnumber{eletroacústico⁹}
> > \stripnumber{trânsfuga⁷}
> > \stripnumber{farmacêutico¹}
> > \stripnumber{maître²}
> > \stripnumber{termômetro³}
> > \stripnumber{noûs⁴}
> >
> > \stoptext
>
>
> \def\stripnumber#1%
>    {\cldcontext{lpeg.match(lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰")),
> [==[#1]==])}}
>
> (US -> utf set)
>
> or
>
> \startluacode
>      local s = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
>      function document.StripNumber(str)
>          context(lpeg.match(s, str))
>      end
> \stopluacode
>
> \def\stripnumber#1{\ctxlua{document.StripNumber([==[#1]==])}}
>
> or you can go fancy:
>
> \startluacode
>      local p_strip = lpeg.stripper(lpeg.US("¹²³⁴⁵⁶⁷⁸⁹⁰"))
>
>      interfaces.implement {
>          name      = "StripNumber",
>          public    = true,
>          arguments = "string",
>          actions   = function(str)
>              context(lpeg.match(p_strip, str))
>          end
>      }
> \stopluacode
>
> \StripNumber{zoológico⁰}
>
> There are more efficient variants but i guess it's good enough.
>
> Hans
>
> -----------------------------------------------------------------
>                                            Hans Hagen | PRAGMA ADE
>                Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
>         tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl
> -----------------------------------------------------------------
>


-- 
Todas as coisas fatigam o corpo, salvo a música, que não fatiga nem o corpo
nem seus membros, por ser descanso da alma, primavera do coração, distração
do aflito, entretenimento do solitário, e viático do viajante.

Kunnâsh al-Hâ'ik (Cancioneiro de al-Hâ'ik)

[-- Attachment #1.2: Type: text/html, Size: 3502 bytes --]

[-- Attachment #2: Type: text/plain, Size: 493 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://context.aanhet.net
archive  : https://bitbucket.org/phg/context-mirror/commits/
wiki     : http://contextgarden.net
___________________________________________________________________________________

      reply	other threads:[~2021-08-07  9:40 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-08-06 20:58 Marcus Vinicius Mesquita via ntg-context
2021-08-07  8:37 ` Hans Hagen via ntg-context
2021-08-07  9:40   ` Marcus Vinicius Mesquita via ntg-context [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAK9ODgRPjZQ3wWtnW+2+2EoGRDCJT0aXB7zuFYQT1=-W7QUuOQ@mail.gmail.com' \
    --to=ntg-context@ntg.nl \
    --cc=j.hagen@xs4all.nl \
    --cc=marcusvinicius.mesquita@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).