From: Hans Hagen via ntg-context <ntg-context@ntg.nl>
To: ntg-context@ntg.nl
Cc: Hans Hagen <j.hagen@freedom.nl>
Subject: [NTG-context] Re: Copy PDF text without hyphenated words
Date: Fri, 14 Feb 2025 08:48:06 +0100 [thread overview]
Message-ID: <5baea646-dde9-427d-986d-96cabc58aeb2@freedom.nl> (raw)
In-Reply-To: <4988738.31r3eYUQgx@falbala>
On 2/14/2025 2:19 AM, Gerion Entrup wrote:
> Hi,
>
> I recently learned that Typst seems to be able to produce PDFs where a hyphenated text can be copied without the hyphenation (so all words in the copied text are not hyphenated).
> I seem to recall that the PDF format has an extra mode for this, where the creation program can embed some text that should only appear when copied and replace the word parts that are visible on the page.
>
> ConTeXt, in it's default mode, seems not to embed this text. When copying hyphenated words, the hyphenated word parts appear as distinct words (even without the hyphen). Is there a way to tell ConTeXt to produce PDF where the text can be copied without hyphenated words?
This is a fuzzy area and has always depended on how pdf viewers see
things. The standard has some suggestions and oenm is to use soft
hyphens which is what we do (can be turned off). From your description
it looks like actual text is used and in this case, although one can
make that work, to me it is not a solution, it not only polutes the page
stream, it also can interferes with other features and increases overhead.
When a viewer sees aoft hyphen it is assumed that it looks for the next
part of the word. Afaik acrobat reader can handle both variants. The
other (open source) viewers that I use are a mixed bag (in areas like
these).
Hans
-----------------------------------------------------------------
Hans Hagen | PRAGMA ADE
Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl
-----------------------------------------------------------------
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive : https://github.com/contextgarden/context
wiki : https://wiki.contextgarden.net
___________________________________________________________________________________
next prev parent reply other threads:[~2025-02-14 7:53 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-02-14 1:19 [NTG-context] " Gerion Entrup
2025-02-14 7:48 ` Hans Hagen via ntg-context [this message]
2025-02-16 3:04 ` [NTG-context] " Gerion Entrup
2025-02-16 7:33 ` Marco Patzer
2025-02-16 8:06 ` Mikael Sundqvist
2025-02-17 11:44 ` Ulrike Fischer
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5baea646-dde9-427d-986d-96cabc58aeb2@freedom.nl \
--to=ntg-context@ntg.nl \
--cc=j.hagen@freedom.nl \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).