From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/28451 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Luca Newsgroups: gmane.text.pandoc Subject: Pandoc filter for (Word) index markers Date: Thu, 27 May 2021 10:37:12 +0100 Message-ID: <18dc38e8-0adc-f198-0aff-3a557815a892@openbookpublishers.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8"; format=flowed Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="32944"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.10.2 To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBCIO7CPX4MNRBS6QXWCQMGQEXSUMLCA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Thu May 27 11:37:17 2021 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-pg1-f187.google.com ([209.85.215.187]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1lmCS5-0008QL-Lp for gtp-pandoc-discuss@m.gmane-mx.org; Thu, 27 May 2021 11:37:17 +0200 Original-Received: by mail-pg1-f187.google.com with SMTP id 139-20020a6304910000b029021636f6732asf2582591pge.17 for ; Thu, 27 May 2021 02:37:17 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1622108236; cv=pass; d=google.com; s=arc-20160816; b=kAuWLJzNCrxmIZUVkOWoI+mUt6zHsxVpYF0N8gSd5S7ncufmLEjwIHmAKS4+40PG/3 h2hEiHvHTd3e5kOcXOibP3ObdZ3H5Id4USmcYpe0ARPeRBQpeBTVYJ4kMw8573+O7LbT jdCB+etejQFCoQ03c8o3SxsGKFW7M6QrUDrkBgxGXMGoE80IEFTnJFmJ3W1PJ+lUFOmN E8mSR3eWtZYQxu2jnMAewceQUQXAHL1J09RXNBaEPmGzXJRfJlqfHPv7p4ZxFbjM/U1f GrxdXTsFVCkAZro1aDPp29xmPbQZfjS6NGMQ4f2OcWgeNfz3XOe2HxLaKMc08oT5NDNS kF5g== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:content-language :mime-version:user-agent:date:message-id:to:subject:from:sender :dkim-signature; bh=96tfsHOjcYLZrmkkL3fk/xAkz6AV6wMyWkyFdFSwhH0=; b=p3NCXmH43VE29TatrtJpmpZ2Al16UIwCFDl1QdiFF8Wsg0wSxdgDSdd+usx++liJcA lc5oAiOvjWUc8LozlE0uMWiyUKI6LE4Ea77hApeYXUzuqrcKXuWncALX9E44lkgEDjHf T1VJC7YpBCQZxiV7FbDKDwlRvh7CYacC0LMAu4ssAZSm2vyJQJ8O8R+hTmf5J0baVjh6 NqzAIyz5vybCpiTQUElbAHysLpXgeu0RRwvFg2TVY4tvFO91PgajGS6BAQLXKMc2hpDB zLKay5q4WGbrRpwIUuOx+1nGR65Ci1Cr8EYePPCqzjORpp89wgr6wo+FAsy0eALatKhs dUUQ== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@messagingengine.com header.s=fm2 header.b=gRPzVVY1; spf=pass (google.com: domain of luca-fxeGFEEDKJzVwve80SRih1jMPmZJtkid@public.gmane.org designates 66.111.4.29 as permitted sender) smtp.mailfrom=luca-fxeGFEEDKJzVwve80SRih1jMPmZJtkid@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:from:subject:to:message-id:date:user-agent:mime-version :content-language:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=96tfsHOjcYLZrmkkL3fk/xAkz6AV6wMyWkyFdFSwhH0=; b=XXBLb8PWNoOdXh3ZzfrkyfPZEDeXPXErErSqSf9+0Gw2QS6piZxLQx9U2R5h6xp+uc SNhNyDupfjr+dyuXiwSwI5lsyhXnprTNwuwf+KYCwBb1q2eiIQEBAwR6jjbkEH5f0K1r tu813k4EHWbDxcdfFRHjcuGEtnhfE4Rx+4KkEwCNqvv9DdRxliXqKFmTtNLCOZMA/zSA drBFGYzNcLDR8zRUXoz46DcesN3seMx7cdkvmRBt7AahvbZZCpjW90rodrfeZRwVNvFz YbRAu57+MgvnkuTUtMxT9L9lOuuHHuWWC1kGMBUTGO1Nk/H/aKJhchxlpXD9y07KBxjg DyqQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:from:subject:to:message-id:date :user-agent:mime-version:content-language:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=96tfsHOjcYLZrmkkL3fk/xAkz6AV6wMyWkyFdFSwhH0=; b=Xk2ltxLQiHIRyWj2Mz3aHgkwWpiK3m6GPneLequw2EDY7OAz6IKPrV33yBgF65Skv9 eB7LzwNB035Y9uVbswkF+gAWGMhM3izg0aauPkIxbwrsoZyj7lkeP0LpExzVvJiIXxDr vyAWdxbim5PrgXVeMCqzvTQR088j5O5hkPmJQO+0AndE3WnHkOjzkAdVl7I5qe4vRNef rw4/8MCyis29ZFuSsbPnitCG8zKqd076MDhYMQirJDTkuMnBK48fkQRfvoA1VDV4+Fb8 pinPubSm4FnBWOHItrEtSzzf8AtLKI8EAUWDLAljo4OjRCOMTP3vRvF5VWaQWys22mrl IiWA== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM530WAKaXP0WeCfeAkfEO+8vXgYpZo+ak6aN/X/lQpthwZ/FbizDk 1lvNfbvaKCw770nAg5+/QUw= X-Google-Smtp-Source: ABdhPJyx4XzglXmv6xfWIk+002jwuSIIVB4A+M6mqPrEltqDReobgMoZy/9umQ0l4rU2UmNvFblGOA== X-Received: by 2002:a62:7e86:0:b029:28e:5a88:5cfa with SMTP id z128-20020a627e860000b029028e5a885cfamr2887082pfc.70.1622108236493; Thu, 27 May 2021 02:37:16 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a17:902:b185:: with SMTP id s5ls1461827plr.3.gmail; Thu, 27 May 2021 02:37:14 -0700 (PDT) X-Received: by 2002:a17:90a:3bc6:: with SMTP id e64mr8721771pjc.156.1622108234793; Thu, 27 May 2021 02:37:14 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1622108234; cv=none; d=google.com; s=arc-20160816; b=z1qdLJ9r/BOaskPNLuX2k7xCBsfEc3xzHhCNvWiySaCMMKdf5aGoKAKLpvYrQq7ZWY PS7rReGMXu+qCGvCF6nqpW8LgxUbxnkk6Q6jAuy1M4tzMkruDp2ZGks31aFwk2sy645s NXuYCiAytY7FVtETDtbG0JrYVK7sEGYsREmXmcIwVQrWM+8gN4ujQpM/8wgpfIIPEbc+ 94UsAwvdPEVlrvzz3+tHhjFTK+hnbxdmVs5vAESidRxIJOo8OmFXlJB3y1BfuHx7qmsd nlgkOkFmts6Fpm0CpuYDryocfgsP3SZVkPLy0vbOoGEn8o5Eb1eZprX8Hvp6YvjV6s2d TKEQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:content-language:mime-version:user-agent :date:message-id:to:subject:from:dkim-signature; bh=+Cd2aVNpNKtiVw93llXgPAWtJAg9pd4J20rHRP5hmtA=; b=oWKATBhUkYXRA/Pnf7YgeWPpktvZSuTCt87Zu4uOWDEbEtHadhdofFDKFeI1tYqIlg tnlYCsWvoBXJiszvXO7sHfocQ9SfiqqeNqZca5Kbtko6sG1G9kEMo7S2/f0ANCALmtDz BKjNX7iTRgQbZS4p9DowzY8+mdMYMotNlLXjbQv6eo/Yquqe20qRON/WTVN0Twggd2Mb x1DgdqvHr+1NN/u/SnaIl9wmjgawSzQjaMsK2enKMsCQGd6DxQqHQBwONMaEeMMfsN+h 0pz6BO8PvH4AB0sB2p7ea9PtZEqq0qAts9kFEIp7C1ug97g71rvzkvC1tUo8MgAoB3Ao dHhg== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@messagingengine.com header.s=fm2 header.b=gRPzVVY1; spf=pass (google.com: domain of luca-fxeGFEEDKJzVwve80SRih1jMPmZJtkid@public.gmane.org designates 66.111.4.29 as permitted sender) smtp.mailfrom=luca-fxeGFEEDKJzVwve80SRih1jMPmZJtkid@public.gmane.org Original-Received: from out5-smtp.messagingengine.com (out5-smtp.messagingengine.com. [66.111.4.29]) by gmr-mx.google.com with ESMTPS id b17si161440pgs.1.2021.05.27.02.37.14 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 27 May 2021 02:37:14 -0700 (PDT) Received-SPF: pass (google.com: domain of luca-fxeGFEEDKJzVwve80SRih1jMPmZJtkid@public.gmane.org designates 66.111.4.29 as permitted sender) client-ip=66.111.4.29; Original-Received: from compute3.internal (compute3.nyi.internal [10.202.2.43]) by mailout.nyi.internal (Postfix) with ESMTP id BFB145C00A9 for ; Thu, 27 May 2021 05:37:13 -0400 (EDT) Original-Received: from mailfrontend1 ([10.202.2.162]) by compute3.internal (MEProxy); Thu, 27 May 2021 05:37:13 -0400 X-ME-Sender: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeduledrvdekhedgudekucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen uceurghilhhouhhtmecufedttdenucgoufhushhpvggtthffohhmrghinhculdegledmne cujfgurhephffuvffkffgfgggtgfesthejredttdefheenucfhrhhomhepnfhutggruceo lhhutggrsehophgvnhgsohhokhhpuhgslhhishhhvghrshdrtghomheqnecuggftrfgrth htvghrnhepgfetffevudefhfehvdetgeduieejhefhgedugffhhefhkedtgfduveefvdej ledunecuffhomhgrihhnpehgohhoghhlvgdrtghomhenucfkphepkedvrdelrddvvdeird duieelnecuvehluhhsthgvrhfuihiivgeptdenucfrrghrrghmpehmrghilhhfrhhomhep lhhutggrsehophgvnhgsohhokhhpuhgslhhishhhvghrshdrtghomh X-ME-Proxy: Original-Received: from [192.168.0.10] (cpc92310-cmbg19-2-0-cust680.5-4.cable.virginm.net [82.9.226.169]) by mail.messagingengine.com (Postfix) with ESMTPA for ; Thu, 27 May 2021 05:37:13 -0400 (EDT) Content-Language: en-GB X-Original-Sender: luca-fxeGFEEDKJzVwve80SRih1jMPmZJtkid@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@messagingengine.com header.s=fm2 header.b=gRPzVVY1; spf=pass (google.com: domain of luca-fxeGFEEDKJzVwve80SRih1jMPmZJtkid@public.gmane.org designates 66.111.4.29 as permitted sender) smtp.mailfrom=luca-fxeGFEEDKJzVwve80SRih1jMPmZJtkid@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:28451 Archived-At: Hi! I would like to write a (python) filter to handle index markers from an input docx document. However, I don't seem to be able to spot any (in this example, the word "test" is indexed): $ pandoc -s -t native input.docx Pandoc (Meta {unMeta = fromList []}) [Para [Str "This",Space,Str "is",Space,Str "a",Space,Str "test."]] A (very) old post in this newsgroup suggests that this operation is yet not possible with Pandoc (https://groups.google.com/g/pandoc-discuss/c/gnHmLmLCvUQ/m/PnCc1yEMAAAJ). I was wondering if this is still the case and if there are any workarounds I could consider? I am on pandoc 2.13. Many thanks!!