From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/28830 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: "'Jesse Rosenthal' via pandoc-discuss" Newsgroups: gmane.text.pandoc Subject: Re: docx parsing bug: nested fldChar fields are interpreted incorrectly Date: Thu, 15 Jul 2021 14:46:56 +0000 Message-ID: References: <2f5489af-f5a9-4ea4-9155-9f85c4808756n@googlegroups.com> <9bdb337d-fa68-4c66-8f5c-d4fa81547953n@googlegroups.com> ,<24273fbf-2ce9-4c26-886b-50d504cb7b05n@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="9589"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDF7DMU574PBB5MUYGDQMGQEAUTTACY-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Thu Jul 15 16:47:21 2021 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-pj1-f59.google.com ([209.85.216.59]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1m42e0-0002Bw-AX for gtp-pandoc-discuss@m.gmane-mx.org; Thu, 15 Jul 2021 16:47:20 +0200 Original-Received: by mail-pj1-f59.google.com with SMTP id j24-20020a17090a8418b02901756faa6840sf3689982pjn.2 for ; Thu, 15 Jul 2021 07:47:20 -0700 (PDT) ARC-Seal: i=3; a=rsa-sha256; t=1626360439; cv=pass; d=google.com; s=arc-20160816; b=MLp7rMg5TwFp7yXKrxT42IlUocndEDOE3xUfL2HaPUj1NvAqGSlu+tjJ1CISaj17/S O2nqJsZJECTeLxHYKWumFOKkLrAu0GEPTDCsQbE4W6JQr0FLjLMpxHeT5LdrZ1glti9i 2DO/bMWkVoGcAwyW6RNGl/D+i+EeD2WjTHYgoKEWw6Wh5UKQQmExu+Bvs9TRwfvOsR9D jDQic/UEDqpLCq4r0Smm7HNP5pDaUVMebKcAHT6nu8jQzxQvh1k1+DZ2bfOK9Bvx5u0A E7hPIDa3M3UKhkEyXf05YjF3ElHlZarTMUnRnvnPQQs1auJn3DxxlKtYhf/Xst7eZ8+5 EjAQ== ARC-Message-Signature: i=3; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:mime-version :content-transfer-encoding:content-language:accept-language :in-reply-to:references:message-id:date:thread-index:thread-topic :subject:to:from:ironport-hdrordr:ironport-phdr:ironport-sdr :dkim-signature; bh=C2L0K5JMJOvhEq1xYLGBRzSyCPn9GoraFSd6Sfv4By4=; b=hE8y26bpG8WQVcYMRjzvCFJsUjdrStI+Xy880mzKErYKlfYGTuCcibAJ8cMPaeSbxW KsUnmyy6iQYVb7rC4D2riwCvuHNxGiiglEkHzA9oAvNUv12a9JDLbkzwbYXC8tljoQIp 5rRtb9BkAMJ3e2V8TB4D+3TJpRlGAXDynTUFqdJsoLPi1PxQdUe3oe/J4YBez9KGTrfv 6438bkQLshsqL5Xf3ZPJ3WVt9wTc0QVRcwPiMNc0MUoriiu9UrhCTlArNR3IaBFJyRkI C6LpMbKDh/unX7/PMAeEI7qdBZ2jiLGvpx5ZdyFaIRah2FVFz+YIV ARC-Authentication-Results: i=3; gmr-mx.google.com; dkim=pass header.i=@jhu.edu header.s=jhuiron header.b=MqjcUvD1; dkim=pass header.i=@livejohnshopkins.onmicrosoft.com header.s=selector2-livejohnshopkins-onmicrosoft-com header.b=FQ4HUaCY; arc=pass (i=1 spf=pass spfdomain=jhu.edu dkim=pass dkdomain=jhu.edu dmarc=pass fromdomain=jhu.edu); spf=pass (google.com: domain of prvs=82387b5ca=jrosenthal-4GNroTWusrE@public.gmane.org designates 162.129.199.180 as permitted sender) smtp.mailfrom="prvs=82387b5ca=jrosenthal-4GNroTWusrE@public.gmane.org"; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=jhu.edu DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=ironport-sdr:ironport-phdr:ironport-hdrordr:from:to:subject :thread-topic:thread-index:date:message-id:references:in-reply-to :accept-language:content-language:content-transfer-encoding :mime-version:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=C2L0K5JMJOvhEq1xYLGBRzSyCPn9GoraFSd6Sfv4By4=; b=mKkgFMMU3fLLCtVXwC0MGlbR/K4cWXMptrf3ZgLpn4ZC4YB4S9dHeBgWNZ9Jtp1lQX D0hIZ2rqIys5JmE4hDV/TrNmL2OCFsXJjOq3HtuNUA6BHihlIdCTFD9FnZ+Zj9b9fVj8 tjaKnvSGrUFkkJTlGV6sPpo69MXTV1RQP1t+fGaIdE9fTN423njXz+yTgxu+2bwJtgNX BiTN0SchW36vvhKJT4/Lhps0GNcQ/rV3B/yGnQXa614dw9HuD9BKk/HMT2uN0EvEZnQi MkQW6yZC+0WmL X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:ironport-sdr:ironport-phdr:ironport-hdrordr:from :to:subject:thread-topic:thread-index:date:message-id:references :in-reply-to:accept-language:content-language :content-transfer-encoding:mime-version:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=C2L0K5JMJOvhEq1xYLGBRzSyCPn9GoraFSd6Sfv4By4=; b=cxsZUB2B6YLYmw6RBUYxon7WFlVZrxrT1ZvP99K7lGWE9VxvzBYsE1Zzih4YLxCI9h mtUSsScNovZ5y/4wxKEcihBYUsDVX0RKKHH5Vip9SgbqooYb1pDrZxgjyPMGoMyX4Zvn VoBQMvqTR3fLcA+p0QqtCLYvm7GECreESWwrbY21SpTe6N9NShJWyetSVMpn90IfF7XP w15XradCPLYjSp8EtkIERd05pLBtG1qSfCJM X-Gm-Message-State: AOAM533GBIuuOi8kT3G0DhC8PHY9nK0O6B0R8j2vxH9/OCu+4QjizGe2 u6IDgZsS8deAwQY0YVOxVnE= X-Google-Smtp-Source: ABdhPJyrAcj4K3+yby/mi0cUmYPnRiVkaQMPc8FrU5d9vn+qLwL9t3bkEdOdT7tn9DTkt+gCNFsQvA== X-Received: by 2002:a17:902:7589:b029:12b:45b0:7376 with SMTP id j9-20020a1709027589b029012b45b07376mr3761139pll.62.1626360439047; Thu, 15 Jul 2021 07:47:19 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6a00:1347:: with SMTP id k7ls655049pfu.10.gmail; Thu, 15 Jul 2021 07:47:17 -0700 (PDT) X-Received: by 2002:a65:6a09:: with SMTP id m9mr4964569pgu.149.1626360437182; Thu, 15 Jul 2021 07:47:17 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1626360437; cv=pass; d=google.com; s=arc-20160816; b=CaoPoHvMKeNCt84PQTq2Gz6/lYUbi8sOyvDzK1JeuM4+7VtE1tnxuBWZ6x6tASdzav 14QGrgJ6/A5fXMaTWEHLuzsxVkqDud8ytx3gzm5gOJrCjdrZdvb5pKHEL4s70W4KFSA4 9JUBaYerCTs7MyT70C8fID44lNTy4hhR9LzRyk60fDERbfzXgaFd2FBHUJzrTMhO+CLh NKLY+NlxD1KS3/qjGCH6hP1APPuatl/QysUDb3mMHNZT+sv/2FgqmUYNqFyerv+bQ5W1 lr0JIMVHJ383z+adrLwvXDQXMMjqGsiTuI2GxT5KR5oOKSUjxKf9VmFxTsRc9/k87iRu tEHA== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=mime-version:content-transfer-encoding:content-language :accept-language:in-reply-to:references:message-id:date:thread-index :thread-topic:subject:to:from:dkim-signature:ironport-hdrordr :ironport-phdr:ironport-sdr:dkim-signature; bh=cJSDBCHeCQRly1eaMOkypOVc/mBfh3Xf/hxkcWEr9K4=; b=o8r1gRoBMOfSztOoVLPC5yX58/5AkbRPoOwC7ferzPyshvJBL5tRhWKI0SufSMdhnz PLHs2AVchSPtkSiUbWayw8ntCApmav7lFpkPe2GVX5Q5qnBXqhHhtGDBskr7Hb0QjVbt LqzyUsJmOfolA9icAAoy7oCLk/hyrlEhPxBDrUbQINRKfh2ccZnQ9hLOcfa24bUjnLw2 aqGlZA1Gi2Cal6rn62C/obi4rwqddmMAp/IKNRSwh/zLjT3XXCGIhdqaCdt/zpAVII7M uLPBV/JFOAzaHFheHeRvTv4yxbVXe3+PvhceJOYbw4UZmm/nMGojZDdwuQssTUBC0CTi mpLg== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@jhu.edu header.s=jhuiron header.b=MqjcUvD1; dkim=pass header.i=@livejohnshopkins.onmicrosoft.com header.s=selector2-livejohnshopkins-onmicrosoft-com header.b=FQ4HUaCY; arc=pass (i=1 spf=pass spfdomain=jhu.edu dkim=pass dkdomain=jhu.edu dmarc=pass fromdomain=jhu.edu); spf=pass (google.com: domain of prvs=82387b5ca=jrosenthal-4GNroTWusrE@public.gmane.org designates 162.129.199.180 as permitted sender) smtp.mailfrom="prvs=82387b5ca=jrosenthal-4GNroTWusrE@public.gmane.org"; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=jhu.edu Original-Received: from IronEB8.johnshopkins.edu (ironeb8.johnshopkins.edu. [162.129.199.180]) by gmr-mx.google.com with ESMTPS id h16si543543pjv.0.2021.07.15.07.47.16 for (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Thu, 15 Jul 2021 07:47:17 -0700 (PDT) Received-SPF: pass (google.com: domain of prvs=82387b5ca=jrosenthal-4GNroTWusrE@public.gmane.org designates 162.129.199.180 as permitted sender) client-ip=162.129.199.180; IronPort-SDR: v/OHNCKiVAEAyXaBKKXRlzyw95pXiDTOzZDX50vpxv0nizBpTcWOwgpayrVK/Zwha+PGbua6F1 qf7HuIgHQ5i/pF8oi9W5MwYIwKrwiA89jOeL96Q/MPRGspFkJAwMlqqw7vS6F/kI4svRCcFCVN PXwK6pWOLRITcCGCSlSXrxBPRvfPJTFlTaf3rdgIbQHQQImFe6Q1Cd/8EKGwbi9A3JZlPRRmbl BFetcphIEO4n4o7kUAb7SQTeqLz/bRw3issqNYQqvgKGdQ9PFJaQBfMEcfW5XhcZfP1E8/8KwE xfQ= X-IPAS-Result: =?us-ascii?q?A2HtEwASSvBg/3qErQpaHAEBATwBAQQEAQECAQEHAQEVg?= =?us-ascii?q?VuBPAIBElEZAWRYaoRBkWIDilWNCoJOgUKBEQMYFh0JAgkBAQEBAQEBAQEIA?= =?us-ascii?q?RICEQUPCAQBAQMEhEkCAgKCfCY4EwIEAQEBAQMCAwEBBwEBAQUBAQEBAQEFA?= =?us-ascii?q?wEBAQKBAIUvOQ2COAUCAwIkC2NNAwM1AQEBAQEBAQEBAQEBAQEBAQEBAQEBA?= =?us-ascii?q?QEBAQEBAQEBAQEBAQEBBQIMAVEBKT0BAQEBAxIoBgEBDCwPAgEIEQQBARYJC?= =?us-ascii?q?QchEAEUCQgCBAgHBAEHEQICBAGCUIJVAy8BDpsAAXkBEi4Cih94gTSBAYIHA?= =?us-ascii?q?QEGBASBNgGEEA0LQQEJDYFaAwYJAYEuAgEBAYJ4g3+GcUOBSUSBWIJiPoIgK?= =?us-ascii?q?xcCAoEXEgESASECPYMOggwigleBAjU4ECACWRYHLRISSTYPkQQ6HKkDejtbg?= =?us-ascii?q?y6KNI4hhXoSn3aGZZ58gWGBW4MykBgmhGcCBAIEBQIQBoFyMiwwcE0lToJpU?= =?us-ascii?q?BkOkhCKXkIxAg8nAgYKAQEDCSGJe4FtAQE?= IronPort-PHdr: A9a23:AzIQ/xPkrCtgyf/9fwsl6nbqCBdPi9zP1u491JMrhvp0f7i5+Ny6Z QqDv60r1geCAN6Tq6odzbaP4ua4AS1IyK3CmUhKSIZLWR4BhJdetC0bK+nBN3fGKuX3ZTcxB sVIWQwt1Xi6NU9IBJS2PAWK8TW94jEIBxrwKxd+KPjrFY7OlcS30P2594HObwlSizexf69+I A+ooQnNucUbgYVvIbs1xhfVv3dEYetbyX5sKF6Jgxrw+sK894N//ipNvP4s69ROWrjgcaQiS rxYAjUmM2Qr68DuqBLOUwiB6GYCX2sPihZHDBTL4x/8Xpfqryv1rfF91zWAPc33Vr87RzKv5 Lp2RRDyiScHMzk58HzLisF1kalWrg6tqwB5zoXJZ46bKed+frjFfdwEWWVBQshfVytEAoymb osPCfcKMeJXooT5olcDqwa1CwuxC+P10jJGhmH407Am0+ovEg/G0g4vEM4Bvnvbt9j1KL4SX fypwKXU0TnPcfVb0ir95ojSdRAhpOmBU7ZufsrK0kkgCR7OgEmXqYz+ITyV0PgNs2iB4OpkS OmijGwnqxt2oji02MgsjJXJhoIPxVDA7ih5woY1JcCiREFlfNGkDYJduieHPIR5Xs0sWXtnu DomyrIYo567ejAHxZUmyhPQd/GKbZWF7gziWeuRPTp0mHJodbC8ihqu8Uas1/HwW8u23VtLr ydLncTAuHAD2hLT6cWKTuVx81uv1DuJygvd5OZEIUUumqraLZ4s2rEwlpsPsUTDAy/5g1/6g 7ORdkUh4uSo5OPnYrLppp+YKYB4kB3xMqMrmsCnBuQ4LxQOX2yc+eS7173v51H2QLJPjvAwj 6LXs4jaKNw IronPort-HdrOrdr: A9a23:uejCsK28IaPYE2eHrjpZaQqjBSFyeYIsimQD101hICG9Lfb2qy n+ppgmPEHP5Qr5AEtQ4+xoS5PwPE80kqQFrbX5XI3SETUO3VHHEGgM1/qF/9SNIVycygcZ79 YaT0EcMqy+MbEZt7eC3ODQKb9Jq7PnkJxAx92ut0uFJTsaMp2IhD0JbzpzZ3cGIzWucqBJcK Z0iPA3xQaISDAyVICWF3MFV+/Mq5ngj5T9eyMLABYh9U2nkS6owKSSKWnX4j4uFxd0hZsy+2 nMlAL0oo+5teug9xPa32jPq7xLhdrazMdZDsDksLlVFtyssHfpWG1SYczBgNkHmpDr1L/sqq iJn/4UBbUx15oWRBDznfKi4Xin7N9k0Q6f9bbRuwqdnSW+fkNjNyMJv/MmTjLJr0Unp91yy6 RNwiaQsIdWFwrJmGDn68HPTAwCrDv8nZMOq59ls5Vka/ppVFaRl/1swGpFVJMbWC7q4oEuF+ djSMna+fZNaFufK3TUpHNmztCgVmk6Wk7ueDlIhuWFlzxN2HxpxUoRw8IS2n8G6ZImUpFBo+ DJKL5hmr1CRtIfKah9GOACS82qDXGle2OFDItTGyWSKEgjAQOGl3fa2sRE2AiaQu188HJpou WxbLpxjx9AR37T X-IronPort-Anti-Spam-Filtered: true X-IronPort-AV: E=Sophos;i="5.84,242,1620705600"; d="scan'208";a="345543064" X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False X-MessageIPDirection: Outgoing Original-Received: from esgmtwex16-22.win.ad.jhu.edu ([10.173.132.122]) by IronEB8.johnshopkins.edu with ESMTP/TLS/AES256-GCM-SHA384; 15 Jul 2021 10:46:58 -0400 Original-Received: from ESGMTWEX16-21.win.ad.jhu.edu (10.173.132.121) by ESGMTWEX16-22.win.ad.jhu.edu (10.173.132.122) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2242.10; Thu, 15 Jul 2021 10:46:58 -0400 Original-Received: from NAM10-MW2-obe.outbound.protection.outlook.com (128.220.36.247) by ESGMTWEX16-21.win.ad.jhu.edu (10.173.132.121) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2242.10 via Frontend Transport; Thu, 15 Jul 2021 10:46:58 -0400 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=PyGgQ3pC3Ffoq5iAJDtGvdaOhFVx1E2UQBo9kbVuRmYEZOwPwkKHD60iyQ9QBwOWaGd1LuHH6D/bz4apKoZQAYmGbXWS7Axda8oUfLiAkgtGDaRguQmdLkWDfYrrg6Az29dfiQr2leJQpbcVzfKZI4G3G52ZG2sj8zfvZtoOFwIASZGKiY1AqpFlzihEEnNzYWKJwvzQ60lU4oEvJVC5FgIS6AEYnSP3WnC0WEGEtf4EWn2QZ1ADJJi3kkZpaFRNa6KqH0LO10691vTqslh1cljwOgbJQ3fiKAF7vNVg1CxPnxGgimby6zxq+y0pAopRojvGOH29rGZzIeu1FWGg/g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=cJSDBCHeCQRly1eaMOkypOVc/mBfh3Xf/hxkcWEr9K4=; b=Q+d7Z2eRhcv1pQIUM8nzxEthjTT3rPJkWJlnDgzTR1WG6NC3bPsib3vDVlswV8vW8uuNbOGD4RNMMw6lj1UPTasnoatqEITydO7W2/1Q6wbo/gPtBAK5s/xLppUgj0wivkuY81JpC6zvc+K6Yn15sPlr0+oGi9efDGy/682vWj2kdEoQtOvTQWicTxTBgqj/1+2YQtD1JM4d7+BKn9RyuarrT9mBTeJd5HggLyHFLh1GGeaQc/gDmK2QE76BY/U6VBT5JQh03mxKbEVevCccS/yMCHljSqEPCEPTjbnRzTmWoghrVlLXdNWm2RJIqEVgkJIahINDGf/Weq6gIEmTog== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=jhu.edu; dmarc=pass action=none header.from=jhu.edu; dkim=pass header.d=jhu.edu; arc=none Original-Received: from BL3PR01MB7100.prod.exchangelabs.com (2603:10b6:208:33b::12) by BL0PR01MB4193.prod.exchangelabs.com (2603:10b6:208:27::23) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4308.26; Thu, 15 Jul 2021 14:46:56 +0000 Original-Received: from BL3PR01MB7100.prod.exchangelabs.com ([fe80::25ea:d559:3afc:e9e1]) by BL3PR01MB7100.prod.exchangelabs.com ([fe80::25ea:d559:3afc:e9e1%3]) with mapi id 15.20.4308.027; Thu, 15 Jul 2021 14:46:56 +0000 Thread-Topic: docx parsing bug: nested fldChar fields are interpreted incorrectly Thread-Index: AQHXeYfzXOAwmGiNTUe7VHoqgBZUyqtEHQvV In-Reply-To: <24273fbf-2ce9-4c26-886b-50d504cb7b05n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> Accept-Language: en-US Content-Language: en-US x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: 83930cbb-bf56-4978-2a3e-08d9479f64bd x-ms-traffictypediagnostic: BL0PR01MB4193: x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:8882; x-ms-exchange-senderadcheck: 1 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: FBa7/0gsp8Tgo60i/Ac12pWfoJyL+4I6YA5OzyEpewuTqtONF5OfnVpPGW+plkA4Ike9n3Uboh0b3jeP2R4KXYSYltwzszbd1nlbdkZdjJl2d8fGfHKjCDM0MvQn2/QueFWSIxt+iVYcmjgE3xegvoQSO/Tmvk01o1T8gdpVUKpNsYW7B0jMRnhurOihb9kXn6m47IdRW5+x1YgULo6yzJbd7JTPfL4azagajLZEu7/kqNNb467YPl/Ufef8IKmkVoaAMBpgpawSOQoNw2cc8HHzNU5URPZz2BmsWylGShtXiDtQOWp1Tvr1VS90BLdMDr59jckEgbWlLepZacJKCcKw3GJQk+i9qpSQbaqFmP+eluF5aYEfUNtsD4+onkBCJAfQeKgyDBMLlKYKUule3Cr6s43V2jtAF1eMhMr0cfq7FaCW0Cy4abBkEFNHWeR1TY++NukR9ArGcKQaxMOexrk/MB8bh1WvtAssFK5ITcYLWBcOz5hxggOodWEwDj63Zm3oCLaGQYgzHqUDFn+S4wx+h5Gi5umd1t8EGc9gPdz4HNwEGq/UVOBCC02Juo2GfVL8uoWx816HyNMOKpA6UtKCGrr51ri/QTygBy9KVPOhKU4eqf+iVz9AITFx9fegZyd5sg6QkeztGxKEHORKP0js2SCco5an2FsMm/cmtr7G4pba5iAZJyzLxVPwPThXrYqENTst1foZEHf8CX7YuBJd6Vko89cQs0kDNtNM/dgBO2DSE/L6gc97sheaB TrdvDJV/pWDDtul7JsUjUGC8jdgjp21svKE1sNxBOAW3SZSRkpnea3R8yGkuCx8IyuW x-forefront-antispam-report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:BL3PR01MB7100.prod.exchangelabs.com;PTR:;CAT:NONE;SFS:(4636009)(346002)(136003)(39860400002)(366004)(396003)(376002)(7696005)(55016002)(66446008)(316002)(66556008)(76116006)(966005)(53546011)(786003)(8936002)(6506007)(66476007)(66946007)(64756008)(71200400001)(186003)(2906002)(6916009)(86362001)(5660300002)(52536014)(478600001)(33656002)(83380400001)(38100700002)(9686003)(122000001)(75432002)(8676002)(38070700004);DIR:OUT;SFP:1102; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?us-ascii?Q?nQEh1sNFJ2tYWyhjsjPhS8Nrq/vkv+EAePc3PgTG9/G4jCK1uyxYiLdnwZIz?= =?us-ascii?Q?y+OrmW1vGVGohg1XSrL6ADpZyQC+7GnYxsWFcSBuHqXeyhMZRAHxVGsocDRp?= =?us-ascii?Q?SjnrMBwJN6eU7sH7RW/dZ9jb9FUVrQZ4/MO09l3ulaGhFDR9MF072kb7Fbe1?= =?us-ascii?Q?YQAQuLJnjryX0OgJCDDJgL2JC+zKLST0rMj6pU4w2VO1ncdG5w7xXAg0NZOc?= =?us-ascii?Q?ZFAq9TWn1fdNYF3knQgH704nhfMbgbi1ctLQx37dG+J+8HIQIfzkmzpqK1Hv?= =?us-ascii?Q?GN2mStf1e0pdYgAW3THzRALnZsPvLRzWtUkER4jq+DBBwk+bnqLvjppqc+Cx?= =?us-ascii?Q?dQwBWOaqGskJVSVq/eZKQDvePmCWLTc7TR1eExWodvOA6oTvd/Ww5LkA6nZq?= =?us-ascii?Q?4e0wMOboC1pNulTTGX9J1ogzUrWw5NYlH0V/SAnkc4JRIQlL5mHjEVLKVZOp?= =?us-ascii?Q?2mRsam6KWy8aF/o0AVw44XmSlWTlmlh2yjMqma0S3Jrcz0FUKog5HAEKA+VU?= =?us-ascii?Q?xfnEKzIS35IDvcbTEbmTHYZ4g2iz9TnkWKI+4j7Gn5c7DFxqQ13f8AhDLruu?= =?us-ascii?Q?77+ii9BHcqGhZHygqwDUYsBTevqKegNgRfgXWbdljBWcD1lLs1qavPe3RcKJ?= =?us-ascii?Q?VY x-ms-exchange-transport-forked: True X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: BL3PR01MB7100.prod.exchangelabs.com X-MS-Exchange-CrossTenant-Network-Message-Id: 83930cbb-bf56-4978-2a3e-08d9479f64bd X-MS-Exchange-CrossTenant-originalarrivaltime: 15 Jul 2021 14:46:56.4031 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 9fa4f438-b1e6-473b-803f-86f8aedf0dec X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: Fr7jnfyLvK0LXco7oYcQ3HLMmgBh713t5C2Uk18YrosEsfolSnhQEdSqoYJnk1/8 X-MS-Exchange-Transport-CrossTenantHeadersStamped: BL0PR01MB4193 X-OriginatorOrg: jhu.edu X-Original-Sender: jrosenthal-4GNroTWusrE@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@jhu.edu header.s=jhuiron header.b=MqjcUvD1; dkim=pass header.i=@livejohnshopkins.onmicrosoft.com header.s=selector2-livejohnshopkins-onmicrosoft-com header.b=FQ4HUaCY; arc=pass (i=1 spf=pass spfdomain=jhu.edu dkim=pass dkdomain=jhu.edu dmarc=pass fromdomain=jhu.edu); spf=pass (google.com: domain of prvs=82387b5ca=jrosenthal-4GNroTWusrE@public.gmane.org designates 162.129.199.180 as permitted sender) smtp.mailfrom="prvs=82387b5ca=jrosenthal-4GNroTWusrE@public.gmane.org"; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=jhu.edu X-Original-From: Jesse Rosenthal Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:28830 Archived-At: Hi Milan, Thanks for the heads up. Honestly just summer craziness: visiting family fo= r the first time in almost two years, shuttling the kids around. Life stuff= . I'll take a look at it ASAP. Best, Jesse ________________________________________ From: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org on = behalf of Milan Bracke Sent: Thursday, July 15, 2021 10:43 AM To: pandoc-discuss Subject: Re: docx parsing bug: nested fldChar fields are interpreted incorr= ectly Hi all, I've had this pull request open for more than 3 weeks now: https://github.c= om/jgm/pandoc/pull/7401 Is there a reason it's not getting any reaction? I'd be happy to improve or= explain it. If I've done something wrong, I'd like to know, so I can fix it. Best, Milan On Thursday, June 17, 2021 at 8:42:48 AM UTC+2 Milan Bracke wrote: Hi Jesse, Thanks for the feedback. I'll ping you when making the PR. Most of my code = seems to work so far, but I still have some trouble with the fact that the fields now need to contain ParPart= s instead of Runs. It's harder to match all the cases and treat them correctly. I'll try some more and let yo= u know how it goes. Best, Milan On Wednesday, June 16, 2021 at 4:21:05 PM UTC+2 Jesse Rosenthal wrote: Hi Milan, I wrote the original fldChar code (and that comment) and I figured it would= have to evolve as further requirements became necessary. If nesting is a r= equirement, a stack instead of a toggle seems appropriate. As far as crossing paragraphs goes -- your approach seems right (and simila= r to how we've dealt with similar issues like comments crossing paragraphs = in docx parsing). I'd be happy to take a look and offer comments/feedback on your code. Just = make sure to ping me (@jkr) on your PRs. Best, Jesse ________________________________________ From: pandoc-...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org on behalf o= f Milan Bracke Sent: Wednesday, June 16, 2021 5:33 AM To: pandoc-discuss Subject: Re: docx parsing bug: nested fldChar fields are interpreted incorr= ectly I can't fix this without at least some feedback. It's a complex issue and t= he fix will take some time, so I need to at least know that my proposed sol= ution seems good and would be accepted if implemented correctly. On Tuesday, June 15, 2021 at 8:38:30 AM UTC+2 Milan Bracke wrote: I've encountered a new problem. A fldChar field can span multiple paragraph= s, but it doesn't have to start at the beginning of the first one. Because of this, a field across multiple paragraphs will merge those paragr= aphs. There is no way to represent this exactly in the pandoc model I think. So m= y current solution is to have different fields with the same field info in the different paragraphs. This can at least make the hyperlink fiel= ds work and I think it will work for the other fields we might add in the future as well (I've checked the list). What do you think about this ? On Monday, June 14, 2021 at 9:17:13 AM UTC+2 Milan Bracke wrote: For those who don't know fldChar fields, this comment from the docx parse c= ode (parse.hs, starting on line 825) explains it: fldChar fields work by first having a in a run, then a run with , then a run, then the content runs, and finally a run. For example (omissions and my comments in brackets): [...] [...] HYPERLINK [hyperlink url] [...] [...] Foundations of Analysis, 2nd Edition [...] The current way of parsing fldChar fields doesn't take into account that th= ey can be nested. So the end of the nested flcChar field will be interprete= d as the end of the surrounding one. This could for example lead to a hyper= link that ends too soon. See attached example for a docx that demonstrates = this. I propose to fix this by turning the fldChar state into a stack, so that a = field can be started and ended inside other fields. I will include this in = my pull request for PAGEREF fields that I announced here a while ago, since= they are related. -- You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discus...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org. To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/9bdb337d-fa68-4c66-8f5c-d4fa81547953n%40googlegroups.com>. -- You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org. To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/24273fbf-2ce9-4c26-886b-50d504cb7b05n%40googlegroups.com. --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/BL3PR01MB7100D5D9E5DC4898EC9E1221C9129%40BL3PR01MB7100.prod.= exchangelabs.com.