From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/28609 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Jesse Rosenthal Newsgroups: gmane.text.pandoc Subject: Re: docx parsing bug: nested fldChar fields are interpreted incorrectly Date: Wed, 16 Jun 2021 14:21:00 +0000 Message-ID: References: <2f5489af-f5a9-4ea4-9155-9f85c4808756n@googlegroups.com>,<9bdb337d-fa68-4c66-8f5c-d4fa81547953n@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="5633"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDF7DMU574PBBT4RVCDAMGQE5KSDWII-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Wed Jun 16 16:21:05 2021 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-qk1-f192.google.com ([209.85.222.192]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1ltWPh-0001Fd-OC for gtp-pandoc-discuss@m.gmane-mx.org; Wed, 16 Jun 2021 16:21:05 +0200 Original-Received: by mail-qk1-f192.google.com with SMTP id v134-20020a37618c0000b02902fa5329f2b4sf1860172qkb.18 for ; Wed, 16 Jun 2021 07:21:05 -0700 (PDT) ARC-Seal: i=3; a=rsa-sha256; t=1623853265; cv=pass; d=google.com; s=arc-20160816; b=0fT0TNfwj5+JRZvS31R58vvifzqJuXMxmZz0fsOeu9ERUI7ywR8pWoUyHP7dAMUsr5 JO0nIomnufgOCXtZGaUxJdGRMANdovhn+TRkavV8EvX8xStZNyeDEE1OhasYvBSdS56t nxuuICRPs1E70yht1d7GUX/idrCGDskkuwVyO1Bjrv8+28+IytJJ7Bhs5w9tj47GV5RB pghkpLK8OjCPl5GpQziVHaWuQviSUrFF7P2kiYEJETb460Sa6ZtbqRZ5qn0Mkc1r/zpC dFElq/3VaHz6c2aP/syIjzwCHElsN8cGf+siRBpWyEQER1dZuZV881XCCJA5F1Ie5DB+ ewTA== ARC-Message-Signature: i=3; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:mime-version :content-transfer-encoding:content-language:accept-language :in-reply-to:references:message-id:date:thread-index:thread-topic :subject:to:from:ironport-hdrordr:ironport-phdr:ironport-sdr:sender :dkim-signature; bh=2fpx4kLzJlKfiz3iynBQwZ0IOwbwrfBtgDjbqVGl9Bw=; b=eQ821x74Yxs29wd+7biAphRfjcKE/ZNPu7NxcJFQaz40ebtdCG97hgEsdaATIAJipt trWiBq4DlbFeO1lEzFNfzfRWPTEvcWU2Tf70ydF3AZ1Iv9EAN05uXvSVYBYM2Y/ATfa3 3g/0H8Rk+oC5gBVz7+nQWWWLPTaRJ+ycIo9FhV5NE2xtu5aoglk42782jO+L7vnZ5wnU WlxbLNDaaH/8rGshWMkTO2Ck0Zjq/CADx5WVR0Kjssj+EpDB3A+6/cNV+dIxnQOtsrJq ZFimsL9UhdiglQdaojuVG4sj+ZFFAqV9j9UhNAOzkl/sHT ARC-Authentication-Results: i=3; gmr-mx.google.com; dkim=pass header.i=@jhu.edu header.s=jhuiron header.b=EovrwhEP; dkim=pass header.i=@livejohnshopkins.onmicrosoft.com header.s=selector2-livejohnshopkins-onmicrosoft-com header.b=WUAje3eG; arc=pass (i=1 spf=pass spfdomain=jhu.edu dkim=pass dkdomain=jhu.edu dmarc=pass fromdomain=jhu.edu); spf=pass (google.com: domain of prvs=794e4b39d=jrosenthal-4GNroTWusrE@public.gmane.org designates 128.220.39.180 as permitted sender) smtp.mailfrom="prvs=794e4b39d=jrosenthal-4GNroTWusrE@public.gmane.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=jhu.edu DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:ironport-sdr:ironport-phdr:ironport-hdrordr:from:to:subject :thread-topic:thread-index:date:message-id:references:in-reply-to :accept-language:content-language:content-transfer-encoding :mime-version:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=2fpx4kLzJlKfiz3iynBQwZ0IOwbwrfBtgDjbqVGl9Bw=; b=Fv/wbzXlQV3DOnD8T+ZPPuJxHy9uqIcCxyOdyKWjGKBZqbjQyxcpYTUBES5i7yzCNc FA0UxXcf7dMMul+KFUfyFNedZK7GPKW1HiLrWGaXgWYS9H2huyQI4oApztuu2Kb1UouV EcJzTFzz8JWrEVrNFtqnnhk6gf5i56DyOADsG09Rnyu2G51vpvaz8Tuh4sAMkmNxC5z8 TZ0HYrnZd+GcK14DqlX6Y4GN0HSEfHoUS3b7dME0wLzCrjKDvBI1wqJu8dnMQHYqWm/n +aR6FK X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:ironport-sdr:ironport-phdr :ironport-hdrordr:from:to:subject:thread-topic:thread-index:date :message-id:references:in-reply-to:accept-language:content-language :content-transfer-encoding:mime-version:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=2fpx4kLzJlKfiz3iynBQwZ0IOwbwrfBtgDjbqVGl9Bw=; b=hOe1s76z23vBYcZUWvhyMtuOgAkRXtWXp3FBQJVFWoEJaPlrxn/BxQE5R/enWfzgfz uXSpOlmH3ZnAhSp0lhBB4pMeKfOb4q+PNdDYhIflQGwxlQCOQo9517pDt59ONXySxU3U yo4WrcfNGVI3vJgffgO9mWwahHvVuBL3PWVitL1hCjUx/aAXKSZlvMkj3bVOtWd8saqZ NvrHo6cj/QuSV4NBa3n9KrIVXURH/ Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM532GmMwvnXoA0uDl9CLQUatHuLIPPLy3uKiNs2AP0/LaLJNFKSFx bxmTejgf2EHKUGpoiUI2jH4= X-Google-Smtp-Source: ABdhPJxgtEsi3NWjluvSM9IG5SxMM/DXw0zEGTy6OgtipaUSFyWL9669l0hqYf5UJCw99Aa+q5uKFg== X-Received: by 2002:a05:620a:6c9:: with SMTP id 9mr249206qky.303.1623853264845; Wed, 16 Jun 2021 07:21:04 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a0c:9042:: with SMTP id o60ls882876qvo.7.gmail; Wed, 16 Jun 2021 07:21:03 -0700 (PDT) X-Received: by 2002:a05:6214:1021:: with SMTP id k1mr307915qvr.4.1623853263063; Wed, 16 Jun 2021 07:21:03 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1623853263; cv=pass; d=google.com; s=arc-20160816; b=F6mXRIwXAQ9KC8LCymTw94V+4aH6Fh7NVQMzXIQ58gOX9teorgfEPsziXmhVnwM3cl 0Py6FqSClcCY0SlG159d85q3G1KULA5poHOBDB5UqwPbF2YyS76U1RbhK7oQbex6q4Cs fYKQStkJ+N6/BwBrW2qaeFpd+fMnYesFGlgm+ZmyoG7WeYY19Oo7S8gUATbaoy8zx6CO /WNA05hk3clNYOKEche5pIEip5KW7ScvVSil+oGmwpMToCTW5m63uvzE+MmUcqR2wBu2 1fu7Wbgs4TcCSy9r/Ax96rTOAqfsBFj7ZpxDvvYoQvnE5KLnjqs8saCCbyz4aPEXx8Y7 vROQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=mime-version:content-transfer-encoding:content-language :accept-language:in-reply-to:references:message-id:date:thread-index :thread-topic:subject:to:from:dkim-signature:ironport-hdrordr :ironport-phdr:ironport-sdr:dkim-signature; bh=6y1eHmdQkS3zGmyPTJlm3Xdglc6kir8M1jsGAJmlQUQ=; b=A352f7HEDWbK6l79bPUMG7aZJONK7yMO5jCiXJVZqC6y5pOix8TWprX5O2NZ2odiJ9 pwKywE5ww+MbYrUsgiBGu8bpsgyWuBHaDAZNinc/36TIk7Yld4C7KuIhq5WyN5Q8FAGi cIlAq07G+k/ZAMo5UeQVPnMbujXoDyvKNDDOi5Lg7EF0ytmqZ/TAwIoRkI9tYgi0sZss /4cKHdoW514aT3at6nvsNz2bh1FYppdlTuuWaUXXhJBv85ea5p9W17KoGjihoKTtYIiB e49pzdoXAh821Y31zOpFD2X/pnRzgmii5nMuNyihZ1j0Eo1ir0Wj3acYepA5OJkerMC4 UJFA== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@jhu.edu header.s=jhuiron header.b=EovrwhEP; dkim=pass header.i=@livejohnshopkins.onmicrosoft.com header.s=selector2-livejohnshopkins-onmicrosoft-com header.b=WUAje3eG; arc=pass (i=1 spf=pass spfdomain=jhu.edu dkim=pass dkdomain=jhu.edu dmarc=pass fromdomain=jhu.edu); spf=pass (google.com: domain of prvs=794e4b39d=jrosenthal-4GNroTWusrE@public.gmane.org designates 128.220.39.180 as permitted sender) smtp.mailfrom="prvs=794e4b39d=jrosenthal-4GNroTWusrE@public.gmane.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=jhu.edu Original-Received: from IronMTW8.johnshopkins.edu (ironmtw8.johnshopkins.edu. [128.220.39.180]) by gmr-mx.google.com with ESMTPS id d77si256069qkg.1.2021.06.16.07.21.02 for (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Wed, 16 Jun 2021 07:21:03 -0700 (PDT) Received-SPF: pass (google.com: domain of prvs=794e4b39d=jrosenthal-4GNroTWusrE@public.gmane.org designates 128.220.39.180 as permitted sender) client-ip=128.220.39.180; IronPort-SDR: nSi1u2Tk5ZFauEM+4GpX+jGU7QK7Ts7roInDT1WSu2MpxIDSAKrPShGnS+ZIjYsqhWbzszBly6 cG7jq5KEq8aGMhoV84AGvhZMptVuJeb626JOGIWy1JB27JdRcgdcX1gPmB1xCjkOcvtt8d0tGm QGvgS2xzJfzOXVAWHBl7G48gOd1pJvWZgKQeMicby6KrIAjUOfTfFU+/JGV7TFCfePAbffxaR3 OqkYUHuc0plIedAPVCBlW8swcfWnEGfmV39Gc8rg2q+2PM12fvqQ9ud2P1Ig3aTRqQEmtNnIxw XbU= X-IPAS-Result: =?us-ascii?q?A2GlGwD8B8pg/3yEDgpagQmBWYE8AgESURkBZFlphEGIL?= =?us-ascii?q?YlgA4pLj02BLxOBEQMYMwkCCQEBAQEBAQEBAQgBEgIRBQ8GAgQBAQMEhEUCA?= =?us-ascii?q?gKCaiY8Ag0CBAEBAQEDAgMBAQcBAQEFAQEBAQEBBQMBAQECgQCFLzkNgjgFA?= =?us-ascii?q?gMCJAtjTQMDNQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBA?= =?us-ascii?q?QUCDFIqPQEBAQEDEigGAQEMLA8CAQgRBAEBFgkJByEQARQJCAIECAcEAQcTA?= =?us-ascii?q?gQBglCCVQMvAQ6aeAF5ARIuAoofeIE0gQGCBwEBBgQEgTQBg3kNC0EBCQ2BW?= =?us-ascii?q?QMGCQGBLgMBAYJ4g3+GcEOBSUSBWIJgPoIgQgICgRcSARIBIQI9gw6CDCKCW?= =?us-ascii?q?YECNTgQIAJZHSwSEkk0D5EZHKg0ejpbgyiKEI4PhXMSnxGGW6AhgVCDKJAch?= =?us-ascii?q?FsCBAIEBQIQBoICGywwcE0lToJpUBcCDpIPil5DMAIPJwIGCgEBAwkhW4Zvg?= =?us-ascii?q?W0BAQ?= IronPort-PHdr: A9a23:IxYHvhXAcnSN3OdQZdim70DfnPzV8KwkUzF92vMcY1JmTK2v8tzYM VDF4r011RmVBNSdsKgVwLeO+4nbGkU4qa6bt34DdJEeHzQksu4x2zIaPcieFEfgJ+TrZSFpV O5LVVti4m3peRMNQJW2aFLduGC94iAPERvjKwV1Ov71GonPhMiryuy+4ZLebxhMiTanb75+M hG7oQrRu8UKnIBvNrs/xhzVr3VSZu9Y33loJVWdnxb94se/4ptu+DlOtvwi6sBNT7z0c7w3Q rJEAjsmNXs15NDwuhnYUQSP/HocXX4InRdOHgPI8Qv1Xpb1siv9q+p9xCyXNtD4QLwoRTiv6 bpgRxj1hicaLD456H/YhdBsjKxVpxKhogZww4/SYIqIMPZzcafQcdYcSGFcXMheSjZBD5u8Y YUREuQBIehWoYrzp1QMrBuxGQaiC+z0xz9Un3/6wbE23/g9HQ3Y2gErAtIAsG7TrNXwLKocV f66zLLTzTrdcvhYxTD96JPWfRAnvfGMXah/cc7MwkQoFQPFlFKQqZf7MDOTyuQNs3KU7+57W OKti24ntht9rSayyccxk4TEgJ8exV/Y+ytj2ok1OcG4R1BhYd6iCJZetzyWOop2TM4iTWxlp SY0x6EatZO5YCQHyZUqywLdZvKIcIWF7RLuWeWNLDpli39odq6zigqu/UW+zuDwSM253VBXp SRLldnMs2oC1x3V6sWfRft9+UCh2TmL1w/N8O1LPUc0la/DJ54m374wip4TsUPEHi/1gkn5k KiWdkAi9+O16Orneq3rqoKAO4Nulw3zPb4iltGxDOgmKAQCQnaX9Ouk2LH9/ED1XKhGg/grn qTWqp/XINg IronPort-HdrOrdr: A9a23:g4xWXaAlU+WQLFrlHema55DYdb4zR+YMi2TDsHocdfU1SKOlfq WV98jzuiWbtN98YhAdcKm7Sc69qBHnhPtICPgqXYtKNTOO0AHEEGhf1/qA/9SKIVybygcy79 YDT4FOTPH2EFhmnYLbzWCDYq4dKQC8gcaVbDHlvg5QcT0= X-IronPort-Anti-Spam-Filtered: true X-IronPort-AV: E=Sophos;i="5.83,278,1616472000"; d="scan'208";a="349606196" X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False X-MessageIPDirection: Outgoing Original-Received: from esgdrtex16-24.win.ad.jhu.edu ([10.14.132.124]) by IronMTW8.johnshopkins.edu with ESMTP/TLS/AES256-GCM-SHA384; 16 Jun 2021 10:21:02 -0400 Original-Received: from ESGDRTEX16-21.win.ad.jhu.edu (10.14.132.121) by ESGDRTEX16-24.win.ad.jhu.edu (10.14.132.124) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2242.10; Wed, 16 Jun 2021 10:21:01 -0400 Original-Received: from NAM10-DM6-obe.outbound.protection.outlook.com (162.129.196.251) by ESGDRTEX16-21.win.ad.jhu.edu (10.14.132.121) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2242.10 via Frontend Transport; Wed, 16 Jun 2021 10:21:01 -0400 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Q7yvnAmBcu7/aWQiW1qAf85jkzZ0LP9/JiQx/NJnvh/iRAPejX0/m0cfhUp+UIQ/zKFjRHVQnTbadH8JdFv2Qt914ENyXbJqyJsxB7ya/kDWKoTV15ys4ydlSv1VAHJLoeNsH1lZJYnSPa4HPvSc2992nJ1ppwGXgxGRi2fPQcZ/D0p8PT7OEu/VbQ2YdXnEuWaR+CGL3Wpy//WDql2A++/0FTPDAFIIni7KOUtcM0+KTfmq5cZHDDT329RhEQDvFIGneSbRiouQU2rBeTWpy6BS9GdtVC8tIYGJcWXnH33Zznvsvaw4cNqPh5UkDaZNpzq8O/mwunY2MsFm1dFpOw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=6y1eHmdQkS3zGmyPTJlm3Xdglc6kir8M1jsGAJmlQUQ=; b=OH4xI8qJnbPgsYPL05eHi1WoWo7ONSEMVbUyvETW3Jq9pQB22RSSvnTjqoAAxVqokhIqyIQrv+NYlh4695azEzFW5j46neUy1A3EpFe1P7tRe2A4WEmgFcVhy57ctOPtIOtN9FsRmxA9YM2bhWvqSszXiyZmlHavngVuZGQ6Qm59GEn+7JfLyH3vKFxXqRZfHLNRuPI7oQLpCxY6uNFQmywyBVh0WwhoTf2QAMaZ26Xeugt2YduZZ/Z5BN5fNVwgT23htuFvh+N2pruaKYJ0dSm5mq/8jKjfN5g3U4pkIMALx/9hvK2jQ+NMMqmaYR3hPs8NqSnXX4d1ldFsfpZ0rg== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=jhu.edu; dmarc=pass action=none header.from=jhu.edu; dkim=pass header.d=jhu.edu; arc=none Original-Received: from DM6PR01MB4650.prod.exchangelabs.com (2603:10b6:5:6f::15) by DM5PR01MB2538.prod.exchangelabs.com (2603:10b6:3:3e::10) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4219.24; Wed, 16 Jun 2021 14:21:00 +0000 Original-Received: from DM6PR01MB4650.prod.exchangelabs.com ([fe80::e10c:6ca:d8fd:59b6]) by DM6PR01MB4650.prod.exchangelabs.com ([fe80::e10c:6ca:d8fd:59b6%4]) with mapi id 15.20.4219.026; Wed, 16 Jun 2021 14:21:00 +0000 Thread-Topic: docx parsing bug: nested fldChar fields are interpreted incorrectly Thread-Index: AQHXYO2J8TKZPxOOWkqee4W1dsjoP6sUoEAAgAHDUYCAAE9Mnw== In-Reply-To: <9bdb337d-fa68-4c66-8f5c-d4fa81547953n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> Accept-Language: en-US Content-Language: en-US x-originating-ip: [2601:14a:c300:326:29b1:bd0e:faa4:1e2f] x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: b079cb87-df4c-4b85-520c-08d930d1f755 x-ms-traffictypediagnostic: DM5PR01MB2538: x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:8882; x-ms-exchange-senderadcheck: 1 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: 0hiNSd4SUASPrxIIGVHz/FGZxMKRQ1ZhIAbt6C+aT1y+4trZ0zEQ6cQ9s5wErebH+YeyvBRcTg8i4CSHqDW4UV6xnFqAiye9tCFZTxU41x6D7CnRS9SC8YIZrfMjBjtrfLWHAzbc61lSpAgfo3a1nIlWhdLmssgLvZh9Lgsysf+i43DFXcH55qHKVSruBB5LCQJvbrh/wjSbPan55nyw26DSDj4Rivd/QRhgectX+MTsu7ujiWVUD40/wH6/Rw0bYjPtIIKdWKZ/kmtO/gLhTT2Sc4S60JrNXmgEjX7DL+/1h93HvxJYa8xeCzIfYk+c3Pm74+tWCFsCT/PwLBlbmDtJkFfijAd3G0T+c2GIDTbjXKaRLDv33cfH1+dHAOPVQTN3OcK6z9fZaimPcY0doPyoc3YZNb0KTQ3zqakT3h/VB3PIEHUVMro7YSP6YIWlzm57WGMf3J7GMeol6JV/otJQdibK2eJqYg5FTe4LAFUpMywXpjKmO90xpV3TKc7M0pGLp3laRVtRMg+1HhPLxEvE3Dr05x888vJs2sRPMfkt6/D5dfpEVVPA4/lEVHEBgOs7OoDcnCOpyKwOY+TSN4HA+/fmsWAKNx805XOKcIb/66AOCzZ0/0R8Kcno78XcrdPHQbl8/yi80IvVbQsx8Mf+igTUehwXN8Uts/FUw6GAxIATQdAfeQdyNas2NpSnuekNDjoFp5knburSMSxfvYoarO6lI5Q04swydqZLCBV6PwfFYuIl3bXfQV8/M +fR x-forefront-antispam-report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:DM6PR01MB4650.prod.exchangelabs.com;PTR:;CAT:NONE;SFS:(4636009)(346002)(39860400002)(376002)(396003)(136003)(366004)(83380400001)(55016002)(86362001)(38100700002)(91956017)(52536014)(122000001)(66446008)(9686003)(5660300002)(186003)(6506007)(7696005)(478600001)(75432002)(316002)(8936002)(8676002)(33656002)(71200400001)(66946007)(66476007)(64756008)(76116006)(66556008)(6916009)(53546011)(786003)(966005)(2906002);DIR:OUT;SFP:1102; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?us-ascii?Q?FjizgQFGBDw7kyxNAEssuejW8bQP83adZqDTZIjXC+3mWt6LseybShyEvEwT?= =?us-ascii?Q?79oM0sk3pTOxm1R9okG6dMY6263lIkufS42mrjct4RPXqrmzOdquegTqlVk0?= =?us-ascii?Q?npNt+TYRJ5G/adaeFPdEVO7FsFqn5z27dOpihsXAr6IA05+VSZoeElKqY+Lu?= =?us-ascii?Q?m+4Aves87pcS/dDmOr1HiFuprEfUPET69cjuWiXrI9D4ED9Y/YZBp/HFdDGN?= =?us-ascii?Q?fsBNb4FWVM8fwxfrwLnTLvd+HgKbQjk7g9NW49pctbAZsmBhnYyCjecnB/Eg?= =?us-ascii?Q?2Tqigs5GRrLxhubXTNqH7WoCk/hG+IJHZH5rjl/RQSWvmvhk6uzPWrsgV55U?= =?us-ascii?Q?zs9Rfl6FrTajQFuh3rhLhN1nLLiWo9X/mo5s+ZyRuEAZiKL84kz6rPvq3AW0?= =?us-ascii?Q?OKry3xwy9BguFiXaKEx/kiwoErct6r3Y0eYgaPFb915YUOAm9JY16W9PNTIj?= =?us-ascii?Q?r/QRabxR9XdvZ+8sr5I12B0/D2JYwx7WzFWQUjinoZur0LUateLBBPBHX16f?= =?us-ascii?Q?WBbKYqS9F/0AYYa8zzaY5AqmHQfM/OBA5MV3SyWo7gr8VDdzRwGc/GvThKKm?= =?us-ascii?Q?lJWQ3N2K64FbHjaMKbLYdfihFNOMQXBOe332elGzh/DBrfBCChdGTsrrFcuJ?= =?us-ascii?Q?p/ x-ms-exchange-transport-forked: True X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: DM6PR01MB4650.prod.exchangelabs.com X-MS-Exchange-CrossTenant-Network-Message-Id: b079cb87-df4c-4b85-520c-08d930d1f755 X-MS-Exchange-CrossTenant-originalarrivaltime: 16 Jun 2021 14:21:00.5208 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 9fa4f438-b1e6-473b-803f-86f8aedf0dec X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: L237BkUwATx/V9kzttfIv3EBaMslriFkXNPwrR+rVSaGbpEm2DYIWkeQ1sn+/Fo/ X-MS-Exchange-Transport-CrossTenantHeadersStamped: DM5PR01MB2538 X-OriginatorOrg: jhu.edu X-Original-Sender: jrosenthal-4GNroTWusrE@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@jhu.edu header.s=jhuiron header.b=EovrwhEP; dkim=pass header.i=@livejohnshopkins.onmicrosoft.com header.s=selector2-livejohnshopkins-onmicrosoft-com header.b=WUAje3eG; arc=pass (i=1 spf=pass spfdomain=jhu.edu dkim=pass dkdomain=jhu.edu dmarc=pass fromdomain=jhu.edu); spf=pass (google.com: domain of prvs=794e4b39d=jrosenthal-4GNroTWusrE@public.gmane.org designates 128.220.39.180 as permitted sender) smtp.mailfrom="prvs=794e4b39d=jrosenthal-4GNroTWusrE@public.gmane.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=jhu.edu Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:28609 Archived-At: Hi Milan, I wrote the original fldChar code (and that comment) and I figured it would= have to evolve as further requirements became necessary. If nesting is a r= equirement, a stack instead of a toggle seems appropriate. As far as crossing paragraphs goes -- your approach seems right (and simila= r to how we've dealt with similar issues like comments crossing paragraphs = in docx parsing). I'd be happy to take a look and offer comments/feedback on your code. Just = make sure to ping me (@jkr) on your PRs. Best, Jesse ________________________________________ From: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org on = behalf of Milan Bracke Sent: Wednesday, June 16, 2021 5:33 AM To: pandoc-discuss Subject: Re: docx parsing bug: nested fldChar fields are interpreted incorr= ectly I can't fix this without at least some feedback. It's a complex issue and t= he fix will take some time, so I need to at least know that my proposed sol= ution seems good and would be accepted if implemented correctly. On Tuesday, June 15, 2021 at 8:38:30 AM UTC+2 Milan Bracke wrote: I've encountered a new problem. A fldChar field can span multiple paragraph= s, but it doesn't have to start at the beginning of the first one. Because of this, a field across multiple paragraphs will merge those paragr= aphs. There is no way to represent this exactly in the pandoc model I think. So m= y current solution is to have different fields with the same field info in the different paragraphs. This can at least make the hyperlink fiel= ds work and I think it will work for the other fields we might add in the future as well (I've checked the list). What do you think about this ? On Monday, June 14, 2021 at 9:17:13 AM UTC+2 Milan Bracke wrote: For those who don't know fldChar fields, this comment from the docx parse c= ode (parse.hs, starting on line 825) explains it: fldChar fields work by first having a in a run, then a run with , then a run, then the content runs, and finally a run. For example (omissions and my comments in brackets): [...] [...] HYPERLINK [hyperlink url] [...] [...] Foundations of Analysis, 2nd Edition [...] The current way of parsing fldChar fields doesn't take into account that th= ey can be nested. So the end of the nested flcChar field will be interprete= d as the end of the surrounding one. This could for example lead to a hyper= link that ends too soon. See attached example for a docx that demonstrates = this. I propose to fix this by turning the fldChar state into a stack, so that a = field can be started and ended inside other fields. I will include this in = my pull request for PAGEREF fields that I announced here a while ago, since= they are related. -- You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org. To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/9bdb337d-fa68-4c66-8f5c-d4fa81547953n%40googlegroups.com. --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/DM6PR01MB4650D9807C8EE5F33B3D5278C90F9%40DM6PR01MB4650.prod.= exchangelabs.com.