From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/31615 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: Re: docx with images in tables to markdown and back Date: Fri, 21 Oct 2022 09:16:25 -0700 Message-ID: References: <02f65a26-e99a-4cfb-9ef7-899e7f40f899n@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3696.120.41.1.1\)) Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="18001"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDW7ZIEHTIIBBYMLZONAMGQEGNK6FOA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Fri Oct 21 18:16:38 2022 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-pl1-f186.google.com ([209.85.214.186]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1oluhJ-0004S9-VQ for gtp-pandoc-discuss@m.gmane-mx.org; Fri, 21 Oct 2022 18:16:37 +0200 Original-Received: by mail-pl1-f186.google.com with SMTP id q12-20020a170902dacc00b00184ba4faf1csf1937694plx.23 for ; Fri, 21 Oct 2022 09:16:37 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1666368996; cv=pass; d=google.com; s=arc-20160816; b=OEvLLoPmpMRfJjfJr07tKaWG0YyBu73WSO3EgpmD196N7V6I98LkBqFkkwpsZJsNOn VV7OiuXAYFzHgB/WNVMF3EGuDZ8BIVZTlWyfI8WZEwKuCwqFc703Es2Hfb/Fno6BZEYj 1uzsQ30gDpDsx5mUeQKM+yk9Vl4qXsipR5SvnwjRictxp0+0ZLEfD8TsSoMeV+To2QmV ZW4+8xL8W51HR2LmK/nuZQOfhN4GJPf/lINt5K/11yJiOE8Yu09T8zZ8Rn5xw0gSWOKY 9R/SCOaU8yry1NKUtcRTyWrFC+Y0qOFculfDiO4AN58IGLYUS0M/t58j7jxsnUOUcHgP 0XhQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:message-id:in-reply-to:to :references:date:subject:mime-version:content-transfer-encoding:from :sender:dkim-signature:dkim-signature; bh=lgPAWA5cswh2zNMS26zhrMZ7MKOmXAMqW054s0/q3Kg=; b=sWyRS43KkSuBQxsVY5oet/xpwwiVWCKYS5hfBBBDXxnO5IAN72B8K3eejIE6ttWnlb Hi34l3T3+53HifpdUxGOHkljdK193bQCP6rDONBLM1bJSdSfJRtMCHRXBjzkhOrSztvq WvzDDKjLM+uUlfDW75lzWu4XHMc/BqyEZw7feeZV8QpNeufcvdUu5FHU2CnsBizbJ/Qu UaCz3/vwedxX8Lyz9cnG978LVvF6mqw8tMZ9WG5fShzarLIR6wY5bGYuvp91ya+C8t5z Y3VLQzEg0XCBYzHYMJGtf6boqGGFsqS+ZsYubKs5Fcffpz/YRpSlMWlLhG5xtnrJOxjF ISFQ== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=Bkx73JRi; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::1029 as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:sender:from:to:cc:subject:date :message-id:reply-to; bh=lgPAWA5cswh2zNMS26zhrMZ7MKOmXAMqW054s0/q3Kg=; b=IDauPR4GCW6kicVb/UWfaxns0j441l5+ypMN1k8GG6W5VyDhohdFcYx3rPuvQCbCSW Q1azh/sIzZYPRj2V6k1dgNEQahjVALXoPT8kVTnKHnvoUayBlBU6TAP2OyoZEfzSx3if D7NtOMDv6t0pS0L8gXHd8xJWSzVccNipDJy+IRXFF0jlrg1kGZnu2hWwBYNHqIdVrPdK 4WOPBe/6BS59q5KENg/KcMeBozHBb7C5qze0KAg4Lt7CyD2lYdlJrU8ypNAF5oiI9+CN 5xNxqjsjT25/BCrXry35bm3CYJsuacfQEtCJa7WIVl4oxFeU74oJweC8TWpOhc18p DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:from:to:cc:subject:date:message-id :reply-to; bh=lgPAWA5cswh2zNMS26zhrMZ7MKOmXAMqW054s0/q3Kg=; b=V8QfpsV+iDqicerg97x7AMF6HqFWZdFF2f3QxtGHwAMUKvJVSE5+R9j2U7aU3DnAgd qs70cii24nFTdS9AN8+KLDN6HXgd/18yOj4+Jon48HYjoQKD0SfZgsLV4rQMC3DzTtla PLH4hVwhTCQ3ohtgNrSmg4ldHLVZiZfi5z/u5K3CO3ABWQkW91H86XCfpzdbd907OAmf YzcUpUTl3p6iNokfFy/UkxtTGi/bp0mnCgm/Ryucx4Z2EhtrtoEMSs79eczWGlJZb17D eJhEB5buWZzUj8cvsZKwFeATu4aR7j8LeAnxEg3Zjan/gjmvqs/2OD0e80r2+4gAm0bX G X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:x-gm-message-state:sender:from:to:cc :subject:date:message-id:reply-to; bh=lgPAWA5cswh2zNMS26zhrMZ7MKOmXAMqW054s0/q3Kg=; b=aHlU0WdshqYgZ0YyNxqyaqZybxcnFqtBaGDVCY2C2eEHRU7z/vxIQDdzSXvTcxTMSK tYwx2cCtOvZkKNG6nMO6IBI8J6IkbVW1jQupNAjNircjISsU94G73UL4pNTBiQaqQ0BO ngzGIclh877fCLGGL893jUE865JgjIPzUF05QJSO2H8oPOwI92oobWE6ATLVDjHX9RKn B+S5x7r20ZKwgv/aeslytNqdfMxKDu7E9WLpRKrGdOvN5M5XGB03TVvPW8oKMRGle4ic psFN1R39jBW2T6b2Seur Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: ACrzQf2ZjT+WZKwFOaGevSUqlxeUVT+leNjj5Tc51QGgwhdTu/vS9lZb BXutnKXGLtUfLRMipihxTmM= X-Google-Smtp-Source: AMsMyM7rZGtRKIB8ZTFVNlMZpL33T3v84QKFOiGvAG8W8iKT63vwQsiAtaYosPIDUAY7EgcPafwnhw== X-Received: by 2002:a63:5d65:0:b0:43c:c924:d58a with SMTP id o37-20020a635d65000000b0043cc924d58amr16270139pgm.348.1666368996676; Fri, 21 Oct 2022 09:16:36 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a17:903:2311:b0:17a:8ee:1234 with SMTP id d17-20020a170903231100b0017a08ee1234ls2279084plh.8.-pod-prod-gmail; Fri, 21 Oct 2022 09:16:33 -0700 (PDT) X-Received: by 2002:a17:902:a584:b0:186:6040:87f9 with SMTP id az4-20020a170902a58400b00186604087f9mr10516664plb.36.1666368993304; Fri, 21 Oct 2022 09:16:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666368993; cv=none; d=google.com; s=arc-20160816; b=CPBvRNYaRmusTx9YjPkXpXyr7fyFWJRUJmsJGblVaJYhhpA1liFLLqNdGKCSeOuu8z QPxra5gNOewJ6jBZFiAhsybcFXbHrUZt04JPPOWE7wZ50/X8Ezjxl9OMj7ArhebfJuI7 pQOFChTsxcQJb1keDd0RY+jLym2H5uBfaIdyDP7a2LHsPrgQ+6Kgf5YCWTTQ13/9QOmM CEQzX3xnL8t8+yAeQaFoGGoSn0xuwP+KRBhuv6AYD5yQxliNsEnq05fVVB5IVBQ2Rzss QJ6J+7uZGtYkmiQXnb8e/oFyRWVHnfGtHgjFvzFG3KdbCOHVeyg9p05u2/ohtx34+5j8 XmaQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=message-id:in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:dkim-signature; bh=liVFxZtCLF/G8zboM0SWht1YOWswm23shoOyhcb4hwI=; b=TEbBpp4wGTL/j7CUFxLv12ZKLMTJ28XFBHea28JYqNVMrYN7mM2a/z7FXG+3hedl+V E/QYRz0I75AkGR98u+iiKasMHO+E3TacUBQ3wm+K3k+I5l+rsONAs36Hh3Iih73wooQO tki2JdysizwaMXg/2dxYUagQ6WWsQuvk2UYGX/nUDmHISemvIr2RqEYIySC4u2hEpr8B fCuRT+e8CY0Es+QAy6I8Q8qy/oGA8CzCsURdL4J1juaeAGzv7hhaNZ4b+DCYLjnnYXXc fBOhNe4QnQ3NkhVUOBw2FZTSXBaURfkrd53wJ2KUA48+lLQKlFg5gfeYyDoWtf4M0MyC Cc/Q== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=Bkx73JRi; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::1029 as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Original-Received: from mail-pj1-x1029.google.com (mail-pj1-x1029.google.com. [2607:f8b0:4864:20::1029]) by gmr-mx.google.com with ESMTPS id c5-20020a17090ab28500b0021217a5fa15si157554pjr.3.2022.10.21.09.16.33 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 21 Oct 2022 09:16:33 -0700 (PDT) Received-SPF: pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::1029 as permitted sender) client-ip=2607:f8b0:4864:20::1029; Original-Received: by mail-pj1-x1029.google.com with SMTP id x1-20020a17090ab00100b001fda21bbc90so7087920pjq.3 for ; Fri, 21 Oct 2022 09:16:33 -0700 (PDT) X-Received: by 2002:a17:902:c241:b0:182:a32f:4db7 with SMTP id 1-20020a170902c24100b00182a32f4db7mr19994786plg.131.1666368987749; Fri, 21 Oct 2022 09:16:27 -0700 (PDT) Original-Received: from smtpclient.apple ([2601:644:400:7c40:fd6d:e541:74ba:6aa8]) by smtp.gmail.com with ESMTPSA id p3-20020a170902780300b001811a197797sm14821114pll.194.2022.10.21.09.16.26 for (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Fri, 21 Oct 2022 09:16:26 -0700 (PDT) In-Reply-To: <02f65a26-e99a-4cfb-9ef7-899e7f40f899n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> X-Mailer: Apple Mail (2.3696.120.41.1.1) X-Original-Sender: fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=Bkx73JRi; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::1029 as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:31615 Archived-At: I'd recommend trying with `--reference-links`, which might reduce the width= enough for them to fit on one line. Another option is to force pipe tables to be used. `-t markdown-grid_tables-multiline_tables-simple_tables` or simply `-t gfm` > On Oct 21, 2022, at 5:30 AM, Jan St=C3=BChler wr= ote: >=20 > Hello group. >=20 > I use=20 > ``` > pandoc -f docx -t markdown --extract-media "Lab 1-2.docx-dir" -o file.md = file.docx > ``` > to convert a word document to markdown. The word document has (many) imag= es which are sitting in table cells. One of the results is: > ``` > 1 +=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D+=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D+ > 2 | Click on the Link:\ | ![](./Lab 1-2.docx-dir/media/ima = | > 3 | *[Click here to start the | ge7.png){width=3D"3.64396106736657= 9in" | > 4 | Local Service | height=3D"1.9991174540682415in"} = | > ``` > (Line numbers from `vim`). >=20 > Observe the line break between `ima` and `ge7.png`. >=20 > To convert this markdown to word, I use > ``` > pandoc -f markdown -t docx -o file-new.docx file.md > ``` > which results in this error message: > ``` > [WARNING] Could not fetch resource ./Lab%201-2.docx-dir/media/ima%20ge7.p= ng: replacing image with description > ``` > Observe the `%20` between `ima` and `ge7.png`. >=20 > Is there something I can do so that pandoc can put the images into the wo= rd document? >=20 > Thanks alot. >=20 > --=20 > You received this message because you are subscribed to the Google Groups= "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an= email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/msgi= d/pandoc-discuss/02f65a26-e99a-4cfb-9ef7-899e7f40f899n%40googlegroups.com. --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/C16FAB55-4036-460E-A3CA-5C755EB2F207%40gmail.com.