From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/31629 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: =?UTF-8?Q?Jan_St=C3=BChler?= Newsgroups: gmane.text.pandoc Subject: Re: docx with images in tables to markdown and back Date: Sat, 22 Oct 2022 11:56:34 -0700 (PDT) Message-ID: References: <02f65a26-e99a-4cfb-9ef7-899e7f40f899n@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_3688_1351405304.1666464994019" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="15936"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDXJPMHQZ4FRBY7Z2CNAMGQE5FKQ7LY-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Sat Oct 22 20:56:38 2022 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-ot1-f62.google.com ([209.85.210.62]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1omJfi-0003zz-Nn for gtp-pandoc-discuss@m.gmane-mx.org; Sat, 22 Oct 2022 20:56:38 +0200 Original-Received: by mail-ot1-f62.google.com with SMTP id l5-20020a9d7345000000b00661c76ded95sf3507088otk.15 for ; Sat, 22 Oct 2022 11:56:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:x-original-sender :mime-version:subject:references:in-reply-to:message-id:to:from:date :sender:from:to:cc:subject:date:message-id:reply-to; bh=smLIApb9JjZAbotRYLdB15MAWreyMUCjddyfY+escc0=; b=PRf48KW/XF1I5QJBA185fDKNvhrOvFM4Mesr6g3KDTEqCY+fJv4MMNDWyDtZOUD1T1 1Sh8fKXD4cdB6Pk8zYKTwhg0CRfDGjwbE2M1BCkR9WKWGZgwmmj1Mz5GOWf3QDdIcS52 UiodmRAJOiSWnPnyOolFapBna5yAkByV7dVFiMq0cFobMUAGdxKp24RBvvQC4bcashMa BYYHiNObIW0CKqQDSmhQ8Cn7yq9gqNWS2TejHADJpclGb05GfR5Kxa0v8M6fztU0Y4jh /WCGlKOJ5Cp2AjS8taxtuH/r3gdCHXvAPb2dAHgLFZ7V+uxbD2b7R6zojeajXuuc1I+p jO4g== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:x-original-sender :mime-version:subject:references:in-reply-to:message-id:to:from:date :from:to:cc:subject:date:message-id:reply-to; bh=smLIApb9JjZAbotRYLdB15MAWreyMUCjddyfY+escc0=; b=RhPi/hBAGmu/JgP+0NksLuL55nHCeeE9oWx5ty8VOv9ByCJOi6D9ysELE8gFpj1zr/ L+TUGQHGVMd0L3l4mvuh9MH/G0hgH9hqDgcR7m3MGa5uI6NNfygjsPlKoAu0ocu9Um68 8sSDxTZxKHYNlcqArJQWj+VfKy6Cv4J/2C7+t3JpBE7XiufcDPnGCI3xZ4PWceJoo3RJ 0/RIAnulhNzKrsRF5Y5QU8FgXmZJ18/xkId96Kamd7pZ8KC8vdgUi941EYydIxQBsQ3N 1yk2RK21x7TGMKwPWqpelBJYtVoM68Eoik1qR8WcXPjq4Nv+mJh2pby/rkFygXW50Wrf 5/kA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-sender:mime-version:subject:references:in-reply-to :message-id:to:from:date:x-gm-message-state:sender:from:to:cc :subject:date:message-id:reply-to; bh=smLIApb9JjZAbotRYLdB15MAWreyMUCjddyfY+escc0=; b=7BVyuquJnIJ+DRg5BVSRl0L/wSoTyfJ1lsyFPBaXVifj5hDtAQQ+kxZBLwTqMp7w3J sHjNKDKH22hQARTMboZlnihs7+tMM+MtnAHtck/1oTAAkMw3l6zcn5AIETnyiTMIGbLR LqMNKAoclXlFs1h6PVJ5ahg/8pYstCXGErSIvMrEkIYSEQuO+3hZu5q56dO3Ve5eKckY rwQnOYcUInktYNAKoDGWqDnRDtNX/OsLO60WGkN7AQ6so1Bdx88pBgWQ9eu1w6PLnS1r U0HHIjLb7AzotVbp2MfbBuz9oxVwuEQk7A3faknQM+JCsQzjzE9cpJ9bVg14bn+yxC0i dqwA== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: ACrzQf2RR4Q8TRVoArSYJ1QbIl+j5PTYFyzb38l1KJi3D1Nh1N4j0U/4 4NInPIV5zU4oypmUOEiX138= X-Google-Smtp-Source: AMsMyM6xyxxEogy1hEr2/h3uwP6EwNC2KnZaeobAXfnEuDi6Mj9sXCD6et2fT0fWjnnkvSp8fWkA5A== X-Received: by 2002:a05:6830:3493:b0:661:e687:1907 with SMTP id c19-20020a056830349300b00661e6871907mr12864860otu.344.1666464997511; Sat, 22 Oct 2022 11:56:37 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6871:415:b0:13b:74d7:e0f with SMTP id d21-20020a056871041500b0013b74d70e0fls265175oag.7.-pod-prod-gmail; Sat, 22 Oct 2022 11:56:35 -0700 (PDT) X-Received: by 2002:a05:6870:8888:b0:13a:df25:b078 with SMTP id m8-20020a056870888800b0013adf25b078mr11094715oam.189.1666464995023; Sat, 22 Oct 2022 11:56:35 -0700 (PDT) In-Reply-To: X-Original-Sender: jan.stuehler-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:31629 Archived-At: ------=_Part_3688_1351405304.1666464994019 Content-Type: multipart/alternative; boundary="----=_Part_3689_2012526371.1666464994019" ------=_Part_3689_2012526371.1666464994019 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Interestingly, -t gfm brings me HTML tables. But --reference-links helped= =20 me very much. Thanks alot for that, must have skipped that in the=20 documentation. fiddlosopher schrieb am Freitag, 21. Oktober 2022 um 18:16:36 UTC+2: > I'd recommend trying with `--reference-links`, which might reduce the=20 > width enough for them to fit on one line. > > Another option is to force pipe tables to be used. > > `-t markdown-grid_tables-multiline_tables-simple_tables` > > or simply > > `-t gfm` > > > > On Oct 21, 2022, at 5:30 AM, Jan St=C3=BChler wro= te: > >=20 > > Hello group. > >=20 > > I use=20 > > ``` > > pandoc -f docx -t markdown --extract-media "Lab 1-2.docx-dir" -o file.m= d=20 > file.docx > > ``` > > to convert a word document to markdown. The word document has (many)=20 > images which are sitting in table cells. One of the results is: > > ``` > > 1=20 > +=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D+=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D+ > > 2 | Click on the Link:\ | ![](./Lab 1-2.docx-dir/media/ima | > > 3 | *[Click here to start the | ge7.png){width=3D"3.643961067366579in" = | > > 4 | Local Service | height=3D"1.9991174540682415in"} | > > ``` > > (Line numbers from `vim`). > >=20 > > Observe the line break between `ima` and `ge7.png`. > >=20 > > To convert this markdown to word, I use > > ``` > > pandoc -f markdown -t docx -o file-new.docx file.md > > ``` > > which results in this error message: > > ``` > > [WARNING] Could not fetch resource=20 > ./Lab%201-2.docx-dir/media/ima%20ge7.png: replacing image with descriptio= n > > ``` > > Observe the `%20` between `ima` and `ge7.png`. > >=20 > > Is there something I can do so that pandoc can put the images into the= =20 > word document? > >=20 > > Thanks alot. > >=20 > > --=20 > > You received this message because you are subscribed to the Google=20 > Groups "pandoc-discuss" group. > > To unsubscribe from this group and stop receiving emails from it, send= =20 > an email to pandoc-discus...-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > > To view this discussion on the web visit=20 > https://groups.google.com/d/msgid/pandoc-discuss/02f65a26-e99a-4cfb-9ef7-= 899e7f40f899n%40googlegroups.com > . > > --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/efc647f9-eab5-48a4-85b9-cc4dc63114b6n%40googlegroups.com. ------=_Part_3689_2012526371.1666464994019 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Interestingly, -t gfm brings me HTML tables. But --reference-links helped m= e very much. Thanks alot for that, must have skipped that in the documentat= ion.



fiddlosopher schrieb am Freitag, 21. Oktober 2022 um 18:1= 6:36 UTC+2:
I= 'd recommend trying with `--reference-links`, which might reduce the wi= dth enough for them to fit on one line.

Another option is to force pipe tables to be used.

`-t markdown-grid_tables-multiline_tables-simple_tables`

or simply

`-t gfm`


> On Oct 21, 2022, at 5:30 AM, Jan St=C3=BChler <jan.st...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
>=20
> Hello group.
>=20
> I use=20
> ```
> pandoc -f docx -t markdown --extract-media "Lab 1-2.docx-dir&= quot; -o file.md file.docx
> ```
> to convert a word document to markdown. The word document has (man= y) images which are sitting in table cells. One of the results is:
> ```
> 1 +=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D+=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= +
> 2 | Click on the Link:\ | ![](./Lab 1-2.docx-dir/medi= a/ima |
> 3 | *[Click here to start the | ge7.png){width=3D"3.64= 3961067366579in" |
> 4 | Local Service | height=3D"1.9991174540= 682415in"} |
> ```
> (Line numbers from `vim`).
>=20
> Observe the line break between `ima` and `ge7.png`.
>=20
> To convert this markdown to word, I use
> ```
> pandoc -f markdown -t docx -o file-new.docx file.md
> ```
> which results in this error message:
> ```
> [WARNING] Could not fetch resource ./Lab%201-2.docx-dir/media/ima%= 20ge7.png: replacing image with description
> ```
> Observe the `%20` between `ima` and `ge7.png`.
>=20
> Is there something I can do so that pandoc can put the images into= the word document?
>=20
> Thanks alot.
>=20
> --=20
> You received this message because you are subscribed to the Google= Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, = send an email to pandoc-discus..= .@googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/02f65a26-e= 99a-4cfb-9ef7-899e7f40f899n%40googlegroups.com.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d= /msgid/pandoc-discuss/efc647f9-eab5-48a4-85b9-cc4dc63114b6n%40googlegroups.= com.
------=_Part_3689_2012526371.1666464994019-- ------=_Part_3688_1351405304.1666464994019--