From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/26075 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Lionel Dyck Newsgroups: gmane.text.pandoc Subject: Convert DOCX to Multiple MD files based on heading level and Retain TOC? Date: Mon, 7 Sep 2020 06:00:14 -0700 (PDT) Message-ID: <5cd30106-fcc5-419e-9093-bc3a00849872n@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_264_1614108987.1599483614414" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="22109"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBCK43XNIWAHRBX653D5AKGQEXMEGHTI-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mon Sep 07 15:00:19 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-oi1-f187.google.com ([209.85.167.187]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1kFGks-0005e0-SU for gtp-pandoc-discuss@m.gmane-mx.org; Mon, 07 Sep 2020 15:00:18 +0200 Original-Received: by mail-oi1-f187.google.com with SMTP id w200sf4279595oie.0 for ; Mon, 07 Sep 2020 06:00:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=zZN4ZKj5jwILmC/bx/NQEKcZhpw9JvIc0XVLua76ucI=; b=ibg1mfRY4lq7Y2vQREvOacghURTQsPUtGtG+3K5eF+0WePKefsyNCDkdaSBQJMi86a qbPtzS+4jk3Mg1Q8W2wzD71rwdPnw0jyIlaQjC0vMNs7l+vZjneUP+cCR+jYK3J9Yfn4 rxm/aYaZKb4Oyb6o8JBbXoDA/QBwcwaF0G7OiZnxsRFO8SpXkkfhTPe1uKZv7Dr8YgJn TsJLTUucbFVwxYlgGBP6UqnXeNee3sNBLMCciA3z3jzLfJGjPKP+ttWz3NO+S429e5Vg 5uZpHevqJUSNlTlyqgqN67cQShp2Jy6opI6Jsc2T8HQThOVooF7BBn5TFi8weVntwNxg v7tQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:subject:mime-version:x-original-sender :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=zZN4ZKj5jwILmC/bx/NQEKcZhpw9JvIc0XVLua76ucI=; b=ltk3RLqYw/biapHLGpH99HYCTGTUvroQLJ7Cqr/U5W9z33WBzv+DySklPGO001pzOx +rBosObpkWVGzAr7wdi9WX/7Z+cJgugvND19axNv5dVxkwsG9W+a91lbGts1FiCCpVO9 PmV/G6lJGp6LUxQsYQlD+zVe9FCvRsd43uEWS01k4tDsgMMOHMlLo/5P7/DMudg1IQzd jkeqv+0xQd9/ZlGJJKirMATBhLAS1i2GfRh8DRcz22DXBGzn2xpo1Lq2s1K6aBD/CBU+ MI2Elq0FenLiUlrIGu8Xw7cPGayKaCl8xgeNHnIjkAzcYDddOmjaYe6thwtda6zFrry4 6Shg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=zZN4ZKj5jwILmC/bx/NQEKcZhpw9JvIc0XVLua76ucI=; b=H0v65ER4QOFCzM7xYxDj8iEudpUtGU+QSRS6cBpnisSPAi7KZblFkpJqeJQENNfM0u +hH44bG7csLRpJ69PHdINi6RIkCCXivkL0aLO5juNNk7QZHkOxv7D09NaRxGnQTyii2d fWNchjynv1m7cntCMPrdKJS8u7QL5ENSZ4E8Vi7jYaa4xYINGFkmBBvfXO45lHyDhxN1 GyvlLdFD3M3CCJ8BNGyjYnQjXEQhUOAnWwnWM/xjvHNWF/2N+hHNb8aYs7Z/qDpekRda zvY9isQGa4u7LJ26Y6K9TY+zVGXfmQKqsoCl6L427xFZj45+x5fuHHhgEqL7N+jYEMSr dL6g== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM531AEkY2axfoUmxeSr8k43X1QofC8Fl50TcsYwwBrTVLIPgkVzvN ufIPp/vfkblpIVjR462mO8E= X-Google-Smtp-Source: ABdhPJxafYpMrL7IrFPzRpoy7zSecQ3Oisbqnv+oARa9PsSdvs+tK/ym/4cq2CETiIw9sPRroEURoQ== X-Received: by 2002:a4a:b443:: with SMTP id h3mr14861173ooo.45.1599483617908; Mon, 07 Sep 2020 06:00:17 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6830:4d2:: with SMTP id s18ls3142589otd.0.gmail; Mon, 07 Sep 2020 06:00:15 -0700 (PDT) X-Received: by 2002:a9d:908:: with SMTP id 8mr3908611otp.356.1599483615100; Mon, 07 Sep 2020 06:00:15 -0700 (PDT) X-Original-Sender: lbdyck-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:26075 Archived-At: ------=_Part_264_1614108987.1599483614414 Content-Type: multipart/alternative; boundary="----=_Part_265_948733878.1599483614414" ------=_Part_265_948733878.1599483614414 Content-Type: text/plain; charset="UTF-8" I'm using this command: pandoc -t gfm --extract-media . -o file.md file.docx And it works great *but* the table of contents has lost the link into the document and internal hyperlinks are also lost. I would also like to take this large DOCX and split it into smaller sections based on the heading level. So: 1. Is there a way to split the docx into multiple md files based on heading level? 2. Is there a way to retain the table of contents links into the body? 3. Is there a way to retain the internal links? Thanks very much. -- You received this message because you are subscribed to the Google Groups "pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/5cd30106-fcc5-419e-9093-bc3a00849872n%40googlegroups.com. ------=_Part_265_948733878.1599483614414 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable I'm using this command:  pandoc -t gfm --extract-media . -o file.md file.docx

And= it works great but the table of contents has lost the link int= o the document and internal hyperlinks are also lost.

<= div>I would also like to take this large DOCX and split it into smaller sec= tions based on the heading level.

So:
1. Is there a way to split the docx into multiple md files bas= ed on heading level?
2. Is there a way to retain the table of con= tents links into the body?
3. Is there a way to retain the intern= al links?

Thanks very much.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d= /msgid/pandoc-discuss/5cd30106-fcc5-419e-9093-bc3a00849872n%40googlegroups.= com.
------=_Part_265_948733878.1599483614414-- ------=_Part_264_1614108987.1599483614414--