From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/23431 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: rama prakasha Newsgroups: gmane.text.pandoc Subject: Re: docx to markdown conversion; retain only bold and italics. Date: Tue, 17 Sep 2019 12:15:54 -0700 (PDT) Message-ID: <5a985960-990c-4a18-8c52-972c37bb542b@googlegroups.com> References: <4e4c8527-4998-4f9e-9e36-92d26d8849d1@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_3136_336958952.1568747754499" Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="6658"; mail-complaints-to="usenet@blaine.gmane.org" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBCNIFEWN6QLBB27BQTWAKGQEVHQTDKQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Sep 17 21:15:58 2019 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-oi1-f188.google.com ([209.85.167.188]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.89) (envelope-from ) id 1iAIxB-0001Yg-QS for gtp-pandoc-discuss@m.gmane.org; Tue, 17 Sep 2019 21:15:57 +0200 Original-Received: by mail-oi1-f188.google.com with SMTP id b2sf2265283oie.21 for ; Tue, 17 Sep 2019 12:15:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:in-reply-to:references:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=AOb7T0NNl1HuDWU5qy6v51n/gkEwhhhn8pklzV7vMC8=; b=ZLiJOey8cCoAEs/ktvdbQytW5I/USp7gcRJrwEsmCm+CNPDfanViGOsbu7vt2skZc6 YpQgVqqEQaRNKGiWuL5xs897TsfN/1mwvy1m34Yv89Vq1Sp8+wm806tkoabyx5TY/hrK mYjrQZfESwjdXma0xs3oja7w8pONspskwbhUD+PLEjP/lH6zLvk4t3V7V8NKX2Dl/X4y G27sgKIHKzRBOQ31SLO0VdvXo5LNTpz3sCmGGkkHMO/uQYdL42VO8d82stvNtjvfjbW2 +bH1XTEhj/5IE1+Kks49jWPlXKjbM4uD684AwJk4jTsjbDMt2kcXqzxj4QIuzwbZhAKN qdAw== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=AOb7T0NNl1HuDWU5qy6v51n/gkEwhhhn8pklzV7vMC8=; b=PK/VHsU/TuIMhKIczGBBMpNYnet5R0Zcz1sDCWKxbKhg7ziZ8gXUFXXbP87veEGjmC 2bUydzIrC1OylcQHr2yJElCRI9UNFscvQgKzy+/yPcm7tJHavzUsaJyd4G6t0UkXubRy BHeZQLfC7Z+DbPMt4a4gAZ41YvrYo/2i7fBDcJirTMLToEIS8/tfnOhADq6vbrogfyXY 5hzT6QioF84auzcEKCSQrhX+SZJZod96uDziBWJB/muHpdG/8wkb0t393EyqlWdWMhH4 U/UUCACSnRdC/yGwyLqGmOaQluCw0hJLeShoWy0AHSvG9yCwDz9mb/SlgFOM/AyuiI8C LqKQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:in-reply-to :references:subject:mime-version:x-original-sender:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:list-subscribe:list-unsubscribe; bh=AOb7T0NNl1HuDWU5qy6v51n/gkEwhhhn8pklzV7vMC8=; b=UpMpmF4OxT7hBtvFHPj3pKAr4G9nL4nS1y+LRJpk5+rGQV1uTovBaWoaKwAbUhWeWb CQHIcJQcIPF8BcTOsrRanGdy/M4VPCr1hM4zhX24Y4TwkeLa0WiEC27fe9UTtsr3jmgE CfG8Oy7kIYtt1AZHM1U4S64QYWVCfMCC4ukG6HBASblpr1qWSYCEwAz+ewG44EPnqoek QU3crtMvDf1MkqHACeITm1neri7sVAlQlRbWsQ3AJT4KcOhCV4cFk2ca50y187/F4zWh +KSsuon56y66NHWrOI5tjKtnkKTFMg837MCeItv84QMVtYimqZWOLY2bfZm9LFlZvg95 IzBg== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: APjAAAUJh8EMryDhOnZJhi12xAf75EYooJm9hv1NkTY50A3Ngd4XeJLJ IhntZrIDXslRicLWoLhFFaE= X-Google-Smtp-Source: APXvYqwOT2I1mq6q/enJnTcMIyuuFnalCf+We9JRyIH4nUAylHFAsGoGcDYRpnVl0Dqd9x7rrhznuQ== X-Received: by 2002:a9d:629a:: with SMTP id x26mr370275otk.120.1568747756272; Tue, 17 Sep 2019 12:15:56 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6808:3cc:: with SMTP id o12ls547857oie.0.gmail; Tue, 17 Sep 2019 12:15:55 -0700 (PDT) X-Received: by 2002:aca:eb09:: with SMTP id j9mr5305108oih.105.1568747755199; Tue, 17 Sep 2019 12:15:55 -0700 (PDT) In-Reply-To: <4e4c8527-4998-4f9e-9e36-92d26d8849d1-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> X-Original-Sender: ramaprakashak-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:23431 Archived-At: ------=_Part_3136_336958952.1568747754499 Content-Type: multipart/alternative; boundary="----=_Part_3137_2048846468.1568747754500" ------=_Part_3137_2048846468.1568747754500 Content-Type: text/plain; charset="UTF-8" I should have searched from other posts; anyhow -t markdown-header_attributes-link_attributes-native_divs-native_spans- styles-bracketed_spans-raw_html this solved my problem. Thank you for your solution John MacFarlane, that is cool On Monday, September 16, 2019 at 10:47:51 PM UTC+5:30, rama prakasha wrote: > > I recently converted lot of PDF in Google doc, It by default creates lot > of markup to the document. I want to only retain bold and italics and > remove all other formatting while converting to markdown. I tried this > -t markdown-header_attributes-link_attributes-native_divs-native_spans- > styles > but resulting markdown still conatins markup like this ***[*Tomorrow 12 > *]{dir="ltr"}*** I needed only **Tomorrow 12**. How can we achieve this. > Please help. > -- You received this message because you are subscribed to the Google Groups "pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/5a985960-990c-4a18-8c52-972c37bb542b%40googlegroups.com. ------=_Part_3137_2048846468.1568747754500 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
I should have searched from other posts; anyhow=C2=A0
=
=C2=A0-t markdown-header_attributes-link_attributes-native_divs-native_spans-styles-bracketed_spans-raw_html
=C2=A0this solved my problem.= Thank you for your solution John MacFarlane, that is cool


On Mo= nday, September 16, 2019 at 10:47:51 PM UTC+5:30, rama prakasha wrote:
I recently convert= ed lot of PDF in Google doc, It by default creates lot of markup to the doc= ument. I want to only retain bold and italics and remove all other formatti= ng while converting to markdown. I tried this
-t markdown-header_attributes-link_attributes-native_divs-native_spans-styles
but resulting markdown still conatins markup like this=C2=A0**= [Tomorrow=C2=A0 12]{dir=3D"ltr"}** I needed only **Tom= orrow=C2=A0 12**. How can we achieve this. Please help.
<= /div>

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/= msgid/pandoc-discuss/5a985960-990c-4a18-8c52-972c37bb542b%40googlegroups.co= m.
------=_Part_3137_2048846468.1568747754500-- ------=_Part_3136_336958952.1568747754499--