From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/18168 Path: news.gmane.org!.POSTED!not-for-mail From: BP Jonsson Newsgroups: gmane.text.pandoc Subject: A way to convert PDF to Markdown or other (Solution!) Date: Tue, 12 Sep 2017 15:14:15 +0200 Message-ID: Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: blaine.gmane.org 1505222097 10002 195.159.176.226 (12 Sep 2017 13:14:57 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Tue, 12 Sep 2017 13:14:57 +0000 (UTC) User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.1.1 To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDIY76M674FRBNV337GQKGQETKKSY6Q-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Sep 12 15:14:44 2017 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-it0-f60.google.com ([209.85.214.60]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1drl1E-0001D7-ET for gtp-pandoc-discuss@m.gmane.org; Tue, 12 Sep 2017 15:14:24 +0200 Original-Received: by mail-it0-f60.google.com with SMTP id y138sf9734817itc.13 for ; Tue, 12 Sep 2017 06:14:32 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1505222071; cv=pass; d=google.com; s=arc-20160816; b=PzLygEv3wUEilT7lURpnxS0CBOgxJDIAfsotvSjICDYymj1ZyeFD0bC9YiwfvAF2fB Y0+8Lz8xl81OW1Z8Gvy+ZAJM/U/YIWJuYEKiUgd8jYZ82HNElFfzlQVV1ujHmCRtjsGq qnI4BJEAqZ/j/PL9fqtKMGaXqsWqx/h3oYIf0rFcHUyWjEjrDdvmZGgni/lZQr1weIx7 APN4U8RpgLKmdpSmzp1pzqSoTmqJ8SFGxA6t8aKCgtb2GemkLLFLL8yrXDvEdIh0kGVf lFVXvgp1Y/HCokLUS8fHOuDaiQgUrKGS/1PHy2gfSV1/yUbAdFDGfaDdzyzkuCCgOQ8/ I8sg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:content-transfer-encoding :content-language:mime-version:user-agent:date:message-id:subject :from:to:arc-authentication-results:arc-message-signature:sender :dkim-signature:arc-authentication-results; bh=AcfFS7YdNF454DU+gcup0Hy7Tuw2cEceWBZC1RYAql8=; b=uP0qJ14pVf0L6B7O7Gvf7m4vR6Hj1fNibzERdmqXtNkTVWOOS3UVVXJP6suWwO8qs3 yM66RIxb74S+1eP+1OhGoq6D0FrRI/0ise1sgnuw9WSwUoiU5iKpGM/tDYo/JbCoHnfV Z3vKEDB7QmNSQifJ3q6rpqFwPJP+ZR8JLq8hjiv6CzE2OphNNuFWYBY+fcQp3pa6Uguc 4BdxaedwcilMLrIRI37JPMAsAwpy5IlguHOhd/vgsl42oMnJI0D7ToW42DVCGUsRmU0W sVqGsVFTYoIgAUzuUM6h78PNwFd1Yn5S21xDZ3tufOrb5H1VYLnT7uFMAbMyo3UI5s7I ARC-Authentication-Results: i=2; gmr-mx.google.com; spf=softfail (google.com: domain of transitioning bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org does not designate 138.128.164.243 as permitted sender) smtp.mailfrom=bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:to:from:subject:message-id:date:user-agent:mime-version :content-language:content-transfer-encoding:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=AcfFS7YdNF454DU+gcup0Hy7Tuw2cEceWBZC1RYAql8=; b=lTX3/8z14a3vL2eZ7FpbzHPTrrbR/xghoPjreVi41+QoQ/5cVgTBSizeOFqp/PCT2/ TLfkBkKjBkALDf6ZPM6iuXsQjP5VVaoPB3XsiExneG73BOUiWAxtmOa6DmKz2SLcr+h8 CZihM+amtpV6aZTaynNgQeVvQNUYdN1Q8WeQ+WJcXyDqdTYCxVTy32IWzcStLOCmx47f dQO4tIVqBlVszYwBmc2HvBiZQp6JuNsFBpp1ezsxi9fhw/j61tH7mShF5VKYtXt8AH+B wv66qJ1WtEhh2wEMN5vT/Xf7PPZavhIhdc7pcp2xNiG69WohD2Wx6ymB94EelFCfckGQ 0jNg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:to:from:subject:message-id:date :user-agent:mime-version:content-language:content-transfer-encoding :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:list-subscribe:list-unsubscribe; bh=AcfFS7YdNF454DU+gcup0Hy7Tuw2cEceWBZC1RYAql8=; b=lfFA/yhtcuZQyW2s3CRui+n1z7lrX48Z2sqNyPQ6qWVf0hWDnVFFiv7RcIkO203RQB afCxJabwUcDu8eHHh4aFCqgMNZZc6qU5XO+Fz14XOrioKku3VyckrJlmFo9TDCBWvgqz j4/DGLAWEbLziqgQLjOtdyaps+XCsfFxMZUXYUxUPAWdoGvEdkUVDEmKtymFgO7TT8j0 E375AM0RxDGY2hHFnhmdHDPe3VuqADJ/pyGJ6QBqkTpLZuOcZq3uDGOjV56ZcSlWY9Hf z+dVm7xNAIXuIupQOgNbpBnVqvcKImJ9pHLxmemF9WJeSY4jh/tjOe+7PnMnVJ8fH4/J Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AHPjjUj/xG92vmv6mUv90oImeOj+havWPOiaZaxY5nfDUB48bY9tmRH5 873StC6I3MesMBF+6fEsBMo= X-Google-Smtp-Source: AOwi7QAtnHs2BGu51h7HNhsnd6+pbBHVC2tk1xsHN8UVTDde/uZ5fAiIffjXoXHSbyI5baXIYYE3Xw== X-Received: by 10.36.23.86 with SMTP id 83mr783682ith.4.1505222071717; Tue, 12 Sep 2017 06:14:31 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.107.35.13 with SMTP id j13ls1113631ioj.39.gmail; Tue, 12 Sep 2017 06:14:30 -0700 (PDT) X-Received: by 10.99.129.194 with SMTP id t185mr9086515pgd.23.1505222070812; Tue, 12 Sep 2017 06:14:30 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1505222070; cv=none; d=google.com; s=arc-20160816; b=L/8dGk4Y06GlJ7oMYiCH7P3XU0favqHmn2jF0gs8/2iFrPQ6HT3T6YBmz2xuE76KOf 2l9seFOkPrBbapD/FwGqqTAVuoJzXr/6Y+8rKqTyrvxvl7i8R6PLZNm+dknqibJGCJrr CCjVW3bBCxS4pIZG8jZeViqKOAZs3DLW0lDIZR66bbBT0mmG0zZ+gJBxXFGyE18Vd6Jb 41zfi77DMzDkYzTM57TyWWYAeIpZXnxpy73CctYHdqrTolZQ+AlFLC48N8RIVbm26YSk eD9QRd8dif53AlM/kp59tER7yN4SK0aHwAco+FBYoFfKTbUojmIrUEL5poISI6FMx3lR iBgw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:content-language:mime-version:user-agent :date:message-id:subject:from:to:arc-authentication-results; bh=J90K3BWOJ2MajF5fec6qznflObcq/pXOAkGUs53uQH8=; b=OfvRbZrIlvzmuo8bIIYC+1KW2YPmhmrb0e2yWIwVdG58oXuN7PuqqhXcvCZk48HEje AZvcah/YFj+y/o8lWUMB9PUgEg20CH1kTDzf61gOCcgO6JxROaWnREbq+NyOAQmvmMXs BAZOCcBJgiWF06pvFA9/fsxNBjA4AOGyl3OemeZZKrDyWyO+fERZs2bH4gFtdRkwtXBe DCsct+UYgQdrrO//OaistLIWaRNpXuM50rgPhdYIFYtCVFT6qG4jvSvfw9jigvSZTsz6 vxhkwCyLPo4ihuxNCWzxQw//v5wbL9hXN4HVIeR2dly47lCfunAIcp2KqocxKj1OALM7 AueA== ARC-Authentication-Results: i=1; gmr-mx.google.com; spf=softfail (google.com: domain of transitioning bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org does not designate 138.128.164.243 as permitted sender) smtp.mailfrom=bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=gmail.com Original-Received: from manu6.manufrog.com (ns11.manufrog.com. [138.128.164.243]) by gmr-mx.google.com with ESMTPS id g196si441509vke.3.2017.09.12.06.14.30 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 12 Sep 2017 06:14:30 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org does not designate 138.128.164.243 as permitted sender) client-ip=138.128.164.243; Original-Received: from [178.249.150.162] (port=53000 helo=[192.168.1.6]) by manu6.manufrog.com with esmtpsa (TLSv1.2:ECDHE-RSA-AES128-GCM-SHA256:128) (Exim 4.89) (envelope-from ) id 1drl1K-003rRf-35 for pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; Tue, 12 Sep 2017 15:14:30 +0200 Content-Language: sv-SE X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - manu6.manufrog.com X-AntiAbuse: Original Domain - googlegroups.com X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - gmail.com X-Get-Message-Sender-Via: manu6.manufrog.com: authenticated_id: bpj-J3H7GcXPSITLoDKTGw+V6w@public.gmane.org X-Authenticated-Sender: manu6.manufrog.com: bpj-J3H7GcXPSITLoDKTGw+V6w@public.gmane.org X-Source: X-Source-Args: X-Source-Dir: X-Original-Sender: bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=softfail (google.com: domain of transitioning bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org does not designate 138.128.164.243 as permitted sender) smtp.mailfrom=bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=gmail.com Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:18168 Archived-At: This may be old news to some, but I can't remember having seen it, so I make a post for the record. I just discovered that you can convert a PDF to Markdown (or any other format Pandoc supports) by uploading it to Google Drive, opening it in Google Docs and downloading it from there as DOCX, then converting the DOCX to Markdown with Pandoc. The result is quite good! The steps: 0. Log into in a web browser. 1. Select the menu [My Drive=E2=8F=B7] =E2=86=92 [Upload files=E2=80=A6] i= n the top bar. 2. At least on my system a file dialog opens. Browse to the PDF file; select it; click [Open]. 3. The file appears in the "Quick access" field just below the top bar. 4. Right-click the file thumbnail; choose [Open with] =E2=86=92 [Google Docs]. You should now find yourself in the Google Docs document view. 5. In the [File] menu choose [Download as] =E2=86=92 [Microsoft Word (.docx)]. 6. Save the DOCX file to disk and convert it with Pandoc the same as you would any DOCX file. I hope this is of use to someone! /bpj --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/a26eb786-3e48-671b-99ca-dbc3aeb274f5%40gmail.com. For more options, visit https://groups.google.com/d/optout.