From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/11860 Path: news.gmane.org!not-for-mail From: Phillip Smith Newsgroups: gmane.text.pandoc Subject: Re: Curious: ODT reader Date: Tue, 27 Jan 2015 12:00:20 -0800 (PST) Message-ID: References: <4fef1220-23ec-441c-9e42-41ef29d6f1ea@googlegroups.com> <20150126224239.GA30710@pupunha> <7EE5FAC3-481F-468F-AFE1-E898FC1E5387@gmail.com> <87ppa0rxyd.fsf@jhu.edu> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_2394_1277091802.1422388820633" X-Trace: ger.gmane.org 1422388824 18517 80.91.229.3 (27 Jan 2015 20:00:24 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 27 Jan 2015 20:00:24 +0000 (UTC) Cc: phillipadsmith-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDA6VFNJTMJBBVO4T6TAKGQEHZTXLII-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Jan 27 21:00:24 2015 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-ob0-f187.google.com ([209.85.214.187]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YGCJC-0008Oj-UC for gtp-pandoc-discuss@m.gmane.org; Tue, 27 Jan 2015 21:00:23 +0100 Original-Received: by mail-ob0-f187.google.com with SMTP id gq1sf3050140obb.4 for ; Tue, 27 Jan 2015 12:00:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:cc:message-id:in-reply-to:references:subject :mime-version:content-type:x-original-sender:reply-to:precedence :mailing-list:list-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe; bh=QmMPXMOeSRs2cOIXQLoYs+OkJQ7YTGiQmNnZwx4Kj28=; b=pl5v/S0XEcX3gzu3F5Di4XFfpefdpNKUmyq1iEVTUgXPLlefPvM7IrxiDitzFp5R4s Q/FIL9Oz2i5JhTlUQTeSjofXzqMx1+Et/6l9C+ygsNqNWLpK/hM+yzEvQBWFqYpqd5Nk kw9VVKKPV5UUDslwX5XHJMAA94DjXe2yRLMR7jA2R5qViUeQ/7mEjV3fs2UBO4M+7Zze Y/YVQp9xvG6gpCjdvYRsUN4zNjLSahSjxx2zbeaiYH5mOz0Gh6gw0R8XPN3WbupoWZ1R evqPm8yJEvvMhGH6W5NsIt3uD5LL4LHQIrwPCy3gx7WAyvBAWHqj3JmgUY/kNhotv2ie iHbw== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:to:cc:message-id:in-reply-to:references:subject :mime-version:content-type:x-original-sender:reply-to:precedence :mailing-list:list-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe; bh=QmMPXMOeSRs2cOIXQLoYs+OkJQ7YTGiQmNnZwx4Kj28=; b=ItVCQltN6wZ/Agtbnp6L3ixZ8wcvb5G0NhnwjEeTbzO1wQuVV3HZf3dJ3xgHvm1+AO LDMjgi0QuE7bUZ2t0WbuaJpzB/koJqEvZ9EyqOLzbwtCxS6+zAbFpds28Do5e9e3j4+U tNUAgY4PFafKFbZwl7J+Oie7RUeVx09AQzTB3WwE+e1cApkKQ1wlgPCbQdrGHzcDGIsJ Cpdr8h+qWDX1T7CuYZJOSk05m3o+RVRXhO437x5PqF59PPMTTRiS6eqwJRkvpPe3dpqe k5to8jcV6TUTN46yOTJF++HcJYwvJHRTNHAi+cEf2nZYj0f/bqOuuCs77MG7FpPMmrWV BEBw== X-Received: by 10.140.81.199 with SMTP id f65mr44797qgd.12.1422388822212; Tue, 27 Jan 2015 12:00:22 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.140.85.177 with SMTP id n46ls259163qgd.36.gmail; Tue, 27 Jan 2015 12:00:20 -0800 (PST) X-Received: by 10.140.19.193 with SMTP id 59mr2282qgh.23.1422388820929; Tue, 27 Jan 2015 12:00:20 -0800 (PST) In-Reply-To: <87ppa0rxyd.fsf-4GNroTWusrE@public.gmane.org> X-Original-Sender: phillipadsmith-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:11860 Archived-At: ------=_Part_2394_1277091802.1422388820633 Content-Type: multipart/alternative; boundary="----=_Part_2395_2130348096.1422388820633" ------=_Part_2395_2130348096.1422388820633 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable My responses seem to be getting delayed or grey listed for several hours,= =20 so I'm just going to re-post directly from the Google Groups interface here= =20 (apologies if you receive a duplicate tomorrow): On Tuesday, January 27, 2015 at 8:27:07 AM UTC-8, Jesse Rosenthal wrote: > > Hi Phillip,=20 > > I'm actually the author of the docx reader. I wrote it because I wanted= =20 > to use it. I'd guess that's the case for most of the readers in pandoc.= =20 > > Which is to say that I don't think there's any technical limitation to=20 > constructing such a reader. It would likely follow the same basic=20 > pattern as the docx reader: unzip, parse, convert. It would in fact=20 > probably be a bit easier than the docx reader because lists etc seem=20 > much more sensible in ODT due to proper nesting. I just don't think=20 > anyone has had sufficient desrie (or the combination of desire and=20 > haskell chops) to make it happen.=20 > > By the way, before I wrote the docx reader, I prototyped it in python,=20 > and wrote a python script that output pandoc json. If you don't want to= =20 > work in haskell you could look around for an ODT reader in another=20 > language and go from there.=20 > > Best,=20 > Jesse=20 > Hi Jesse, That's very helpful context to have. I have been looking at the other libraries here,=20 http://www.opendocumentformat.org/developers/ =E2=80=A6 and I'd be curious= to see=20 an example of the "pandoc JSON" if you could point me to one (and/or your= =20 python prototype).=20 It does look like an interesting problem to solve, haskell aside. ;)=20 However, the team I'm working with at the moment is quite small, and we=20 are heavily committed (like everyone!). I see that the idea of a "bounty" has been raised before on this list. Our= =20 project is well funded and this would be a worthwhile investment. Before I go searching for someone to write this reader out in the wild, is= =20 there anyone on this list that would be interested in taking it on? If so,= =20 please drop me a note off-list. Phillip.=20 =20 > Phillip Smith > writes:=20 > > > On 2015-01-26, at 2:42 PM, Raniere Silva > wrote:=20 > >=20 > >> Hi Phillip,=20 > >>=20 > >>> I'm completely ignorant here, having just starting using pandoc, but= =20 > -- if=20 > >>> it's not a waste of time to explain it to me -- I'd be curious to=20 > >>> understand how an odt reader could be added.=20 > >>=20 > >> At src/Text/Pandoc/Readers/ you will find the source code of all=20 > readers.=20 > >> If you take a look at src/Text/Pandoc/Readers/Docx{/,.hs} you will see= =20 > the=20 > >> source code of the DOCX reader. You can use it as a start point for th= e=20 > ODT=20 > >> reader.=20 > >=20 > > Hi Raniere,=20 > >=20 > > Many thanks for your response. I did have a look at the DOCX reader=20 > earlier.=20 > >=20 > > Let me perhaps re-phrase my question: What have been the barriers that= =20 > have prevented an odt reader from being added before?=20 > >=20 > > I'm curious why so many readers are available, but not odt? Are there= =20 > obstacles that are well-known and hard to overcome?=20 > >=20 > > Many thanks in advance,=20 > >=20 > > Phillip.=20 > >=20 > > --=20 > > You received this message because you are subscribed to the Google=20 > Groups "pandoc-discuss" group.=20 > > To unsubscribe from this group and stop receiving emails from it, send= =20 > an email to pandoc-discus...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org .=20 > > To post to this group, send email to pandoc-...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org=20 > .=20 > > To view this discussion on the web visit=20 > https://groups.google.com/d/msgid/pandoc-discuss/7EE5FAC3-481F-468F-AFE1-= E898FC1E5387%40gmail.com.=20 > > > For more options, visit https://groups.google.com/d/optout.=20 > --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/e8873929-e613-43f6-98f9-a760a6e33772%40googlegroups.com. For more options, visit https://groups.google.com/d/optout. ------=_Part_2395_2130348096.1422388820633 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
My responses seem to be getting delayed or grey listed for= several hours, so I'm just going to re-post directly from the Google Group= s interface here (apologies if you receive a duplicate tomorrow):

On= Tuesday, January 27, 2015 at 8:27:07 AM UTC-8, Jesse Rosenthal wrote:Hi Phillip,

I'm actually the author of the docx reader. I wrote it because I wanted
to use it. I'd guess that's the case for most of the readers in pandoc.

Which is to say that I don't think there's any technical limitation to
constructing such a reader. It would likely follow the same basic
pattern as the docx reader: unzip, parse, convert. It would in fact
probably be a bit easier than the docx reader because lists etc seem
much more sensible in ODT due to proper nesting. I just don't think
anyone has had sufficient desrie (or the combination of desire and
haskell chops) to make it happen.

By the way, before I wrote the docx reader, I prototyped it in python,
and wrote a python script that output pandoc json. If you don't want to
work in haskell you could look around for an ODT reader in another
language and go from there.

Best,
Jesse

Hi Jesse,

That's very helpful context to have.

I have been looking at the other libraries here, http://www.opendocumentform= at.org/developers/  =E2=80=A6 and I'd be curious to see an example= of the "pandoc JSON" if you could point me to one (and/or your python prot= otype). 

It does look like an interesting problem to solve, haskell = aside. ;)  However, the team I'm working with at the moment is quite s= mall, and we are heavily committed (like everyone!).

I see that the idea of a "bounty" has been raised before on= this list. Our project is well funded and this would be a worthwhile inves= tment.

Before I go searching for someone to write this reader out = in the wild, is there anyone on this list that would be interested in takin= g it on? If so, please drop me a note off-list.

Phillip. 
 
Philli= p Smith <phillip..= .@gmail.com> writes:

> On 2015-01-26, at 2:42 PM, Raniere Silva <ran...-Xhq86aZylhRTIXuwt5Zssw@public.gmane.org> wrote:
>
>> Hi Phillip,
>>=20
>>> I'm completely ignorant here, having just starting using p= andoc, but -- if=20
>>> it's not a waste of time to explain it to me -- I'd be cur= ious to=20
>>> understand how an odt reader could be added.
>>=20
>> At src/Text/Pandoc/Readers/ you will find the source code of a= ll readers.
>> If you take a look at src/Text/Pandoc/Readers/Docx{/,.hs}= you will see the
>> source code of the DOCX reader. You can use it as a start poin= t for the ODT
>> reader.
>
> Hi Raniere,
>
> Many thanks for your response. I did have a look at the DOCX reade= r earlier.
>
> Let me perhaps re-phrase my question: What have been the barriers = that have prevented an odt reader from being added before?
>
> I'm curious why so many readers are available, but not odt? Are th= ere obstacles that are well-known and hard to overcome?
>
> Many thanks in advance,
>
> Phillip.
>
> --=20
> You received this message because you are subscribed to the Google= Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, = send an email to pand= oc-discus...@googlegroups.com.
> To post to this group, send email to pandoc-...@googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discus= s/7EE5FAC3-481F-468F-AFE1-E898FC1E5387%40gmail.com.
> For more options, visit https://groups.google.com/d/= optout.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/= msgid/pandoc-discuss/e8873929-e613-43f6-98f9-a760a6e33772%40googlegroups.co= m.
For more options, visit http= s://groups.google.com/d/optout.
------=_Part_2395_2130348096.1422388820633-- ------=_Part_2394_1277091802.1422388820633--