From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/8054 Path: news.gmane.org!not-for-mail From: Nick Bart Newsgroups: gmane.text.pandoc Subject: Re: Decoupling citeproc and highlighting-kate from pandoc Date: Mon, 4 Nov 2013 00:19:43 -0800 (PST) Message-ID: References: <78a76dee-7c31-4323-8d66-21de3212d267@googlegroups.com> <7b646c3e-be17-44f0-b56a-d49519e63b6c@googlegroups.com> <20131028160326.GA22220@Johns-MacBook-Pro.local> <5c690247-2054-4198-87b3-92521dd489bb@googlegroups.com> <20131029044704.GD23351@Johns-MacBook-Pro.local> <25f97fa9-4ec7-4414-a13a-1d69e84e5fde@googlegroups.com> <663baeef-dcc3-4fc4-af80-f36e0a8a509c@googlegroups.com> <20131030064201.GB35821@Johns-MacBook-Pro.local> <55a6616f-948d-49eb-bbd5-5de5ef48344a@googlegroups.com> <20131030163434.GB40216@Johns-MacBook-Pro.local> <20131030175702.GA49946@Johns-MacBook-Pro.local> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_2022_30148799.1383553183859" X-Trace: ger.gmane.org 1383553183 13035 80.91.229.3 (4 Nov 2013 08:19:43 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 4 Nov 2013 08:19:43 +0000 (UTC) To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDIPX6WQ4INBBIFR3WJQKGQEAASUCNQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mon Nov 04 09:19:49 2013 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-pa0-f55.google.com ([209.85.220.55]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1VdFNy-00042G-5C for gtp-pandoc-discuss@m.gmane.org; Mon, 04 Nov 2013 09:19:46 +0100 Original-Received: by mail-pa0-f55.google.com with SMTP id fb1sf1460209pad.10 for ; Mon, 04 Nov 2013 00:19:45 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=0lyUCpEkeY/BtMy7usvxmuzR1srHfGO48ojEdXJ0qF8=; b=lkLb8+0a0blOxv3egCppE+sn/3m34dD5JXvZ+I+ShN4vGznwG6+UGgwAGSPUDi0fys BD72E504tp2YzavGsAFeg/5zHxOSs9FV3nIG8YOsY8aBee3bfq5Hx8ybpThqv/Bci2D4 vEqxxYXTWCqP3R9I0Zmln8M4B7MB0p3ImGyJUI/FVIRXK8s+YXYuAKKpcLJngzPuAS7Q D4CnB2O4LNOFcezHFJ0I9TiylIPGrQHspgmiw1l2ZSclboVrc2e3qnRYEL4W1SJIGF5g 2szVC3xPY4LdjbIcfFdFjjWP+J39tX2D5j5OgQuUyOjivNshRsSX0Ato2YYDI9EWUH2w jn4w== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=0lyUCpEkeY/BtMy7usvxmuzR1srHfGO48ojEdXJ0qF8=; b=ojFmVFEMxR3tZP01pfuASyTLlPD7edR4/FKkowhuWz9N0DaYugSgBqoqN2/qKpeRwY XTnKuVMXjFO1BDFuzk1mf9ECDlZz6yak4U/ZNKJfS8bTJpfUm9s2bI0Iv7DYNcpZDc1o nK6bnSClVzyOV3xv1sVEvG0qyGPivZQHrXaGR03dM5Q16FdweVJx+/yeAbO39GSXd4EA Ket0frT4rS5Qw75W2vvC4drnA7u51mbPOICg6J2a/L0WwZMEjoSdllA7Vv1xpLiiAnjn h9pkGaTIyMnQht9/Q7/+JckQj0Cy4G2Isb2OJH00NTszsPz+vm5rrUahbdffrh1DSY9q tBJQ== X-Received: by 10.182.44.163 with SMTP id f3mr2845obm.34.1383553184995; Mon, 04 Nov 2013 00:19:44 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.182.109.170 with SMTP id ht10ls1326418obb.7.gmail; Mon, 04 Nov 2013 00:19:44 -0800 (PST) X-Received: by 10.182.138.66 with SMTP id qo2mr118907obb.8.1383553184625; Mon, 04 Nov 2013 00:19:44 -0800 (PST) In-Reply-To: <20131030175702.GA49946-9Rnp8PDaXcadBw3G0RLmbRFnWt+6NQIA@public.gmane.org> X-Original-Sender: nickbart1980-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-Subscribe: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:8054 Archived-At: ------=_Part_2022_30148799.1383553183859 Content-Type: text/plain; charset=ISO-8859-1 I see that a lot of work is being put into pandoc-citeproc, and most likely not everything is finished yet. Just wanted to mention that biblio2yaml from the latest version seems to choke on escaped chars and double underscores in URLs, and removes single underscores from URLs (try biblio2yaml pandoc-citeproc/tests/biblio2yaml/markey.biblatex). On Wednesday, 30 October 2013 17:57:02 UTC, fiddlosopher wrote: > > +++ John MacFarlane [Oct 30 13 09:34 ]: > > +++ Nick Bart [Oct 30 13 01:13 ]: > > > Great. The output from biblio2yaml looks good, except that I think > there > > > should be only one backslash inside verbatim spans such as > > > `{\textnormal{}}` (see example below). > > > > That's proper YAML escaping for a single backslash. > > > > > The next step would be to let citeproc format the > elements > > > appropriately, and to fix the protection of braced lowercase strings: > > > Currently, neither \textnormal{{new. ser.}} nor {\textnormal{new. > ser.}} > > > works as I would expect it (for both, see example below). > > > > citeproc doesn't know about this yet. In fact, this aspect of things > > is a terrible mess right now. Text.CSL.Input.Bibtex is parsing > > everything into a pandoc structure, then writing this to a string > > (which goes in the Reference data structure); Text.CSL.Eval.Output > > and Text.CSL.Output.Pandoc then parse this string back into a pandoc > > structure. Currently this isn't very reliable, and it's needlessly > > complex. Hence my desire to cut out the intermediate string > > representation, and change everything so that the Reference > > and Cite structures have Pandoc fields instead of strings. Unfortunately > > this is a pretty big change and will take quite a lot of work. Andrea > > said he was going to work on it; I don't know if he has made progress. > > I've done a bit of preliminary cleaning in Text.CSL.Output.Pandoc. > > I think the project could be done in several steps: > > 1. FormattedOutput already contains FPan to hold a Pandoc [Inline]. > So the first step could be to change the routines that produce > FormattedOutput so that they only use FPan. > > 2. The Plain output module would then use writePlain from pandoc. > > 3. At this point, FormattedOutput could be removed and replaced with > pandoc [Inline] (or it could be made a synonym). > > 4. Output could probably be simplified at this point. E.g. I think > we could get rid of OUrl and just use OPan. > > 5. Now the big change: use [Inline] instead of String in Reference > and Affix. This would require lots of other changes, but the compiler > would tell us where they were needed. A lot of code could be > removed, here, because we wouldn't be reparsing strings for formatting. > > -- You received this message because you are subscribed to the Google Groups "pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/a0649979-54f8-4647-99ca-c98bec0d69cc%40googlegroups.com. For more options, visit https://groups.google.com/groups/opt_out. ------=_Part_2022_30148799.1383553183859 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
I see that a lot of work is being put into pandoc-citeproc= , and most likely not everything is finished yet. Just wanted to mention th= at biblio2yaml from the latest version seems to choke on escaped chars and = double underscores in URLs, and removes single underscores from URLs (try b= iblio2yaml pandoc-citeproc/tests/biblio2yaml/markey.biblatex).

On We= dnesday, 30 October 2013 17:57:02 UTC, fiddlosopher wrote:
+++ John MacFarlane [Oct 30 13 09:34 ]:
> +++ Nick Bart [Oct 30 13 01:13 ]:
> > Great. The output from biblio2yaml looks good, except that I = think there=20
> > should be only one backslash inside verbatim spans such as=20
> > `{\textnormal{}}` (see example below).
>=20
> That's proper YAML escaping for a single backslash.
>=20
> > The next step would be to let citeproc format the <span>= ;</span> elements=20
> > appropriately, and to fix the protection of braced lowercase = strings:=20
> > Currently, neither \textnormal{{new. ser.}} nor {\textnormal{= new. ser.}}=20
> > works as I would expect it (for both, see example below).
>=20
> citeproc doesn't know about this yet.  In fact, this aspect o= f things
> is a terrible mess right now.  Text.CSL.Input.Bibtex is parsi= ng
> everything into a pandoc structure, then writing this to a string
> (which goes in the Reference data structure); Text.CSL.Eval.Output
> and Text.CSL.Output.Pandoc then parse this string back into a pand= oc
> structure.  Currently this isn't very reliable, and it's need= lessly
> complex.  Hence my desire to cut out the intermediate string
> representation, and change everything so that the Reference
> and Cite structures have Pandoc fields instead of strings. Unfortu= nately
> this is a pretty big change and will take quite a lot of work. &nb= sp;Andrea
> said he was going to work on it; I don't know if he has made progr= ess.

I've done a bit of preliminary cleaning in Text.CSL.Output.Pandoc.

I think the project could be done in several steps:

1.  FormattedOutput already contains FPan to hold a Pandoc [Inline= ].
So the first step could be to change the routines that produce
FormattedOutput so that they only use FPan.

2.  The Plain output module would then use writePlain from pandoc.

3.  At this point, FormattedOutput could be removed and replaced w= ith
pandoc [Inline] (or it could be made a synonym).

4.  Output could probably be simplified at this point.  E.g. = I think
we could get rid of OUrl and just use OPan.

5.  Now the big change:  use [Inline] instead of String in Re= ference
and Affix.  This would require lots of other changes, but the comp= iler
would tell us where they were needed.  A lot of code could be
removed, here, because we wouldn't be reparsing strings for formatting.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/a0649979-54f8-4647= -99ca-c98bec0d69cc%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
------=_Part_2022_30148799.1383553183859--