From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/12094 Path: news.gmane.org!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: Re: Google Summer of Code 2015 Date: Sat, 21 Feb 2015 15:27:39 -0800 Message-ID: <20150221232739.GA42324@localhost.hsd1.ca.comcast.net> References: <20150221231559.GG42178@localhost.hsd1.ca.comcast.net> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed X-Trace: ger.gmane.org 1424561284 31029 80.91.229.3 (21 Feb 2015 23:28:04 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 21 Feb 2015 23:28:04 +0000 (UTC) To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBCJZJHG45QDBB6NIUSTQKGQEWIHRXQA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Sun Feb 22 00:27:55 2015 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-pa0-f62.google.com ([209.85.220.62]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1YPJSk-00021A-UJ for gtp-pandoc-discuss@m.gmane.org; Sun, 22 Feb 2015 00:27:55 +0100 Original-Received: by padfa1 with SMTP id fa1sf3752031pad.6 for ; Sat, 21 Feb 2015 15:27:54 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=from:date:to:subject:message-id:references:mime-version :content-type:content-disposition:in-reply-to:user-agent :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe; bh=MSseTyr28eh+etYnvAha9i4NWuTbUH6OrVCDFp+Y3Ko=; b=qGpXD+YPhV2uArlcAnFKvypYozsLEqW1xXaoErP8gs+epJDTRKcUvpxVUUcpPzabAR KRNuRWXcvS6X6wbV7DdYKpn9acsreZVcR+xeguLyG9sC56ALJexKMwlH8ot27sO8Xjln LrCU/o5Mzh7ce27TJ1OD8vacXNz4fUcrWpcCX4XE2i3tXxeNbe+oge7P0OWTBWbYL8Fq znvHbCfFhAoN8zT60iRn0k+2FuldaXOA8wmDlbwIncOi+VdE4fR0ajZi3oIrYOYctCDP bUBXN17p9jqlcHMbOa24cg8A0WaRFmCRVhvT18nChtul0BRjc8Idwm5gK2IblYQ1kYdL XOkg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:date:to:subject:message-id:references :mime-version:content-type:content-disposition:in-reply-to :user-agent:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:sender:list-subscribe:list-unsubscribe; bh=MSseTyr28eh+etYnvAha9i4NWuTbUH6OrVCDFp+Y3Ko=; b=Ibd6WF1r5r6Wmvn/yvZpe+MP35tyIJOsaCjlAHODXkKEB2aOtiWP3iyGbfkd5dP/XQ OkbiOhVqtaWypRkDdxJlmVpg/dEcxtba9BOs58cEakKhbJFXQ/UnJvz7vaA5RwWYvMpc h/E5I6TOBAcEurD6fs2m4tEiaCZDFi0Ia46AT4T58LqlkJPfgTcz8T+PEn1TqgDpbXQO R6fPiVhb3m9Tcxiz+bwDW+DPbYPx7Xbfm+Oejf9DaE/iQ4Fh5xW+v+daUp3OLpfj8sTX aoQNLcsVqgSoF3ARkulcrTJu2rnw14QuQYrJX78wwsRDUzVc8HRw91LvuWOJ1mn/M0eM yL X-Received: by 10.182.107.228 with SMTP id hf4mr24452obb.6.1424561274044; Sat, 21 Feb 2015 15:27:54 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.182.227.166 with SMTP id sb6ls476849obc.3.gmail; Sat, 21 Feb 2015 15:27:53 -0800 (PST) X-Received: by 10.182.94.204 with SMTP id de12mr5159947obb.13.1424561273574; Sat, 21 Feb 2015 15:27:53 -0800 (PST) Original-Received: from mail-pd0-f172.google.com (mail-pd0-f172.google.com. [209.85.192.172]) by gmr-mx.google.com with ESMTPS id ks4si3723790pbc.1.2015.02.21.15.27.53 for (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sat, 21 Feb 2015 15:27:53 -0800 (PST) Received-SPF: pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 209.85.192.172 as permitted sender) client-ip=209.85.192.172; Original-Received: by pdno5 with SMTP id o5so16247870pdn.8 for ; Sat, 21 Feb 2015 15:27:53 -0800 (PST) X-Gm-Message-State: ALoCoQmCarVu8INBy2JMTMiFhoXITA74qcs4W4XhGeDd7RTNe0IFdqqRPKosLEIXW6++xSnpBZIu X-Received: by 10.70.123.132 with SMTP id ma4mr7726891pdb.12.1424561273387; Sat, 21 Feb 2015 15:27:53 -0800 (PST) Original-Received: from johnmacfarlane.net (li55-134.members.linode.com. [74.82.3.134]) by mx.google.com with ESMTPSA id r7sm23538480pdo.90.2015.02.21.15.27.51 for (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sat, 21 Feb 2015 15:27:51 -0800 (PST) Original-Received: by johnmacfarlane.net (Postfix, from userid 1000) id 167D2A31F; Sat, 21 Feb 2015 18:27:40 -0500 (EST) Content-Disposition: inline In-Reply-To: <20150221231559.GG42178-bi+AKbBUZKbivNSvqvJHCtPlBySK3R6THiGdP5j34PU@public.gmane.org> X-PGP-Key: http://johnmacfarlane.net/jgm.asc User-Agent: Mutt/1.5.23 (2014-03-12) X-Original-Sender: jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 209.85.192.172 as permitted sender) smtp.mail=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:12094 Archived-At: +++ John MacFarlane [Feb 21 15 15:15 ]: >+++ Matthew Pickering [Feb 20 15 21:45 ]: >>Dear list, >> >>I have created a ticket on the Haskell.org GSoC idea list for one of the >>ideas which John suggested[1] would be suitable. It is important that that >>projects are fairly freestanding (not requiring intimate knowledge of the >>architecture) and are selected based on utility to the Haskell community as >>a whole. >> >>If anyone has any comments, feel free to edit the proposal or reply here >>and I will make the changes. >> >>https://ghc.haskell.org/trac/summer-of-code/ticket/1660#ticket > >It's a good proposal. Some further motivation: pandoc's current >Markdown parser is not very efficient. It even goes exponential on some >inputs, which is not good for web use. > >I've already developed algorithms for parsing CommonMark efficiently, >without backtracking. They are so much more efficient than what pandoc >currently does that even the JavaScript implementation of commonmark is >3-4 times faster than pandoc, and the C implementation is 30-40 times >faster. > >So I'd hope for a 10X speedup with a rewrite. Further thoughts on this. The best GSOC projects benefit the whole community (infrastructure). So, it might make more sense to write a standalone CommonMark parser library with a liberal (BSD) license, that could be used in other Haskell projects (e.g., potentially, in some future version of Haddock). If this is made extensible, pandoc could simply use this library. But the library would also be available for other purposes. Pandoc itself won't ever be relicensed. But I think it would be a good idea, if contributors will agree, to dual license pandoc-types BSD3/GPL. (There are only a few commits by people other than me, so this is feasible.) This would allow authors of more liberally licensed libraries to include a nice "handle" or interface with pandoc. For example, a BSD licensed CommonMark parsing library could provide an option to parse to a Pandoc structure, which could then be consumed by any of the pandoc writers.