From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/26125 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: new citeproc performance notes Date: Sat, 12 Sep 2020 16:06:12 -0700 Message-ID: Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="24072"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBCJZJHG45QDBB4VI6X5AKGQEUVFBAYQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Sun Sep 13 01:06:30 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-vk1-f186.google.com ([209.85.221.186]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1kHEbF-0006AT-U2 for gtp-pandoc-discuss@m.gmane-mx.org; Sun, 13 Sep 2020 01:06:29 +0200 Original-Received: by mail-vk1-f186.google.com with SMTP id p84sf3108777vke.7 for ; Sat, 12 Sep 2020 16:06:29 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1599951989; cv=pass; d=google.com; s=arc-20160816; b=rZGPakfTBDlh+QovUffEjh99mumLtD64Fk1R3z5U7l1q1Ghk+ulnSJxymji7Xv1n8P 1zv+pmOnkGZUWSuVLmiFsjgneazuGd4yRDZI56D/haMQdrMSPjJHzg3aZ+A5tyquGHk2 ww+9L4KTSHqEW4TH4wQKRAmeVId2eSHnCdNCeVnSK3ycNRPdcN3BtmHYQ2svCWSqXDNQ w6GMN0plaotPzQGGHQ7/4usJbJbMi53P3P0l86TskfReJ0ohgDd+TREfqX3T09O+BHmJ 29GVMsFjelMQw7uje9/r9XnMspMcgVI1xRi8S9Xi4Z1DW1zYkoYK3OWW4sgDmtyEzcSh D1pg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:mime-version:message-id :date:subject:to:from:sender:dkim-signature; bh=2sE2PFR8kGYvW9VUZ8Tfn6oaLGYbmpPOIoKYheHcAXY=; b=viZVvwaYSOgQX514i5M6Pldt/HNWKeYtou26zOIoc7FwY8JQepjfaQwXdiWLwU60Pk UN8sTfiZVkvzkGmmhZtnOszuawI7Vw7gsnbQlTZ42nMU3sn+SaGKWOGDhuMmY412tVdF RzPe8NNqk8Wy5uU9474RXnenGRMMitvaugbzv2XmeRPeZmaCATDW21XALo0GcN1oQHz5 1TbrnuWKNDddqd4PPfU8g7dLmq/ogIO1hKQs6xRx4jUR0uzNLt0FOSyO6gWlxcY94cCy CzE/git7ovna4vi6PrInV/Bu39wM7J1Hz2xOh8gThRi6KfPmcSpaWd4MW0Y/njaixM5y YRUA== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=cR1Ytsfd; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::62a as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:from:to:subject:date:message-id:mime-version :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=2sE2PFR8kGYvW9VUZ8Tfn6oaLGYbmpPOIoKYheHcAXY=; b=UbeiJSXdqq04WbyJvNnY3ZVRyZyBcpEwftwIGltvfwr4fGkmlGE1y+eprAzR80RSt+ kWxG1MBBUXngm/a6CKw/h/qGJDjUaF9aqhjbzlvZ24Uvl/RPfTofMTnxujEDhQkBkhBa VbEAAzfi1Hrw024SZTVD0WOVbAfEAfNon1IkOWj5+UInwMRifJ7Siz090urGXI4LQjYX 0pJyNCMg1Z9jm+V01avDVUG/U6+2WpyadsvpiSH/6JhEVKwmhownPyOzedD7xRSuLllC EzQQM9uBvwu/kMGscoVRajH55Rrnam2kopaDfSrZRSkItn+E63gv/HLYBR++QzEruTK4 +tkQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:from:to:subject:date:message-id :mime-version:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:x-spam-checked-in-group :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=2sE2PFR8kGYvW9VUZ8Tfn6oaLGYbmpPOIoKYheHcAXY=; b=XSwCy/+CIxQAoqt7MFyfKPy5tRP4lKBzNm2cRy87+hsc/gOgrSH4XFvbU99NshmbUd hwvCv5XNppaqFeaPbWqXE2QUZOpw+qJcwBxBMbE3nWWiTx0mqSJGg/A3RRr+FzDB2nri 5TvzExfPx6kulgOjwUVvUWTfzeOo9s0ZRbSONjbUX81tWKoYmls3xkW4Jc61YpeyXeSH ovcpua3Pq3ntY7DqT+Xp57BiN3NYUEODaJggHluT6CwDyMMqjGffdvCclzhwLtT7Q4oK IR6EiKyl+q528XGJvpsJlKerbGLFiPfqOH0ZpBxZECc2/OVveGHOMIpSJcpY5l2SRWyZ ov1w== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM5328GplLEj3qzT5n+UcoS5IrPq1D1SamLiEkPskNtWnmPXa/V44v TduucIH2wba6V2pcF0Ygk+Y= X-Google-Smtp-Source: ABdhPJz+nA918YiJ+V7SQXPjy3F4HvlunAhkoR0kOgiPamdQ6KEXRd9lY6/oSI6TJI7WaOJo0c/WmQ== X-Received: by 2002:a67:641:: with SMTP id 62mr4783049vsg.2.1599951989003; Sat, 12 Sep 2020 16:06:29 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6122:7cb:: with SMTP id l11ls282472vkr.3.gmail; Sat, 12 Sep 2020 16:06:26 -0700 (PDT) X-Received: by 2002:a1f:29cf:: with SMTP id p198mr4148310vkp.1.1599951986072; Sat, 12 Sep 2020 16:06:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1599951986; cv=none; d=google.com; s=arc-20160816; b=d0H3v4kh6sn6lc1msLOiGA0BhhVvrH85gqgIUNyXSbhk22ZdnnHD3flHrLmoHZN2Ov hgFh57olBsuC3FOYGHvTXPA3p0vy1lf9m+5TDGCTSen5DaquVJrfK8nybNzqejbVE6Yo Lth47Wbx5t5RTkIuAXWpABd/nyStIgyixrf3aRgMqNREZBZFn1fx3aCShqKx5Bey+dgv scumyARrIwiTjXpzmK96FITXGpIcWmMG1cEhZP+dwSz2ggtAIwHD61eh2wMw9BjVfvDc RFTp0RTRJHfU5OJzpVRzSKMeJt861MyK0DqAz9aAYAdtm+h5AK3uoIjDxKjGXaIQfQ/+ fuyg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=mime-version:message-id:date:subject:to:from:dkim-signature; bh=bYcoZ0u+CHOe/RT695/0+SMuz2L/FmFz18lnPVFnqT0=; b=lBiCWKN/NwRTRFsvurNkYmYqv8kbimriGU1xxBKk5+EqnXGwrzsBAC1Zl2+Rn2VILh UU3C4EXuXx4/bkXJFOAtnCjtggToXg8TcazGPh8aazJRnFixMNRI9llFBmO6mTZvwWO+ PNMjSShZKiiEooFLUWzaDvTqWbOqjAjiUJjXCxNOeiC1IYp2jGuwxWDz/HO9Ae8mwqhg SO6JlEyKfry388MD+tl18TPCaSBUuoQXTTOwsBEzD+rkDU6FRlMYH6l7QZTPUEThSDUE ZNWme7CGPuX7mwEuBm1Cxje399yNU4L2T4Vfd1LPfvKSakawyy2hQr/e+J9bTQWCrH6I CL/A== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=cR1Ytsfd; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::62a as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Original-Received: from mail-pl1-x62a.google.com (mail-pl1-x62a.google.com. [2607:f8b0:4864:20::62a]) by gmr-mx.google.com with ESMTPS id 134si406509vkx.0.2020.09.12.16.06.25 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Sat, 12 Sep 2020 16:06:25 -0700 (PDT) Received-SPF: pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::62a as permitted sender) client-ip=2607:f8b0:4864:20::62a; Original-Received: by mail-pl1-x62a.google.com with SMTP id d16so2656790pll.13 for ; Sat, 12 Sep 2020 16:06:25 -0700 (PDT) X-Received: by 2002:a17:902:6b05:b029:d0:a100:8365 with SMTP id o5-20020a1709026b05b02900d0a1008365mr8297201plk.11.1599951985135; Sat, 12 Sep 2020 16:06:25 -0700 (PDT) Original-Received: from johnmacfarlane.net (li55-134.members.linode.com. [74.82.3.134]) by smtp.gmail.com with ESMTPSA id e17sm5832858pff.6.2020.09.12.16.06.23 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sat, 12 Sep 2020 16:06:23 -0700 (PDT) Original-Received: by johnmacfarlane.net (Postfix, from userid 1000) id D2387A2A1; Sat, 12 Sep 2020 19:06:12 -0400 (EDT) X-Original-Sender: jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=cR1Ytsfd; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::62a as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:26125 Archived-At: I tried out the new citeproc-enabled pandoc (from citeproc branch), comparing it against pandoc 2.10.1 + pandoc-citeproc. My test project was a document with about 30,000 words, using a 132K bibtex bibliography. In addition to pandoc-citeproc, I used a lua filter to count words. For pandoc 2.10.1 + pandoc-citeproc, the build (to HTML) took 2.22s. With the new citeproc-enabled pandoc, it took 1.03s. By comparison, without --citeproc (skipping citation processing) it took 0.84s. So, the overhead for citation processing is now pretty minimal, and people should see a big speedup.