From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/25937 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Joseph Reagle Newsgroups: gmane.text.pandoc Subject: Re: WIP: better citation processing Date: Thu, 20 Aug 2020 08:29:44 -0400 Message-ID: References: Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="18781"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:68.0) Gecko/20100101 Thunderbird/68.11.0 To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBD65ZAVVYEKRBPGZ7H4QKGQEKCIPJMA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Thu Aug 20 14:29:53 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-pf1-f190.google.com ([209.85.210.190]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1k8jhY-0004js-VP for gtp-pandoc-discuss@m.gmane-mx.org; Thu, 20 Aug 2020 14:29:52 +0200 Original-Received: by mail-pf1-f190.google.com with SMTP id 4sf1152901pfd.23 for ; Thu, 20 Aug 2020 05:29:52 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1597926591; cv=pass; d=google.com; s=arc-20160816; b=dE/jZ74/zEWqjK5G2sru6vkpvUTDU4zMhNhfwpm4T85DG6zZ6+gOnVjOG5Giztra0m eWc7oVSWP+D+eWYe2dIjcPapVtJ0dUn1IaJcgzd4610D6yCXK6OXLkAx7ZmjhUbXE7Mo 0C4sKjj99pkNhKRAPClxI4xQ/6Zs5qADY8nWKm3RhpbykeshysxH3+MO36sKrBuxW7Lk tBoF/kZHYAfNStc/stXJsJ0izey5mZ43d6K6KgP+TaQZvU1lo8rSuYmxKQr7GcEsDT4l XiUnYfxukIKWfvJ/9ub7ooV0tPaw5oHi0fi1dLc8XZTfwBJ+U9H+MBQ4Fqs9w6jRAbB3 vKlw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:to:subject:sender:dkim-signature; bh=hE6pjb5UNVzET6Q5VGEzjmqCxz8nTUS90VWD48DY69U=; b=AiiKspP5DxxyWE9ZmAKLGhTgD4mjdGAzYJ+s34481NM1Kbnt4KUQopOSPch8Vast/B TMvFIPylLYppTrAMx5FW7zHLHXZFRXmmPiU1ch7zycYpbtoWa4N3gthYh2Fn4CoKUQ7z OIJeKrbToCkM7G3L2k84BXob3uYDm3EEWzPyqCTTdMfhUv6qbCostat498BvPil99xrM uyXF8YdiORc+mzPM1fu2OcmJxySkRNNeJKlhQ62HUOSacMHAn3lDf8P7SLyb2eEDOm9l TVbz9CGQXKYm4+5dzpKZ2+uCSQcnIeFBixQM6BIvrWzj+s2foZSC5Pchm5GkWLtLFngI FVLw== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@reagle.org header.s=default header.b=ekAHdU+u; spf=pass (google.com: domain of joseph.2011-T1oY19WcHSwdnm+yROfE0A@public.gmane.org designates 23.83.209.22 as permitted sender) smtp.mailfrom=joseph.2011-T1oY19WcHSwdnm+yROfE0A@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:subject:to:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=hE6pjb5UNVzET6Q5VGEzjmqCxz8nTUS90VWD48DY69U=; b=kxQktxRAI47HuF5h9GfaiUzMTjUcIJOeJ0d8y/mMI8tTwPw5oinBsR+o1EPYWXvPtQ BUFbXGdOFvOjfSnUicIrvplOtx+Kd/FW9HHFmJefQALuZM0WJ1ROMbAtEysc+ShiW6Rz r0CgSkTwdw3XvmR0D8NiOJHo/6qjfG4bX5QRUdY/EfbSy5lYLiuVCo4OFJGQr6LfnYqz QPSauNS+TCmVOpcgIwL2LVtYGkvg419yEXb6SbH+mNYhzn1Ie8EJCBUDyoGDsv/Z/zOi gIM3XiXQ1Hjv4RJZLyShgKInijW2eF/ChVJ6ZaR+ag84SzKUaTdFTcIwaptz0hOx+05a 9tFA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:subject:to:references:from:message-id :date:user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=hE6pjb5UNVzET6Q5VGEzjmqCxz8nTUS90VWD48DY69U=; b=CsFuE6EDhtsyjQuxDEszISelEweVcQ5KW0py0xhv2LNlYs7pPIOSauJ1JPl6BJgMR1 z+/oB3lzox2q89uj/QpzdGY78ac6HJvvbOMPtXEg1QmSreeotU5X6hX8Nho8Pk5y84Xm /cl2yJ1+JnYSsDaBKuSpwXFHv6OSh9e9LSdUoolm2mX06wVHRUFLTLEh+Al+ry5UPTL3 ZC0FSL+EWI8w/MCzJCk6K8FXyVn7zYv8yqe3FxREuCJ/PX56at8nPTju/xEkmtjywNIA XFner7gT++dNQyfEwl44ngJyMH+airKV/Mns Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM532UeDxWWcXP4ldW0a9fPQWMDfBYFRvfctlzb26Df2SmUlF4d8+B b1pa+Ct9HsH7ANxFP7ohQHo= X-Google-Smtp-Source: ABdhPJyq9Kf3YIMw2s5Cho+a7AhlgpFlVpDEwAZZax9gjkmXpBNbdLVs/1/xD8EqaI/q0bnLR/efaw== X-Received: by 2002:a62:1997:: with SMTP id 145mr2115374pfz.201.1597926591778; Thu, 20 Aug 2020 05:29:51 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a17:90b:1988:: with SMTP id mv8ls835111pjb.0.gmail; Thu, 20 Aug 2020 05:29:48 -0700 (PDT) X-Received: by 2002:a17:902:b594:: with SMTP id a20mr2378258pls.339.1597926588500; Thu, 20 Aug 2020 05:29:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1597926588; cv=none; d=google.com; s=arc-20160816; b=KR03nAHIMNxoqnHtKc8h+DDCiuzBePUxka0IsFVT5zbxszrT12yerkk/BsqLfkU1qN sstGu4j7XDWk7aQ99Vlopg7NfV8l+dd8AtD0MWf8d7BZXphJ+ym3gXHh2afL3kMPv9hy VH/HzHybUsCoXbKnHm0SU8JPH+eX8y5AqerYKkND64mNrSrDjZ+1lCQsYDZbnrdiAW2P p4iE7xX2ASTNobNAhfBLIEK+v5T5It21irBR/KqjaWkWfplejfuPzUfbbCZcCLPK/Urm YhiY79abIKOutFJDCmXl1hwHhgxqF6okoNZTQDqfIKYa/luHzL7602AH1ZZWf7CNFEL0 Opug== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:content-language:in-reply-to:mime-version :user-agent:date:message-id:from:references:to:subject :dkim-signature; bh=FjGpsc5WvqHQ/ytXW+SLD5gVgTNvNEvLG2ZXwPzZXwI=; b=gkUi25Is6SEIIiAAXzngmIFGKLy3bXTbK3KisuDOQ8LmYejHjy1r0pn1QIdn92nY0+ 0a4DipKlnmSgQdh/AXi3la3t/Fqkm0R8AgM4bLzdcUSLgBy9I4E4rN4LXidPxAsrwdfs Bl6SdsEoT8ligYavaEOXFxUnl/ITNeFmmLTf+mfefORI/HEDi576tJ5C09UtuMgXydA7 dRJHOsBLT6p71vIteHSZubIzr5XwV0fk695EFcraEjF2ZIJtwN/oWcRrfd29AFw3yrBw EElMU/xQyRFLNeSAr6Zz7FsvfngHqgOlJqBOjhyjV0V/km0C3aXtWBwoPOS2NpBFMWqb zUEg== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@reagle.org header.s=default header.b=ekAHdU+u; spf=pass (google.com: domain of joseph.2011-T1oY19WcHSwdnm+yROfE0A@public.gmane.org designates 23.83.209.22 as permitted sender) smtp.mailfrom=joseph.2011-T1oY19WcHSwdnm+yROfE0A@public.gmane.org Original-Received: from bonobo.birch.relay.mailchannels.net (bonobo.birch.relay.mailchannels.net. [23.83.209.22]) by gmr-mx.google.com with ESMTPS id n2si130807pfo.5.2020.08.20.05.29.47 for (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Thu, 20 Aug 2020 05:29:47 -0700 (PDT) Received-SPF: pass (google.com: domain of joseph.2011-T1oY19WcHSwdnm+yROfE0A@public.gmane.org designates 23.83.209.22 as permitted sender) client-ip=23.83.209.22; X-Sender-Id: a2hosting|x-authuser|joseph-T1oY19WcHSwdnm+yROfE0A@public.gmane.org Original-Received: from relay.mailchannels.net (localhost [127.0.0.1]) by relay.mailchannels.net (Postfix) with ESMTP id 0BFDF3617D8 for ; Thu, 20 Aug 2020 12:29:47 +0000 (UTC) Original-Received: from az1-ss21.a2hosting.com (100-96-5-145.trex.outbound.svc.cluster.local [100.96.5.145]) (Authenticated sender: a2hosting) by relay.mailchannels.net (Postfix) with ESMTPA id 6C6F1361637 for ; Thu, 20 Aug 2020 12:29:46 +0000 (UTC) X-Sender-Id: a2hosting|x-authuser|joseph-T1oY19WcHSwdnm+yROfE0A@public.gmane.org Original-Received: from az1-ss21.a2hosting.com (az1-ss21.a2hosting.com [68.66.224.43]) (using TLSv1.2 with cipher DHE-RSA-AES256-GCM-SHA384) by 0.0.0.0:2500 (trex/5.18.8); Thu, 20 Aug 2020 12:29:46 +0000 X-MC-Relay: Neutral X-MailChannels-SenderId: a2hosting|x-authuser|joseph-T1oY19WcHSwdnm+yROfE0A@public.gmane.org X-MailChannels-Auth-Id: a2hosting X-Absorbed-Spill: 41f100d253b503b3_1597926586749_1582443202 X-MC-Loop-Signature: 1597926586749:1621619825 X-MC-Ingress-Time: 1597926586749 Original-Received: from c-73-149-23-48.hsd1.ma.comcast.net ([73.149.23.48]:56849 helo=[192.168.0.50]) by az1-ss21.a2hosting.com with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.93) (envelope-from ) id 1k8jhR-00057J-GW for pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; Thu, 20 Aug 2020 05:29:45 -0700 In-Reply-To: Content-Language: en-US X-AuthUser: joseph-T1oY19WcHSwdnm+yROfE0A@public.gmane.org X-Original-Sender: joseph.2011-T1oY19WcHSwdnm+yROfE0A@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@reagle.org header.s=default header.b=ekAHdU+u; spf=pass (google.com: domain of joseph.2011-T1oY19WcHSwdnm+yROfE0A@public.gmane.org designates 23.83.209.22 as permitted sender) smtp.mailfrom=joseph.2011-T1oY19WcHSwdnm+yROfE0A@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:25937 Archived-At: On 8/16/20 6:59 PM, John MacFarlane wrote: > I've used this library to create a new filter, > new-pandoc-citeproc, which now passes most of the pandoc-citeproc > test suite but runs around 6 times faster. Awesome! I've topped 10k items in my YAML file. Even though you kindly sped= up some citeproc bottlenecks for me in the past, I continue to "subset" th= e 10k file into document-specific YAML files when building a document. That= is, it's faster for me to regex my document for citations and pull their e= ntries out of the 10k YAML file before handing it off to pandoc than just u= sing pandoc itself. I'd love to drop all of this! (Though, to be fair, this is fast because my = YAML is super-contstrained and need not be properly parsed, only "chunked."= ) --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/e94c6f9d-e6d8-099a-4bf2-7aed30476a6c%40reagle.org.