From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/25152 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: Re: Getting Citations in Wikipedia page to convert over to HTML, Docx, LaTeX. Date: Thu, 07 May 2020 14:41:37 -0700 Message-ID: References: <52683ae4-6dc6-45cd-8e2f-66b1226d6b08@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="100466"; mail-complaints-to="usenet@ciao.gmane.io" To: John McCorkle , pandoc-discuss Original-X-From: pandoc-discuss+bncBCJZJHG45QDBBHUB2L2QKGQESIQ6H3A-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Thu May 07 23:41:54 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-qv1-f63.google.com ([209.85.219.63]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1jWoHB-000Q1M-Qa for gtp-pandoc-discuss@m.gmane-mx.org; Thu, 07 May 2020 23:41:53 +0200 Original-Received: by mail-qv1-f63.google.com with SMTP id o18sf7268449qvu.8 for ; Thu, 07 May 2020 14:41:53 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1588887713; cv=pass; d=google.com; s=arc-20160816; b=uGMTfOupGVNLF6iX1/gAs8pb8rlnT4xJoyfLJqGgQRSn/QIL5eMHelU8ve5h4woXFg lxyBBikz2XFkBBpmf5uVsVHX6dBeowASrtQhf4UWWzQ+2H7kCfKAkQzXV0s1/pIJiDpn ws1GMLrmiWTKvjnRHP4a92rJtvHBRB4D5jE9MICYGGdUzTb4fpFwm38op/ZeVGPGPW8Z RjX6jq8hZADlbyWOWbzZk0zvGc/y/DNnjPxJ0lOoDB5S1U5gF7ybKz+fp12vEsgCATWz QgiQVgDlOtWBf87IOyThoKX4Fsirf/sxRcv+Ko2CWbvDzgXw4d/pQsWibiurgWXQ21Gf olLQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:content-transfer-encoding :mime-version:message-id:date:references:in-reply-to:subject:to:from :sender:dkim-signature; bh=fdlIzygIyKTtgRsLlnk6NaHD1EsrUsPaha7VptQ7FEw=; b=OcjRZuyIb5eHYrnkmxLFNAsJQbXpquu73uZxkLeV7GwlYbkBJNrpCG3wdwEBdQ8anf oc08vridedu1IYiE9soCQ4fFugGpl1cMUEMEkFr3oXFonzeieESOm8aY9prUEBZVoHkt lAmnvpBq0yRLEPqsmSlaeKp7bNLnn6XQbzmNgckDS/T3CFtaaQo0znKSCB7oLfS3dUzA BSEzI5VrkGBcBcPXYYltvvPwCGPuI6/Wv1NRFrzAcCMinaN1lIU1A1ZQ9pIarR4ezlQ7 Bsedi1jP4oaw/wjV/X3GQf1zMzvsjmaFr5R+Ov5cKGxtp0uQo5yC4D5y+Gf17olVh4z+ Xong== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=IYaSerhP; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::636 as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:from:to:subject:in-reply-to:references:date:message-id :mime-version:content-transfer-encoding:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=fdlIzygIyKTtgRsLlnk6NaHD1EsrUsPaha7VptQ7FEw=; b=QyMNI6CZzFGVrYEk7Wi8nv8a7Z+I92UgnJfyaypp6EkV+BhQIQoOIwTAlWcNhYRLTy A/6Myl9Vm33u6tVhko09OjCU5vjV37Yz5DeA4oMF1gshGSDbzpSjxJCZ0+qHNLfF1Li1 AbYqggYd/Jo+ODK9k28T0XkCwFgtAja68NEAHjeGSvNoR+Nhqqi5m9MQ9mulQvFkOnGd yhMFTzLW95OmPbd9WVn8JAu62qGG45oIZ8kAMduXvSdqVyqtYs4fqORqlHYa0kgVoBpe F66Te+VjL1/3xnTqcW784oq4FfH+loY472Gt293ctWj+soGaksma+RsMqWfOzu5L3zo2 ZVEA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:from:to:subject:in-reply-to:references :date:message-id:mime-version:content-transfer-encoding :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:list-subscribe:list-unsubscribe; bh=fdlIzygIyKTtgRsLlnk6NaHD1EsrUsPaha7VptQ7FEw=; b=PUfnXN1bLtOJelxTzMdrEPGq5C7VmKw8tRaWM0KGHWCaO9Kci2Iw/3DYsY599+/Kz/ ffWXPpL/U1cyN2UeJ9dN8EXNJX+q6FTTW8eToBFpMO8Btj6vCcKSQKb4u8F6b5APw8iC OmajRHydRHsivRCpoYWPRscVHg+pr13k6jMFuMVKuGLmJmIySomy7OFF7hJEsXfL+yeZ jg96VzLsD/yDdYlZJlwSRbiTJUi+qUeVU2vsJqq55AMMkW3fQgC01nkVJtS+rzOskvrK oB5PuzTkrUPgBk2ByH64Juewk7okHK6UPrBNrkSku0+MwDFiE44cUGSERkWTrk5hL5Ox Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AGi0PuYcWUeGc/CZjJEOjk1nFsKhfB07SBWYQcS3xmVboh17lqrOA9CQ KuOjnLZvWS/PuIOcsx0FZlw= X-Google-Smtp-Source: APiQypJG6qocKZiqAz02o5ko5z/QdCkm+MYYj9xc72tnP4Xw1sdrTZjDRAHzhy8pJamo6OTGJmAmyA== X-Received: by 2002:a37:a348:: with SMTP id m69mr17323112qke.31.1588887712881; Thu, 07 May 2020 14:41:52 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:aed:3fd5:: with SMTP id w21ls5177753qth.7.gmail; Thu, 07 May 2020 14:41:50 -0700 (PDT) X-Received: by 2002:ac8:32a4:: with SMTP id z33mr17425342qta.363.1588887710116; Thu, 07 May 2020 14:41:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1588887710; cv=none; d=google.com; s=arc-20160816; b=RpgIXt2BUl2ozE5BvJUp5XhRQXWfXkkrMUnnP0ACy04AZtqevHszMQA2+YyO8K82pi 0gadqp/HDnSfvY4TXcKz5gH/Myr3wjd0h/tl28WFY9ye3S9doP6fKZsfA80OTMWr20EL kmR5QGymi7PF3bcyKmd+Sj0el3UWaddtqKYT9+QrlJHngoASJeNjltriqWVLAc8QhyR/ vO49MYKF/PcEkV4h1GaE9Q2hAf9QtA0TZqOyvz2vDKMvQlaNCfm88P4qh/6ktd0zmQdC W503qSBwXbbQ993ZVJV6y6ayDydk4+HGTAu44Tud3x57O5qC+giMn/aNRcQU2xz8urrG O26w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:message-id:date:references :in-reply-to:subject:to:from:dkim-signature; bh=NleySg50eLmESU1UynKHNznv8OnOHpK5igwaPjogNx0=; b=MaNGf6eY6VljQQZe48WvKomMB9+CFFNnyimgrSAZCdWHkSRUmJGExgPOrQPrcuUkur deNutl3NqgE7R22IqIIwfRc8+VF1GXe+OFMO3wO4K1onS2CHYuEi0flh5biFBoFIu1DX CjQMDN5xloo2NGPU7vqvJID4V93KazCTEiCN9EaQUq1UVHg6JIRf7+nCDYQO8AR+8EZI 4kCR1t+/8RgpEPo1oIiKuSvQ6UjjvWXgMZ62L/Fl/r+Yq2fKJAPt4DRtl87SBxX9GA8U MI7kER64EV/SWwwoEjeEuCGz1h7dl5tkPU7OPc16NLh1uQCiaa0v5a9GcZfQs2LDc0pE fkZQ== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=IYaSerhP; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::636 as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Original-Received: from mail-pl1-x636.google.com (mail-pl1-x636.google.com. [2607:f8b0:4864:20::636]) by gmr-mx.google.com with ESMTPS id f3si516720qkh.5.2020.05.07.14.41.50 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Thu, 07 May 2020 14:41:50 -0700 (PDT) Received-SPF: pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::636 as permitted sender) client-ip=2607:f8b0:4864:20::636; Original-Received: by mail-pl1-x636.google.com with SMTP id f15so2615074plr.3 for ; Thu, 07 May 2020 14:41:50 -0700 (PDT) X-Received: by 2002:a17:902:a511:: with SMTP id s17mr15465431plq.33.1588887709050; Thu, 07 May 2020 14:41:49 -0700 (PDT) Original-Received: from johnmacfarlane.net (li55-134.members.linode.com. [74.82.3.134]) by smtp.gmail.com with ESMTPSA id s10sm2815758pgq.71.2020.05.07.14.41.47 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 07 May 2020 14:41:48 -0700 (PDT) Original-Received: by johnmacfarlane.net (Postfix, from userid 1000) id 513CAA256; Thu, 7 May 2020 17:41:37 -0400 (EDT) In-Reply-To: <52683ae4-6dc6-45cd-8e2f-66b1226d6b08-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> X-Original-Sender: jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=IYaSerhP; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::636 as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:25152 Archived-At: You might have better luck converting the HTML version of the wikipedia page. See https://groups.google.com/d/msg/pandoc-discuss/ptiLha5vJ2I/bPJvyLw0BAAJ John McCorkle writes: > I need to convert a Wikipedia page I wrote, to HTML and to either LaTex o= r=20 > Docx. > I go to the page here=20 > (https://en.wikipedia.org/wiki/User:JohnM7190/John%27s_Noise_Figure_Page)= ,=20 > click on the "edit source" tab, select and copy the source text, and then= =20 > click on the "read" tab so I don't risk actually editing anything. I past= e=20 > that text into Notepad++ and use several regular expression search/replac= e=20 > operations to eliminate the styles (since Pand= oc=20 > does not recognize them), but keeps the equation and the equation referen= ce=20 > number they contain plus fixes the {{EquationNote|x}} references to thos= e=20 > equations. That gets saved, UTF-8 encoded, as my source.wiki file. Pandoc= =20 > converts my source.wiki file to all three output formats pretty well exce= pt=20 > the citations don't come across. > > Can someone please tell me how to modify the citations in my source.wiki= =20 > file so the citations get converted properly (i.e. both first use of the= =20 > citation, and additional references to the same citation), and end up=20 > listed at the end of the article the same way they do on the Wikipedia pa= ge? > > For example, on first use, one of my citations is: > {{Cite=20 > book|url=3Dhttps://cds.cern.ch/record/105963|title=3DCommunication system= =20 > principles|last=3DPeebles|first=3DPeyton=20 > Z.|date=3D1976|publisher=3DAddison-Wesley|year=3D|isbn=3D|location=3DRead= ing,=20 > MA|pages=3D457}} > > and then other references to it are: > > > There are several types of references, like > > {{Cite journal|last=3DFriis|first=3DH. T.|date=3DJuly=20 > 1944|title=3DNoise Figures of Radio Receivers|url=3D|journal=3DProceeding= s of the=20 > IRE|volume=3D32|issue=3D7|pages=3D419=E2=80=93422|doi=3D10.1109/JRPROC.19= 44.232049|issn=3D0096-8390|via=3D}}[https://ieeexplore.ieee.org/abstract/do= cument/1695024] > > {{Cite=20 > web|url=3Dhttp://www.electropedia.org/iev/iev.nsf/display?openform&ievref= =3D702-08-57|title=3DIEC=20 > 60050 - International Electrotechnical Vocabulary - IEV number 702-08-57:= =20 > "spot noise factor (of a linear two-port device); spot noise figure (of a= =20 > linear two-port device)"|last=3D|first=3D|date=3DSeptember=20 > 2018|website=3D|url-status=3Dlive|archive-url=3D|archive-date=3D|accessda= te=3D2019-12-29}} > > {{Cite journal|last=3DFisk|first=3DJames R.|date=3DOct= =20 > 1975|title=3DReceiver Noise Figure Sensitivity and Dynamic Range - What T= he=20 > Numbers=20 > Mean|url=3Dhttp://www.electronicsandbooks.com/eab3/manual/Magazine/H/Ham%= 20Radio%20Magazine%20US/Ham%20Radio%20Magazine%201975/10%20October%201975.p= df|journal=3DHam=20 > Radio|volume=3D|pages=3D8-25, pg. 12|via=3D}} > > Then Wikimedia automatically numbers these and puts them all at the end o= f=20 > the article with the command: > {{Reflist}} > > Is there some format I could convert these citations to, e.g. using regul= ar=20 > expressions, so that Pandoc would convert them properly? And is there=20 > something I can use to replace the {{Reflist}} command? > > Thanks in advance for any help!=20 > > > --=20 > You received this message because you are subscribed to the Google Groups= "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an= email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/msgi= d/pandoc-discuss/52683ae4-6dc6-45cd-8e2f-66b1226d6b08%40googlegroups.com. --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/m2eerv1kz2.fsf%40johnmacfarlane.net.