From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/25151 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: John McCorkle Newsgroups: gmane.text.pandoc Subject: Getting Citations in Wikipedia page to convert over to HTML, Docx, LaTeX. Date: Thu, 7 May 2020 12:34:53 -0700 (PDT) Message-ID: <52683ae4-6dc6-45cd-8e2f-66b1226d6b08@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_68_1401954266.1588880093605" Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="75034"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBD5JXB4TV4FRBXWF2H2QKGQENI62SZY-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Thu May 07 21:34:58 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-ot1-f61.google.com ([209.85.210.61]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1jWmIM-000JLk-Kv for gtp-pandoc-discuss@m.gmane-mx.org; Thu, 07 May 2020 21:34:58 +0200 Original-Received: by mail-ot1-f61.google.com with SMTP id 92sf3639749oty.13 for ; Thu, 07 May 2020 12:34:58 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=6qxhJrqP1GreBjzzcXkFOnpNl6f9zH+3+g+m1rZhiEw=; b=SWjq0fApRImffmwEfrnCnQeck7jHhH+iN0fCQdwQMJVCxSyiAwpFm6SFPG5DEG++z1 czPrg6pzSjkXQCfIJXsJ4QUP74C6BxmlXqEo7ptD63T/4JB1cDHNertMTovtOiriNzfw N3NJJ2nsFbqE+mhXNknGS6TbGGfgqGyp8OGoevRNm+QrwjaiEj+ked0VIoakrD8BW0Gq Fc9aiIcYThwrkP4LworsW/S2EbWjvEeVoJ45cxuEJ1D3s/BW41MdXs7qTGA2Wr4PgYS+ T5FVf2grM2nW6FxVdIOW1V321mol/jEY1f5+KdpLzWw1aOVT8fdar/KqDzYadDPEdnlz WIaQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:subject:mime-version:x-original-sender :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=6qxhJrqP1GreBjzzcXkFOnpNl6f9zH+3+g+m1rZhiEw=; b=I0av/6vp9Ru5YhdQenkPSzYgwBSL7H+EdDC828kNLRZ+yT3xb781dpw0l12pF00Ik+ rOyWoB7TMLm5lMzCQ2sfcA33OKrNbSRcy3zNocrut9oUlmnhfP74sSK60sQ1uJw5jbee CHfPohHon/fChaHr0uJxYvpmko2EflX4SnLdPcrbkGmP5Fu6HuJTwDK6eUUKDYx/chtX Ik2pGdiIBulmjKua+o4ss3CaMdredOzDEEYRBlHuhJsHxGISy3z+OICq0oCK1kqjwwTY 2Li5OxPj5QmfL5Osw0U1GXHY33XDOS769x/m2anIWa4rcW4V+AgoH5AIR0laZP4mW1+J N2hw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=6qxhJrqP1GreBjzzcXkFOnpNl6f9zH+3+g+m1rZhiEw=; b=dWeGG2lgvv2AVdBCHjZkK/3RaOMDvtnk6LxMXHloZ4iuKyXdJ4ohqskdRhPPsHmcxV aPOOxYb8V/gvuWLK9CAGwfNgogAn5eJPx3Jlq5tzEq/gAGU11tnsmOCf7tco3GTG8pDS YpJtjIwAeeTWnfXtUS5zF/WoMD9eqoRth5W1cFqYPfvjAZhzPNVZlwtQIRomplcAFDKz Dr67TQjkeTzZKT1jgJ1rp2x3IQ84/fEy4hH+KGVRi70wdCJrBMRzMzMX6EDT6M+x1VHS fUBMHztMBsbfCL0j9A32kX8hNE0+NXCqThEVD49I2FJQSEmPoERvaI2lHCchCoJayKzA z/7w== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AGi0PuYmuCtC44NpOLR1al7JcjRpO8ZBQOpUay7+T7SIg6XDFgwdbMl1 1S9T7VV8GFyHQK+QboMVnQk= X-Google-Smtp-Source: APiQypJwsWPdW2UTysMlSkXWvjjxd61bGsRPTsBse6YNgEJOzfXAncyycuwHY/gtOWAIC0UUAAwKNw== X-Received: by 2002:a9d:6c48:: with SMTP id g8mr11187518otq.226.1588880096846; Thu, 07 May 2020 12:34:56 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a9d:4d0c:: with SMTP id n12ls1696471otf.7.gmail; Thu, 07 May 2020 12:34:54 -0700 (PDT) X-Received: by 2002:a9d:4e11:: with SMTP id p17mr12665549otf.35.1588880094270; Thu, 07 May 2020 12:34:54 -0700 (PDT) X-Original-Sender: JMCO67-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:25151 Archived-At: ------=_Part_68_1401954266.1588880093605 Content-Type: multipart/alternative; boundary="----=_Part_69_1675696160.1588880093605" ------=_Part_69_1675696160.1588880093605 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable I need to convert a Wikipedia page I wrote, to HTML and to either LaTex or= =20 Docx. I go to the page here=20 (https://en.wikipedia.org/wiki/User:JohnM7190/John%27s_Noise_Figure_Page),= =20 click on the "edit source" tab, select and copy the source text, and then= =20 click on the "read" tab so I don't risk actually editing anything. I paste= =20 that text into Notepad++ and use several regular expression search/replace= =20 operations to eliminate the styles (since Pandoc= =20 does not recognize them), but keeps the equation and the equation reference= =20 number they contain plus fixes the {{EquationNote|x}} references to those= =20 equations. That gets saved, UTF-8 encoded, as my source.wiki file. Pandoc= =20 converts my source.wiki file to all three output formats pretty well except= =20 the citations don't come across. Can someone please tell me how to modify the citations in my source.wiki=20 file so the citations get converted properly (i.e. both first use of the=20 citation, and additional references to the same citation), and end up=20 listed at the end of the article the same way they do on the Wikipedia page= ? For example, on first use, one of my citations is: {{Cite=20 book|url=3Dhttps://cds.cern.ch/record/105963|title=3DCommunication system= =20 principles|last=3DPeebles|first=3DPeyton=20 Z.|date=3D1976|publisher=3DAddison-Wesley|year=3D|isbn=3D|location=3DReadin= g,=20 MA|pages=3D457}} and then other references to it are: There are several types of references, like {{Cite journal|last=3DFriis|first=3DH. T.|date=3DJuly=20 1944|title=3DNoise Figures of Radio Receivers|url=3D|journal=3DProceedings = of the=20 IRE|volume=3D32|issue=3D7|pages=3D419=E2=80=93422|doi=3D10.1109/JRPROC.1944= .232049|issn=3D0096-8390|via=3D}}[https://ieeexplore.ieee.org/abstract/docu= ment/1695024] {{Cite=20 web|url=3Dhttp://www.electropedia.org/iev/iev.nsf/display?openform&ievref= =3D702-08-57|title=3DIEC=20 60050 - International Electrotechnical Vocabulary - IEV number 702-08-57:= =20 "spot noise factor (of a linear two-port device); spot noise figure (of a= =20 linear two-port device)"|last=3D|first=3D|date=3DSeptember=20 2018|website=3D|url-status=3Dlive|archive-url=3D|archive-date=3D|accessdate= =3D2019-12-29}} {{Cite journal|last=3DFisk|first=3DJames R.|date=3DOct= =20 1975|title=3DReceiver Noise Figure Sensitivity and Dynamic Range - What The= =20 Numbers=20 Mean|url=3Dhttp://www.electronicsandbooks.com/eab3/manual/Magazine/H/Ham%20= Radio%20Magazine%20US/Ham%20Radio%20Magazine%201975/10%20October%201975.pdf= |journal=3DHam=20 Radio|volume=3D|pages=3D8-25, pg. 12|via=3D}} Then Wikimedia automatically numbers these and puts them all at the end of= =20 the article with the command: {{Reflist}} Is there some format I could convert these citations to, e.g. using regular= =20 expressions, so that Pandoc would convert them properly? And is there=20 something I can use to replace the {{Reflist}} command? Thanks in advance for any help!=20 --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/52683ae4-6dc6-45cd-8e2f-66b1226d6b08%40googlegroups.com. ------=_Part_69_1675696160.1588880093605 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
I need to convert a Wikipedia page I wrote, to HTML a= nd to either LaTex or Docx.
I go to the page here (https://en= .wikipedia.org/wiki/User:JohnM7190/John%27s_Noise_Figure_Page), click on th= e "edit source" tab, select and copy the source text, and then cl= ick on the "read" tab so I don't risk actually editing anythi= ng. I paste that text into Notepad++ and use several regular expression sea= rch/replace operations to eliminate the <NumBlk blah blah /NumBlk> st= yles (since Pandoc does not recognize them), but keeps the equation and the= equation reference number they contain plus fixes the=C2=A0{{EquationNote|= x}}=C2=A0 references to those equations. That gets saved, UTF-8 encoded, as= my source.wiki file. Pandoc converts my source.wiki file to all three outp= ut formats pretty well except the citations don't come across.

Can someone please tell me how to modify the citations in = my source.wiki file so the citations get converted properly (i.e. both firs= t use of the citation, and additional references to the same citation), and= end up listed at the end of the article the same way they do on the Wikipe= dia page?

For example, on first use, one of my cit= ations is:
<ref name=3D"Peebles457">{{Cite book|u= rl=3Dhttps://cds.cern.ch/record/105963|title=3DCommunication system princip= les|last=3DPeebles|first=3DPeyton Z.|date=3D1976|publisher=3DAddison-Wesley= |year=3D|isbn=3D|location=3DReading, MA|pages=3D457}}</ref>

and then other references to it are:
<ref name= =3D"Peebles457" />

There are several = types of references, like

<ref name=3D":2&= quot;>{{Cite journal|last=3DFriis|first=3DH. T.|date=3DJuly 1944|title= =3DNoise Figures of Radio Receivers|url=3D|journal=3DProceedings of the IRE= |volume=3D32|issue=3D7|pages=3D419=E2=80=93422|doi=3D10.1109/JRPROC.1944.23= 2049|issn=3D0096-8390|via=3D}}[https://ieeexplore.ieee.org/abstract/documen= t/1695024]</ref>

<ref name=3D"IE= C_Spot_NF">{{Cite web|url=3Dhttp://www.electropedia.org/iev/iev.nsf= /display?openform&ievref=3D702-08-57|title=3DIEC 60050 - International = Electrotechnical Vocabulary - IEV number 702-08-57: "spot noise factor= (of a linear two-port device); spot noise figure (of a linear two-port dev= ice)"|last=3D|first=3D|date=3DSeptember 2018|website=3D|url-status=3Dl= ive|archive-url=3D|archive-date=3D|accessdate=3D2019-12-29}}</ref>

<ref name=3D"Fisk">{{Cite journa= l|last=3DFisk|first=3DJames R.|date=3DOct 1975|title=3DReceiver Noise Figur= e Sensitivity and Dynamic Range - What The Numbers Mean|url=3Dhttp://www.el= ectronicsandbooks.com/eab3/manual/Magazine/H/Ham%20Radio%20Magazine%20US/Ha= m%20Radio%20Magazine%201975/10%20October%201975.pdf|journal=3DHam Radio|vol= ume=3D|pages=3D8-25, pg. 12|via=3D}}</ref>

Then Wikimedia automatically numbers these and puts them all at the end = of the article with the command:
{{Reflist}}

Is there some format I could convert these citations to, e.g. usin= g regular expressions, so that Pandoc would convert them properly? And is t= here something I can use to replace the {{Reflist}} command?

=
Thanks in advance for any help!=C2=A0

<= br>

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/= msgid/pandoc-discuss/52683ae4-6dc6-45cd-8e2f-66b1226d6b08%40googlegroups.co= m.
------=_Part_69_1675696160.1588880093605-- ------=_Part_68_1401954266.1588880093605--