source@mandoc.bsd.lv
 help / color / mirror / Atom feed
* mandoc: Add a complete suite of tests covering ASCII characters in UTF-8
@ 2024-05-16 18:50 schwarze
  0 siblings, 0 replies; only message in thread
From: schwarze @ 2024-05-16 18:50 UTC (permalink / raw)
  To: source

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset=utf-8, Size: 8793 bytes --]

Log Message:
-----------
Add a complete suite of tests covering ASCII characters in UTF-8 spelling.
Most of this goes to nogroff.in rather than ascii.in for now 
because groff-1.23.0 is buggy as hell in this respect 
and i'm too lazy to debug the horrific function make_glyph_node()
in the file groff/src/roff/troff/node.cpp right now.

Modified Files:
--------------
    mandoc/regress/char/unicode:
        ascii.in
        ascii.out_ascii
        ascii.out_html
        ascii.out_utf8
        nogroff.in
        nogroff.out_ascii
        nogroff.out_html
        nogroff.out_utf8

Revision Data
-------------
Index: nogroff.out_ascii
===================================================================
RCS file: /home/cvs/mandoc/mandoc/regress/char/unicode/nogroff.out_ascii,v
diff -Lregress/char/unicode/nogroff.out_ascii -Lregress/char/unicode/nogroff.out_ascii -u -p -r1.5 -r1.6
--- regress/char/unicode/nogroff.out_ascii
+++ regress/char/unicode/nogroff.out_ascii
@@ -37,6 +37,27 @@ D\bDE\bES\bSC\bCR\bRI\bIP\bPT\bTI\bIO\bON\bN
      <GS><GS>  <control> INFORMATION SEPARATOR THREE
      <RS><RS>  <control> INFORMATION SEPARATOR TWO
      <US><US>  <control> INFORMATION SEPARATOR INE
+     !!   EXCLAMATION MARK
+     %%   PERCENT SIGN
+     &&   AMPERSAND
+     ((   LEFT PARENTHESIS
+     ))   RIGHT PARENTHESIS
+     **   ASTERISK
+     ,,   COMMA
+     --   HYPHEN-MINUS
+     ..   FULL STOP
+     00   DIGIT ZERO
+     11   DIGIT ONE
+     99   DIGIT NINE
+     ::   COLON
+     ;;   SEMICOLON
+     <<   LESS-THAN SIGN
+     >>   GREATER-THAN SIGN
+     ??   QUESTION MARK
+     AA   LATIN CAPITAL LETTER A
+     ZZ   LATIN CAPITAL LETTER Z
+     aa   LATIN SMALL LETTER A
+     zz   LATIN SMALL LETTER Z
      <DEL><DEL>     <control> DELETE
      <80><80>  <control> 0x80
      <81><81>  <control> 0x81
@@ -98,4 +119,4 @@ D\bDE\bES\bSC\bCR\bRI\bIP\bPT\bTI\bIO\bON\bN
      <?>  <undefined>
      ENDTEST
 
-OpenBSD                          June 2, 2021          CHAR-UNICODE-NOGROFF(1)
+OpenBSD                          May 16, 2024          CHAR-UNICODE-NOGROFF(1)
Index: nogroff.in
===================================================================
RCS file: /home/cvs/mandoc/mandoc/regress/char/unicode/nogroff.in,v
diff -Lregress/char/unicode/nogroff.in -Lregress/char/unicode/nogroff.in -u -p -r1.4 -r1.5
--- regress/char/unicode/nogroff.in
+++ regress/char/unicode/nogroff.in
@@ -1,5 +1,5 @@
 .\" $OpenBSD: nogroff.in,v 1.6 2021/06/02 17:36:59 schwarze Exp $
-.TH CHAR-UNICODE-NOGROFF 1 "June 2, 2021"
+.TH CHAR-UNICODE-NOGROFF 1 "May 16, 2024"
 .SH NAME
 char-unicode-nogroff \- characters handled differently by groff
 .SH DESCRIPTION
@@ -37,6 +37,27 @@ BEGINTEST
 \[u001D]\N'29'	<control> INFORMATION SEPARATOR THREE
 \[u001E]\N'30'	<control> INFORMATION SEPARATOR TWO
 \[u001F]\N'31'	<control> INFORMATION SEPARATOR INE
+\[u0021]\N'33'	EXCLAMATION MARK
+\[u0025]\N'37'	PERCENT SIGN
+\[u0026]\N'38'	AMPERSAND
+\[u0028]\N'40'	LEFT PARENTHESIS
+\[u0029]\N'41'	RIGHT PARENTHESIS
+\[u002A]\N'42'	ASTERISK
+\[u002C]\N'44'	COMMA
+\[u002D]\N'45'	HYPHEN-MINUS
+\[u002E]\N'46'	FULL STOP
+\[u0030]\N'48'	DIGIT ZERO
+\[u0031]\N'49'	DIGIT ONE
+\[u0039]\N'57'	DIGIT NINE
+\[u003A]\N'58'	COLON
+\[u003B]\N'59'	SEMICOLON
+\[u003C]\N'60'	LESS-THAN SIGN
+\[u003E]\N'62'	GREATER-THAN SIGN
+\[u003F]\N'63'	QUESTION MARK
+\[u0041]\N'65'	LATIN CAPITAL LETTER A
+\[u005A]\N'90'	LATIN CAPITAL LETTER Z
+\[u0061]\N'97'	LATIN SMALL LETTER A
+\[u007A]\N'122'	LATIN SMALL LETTER Z
 \[u007F]\N'127'	<control> DELETE
 \[u0080]\N'128'	<control> 0x80
 \[u0081]\N'129'	<control> 0x81
Index: ascii.out_utf8
===================================================================
RCS file: /home/cvs/mandoc/mandoc/regress/char/unicode/ascii.out_utf8,v
diff -Lregress/char/unicode/ascii.out_utf8 -Lregress/char/unicode/ascii.out_utf8 -u -p -r1.4 -r1.5
--- regress/char/unicode/ascii.out_utf8
+++ regress/char/unicode/ascii.out_utf8
@@ -5,6 +5,7 @@ N\bNA\bAM\bME\bE
 
 D\bDE\bES\bSC\bCR\bRI\bIP\bPT\bTI\bIO\bON\bN
      BEGINTEST
+          SPACE
      """  QUOTATION MARK
      ###  NUMBER SIGN
      $$$  DOLLAR SIGN
@@ -27,4 +28,4 @@ D\bDE\bES\bSC\bCR\bRI\bIP\bPT\bTI\bIO\bON\bN
      ~~~~ TILDE
      ENDTEST
 
-OpenBSD                        October 27, 2014          CHAR-UNICODE-ASCII(1)
+OpenBSD                          May 16, 2024            CHAR-UNICODE-ASCII(1)
Index: ascii.in
===================================================================
RCS file: /home/cvs/mandoc/mandoc/regress/char/unicode/ascii.in,v
diff -Lregress/char/unicode/ascii.in -Lregress/char/unicode/ascii.in -u -p -r1.3 -r1.4
--- regress/char/unicode/ascii.in
+++ regress/char/unicode/ascii.in
@@ -1,10 +1,11 @@
 .\" $OpenBSD: ascii.in,v 1.4 2017/07/04 14:53:23 schwarze Exp $
-.TH CHAR-UNICODE-ASCII 1 "October 27, 2014"
+.TH CHAR-UNICODE-ASCII 1 "May 16, 2024"
 .SH NAME
 char-unicode-ascii \- Unicode characters in the ASCII range
 .SH DESCRIPTION
 .nf
 BEGINTEST
+\[u0020]\N'32' 	SPACE
 \[u0022]\N'34'\(dq	QUOTATION MARK
 \[u0023]\N'35'\(sh	NUMBER SIGN
 \[u0024]\N'36'\(Do	DOLLAR SIGN
Index: ascii.out_html
===================================================================
RCS file: /home/cvs/mandoc/mandoc/regress/char/unicode/ascii.out_html,v
diff -Lregress/char/unicode/ascii.out_html -Lregress/char/unicode/ascii.out_html -u -p -r1.3 -r1.4
--- regress/char/unicode/ascii.out_html
+++ regress/char/unicode/ascii.out_html
@@ -1,3 +1,4 @@
+   	SPACE
 &quot;&quot;&quot;	QUOTATION MARK
 ###	NUMBER SIGN
 $$$	DOLLAR SIGN
Index: ascii.out_ascii
===================================================================
RCS file: /home/cvs/mandoc/mandoc/regress/char/unicode/ascii.out_ascii,v
diff -Lregress/char/unicode/ascii.out_ascii -Lregress/char/unicode/ascii.out_ascii -u -p -r1.4 -r1.5
--- regress/char/unicode/ascii.out_ascii
+++ regress/char/unicode/ascii.out_ascii
@@ -5,6 +5,7 @@ N\bNA\bAM\bME\bE
 
 D\bDE\bES\bSC\bCR\bRI\bIP\bPT\bTI\bIO\bON\bN
      BEGINTEST
+          SPACE
      """  QUOTATION MARK
      ###  NUMBER SIGN
      $$$  DOLLAR SIGN
@@ -27,4 +28,4 @@ D\bDE\bES\bSC\bCR\bRI\bIP\bPT\bTI\bIO\bON\bN
      ~~~~ TILDE
      ENDTEST
 
-OpenBSD                        October 27, 2014          CHAR-UNICODE-ASCII(1)
+OpenBSD                          May 16, 2024            CHAR-UNICODE-ASCII(1)
Index: nogroff.out_html
===================================================================
RCS file: /home/cvs/mandoc/mandoc/regress/char/unicode/nogroff.out_html,v
diff -Lregress/char/unicode/nogroff.out_html -Lregress/char/unicode/nogroff.out_html -u -p -r1.5 -r1.6
--- regress/char/unicode/nogroff.out_html
+++ regress/char/unicode/nogroff.out_html
@@ -30,6 +30,27 @@
 &#xFFFD;&#xFFFD;	&lt;control&gt; INFORMATION SEPARATOR THREE
 &#xFFFD;&#xFFFD;	&lt;control&gt; INFORMATION SEPARATOR TWO
 &#xFFFD;&#xFFFD;	&lt;control&gt; INFORMATION SEPARATOR INE
+!!	EXCLAMATION MARK
+%%	PERCENT SIGN
+&amp;&amp;	AMPERSAND
+((	LEFT PARENTHESIS
+))	RIGHT PARENTHESIS
+**	ASTERISK
+,,	COMMA
+--	HYPHEN-MINUS
+..	FULL STOP
+00	DIGIT ZERO
+11	DIGIT ONE
+99	DIGIT NINE
+::	COLON
+;;	SEMICOLON
+&lt;&lt;	LESS-THAN SIGN
+&gt;&gt;	GREATER-THAN SIGN
+??	QUESTION MARK
+AA	LATIN CAPITAL LETTER A
+ZZ	LATIN CAPITAL LETTER Z
+aa	LATIN SMALL LETTER A
+zz	LATIN SMALL LETTER Z
 &#xFFFD;&#xFFFD;	&lt;control&gt; DELETE
 &#xFFFD;&#xFFFD;	&lt;control&gt; 0x80
 &#xFFFD;&#xFFFD;	&lt;control&gt; 0x81
Index: nogroff.out_utf8
===================================================================
RCS file: /home/cvs/mandoc/mandoc/regress/char/unicode/nogroff.out_utf8,v
diff -Lregress/char/unicode/nogroff.out_utf8 -Lregress/char/unicode/nogroff.out_utf8 -u -p -r1.6 -r1.7
--- regress/char/unicode/nogroff.out_utf8
+++ regress/char/unicode/nogroff.out_utf8
@@ -37,6 +37,27 @@ D\bDE\bES\bSC\bCR\bRI\bIP\bPT\bTI\bIO\bON\bN
      ��   <control> INFORMATION SEPARATOR THREE
      ��   <control> INFORMATION SEPARATOR TWO
      ��   <control> INFORMATION SEPARATOR INE
+     !!   EXCLAMATION MARK
+     %%   PERCENT SIGN
+     &&   AMPERSAND
+     ((   LEFT PARENTHESIS
+     ))   RIGHT PARENTHESIS
+     **   ASTERISK
+     ,,   COMMA
+     --   HYPHEN-MINUS
+     ..   FULL STOP
+     00   DIGIT ZERO
+     11   DIGIT ONE
+     99   DIGIT NINE
+     ::   COLON
+     ;;   SEMICOLON
+     <<   LESS-THAN SIGN
+     >>   GREATER-THAN SIGN
+     ??   QUESTION MARK
+     AA   LATIN CAPITAL LETTER A
+     ZZ   LATIN CAPITAL LETTER Z
+     aa   LATIN SMALL LETTER A
+     zz   LATIN SMALL LETTER Z
      ��   <control> DELETE
      ��   <control> 0x80
      ��   <control> 0x81
@@ -98,4 +119,4 @@ D\bDE\bES\bSC\bCR\bRI\bIP\bPT\bTI\bIO\bON\bN
      􏿿     <undefined>
      ENDTEST
 
-OpenBSD                          June 2, 2021          CHAR-UNICODE-NOGROFF(1)
+OpenBSD                          May 16, 2024          CHAR-UNICODE-NOGROFF(1)
--
 To unsubscribe send an email to source+unsubscribe@mandoc.bsd.lv


^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2024-05-16 18:50 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-05-16 18:50 mandoc: Add a complete suite of tests covering ASCII characters in UTF-8 schwarze

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).