source@mandoc.bsd.lv
 help / color / mirror / Atom feed
* mdocml: Reject the escape sequences \[uD800] to \[uDFFF] in the parser.
@ 2015-10-13 23:31 schwarze
  0 siblings, 0 replies; only message in thread
From: schwarze @ 2015-10-13 23:31 UTC (permalink / raw)
  To: source

Log Message:
-----------
Reject the escape sequences \[uD800] to \[uDFFF] in the parser.
These surrogates are not valid Unicode codepoints,
so treat them just like any other undefined character escapes:
Warn about them and do not produce output.
Issue noticed while talking to stsp@, semarie@, and bentley@.

Modified Files:
--------------
    mdocml:
        mandoc.c

Revision Data
-------------
Index: mandoc.c
===================================================================
RCS file: /home/cvs/mdocml/mdocml/mandoc.c,v
retrieving revision 1.95
retrieving revision 1.96
diff -Lmandoc.c -Lmandoc.c -u -p -r1.95 -r1.96
--- mandoc.c
+++ mandoc.c
@@ -333,6 +333,9 @@ mandoc_escape(const char **end, const ch
 			break;
 		if (*sz == 6 && (*start)[1] == '0')
 			break;
+		if (*sz == 5 && (*start)[1] == 'D' &&
+		    strchr("89ABCDEF", (*start)[2]) != NULL)
+			break;
 		if ((int)strspn(*start + 1, "0123456789ABCDEFabcdef")
 		    + 1 == *sz)
 			gly = ESCAPE_UNICODE;
--
 To unsubscribe send an email to source+unsubscribe@mdocml.bsd.lv

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2015-10-13 23:31 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-10-13 23:31 mdocml: Reject the escape sequences \[uD800] to \[uDFFF] in the parser schwarze

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).