source@mandoc.bsd.lv
 help / color / mirror / Atom feed
From: schwarze@mandoc.bsd.lv
To: source@mandoc.bsd.lv
Subject: docbook2mdoc: Parse internal subset declarations in <!DOCTYPE> tags as
Date: Sun, 14 Apr 2019 11:27:04 -0500 (EST)	[thread overview]
Message-ID: <e3fdd1c316d0cdbe@fantadrom.bsd.lv> (raw)

Log Message:
-----------
Parse internal subset declarations in <!DOCTYPE> tags as XML;
in particular, they can contain XML comments.
Similar to parts of parse.c rev. 1.23, but simpler.

Modified Files:
--------------
    docbook2mdoc:
        statistics.c

Revision Data
-------------
Index: statistics.c
===================================================================
RCS file: /home/cvs/mdocml/docbook2mdoc/statistics.c,v
retrieving revision 1.25
retrieving revision 1.26
diff -Lstatistics.c -Lstatistics.c -u -p -r1.25 -r1.26
--- statistics.c
+++ statistics.c
@@ -211,10 +211,10 @@ parse_file(int fd, char *fname)
 	size_t		 rlen;  /* Number of bytes in b[]. */
 	size_t		 poff;  /* Parse offset in b[]. */
 	size_t		 pend;  /* Offset of the end of the current word. */
-	int		 in_tag, in_arg, in_quotes, elem_end;
+	int		 in_tag, in_arg, in_quotes, in_doctype, elem_end;
 
 	rlen = 0;
-	in_tag = in_arg = in_quotes = 0;
+	in_tag = in_arg = in_quotes = in_doctype = 0;
 	while ((rsz = read(fd, b + rlen, sizeof(b) - rlen - 1)) >= 0) {
 		if ((rlen += rsz) == 0)
 			break;
@@ -251,6 +251,11 @@ parse_file(int fd, char *fname)
 				if (elem_end)
 					stack_pop(NULL);
 			} else if (in_tag) {
+				if (in_doctype && b[pend] == '[') {
+					in_tag = in_doctype = 0;
+					pend++;
+					continue;
+				}
 				if (advance(b, rlen, &pend, " =>") && rsz > 0)
 					break;
 				elem_end = 0;
@@ -301,6 +306,9 @@ parse_file(int fd, char *fname)
 				if (b[++poff] == '/') {
 					elem_end = 1;
 					poff++;
+				} else if (strcasecmp(b + poff,
+				    "!DOCTYPE") == 0) {
+					in_doctype = 1;
 				} else if (b[poff] != '!' && b[poff] != '?') {
 					table_add(stacki > 0 ?
 					    stack[stacki - 1] : "ROOT",
--
 To unsubscribe send an email to source+unsubscribe@mandoc.bsd.lv

                 reply	other threads:[~2019-04-14 16:27 UTC|newest]

Thread overview: [no followups] expand[flat|nested]  mbox.gz  Atom feed

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=e3fdd1c316d0cdbe@fantadrom.bsd.lv \
    --to=schwarze@mandoc.bsd.lv \
    --cc=source@mandoc.bsd.lv \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).