From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/8871 Path: main.gmane.org!not-for-mail From: Tobias Burnus Newsgroups: gmane.comp.tex.context Subject: XML and empty line (DocBook) Date: Mon, 29 Jul 2002 21:58:35 +0200 (CEST) Sender: owner-ntg-context@let.uu.nl Message-ID: NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: TEXT/PLAIN; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035399242 31109 80.91.224.250 (23 Oct 2002 18:54:02 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 23 Oct 2002 18:54:02 +0000 (UTC) Original-To: NTG-ConTeXt Xref: main.gmane.org gmane.comp.tex.context:8871 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:8871 Hi, First: unfortunally ConTeXt 2002.7.26 doesn't work with the DocBook anymore which makes testing a bit harder (2002.7.12 does work). Using 2002.7.12 I found the problem that Apache <filename>mod_rewrite</filename> magic causes the problem with the empty line ( = \par ) any idea how to prevent this problem (except by editing the XML source)? ! Paragraph ended before \XMLDBdotitle was complete. \par Addionally I get frequently a ']¿' at the beginning of my documents. This causes also strange results: But why should you use Bugzilla? Since the empty line is regarded as parapraph :-( Tobias From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/8873 Path: main.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: XML and empty line (DocBook) Date: Tue, 30 Jul 2002 11:26:24 +0200 Sender: owner-ntg-context@let.uu.nl Message-ID: <5.1.0.14.1.20020730112538.0226e9c0@server-1> References: NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed X-Trace: main.gmane.org 1035399244 31132 80.91.224.250 (23 Oct 2002 18:54:04 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 23 Oct 2002 18:54:04 +0000 (UTC) Cc: ntg-context@ntg.nl Original-To: Tobias Burnus In-Reply-To: Xref: main.gmane.org gmane.comp.tex.context:8873 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:8873 At 09:58 PM 7/29/2002 +0200, you wrote: >Hi, > >First: unfortunally ConTeXt 2002.7.26 doesn't work with the DocBook >anymore which makes testing a bit harder (2002.7.12 does work). The current docubook style redefines a few low level macros, this has to be adapted (i added some low level support macros needed by the authors) Hans ------------------------------------------------------------------------- Hans Hagen | PRAGMA ADE | pragma@wxs.nl Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: +31 (0)38 477 53 69 | fax: +31 (0)38 477 53 74 | www.pragma-ade.com ------------------------------------------------------------------------- information: http://www.pragma-ade.com/roadmap.pdf documentation: http://www.pragma-ade.com/showcase.pdf ------------------------------------------------------------------------- From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/8896 Path: main.gmane.org!not-for-mail From: Simon Pepping Newsgroups: gmane.comp.tex.context Subject: Re: XML and empty line (DocBook) Date: Wed, 31 Jul 2002 21:43:00 +0200 Sender: owner-ntg-context@let.uu.nl Message-ID: <20020731214300.A13643@scaprea> References: NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035399264 31297 80.91.224.250 (23 Oct 2002 18:54:24 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 23 Oct 2002 18:54:24 +0000 (UTC) Original-To: NTG-ConTeXt In-Reply-To: ; from tobias.burnus@physik.fu-berlin.de on Mon, Jul 29, 2002 at 09:58:35PM +0200 Xref: main.gmane.org gmane.comp.tex.context:8896 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:8896 On Mon, Jul 29, 2002 at 09:58:35PM +0200, Tobias Burnus wrote: > Hi, > > Using 2002.7.12 I found the problem that > > Apache > <filename>mod_rewrite</filename> > > magic > > causes the problem with the empty line ( = \par ) any idea how to prevent > this problem (except by editing the XML source)? > ! Paragraph ended before \XMLDBdotitle was complete. > > \par Even in XML mode two blank lines generate a \par. I cannot solve this; perhaps Hans knows a way out. Both this and your previous problem (and Michael's answer to it) show that TeX has no knowledge of ignorable white space. It cannot, because it does not know the DTD. (Ignorable white space is all white space in elements that do not have mixed content.) > Addionally I get frequently a ']¿' at the beginning of my documents. I believe this is another parsing problem with the internal DTD set. (AFAIK you should get '¿]' from '>]' in the document. Regards, Simon -- Simon Pepping email: spepping@scaprea.hobby.nl From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/8897 Path: main.gmane.org!not-for-mail From: Michael Wiedmann Newsgroups: gmane.comp.tex.context Subject: Re: XML and empty line (DocBook) Date: Wed, 31 Jul 2002 23:21:38 +0200 Sender: owner-ntg-context@let.uu.nl Message-ID: <20020731212138.GA6530@miwie.in-berlin.de> References: <20020731214300.A13643@scaprea> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035399264 31298 80.91.224.250 (23 Oct 2002 18:54:24 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 23 Oct 2002 18:54:24 +0000 (UTC) Original-To: NTG-ConTeXt In-Reply-To: <20020731214300.A13643@scaprea> Xref: main.gmane.org gmane.comp.tex.context:8897 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:8897 * Simon Pepping [020731 21:43]: > On Mon, Jul 29, 2002 at 09:58:35PM +0200, Tobias Burnus wrote: ... > > Addionally I get frequently a ']¿' at the beginning of my documents. > > I believe this is another parsing problem with the internal DTD > set. (AFAIK you should get '¿]' from '>]' in the document. I observed this only on the first page (additional page before the title page) of a DocBook 'article', and not for a 'book'. In this case this has nothing to do with an internal subset. Michael -- mw@miwie.in-berlin.de http://www.miwie.org mw@miwie.org From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/8899 Path: main.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: XML and empty line (DocBook) Date: Thu, 01 Aug 2002 09:14:38 +0200 Sender: owner-ntg-context@let.uu.nl Message-ID: <5.1.0.14.1.20020801091348.0214fd80@server-1> References: <20020731214300.A13643@scaprea> <20020731214300.A13643@scaprea> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1"; format=flowed Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035399266 31311 80.91.224.250 (23 Oct 2002 18:54:26 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 23 Oct 2002 18:54:26 +0000 (UTC) Cc: NTG-ConTeXt Original-To: Michael Wiedmann In-Reply-To: <20020731212138.GA6530@miwie.in-berlin.de> Xref: main.gmane.org gmane.comp.tex.context:8899 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:8899 At 11:21 PM 7/31/2002 +0200, Michael Wiedmann wrote: >* Simon Pepping [020731 21:43]: > > On Mon, Jul 29, 2002 at 09:58:35PM +0200, Tobias Burnus wrote: >... > > > Addionally I get frequently a ']¿' at the beginning of my documents. > > > > I believe this is another parsing problem with the internal DTD > > set. (AFAIK you should get '¿]' from '>]' in the document. > >I observed this only on the first page (additional page before >the title page) of a DocBook 'article', and not for a 'book'. >In this case this has nothing to do with an internal subset. if you make me small test files to play with, i will have a look Btw, i fixed the skipped first ENTITY problem, Hans ------------------------------------------------------------------------- Hans Hagen | PRAGMA ADE | pragma@wxs.nl Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: +31 (0)38 477 53 69 | fax: +31 (0)38 477 53 74 | www.pragma-ade.com ------------------------------------------------------------------------- information: http://www.pragma-ade.com/roadmap.pdf documentation: http://www.pragma-ade.com/showcase.pdf ------------------------------------------------------------------------- From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/8903 Path: main.gmane.org!not-for-mail From: Simon Pepping Newsgroups: gmane.comp.tex.context Subject: Re: XML and empty line (DocBook) Date: Thu, 1 Aug 2002 21:07:52 +0200 Sender: owner-ntg-context@let.uu.nl Message-ID: <20020801210752.A631@scaprea> References: <20020731214300.A13643@scaprea> <20020731214300.A13643@scaprea> <20020731212138.GA6530@miwie.in-berlin.de> <5.1.0.14.1.20020801091348.0214fd80@server-1> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035399269 31337 80.91.224.250 (23 Oct 2002 18:54:29 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 23 Oct 2002 18:54:29 +0000 (UTC) Original-To: NTG-ConTeXt In-Reply-To: <5.1.0.14.1.20020801091348.0214fd80@server-1>; from pragma@wxs.nl on Thu, Aug 01, 2002 at 09:14:38AM +0200 Xref: main.gmane.org gmane.comp.tex.context:8903 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:8903 On Thu, Aug 01, 2002 at 09:14:38AM +0200, Hans Hagen wrote: > At 11:21 PM 7/31/2002 +0200, Michael Wiedmann wrote: > >* Simon Pepping [020731 21:43]: > > > On Mon, Jul 29, 2002 at 09:58:35PM +0200, Tobias Burnus wrote: > >... > > > > Addionally I get frequently a ']¿' at the beginning of my documents. > > > > > > I believe this is another parsing problem with the internal DTD > > > set. (AFAIK you should get '¿]' from '>]' in the document. > > > >I observed this only on the first page (additional page before > >the title page) of a DocBook 'article', and not for a 'book'. > >In this case this has nothing to do with an internal subset. > > if you make me small test files to play with, i will have a look ]> Title Title of chapter A paragraph &TEX; > Btw, i fixed the skipped first ENTITY problem, Good. Simon -- Simon Pepping email: spepping@scaprea.hobby.nl From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/8908 Path: main.gmane.org!not-for-mail From: Simon Pepping Newsgroups: gmane.comp.tex.context Subject: Re: XML and empty line (DocBook) Date: Sat, 3 Aug 2002 17:22:13 +0200 Sender: owner-ntg-context@let.uu.nl Message-ID: <20020803172213.A751@scaprea> References: <20020731214300.A13643@scaprea> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035399274 31366 80.91.224.250 (23 Oct 2002 18:54:34 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 23 Oct 2002 18:54:34 +0000 (UTC) Original-To: NTG-ConTeXt In-Reply-To: <20020731214300.A13643@scaprea>; from spepping@scaprea.hobby.nl on Wed, Jul 31, 2002 at 09:43:00PM +0200 Xref: main.gmane.org gmane.comp.tex.context:8908 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:8908 On Wed, Jul 31, 2002 at 09:43:00PM +0200, Simon Pepping wrote: > On Mon, Jul 29, 2002 at 09:58:35PM +0200, Tobias Burnus wrote: > > Hi, > > > > Using 2002.7.12 I found the problem that > > > > Apache > > <filename>mod_rewrite</filename> > > > > magic > > > > causes the problem with the empty line ( = \par ) any idea how to prevent > > this problem (except by editing the XML source)? > > ! Paragraph ended before \XMLDBdotitle was complete. > > > > \par > > Even in XML mode two blank lines generate a \par. I cannot solve this; > perhaps Hans knows a way out. > > Both this and your previous problem (and Michael's answer to it) show > that TeX has no knowledge of ignorable white space. It cannot, because > it does not know the DTD. (Ignorable white space is all white space in > elements that do not have mixed content.) > > > Addionally I get frequently a ']¿' at the beginning of my documents. > > I believe this is another parsing problem with the internal DTD > set. (AFAIK you should get '¿]' from '>]' in the document. Perhaps it is better not to require that an XML parser in TeX can do all these features right. It must be possible to rewrite the XML file as a 'normalized' file and submit that to the TeX parser. For example, it is possible to write a ContentHandler for a validating SAX parser that removes ignorable white space. Perhaps the same is possible with an XSLT script, but I am not sure if any XSLT processor does a validating parse. Such a procedure would get rid of ignorable white space, and it would resolve entities, thus making the work of a TeX parser much easier. Regards, Simon -- Simon Pepping email: spepping@scaprea.hobby.nl From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/8919 Path: main.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: XML and empty line (DocBook) Date: Sun, 04 Aug 2002 23:59:06 +0200 Sender: owner-ntg-context@let.uu.nl Message-ID: <5.1.0.14.1.20020804235501.02dd8a28@remote-1> References: <20020731214300.A13643@scaprea> <20020731214300.A13643@scaprea> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed X-Trace: main.gmane.org 1035399283 31448 80.91.224.250 (23 Oct 2002 18:54:43 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 23 Oct 2002 18:54:43 +0000 (UTC) Cc: NTG-ConTeXt Original-To: Simon Pepping In-Reply-To: <20020803172213.A751@scaprea> Xref: main.gmane.org gmane.comp.tex.context:8919 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:8919 At 05:22 PM 8/3/2002 +0200, Simon Pepping wrote: >Perhaps it is better not to require that an XML parser in TeX can do >all these features right. It must be possible to rewrite the XML file >as a 'normalized' file and submit that to the TeX parser. For example, >it is possible to write a ContentHandler for a validating SAX parser >that removes ignorable white space. Perhaps the same is possible with >an XSLT script, but I am not sure if any XSLT processor does a >validating parse. Such a procedure would get rid of ignorable white >space, and it would resolve entities, thus making the work of a TeX >parser much easier. indeed, in some cases preprocessing is handy, for instance, i sometimes convert the 'verbatim cdata' things into code like: ... thereby not only gaining much more control over typography, but also getting cleaner source code. I will provide some more cleanup, and esp when we have to deal with language specific typesetting, it makes sense to convert all non chars into entities (: => : and alike, because this permits language dependent spacing). Some of our current project sdemands this kind of control, so you can expect some tools Hans ------------------------------------------------------------------------- Hans Hagen | PRAGMA ADE | pragma@wxs.nl Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: +31 (0)38 477 53 69 | fax: +31 (0)38 477 53 74 | www.pragma-ade.com ------------------------------------------------------------------------- information: http://www.pragma-ade.com/roadmap.pdf documentation: http://www.pragma-ade.com/showcase.pdf -------------------------------------------------------------------------