ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
* HTML to ConTeXt
@ 2007-10-25 14:50 Aditya Mahajan
  2007-10-25 20:17 ` Idris Samawi Hamid
  0 siblings, 1 reply; 19+ messages in thread
From: Aditya Mahajan @ 2007-10-25 14:50 UTC (permalink / raw)
  To: mailing list for ConTeXt users

This is interesting. A website that converts html to context 
(actually uses markdown behind the scenes).

http://johnmacfarlane.net/pandoc/html2x.html

This is how the context wiki looks like.

http://johnmacfarlane.net/cgi-bin/html2x.pl?url=http%3A%2F%2Fwiki.contextgarden.net%2FMain_Page&format=context

The program is written in haskell and is also available for download. 
You can use it to convert markdown to context. I had been looking for 
this for a while, when multiple formats are needed. Write in markdown 
and generate html or context. I do not completely like the context 
output it generates (for example 
http://johnmacfarlane.net/pandoc/README gets converted to 
http://johnmacfarlane.net/pandoc/example11.tex.html)

Aditya
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: HTML to ConTeXt
  2007-10-25 14:50 HTML to ConTeXt Aditya Mahajan
@ 2007-10-25 20:17 ` Idris Samawi Hamid
  2007-10-26  4:22   ` Aditya Mahajan
  0 siblings, 1 reply; 19+ messages in thread
From: Idris Samawi Hamid @ 2007-10-25 20:17 UTC (permalink / raw)
  To: mailing list for ConTeXt users

Hi Aditya,

On Thu, 25 Oct 2007 08:50:03 -0600, Aditya Mahajan <adityam@umich.edu>  
wrote:

> This is interesting. A website that converts html to context
> (actually uses markdown behind the scenes).
>
> http://johnmacfarlane.net/pandoc/html2x.html
>
> This is how the context wiki looks like.
>
> http://johnmacfarlane.net/cgi-bin/html2x.pl?url=http%3A%2F%2Fwiki.contextgarden.net%2FMain_Page&format=context
>
> The program is written in haskell and is also available for download.
> You can use it to convert markdown to context. I had been looking for
> this for a while, when multiple formats are needed. Write in markdown
> and generate html or context. I do not completely like the context
> output it generates (for example
> http://johnmacfarlane.net/pandoc/README gets converted to
> http://johnmacfarlane.net/pandoc/example11.tex.html)

This looks very promising. Perhaps some of us can help the developers to  
improve the ConTeXt support.

Thank you very much for sharing this!

Best wishes
Idris

-- 
Professor Idris Samawi Hamid, Editor-in-Chief
International Journal of Shi`i Studies
Department of Philosophy
Colorado State University
Fort Collins, CO 80523

--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: HTML to ConTeXt
  2007-10-25 20:17 ` Idris Samawi Hamid
@ 2007-10-26  4:22   ` Aditya Mahajan
  2007-10-26 11:37     ` Doc to ConTeXt [was Re: HTML to ConTeXt] Idris Samawi Hamid
  0 siblings, 1 reply; 19+ messages in thread
From: Aditya Mahajan @ 2007-10-26  4:22 UTC (permalink / raw)
  To: mailing list for ConTeXt users

On Thu, 25 Oct 2007, Idris Samawi Hamid wrote:

> Hi Aditya,
>
> On Thu, 25 Oct 2007 08:50:03 -0600, Aditya Mahajan <adityam@umich.edu>
> wrote:
>
>> This is interesting. A website that converts html to context
>> (actually uses markdown behind the scenes).
>>
>> http://johnmacfarlane.net/pandoc/html2x.html
>>
>> This is how the context wiki looks like.
>>
>> http://johnmacfarlane.net/cgi-bin/html2x.pl?url=http%3A%2F%2Fwiki.contextgarden.net%2FMain_Page&format=context
>>
>> The program is written in haskell and is also available for download.
>> You can use it to convert markdown to context. I had been looking for
>> this for a while, when multiple formats are needed. Write in markdown
>> and generate html or context. I do not completely like the context
>> output it generates (for example
>> http://johnmacfarlane.net/pandoc/README gets converted to
>> http://johnmacfarlane.net/pandoc/example11.tex.html)
>
> This looks very promising. Perhaps some of us can help the developers to
> improve the ConTeXt support.
>
I will explore pandoc in more detail in the future. I am more 
interested in it from the point of view of understanding Haskell 
parsers, but improving the context output will definitely not hurt.

Aditya
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-10-26  4:22   ` Aditya Mahajan
@ 2007-10-26 11:37     ` Idris Samawi Hamid
  2007-11-10  1:30       ` Andrea Valle
  0 siblings, 1 reply; 19+ messages in thread
From: Idris Samawi Hamid @ 2007-10-26 11:37 UTC (permalink / raw)
  To: mailing list for ConTeXt users

On Thu, 25 Oct 2007 22:22:46 -0600, Aditya Mahajan <adityam@umich.edu>  
wrote:

>> This looks very promising. Perhaps some of us can help the developers to
>> improve the ConTeXt support.
>>
> I will explore pandoc in more detail in the future. I am more
> interested in it from the point of view of understanding Haskell
> parsers, but improving the context output will definitely not hurt.

Ah, you're missing a big point in your discovery ;-)

As I told Andrea: For relatively simple documents (like the kind we use in  
academic journals) it seems we can now

1) convert doc to odt using OOo
2) convert odt to markdown using

http://wiki.services.openoffice.org/wiki/Odt2txt.py

3) use the pandoc utility to convert markdown to ConTeXt.

As for the pandoc list, we may be able to influence the final ConTeXt  
output by making suggestions, reporting bugs etc.

If we can convert Odt2txt.py to lua maybe this workflow can be partly  
integrated into ConTeXt itself someday.*****

The pandoc developer seems interested in improving ConTeXt support (see my  
forwarded mail) so this is a good opportunity for all those who need a  
decent doc=>context workflow.

Best wishes
Idris

*****Or maybe we can just port Odt2txt.py to give direct ConTeXt output  
and forget the markdown layer entirely. Any ideas on how hard that would  
be?

-- 
Professor Idris Samawi Hamid, Editor-in-Chief
International Journal of Shi`i Studies
Department of Philosophy
Colorado State University
Fort Collins, CO 80523

--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-10-26 11:37     ` Doc to ConTeXt [was Re: HTML to ConTeXt] Idris Samawi Hamid
@ 2007-11-10  1:30       ` Andrea Valle
  2007-11-10  3:14         ` Idris Samawi Hamid
                           ` (3 more replies)
  0 siblings, 4 replies; 19+ messages in thread
From: Andrea Valle @ 2007-11-10  1:30 UTC (permalink / raw)
  To: mailing list for ConTeXt users


[-- Attachment #1.1: Type: text/plain, Size: 3245 bytes --]

Hi to all (Idris, in particular, as we are always dealing with the  
same problems... ),

I just want to share some thoughts about the ol' damn' problem of  
converting to ConTeXt from Word et al.

> As I told Andrea: For relatively simple documents (like the kind we  
> use in
> academic journals) it seems we can now
>
> 1) convert doc to odt using OOo
> 2) convert odt to markdown using

As suggest by Idris, I subscribed to the pandoc list, but I have to  
say that the activity is not exactly like the one on ConTeXt list...
So the actual support for ConTeXt conversion is not convincing. More,  
it's always better to put the hands on your machine...

My problem is to convert a series of academic journals in ConTeXt.  
They come form the Humanities so little structure (basically, mainly  
body and footnotes).
Far from me the idea of automatically doing all the stuff, I'd like  
to be faster and more accurate in conversion.
(No particular interest in figures, they are few, not so much in  
references: they tends to be typographically inconsistent if done
in a WYSISYG environment, so difficult to parse).
More, as the journal has already being published we need to work with  
final pdfs.

After wasting my time with an awful pdf to html converter by  
Acrobat,  I discovered this, you may all know:
http://pdftohtml.sourceforge.net/

The html  conversion is very very good in resulting rendering and  
also in sources, but after some tweakings I got interested in the xml  
conversion it allows.
The xml format  substantially encodes the infos related to page,  
typically each line is an element. Plus, there are bold and italics  
marked easily as <b> and <i>
I'm still struggling to understand something really operative of XML  
processing in ConTeXt, so  I switched back to Python.
I used an incremental sax parser with some replacement.
This is today's draft.
Original:
http://www.semiotiche.it/andrea/membrana/02%20imp.pdf

Recomposed (no setup at all, only \enableregime[utf]):
http://www.semiotiche.it/andrea/membrana/02imp.pdf

pdf --> pdftoxml --> xml --> python script --> tex --> pdf

I recovered par, bold, em, footnotes,  stripping dashes and  
reassembling the text with footnote references. Not bad as a first step.

I guess that you xml gurus could probably do much easier and cleaner.
So, I mean -just for my very specific needs, I con probably  take  
word sources, convert to pdf and then finally reach ConTeXt as  
discussed.

Just some ideas to share with the list

Best

-a-




--------------------------------------------------
Andrea Valle
--------------------------------------------------
CIRMA - DAMS
Università degli Studi di Torino
--> http://www.cirma.unito.it/andrea/
--> andrea.valle@unito.it
--------------------------------------------------


I did this interview where I just mentioned that I read Foucault. Who  
doesn't in university, right? I was in this strip club giving this  
guy a lap dance and all he wanted to do was to discuss Foucault with  
me. Well, I can stand naked and do my little dance, or I can discuss  
Foucault, but not at the same time; too much information.
(Annabel Chong)





[-- Attachment #1.2: Type: text/html, Size: 9248 bytes --]

[-- Attachment #2: Type: text/plain, Size: 487 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-11-10  1:30       ` Andrea Valle
@ 2007-11-10  3:14         ` Idris Samawi Hamid
  2007-11-10 11:25           ` Andrea Valle
  2007-11-10  3:33         ` Idris Samawi Hamid
                           ` (2 subsequent siblings)
  3 siblings, 1 reply; 19+ messages in thread
From: Idris Samawi Hamid @ 2007-11-10  3:14 UTC (permalink / raw)
  To: mailing list for ConTeXt users

Hi Andrea,

On Fri, 09 Nov 2007 18:30:36 -0700, Andrea Valle <valle@di.unito.it> wrote:

> Hi to all (Idris, in particular, as we are always dealing with the
> same problems... ),
>
> I just want to share some thoughts about the ol' damn' problem of
> converting to ConTeXt from Word et al.
>
>> As I told Andrea: For relatively simple documents (like the kind we
>> use in
>> academic journals) it seems we can now
>>
>> 1) convert doc to odt using OOo
>> 2) convert odt to markdown using

http://wiki.services.openoffice.org/wiki/Odt2txt.py

3) use the pandoc utility to convert markdown to ConTeXt.

[you left this out]

> As suggest by Idris, I subscribed to the pandoc list, but I have to
> say that the activity is not exactly like the one on ConTeXt list...
> So the actual support for ConTeXt conversion is not convincing. More,
> it's always better to put the hands on your machine...

Did you try the markdown-to-ConTeXt conversion? The  
doc-odt-markdown-context workflow seems pretty useful as is. See also

http://code.google.com/p/pandoc/wiki/ConTeXtImprovements

I'm working on something else related to this issue that I hope to say  
more about in the coming weeks ;-)

Best wishes
Idris
-- 
Professor Idris Samawi Hamid, Editor-in-Chief
International Journal of Shi`i Studies
Department of Philosophy
Colorado State University
Fort Collins, CO 80523

--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-11-10  1:30       ` Andrea Valle
  2007-11-10  3:14         ` Idris Samawi Hamid
@ 2007-11-10  3:33         ` Idris Samawi Hamid
  2007-11-10 11:59           ` Andrea Valle
  2007-11-10  5:44         ` Saji Njarackalazhikam Hameed
       [not found]         ` <6faad9f00711100331h547664c6l97d2c3b82c16d8dd@mail.gmail.com>
  3 siblings, 1 reply; 19+ messages in thread
From: Idris Samawi Hamid @ 2007-11-10  3:33 UTC (permalink / raw)
  To: mailing list for ConTeXt users

On Fri, 09 Nov 2007 18:30:36 -0700, Andrea Valle <valle@di.unito.it> wrote:

> After wasting my time with an awful pdf to html converter by
> Acrobat,  I discovered this, you may all know:
> http://pdftohtml.sourceforge.net/

Looks impressive...

> The html  conversion is very very good in resulting rendering and
> also in sources, but after some tweakings I got interested in the xml
> conversion it allows.
> The xml format  substantially encodes the infos related to page,
> typically each line is an element. Plus, there are bold and italics
> marked easily as <b> and <i>
> I'm still struggling to understand something really operative of XML
> processing in ConTeXt, so  I switched back to Python.
> I used an incremental sax parser with some replacement.
> This is today's draft.
> Original:
> http://www.semiotiche.it/andrea/membrana/02%20imp.pdf
>
> Recomposed (no setup at all, only \enableregime[utf]):
> http://www.semiotiche.it/andrea/membrana/02imp.pdf

Looks VERY impressive... Tell me, how did you set up the cropmarks etc.?

> pdf --> pdftoxml --> xml --> python script --> tex --> pdf
>
> I recovered par, bold, em, footnotes,  stripping dashes and
> reassembling the text with footnote references. Not bad as a first step.

Did you also try pdftohtml --> html --> context?

> I guess that you xml gurus could probably do much easier and cleaner.
> So, I mean -just for my very specific needs, I con probably  take
> word sources, convert to pdf and then finally reach ConTeXt as
> discussed.

Again, very nice stuff!

Best wishes
Idris

-- 
Professor Idris Samawi Hamid, Editor-in-Chief
International Journal of Shi`i Studies
Department of Philosophy
Colorado State University
Fort Collins, CO 80523

--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-11-10  1:30       ` Andrea Valle
  2007-11-10  3:14         ` Idris Samawi Hamid
  2007-11-10  3:33         ` Idris Samawi Hamid
@ 2007-11-10  5:44         ` Saji Njarackalazhikam Hameed
  2007-11-10 13:10           ` Andrea Valle
       [not found]         ` <6faad9f00711100331h547664c6l97d2c3b82c16d8dd@mail.gmail.com>
  3 siblings, 1 reply; 19+ messages in thread
From: Saji Njarackalazhikam Hameed @ 2007-11-10  5:44 UTC (permalink / raw)
  To: mailing list for ConTeXt users

Hi Andrea,

I face a similar issue while organizing large-scale documents
prepared by members of my group (many folks are not conversant
with TeX here and write documents with WORD). My solution was to take their
input through a wiki and convert the HTML to context markup
using filters written with ruby (also see 
http://wiki.contextgarden.net/HTML_and_ConTeXt). Converting
HTML syntax to ConTeXt syntax is very do-able. 

If it is of any use, I attach the ruby filters I use for
my purpose. BTW, I use a ruby library called "hpricot" to ease
some of these conversions.

saji
...

def scrape_the_page(pagePath,oFile,hFile) 
items_to_remove = [
  "#menus",        #menus notice
  "div.markedup",
  "div.navigation",
  "head",          #table of contents 
  "hr"
  ]

doc=Hpricot(open(pagePath))
# this may not be applicable to your case
# this removes some unnecessary markup from the Wiki pages

@article = (doc/"#container").each do |content|
  #remove unnecessary content and edit links
  items_to_remove.each { |x| (content/x).remove }
end 

# Write HTML content to file
hFile.write @article.inner_html

# How to replace various syntactic elements using Hpricot
# replace p/b element with \bf
(@article/"p/*/b").each do |pb|
  pb.swap("{\\bf #{pb.inner_html}}")
end

# replace p/b element with \bf
(@article/"p/b").each do |pb|
  pb.swap("{\\bf #{pb.inner_html}}")
end

# replace strong element with \bf
(@article/"strong").each do |ps|
  ps.swap("{\\bf #{ps.inner_html}}")
end

# replace h1 element with section
(@article/"h1").each do |h1|
  h1.swap("\\section{#{h1.inner_html}}")
end

# replace h2 element with subsection
(@article/"h2").each do |h2|
  h2.swap("\\subsection{#{h2.inner_html}}")
end

# replace h3 element with subsection
(@article/"h3").each do |h3|
  h3.swap("\\subsubsection{#{h3.inner_html}}")
end

# replace h4 element with subsection
(@article/"h4").each do |h4|
  h4.swap("\\subsubsubsection{#{h4.inner_html}}")
end

# replace h5 element with subsection
(@article/"h5").each do |h5|
  h5.swap("\\subsubsubsubsection{#{h5.inner_html}}")
end

# replace <pre><code> by equivalent command in context
(@article/"pre").each do |pre|
  pre.swap("\\startcode \n #{pre.at("code").inner_html} \n
  \\stopcode")
end

# when we encounter a reference to a figure inside the html
# we replace it with a ConTeXt reference

(@article/"a").each do |a|
  a.swap("\\in[#{a.inner_html}]")
end


# remove 'alt' attribute inside <img> element
# replace <p><img> by equivalent command in context
(@article/"p/img").each do |img|

  img_attrs=img.attributes['alt'].split(",")

  # separate the file name from the extension
  # have to take of file names that have a "." embedded in them
  img_src=img.attributes['src'].reverse.sub(/\w+\./,"").reverse
  # puts img_src
  # see if position of figure is indicated
  img_pos="force"
  img_attrs.each do |arr| 
    img_pos=arr.gsub("position=","") if arr.match("position=")
  end
  img_attrs.delete("position=#{img_pos}") unless img_pos=="force" 

  # see if the array img_attrs contains an referral key word
  if img_attrs.first.match(/\w+[=]\w+/)
    img_id=" "
  else
    img_id=img_attrs.first
    img_attrs.delete_at(0)
  end

  if img_pos=="force"
    if img.attributes['title']
      img.swap("
      \\placefigure\n 
      [#{img_pos}][#{img_id}] \n 
      {#{img.attributes['title']}} \n 
      {\\externalfigure[#{img_src}][#{img_attrs.join(",")}]}  \n
              ")
    else
      img.swap("
      \\placefigure\n 
      [#{img_pos}] \n
      {none} \n
      {\\externalfigure[#{img_src}][#{img_attrs.join(",")}]} 
              ")
    end
  else
    if img.attributes['title']
      img.swap("
      \\placefigure\n 
      [#{img_pos}][#{img_id}] \n 
      {#{img.attributes['title']}} \n 
      {\\externalfigure[#{img_src}][#{img_attrs.join(",")}]}  \n
              ")
    else
      img.swap("
      \\placefigure\n 
      [#{img_pos}] \n
      {none} \n
      {\\externalfigure[#{img_src}][#{img_attrs.join(",")}]}
       \n 
              ")
    end
  end

end # end of converting inside (@article/"p/img")

 
   
# why not search for table and if we find caption, keep it ; if not add an empty

# Styling options: Here I catch the div element called Col2 and
# format the tex document in 2 columns


# Tables : placing them
# replace <p><img> by equivalent command in context
(@article/"table").each do |tab|
  if tab.at("caption")
  tab.swap("
  \\placetable[split]{#{tab.at("caption").inner_html}}\n
  {\\bTABLE \n
  #{tab.inner_html}
  \\eTABLE} 
             ")
  else
  tab.swap("
   \\placetable[split]{}\n
   {\\bTABLE \n
  #{tab.inner_html}
  \\eTABLE} \n 
            ")
  end
end

# Tables: remove the caption
(@article/"caption").each do |cap|
  cap.swap("\n")
end



# Now we transfer the syntactically altered html to a string Object
# and manipulate that object further

newdoc=@article.inner_html

# remove empty space in the beginning
newdoc.gsub!(/^\s+/,"")

# remove all elements we don't need.
newdoc.gsub!(/^<div.*/,"")
newdoc.gsub!(/^<\/div.*/,"")
newdoc.gsub!(/^<form.*/,"")
newdoc.gsub!(/^<\/form.*/,"")
newdoc.gsub!(/<p>/,"\n")
newdoc.gsub!(/<\/p>/,"\n")
newdoc.gsub!(/<\u>/,"")
newdoc.gsub!(/<\/u>/,"")
newdoc.gsub!(/<ul>/,"\\startitemize[1]")
newdoc.gsub!(/<\/ul>/,"\\stopitemize")
newdoc.gsub!(/<ol>/,"\\startitemize[n]")
newdoc.gsub!(/<\/ol>/,"\\stopitemize")
newdoc.gsub!(/<li>/,"\\item ")
newdoc.gsub!(/<\/li>/,"\n")
newdoc.gsub!("_","\\_")
newdoc.gsub!(/<table>/,"\\bTABLE \n")
newdoc.gsub!(/<\/table>/,"\\eTABLE \n")
newdoc.gsub!(/<tr>/,"\\bTR ")
newdoc.gsub!(/<\/tr>/,"\\eTR ")
newdoc.gsub!(/<td>/,"\\bTD ")
newdoc.gsub!(/<\/td>/,"\\eTD ")
newdoc.gsub!(/<th>/,"\\bTH ")
newdoc.gsub!(/<\/th>/,"\\eTH ")
newdoc.gsub!(/<center>/,"")
newdoc.gsub!(/<\/center>/,"")
newdoc.gsub!(/<em>/,"{\\em ")
newdoc.gsub!(/<\/em>/,"}")
newdoc.gsub!("^","")
newdoc.gsub!("\%","\\%")
newdoc.gsub!("&amp;","&")
newdoc.gsub!("&",'\\\&')
newdoc.gsub!("$",'\\$')
newdoc.gsub!(/<tbody>/,"\\bTABLEbody \n")
newdoc.gsub!(/<\/tbody>/,"\\eTABLEbody \n")

# Context does not mind "_" in figures and does not recognize \_,
# so i have to catch these and replace \_ with _

# First catch
filter=/\/AnnRep07\/Figures\/(\w+\/)*(\w+\\_)*/

if newdoc[filter]
newdoc.gsub!(filter) { |fString| 
fString.gsub("\\_","_") 
}
end

# Second catch
filter2=/\/AnnRep07\/Figures\/(\w+\/)*\w+[-.]\w+\\_\w+/

if newdoc[filter2]
newdoc.gsub!(filter2) { |fString| 
fString.gsub("\\_","_") }
end

# Third catch; remove \_ inside []
filter3=/\[\w+\\_\w+\]/

if newdoc[filter3]
newdoc.gsub!(filter3) { |fString| 
puts fString
fString.gsub("\\_","_") }
end


# remove the comment tag, which we used to embed context commands
newdoc.gsub!("<!--","")
newdoc.gsub!("-->","")

# add full path to the images
newdoc.gsub!("\/AnnRep07\/Figures\/","~\/AnnRep07\/Figures\/")

newdoc.gsub!(/<\w+\s*\/>/,"")

#puts newdoc
# open file for output
#outfil="#{oFile}.tex"
#`rm #{outfil}`

#fil=File.new(outfil,"a")
#puts "Writing #{oFile}"
oFile.write newdoc

end
# imgProps={}
  #       img_attrs.each do |arr| 
  #       imgProps['width']=arr.gsub("width=","") if arr.match("width=")
  #       imgProps['position']=arr.gsub("position=","") if arr.match("position=")
  #       end


* Andrea Valle <valle@di.unito.it> [2007-11-10 02:30:36 +0100]:

> Hi to all (Idris, in particular, as we are always dealing with the same 
> problems... ),
>
> I just want to share some thoughts about the ol' damn' problem of 
> converting to ConTeXt from Word et al.
>
>> As I told Andrea: For relatively simple documents (like the kind we use in
>> academic journals) it seems we can now
>>
>> 1) convert doc to odt using OOo
>> 2) convert odt to markdown using
>
> As suggest by Idris, I subscribed to the pandoc list, but I have to say 
> that the activity is not exactly like the one on ConTeXt list...
> So the actual support for ConTeXt conversion is not convincing. More, it's 
> always better to put the hands on your machine...
>
> My problem is to convert a series of academic journals in ConTeXt. They 
> come form the Humanities so little structure (basically, mainly body and 
> footnotes).
> Far from me the idea of automatically doing all the stuff, I'd like to be 
> faster and more accurate in conversion.
> (No particular interest in figures, they are few, not so much in 
> references: they tends to be typographically inconsistent if done
> in a WYSISYG environment, so difficult to parse).
> More, as the journal has already being published we need to work with final 
> pdfs.
>
> After wasting my time with an awful pdf to html converter by Acrobat,  I 
> discovered this, you may all know:
> http://pdftohtml.sourceforge.net/
>
> The html  conversion is very very good in resulting rendering and also in 
> sources, but after some tweakings I got interested in the xml conversion it 
> allows.
> The xml format  substantially encodes the infos related to page, typically 
> each line is an element. Plus, there are bold and italics marked easily as 
> <b> and <i>
> I'm still struggling to understand something really operative of XML 
> processing in ConTeXt, so  I switched back to Python.
> I used an incremental sax parser with some replacement.
> This is today's draft.
> Original:
> http://www.semiotiche.it/andrea/membrana/02%20imp.pdf
>
> Recomposed (no setup at all, only \enableregime[utf]):
> http://www.semiotiche.it/andrea/membrana/02imp.pdf
>
> pdf --> pdftoxml --> xml --> python script --> tex --> pdf
>
> I recovered par, bold, em, footnotes,  stripping dashes and reassembling 
> the text with footnote references. Not bad as a first step.
>
> I guess that you xml gurus could probably do much easier and cleaner.
> So, I mean -just for my very specific needs, I con probably  take word 
> sources, convert to pdf and then finally reach ConTeXt as discussed.
>
> Just some ideas to share with the list
>
> Best
>
> -a-
>
>
>
>
> --------------------------------------------------
> Andrea Valle
> --------------------------------------------------
> CIRMA - DAMS
> Università degli Studi di Torino
> --> http://www.cirma.unito.it/andrea/
> --> andrea.valle@unito.it
> --------------------------------------------------
>
>
> I did this interview where I just mentioned that I read Foucault. Who 
> doesn't in university, right? I was in this strip club giving this guy a 
> lap dance and all he wanted to do was to discuss Foucault with me. Well, I 
> can stand naked and do my little dance, or I can discuss Foucault, but not 
> at the same time; too much information.
> (Annabel Chong)
>
>
>
>

> ___________________________________________________________________________________
> If your question is of interest to others as well, please add an entry to the Wiki!
> 
> maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
> webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
> archive  : https://foundry.supelec.fr/projects/contextrev/
> wiki     : http://contextgarden.net
> ___________________________________________________________________________________


-- 
Saji N. Hameed

APEC Climate Center          				+82 51 668 7470
National Pension Corporation Busan Building 12F         
Yeonsan 2-dong, Yeonje-gu, BUSAN 611705			saji@apcc21.net
KOREA
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-11-10  3:14         ` Idris Samawi Hamid
@ 2007-11-10 11:25           ` Andrea Valle
  2007-11-10 12:09             ` Andrea Valle
  0 siblings, 1 reply; 19+ messages in thread
From: Andrea Valle @ 2007-11-10 11:25 UTC (permalink / raw)
  To: mailing list for ConTeXt users


[-- Attachment #1.1: Type: text/plain, Size: 1507 bytes --]

Hi Idris

On 10 Nov 2007, at 04:14, Idris Samawi Hamid wrote:
>> [you left this out]
Sorry, it was just to refer to the discussion

> The
> doc-odt-markdown-context workflow seems pretty useful as is. See also

I will try it more in depth. My main problem for now is to work from  
pdfs.
Because they're past issues. Once I have new contributions, I will be  
there for sure :).
I'm also curious to see if mine could be a more general approach to  
(word-->)pdf-->context conversion.
Just started on it.

> http://code.google.com/p/pandoc/wiki/ConTeXtImprovements
>

Oh, yes, quite useful. But has anyone replied to this on pandoc list?
I thought none.


> I'm working on something else related to this issue that I hope to say
> more about in the coming weeks ;-)

Looking forward to see the news :)

Best

-a-


--------------------------------------------------
Andrea Valle
--------------------------------------------------
CIRMA - DAMS
Università degli Studi di Torino
--> http://www.cirma.unito.it/andrea/
--> andrea.valle@unito.it
--------------------------------------------------


I did this interview where I just mentioned that I read Foucault. Who  
doesn't in university, right? I was in this strip club giving this  
guy a lap dance and all he wanted to do was to discuss Foucault with  
me. Well, I can stand naked and do my little dance, or I can discuss  
Foucault, but not at the same time; too much information.
(Annabel Chong)





[-- Attachment #1.2: Type: text/html, Size: 7367 bytes --]

[-- Attachment #2: Type: text/plain, Size: 487 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-11-10  3:33         ` Idris Samawi Hamid
@ 2007-11-10 11:59           ` Andrea Valle
  2007-11-10 14:07             ` Idris Samawi Hamid
  0 siblings, 1 reply; 19+ messages in thread
From: Andrea Valle @ 2007-11-10 11:59 UTC (permalink / raw)
  To: mailing list for ConTeXt users


[-- Attachment #1.1: Type: text/plain, Size: 1695 bytes --]

>> http://pdftohtml.sourceforge.net/
> Looks impressive...

I think so


> Looks VERY impressive... Tell me, how did you set up the cropmarks  
> etc.?

Mmh, maybe you're are referring to the original source? The output  
one is bare bone (what I need)
In any case, I upload the wrong reconstructed pdf (too late at night...)

The reconstructed pdf (pdf -->xml-->context-->pdf) is this one, where  
footnotes are handled correctly (an important point for me).
http://www.semiotiche.it/andrea/membrana/text.pdf
I rendered it with XeConTeXt. As noted by Mojca, there are some  
problems with double apices
In relation to footnote 1, this is what is coded in source :
"Sign and Reality"

> Did you also try pdftohtml --> html --> context?

No. You are suggesting via pandoc? Good point.
The exported html is very clean.
In general it seems that the idea is not to generate information  
related to document structure (as this should be inferred from pdf)
favoring appearance description.

Best

-a-




--------------------------------------------------
Andrea Valle
--------------------------------------------------
CIRMA - DAMS
Università degli Studi di Torino
--> http://www.cirma.unito.it/andrea/
--> andrea.valle@unito.it
--------------------------------------------------


I did this interview where I just mentioned that I read Foucault. Who  
doesn't in university, right? I was in this strip club giving this  
guy a lap dance and all he wanted to do was to discuss Foucault with  
me. Well, I can stand naked and do my little dance, or I can discuss  
Foucault, but not at the same time; too much information.
(Annabel Chong)





[-- Attachment #1.2: Type: text/html, Size: 7252 bytes --]

[-- Attachment #2: Type: text/plain, Size: 487 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-11-10 11:25           ` Andrea Valle
@ 2007-11-10 12:09             ` Andrea Valle
  0 siblings, 0 replies; 19+ messages in thread
From: Andrea Valle @ 2007-11-10 12:09 UTC (permalink / raw)
  To: mailing list for ConTeXt users


[-- Attachment #1.1: Type: text/plain, Size: 815 bytes --]

 > In any case, I upload the wrong reconstructed pdf (too late at  
night...)

(Sorry, I'm always making too many typos: "I uploaded", indeed)

-a-

--------------------------------------------------
Andrea Valle
--------------------------------------------------
CIRMA - DAMS
Università degli Studi di Torino
--> http://www.cirma.unito.it/andrea/
--> andrea.valle@unito.it
--------------------------------------------------


I did this interview where I just mentioned that I read Foucault. Who  
doesn't in university, right? I was in this strip club giving this  
guy a lap dance and all he wanted to do was to discuss Foucault with  
me. Well, I can stand naked and do my little dance, or I can discuss  
Foucault, but not at the same time; too much information.
(Annabel Chong)





[-- Attachment #1.2: Type: text/html, Size: 8551 bytes --]

[-- Attachment #2: Type: text/plain, Size: 487 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re: HTML to ConTeXt]
       [not found]         ` <6faad9f00711100331h547664c6l97d2c3b82c16d8dd@mail.gmail.com>
@ 2007-11-10 12:30           ` Andrea Valle
  0 siblings, 0 replies; 19+ messages in thread
From: Andrea Valle @ 2007-11-10 12:30 UTC (permalink / raw)
  To: mailing list for ConTeXt users; +Cc: Mojca Miklavec


[-- Attachment #1.1: Type: text/plain, Size: 911 bytes --]


On 10 Nov 2007, at 12:31, Mojca Miklavec wrote:

> Only as a note: all the opening quotation marks are wrong!

Ok, Thanks Mojca, my fault.
it was a problem of encoding, now I set pdftohml to utf8 and  
everything's fine


Best

-a-



--------------------------------------------------
Andrea Valle
--------------------------------------------------
CIRMA - DAMS
Università degli Studi di Torino
--> http://www.cirma.unito.it/andrea/
--> andrea.valle@unito.it
--------------------------------------------------


I did this interview where I just mentioned that I read Foucault. Who  
doesn't in university, right? I was in this strip club giving this  
guy a lap dance and all he wanted to do was to discuss Foucault with  
me. Well, I can stand naked and do my little dance, or I can discuss  
Foucault, but not at the same time; too much information.
(Annabel Chong)





[-- Attachment #1.2: Type: text/html, Size: 5706 bytes --]

[-- Attachment #2: Type: text/plain, Size: 487 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-11-10  5:44         ` Saji Njarackalazhikam Hameed
@ 2007-11-10 13:10           ` Andrea Valle
  0 siblings, 0 replies; 19+ messages in thread
From: Andrea Valle @ 2007-11-10 13:10 UTC (permalink / raw)
  To: mailing list for ConTeXt users


[-- Attachment #1.1: Type: text/plain, Size: 946 bytes --]

Hi Saji,

Thanks, I've already looked at it.
I will surely take into account your idea, as I'd like to convert to  
context my wiki pages (with wikka wiki).
At the end, the problem is html to context.
Powerful library indeed, as far as I can understand Ruby.

Best

-a-



--------------------------------------------------
Andrea Valle
--------------------------------------------------
CIRMA - DAMS
Università degli Studi di Torino
--> http://www.cirma.unito.it/andrea/
--> andrea.valle@unito.it
--------------------------------------------------


I did this interview where I just mentioned that I read Foucault. Who  
doesn't in university, right? I was in this strip club giving this  
guy a lap dance and all he wanted to do was to discuss Foucault with  
me. Well, I can stand naked and do my little dance, or I can discuss  
Foucault, but not at the same time; too much information.
(Annabel Chong)





[-- Attachment #1.2: Type: text/html, Size: 6482 bytes --]

[-- Attachment #2: Type: text/plain, Size: 487 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-11-10 11:59           ` Andrea Valle
@ 2007-11-10 14:07             ` Idris Samawi Hamid
  2007-11-10 14:11               ` Andrea Valle
  0 siblings, 1 reply; 19+ messages in thread
From: Idris Samawi Hamid @ 2007-11-10 14:07 UTC (permalink / raw)
  To: mailing list for ConTeXt users

On Sat, 10 Nov 2007 04:59:18 -0700, Andrea Valle <valle@di.unito.it> wrote:

>>> http://pdftohtml.sourceforge.net/
>> Looks impressive...
>
> I think so
>
>
>> Looks VERY impressive... Tell me, how did you set up the cropmarks
>> etc.?
>
> Mmh, maybe you're are referring to the original source? The output
> one is bare bone (what I need)
> In any case, I upload the wrong reconstructed pdf (too late at night...)

Ah! I had a feeling that was too good to be true ;-)

Best wishes
Idris

-- 
Professor Idris Samawi Hamid, Editor-in-Chief
International Journal of Shi`i Studies
Department of Philosophy
Colorado State University
Fort Collins, CO 80523

--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-11-10 14:07             ` Idris Samawi Hamid
@ 2007-11-10 14:11               ` Andrea Valle
  2007-11-10 19:08                 ` Hans Hagen
  0 siblings, 1 reply; 19+ messages in thread
From: Andrea Valle @ 2007-11-10 14:11 UTC (permalink / raw)
  To: mailing list for ConTeXt users


[-- Attachment #1.1: Type: text/plain, Size: 1625 bytes --]

Yes,

>>> Tell me, how did you set up the cropmarks
>>> etc.?

but cropmarks are the easy parts using layers...
:)

Best

-a-



> Best wishes
> Idris
>
> --  
> Professor Idris Samawi Hamid, Editor-in-Chief
> International Journal of Shi`i Studies
> Department of Philosophy
> Colorado State University
> Fort Collins, CO 80523
>
> --
> Using Opera's revolutionary e-mail client: http://www.opera.com/mail/
> ______________________________________________________________________ 
> _____________
> If your question is of interest to others as well, please add an  
> entry to the Wiki!
>
> maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ 
> ntg-context
> webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
> archive  : https://foundry.supelec.fr/projects/contextrev/
> wiki     : http://contextgarden.net
> ______________________________________________________________________ 
> _____________

--------------------------------------------------
Andrea Valle
--------------------------------------------------
CIRMA - DAMS
Università degli Studi di Torino
--> http://www.cirma.unito.it/andrea/
--> andrea.valle@unito.it
--------------------------------------------------


I did this interview where I just mentioned that I read Foucault. Who  
doesn't in university, right? I was in this strip club giving this  
guy a lap dance and all he wanted to do was to discuss Foucault with  
me. Well, I can stand naked and do my little dance, or I can discuss  
Foucault, but not at the same time; too much information.
(Annabel Chong)





[-- Attachment #1.2: Type: text/html, Size: 8912 bytes --]

[-- Attachment #2: Type: text/plain, Size: 487 bytes --]

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: Doc to ConTeXt [was Re:  HTML to ConTeXt]
  2007-11-10 14:11               ` Andrea Valle
@ 2007-11-10 19:08                 ` Hans Hagen
  0 siblings, 0 replies; 19+ messages in thread
From: Hans Hagen @ 2007-11-10 19:08 UTC (permalink / raw)
  To: mailing list for ConTeXt users

Andrea Valle wrote:
> Yes,
> 
>>>> Tell me, how did you set up the cropmarks
>>>> etc.?
> 
> but cropmarks are the easy parts using layers...
> :)

\setuplayout[marking=on|color]


-----------------------------------------------------------------
                                           Hans Hagen | PRAGMA ADE
               Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
      tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com
                                              | www.pragma-pod.nl
-----------------------------------------------------------------
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: html to context
  2005-01-15 15:12 html to context ishamid
  2005-01-15 15:23 ` Adam Lindsay
@ 2005-01-15 15:24 ` Patrick Gundlach
  1 sibling, 0 replies; 19+ messages in thread
From: Patrick Gundlach @ 2005-01-15 15:24 UTC (permalink / raw)


Hi Idris,

> Have any of you guys a script or something to convert simple html to ConTeXt? 
> There are apparently a couple of html2latex converters out there but is there 
> anything for ConTeXt?

I came across this a couple of days ago:

http://kronavita.de/chris/data/md2context.xsl 

didn't try it, looks rather simple (and special). Perhaps better than
nothing(?).

Patrick
-- 
ConTeXt wiki: http://contextgarden.net

^ permalink raw reply	[flat|nested] 19+ messages in thread

* Re: html to context
  2005-01-15 15:12 html to context ishamid
@ 2005-01-15 15:23 ` Adam Lindsay
  2005-01-15 15:24 ` Patrick Gundlach
  1 sibling, 0 replies; 19+ messages in thread
From: Adam Lindsay @ 2005-01-15 15:23 UTC (permalink / raw)


ishamid said this at Sat, 15 Jan 2005 08:12:34 -0700:

>Have any of you guys a script or something to convert simple html to
ConTeXt? 
>There are apparently a couple of html2latex converters out there but is
there 
>anything for ConTeXt?

Huh, that's funny. I was just thinking about that in the context of this
post, as pointed out by Bruce d'Arcus:
<http://kronavita.de/chris/blog/archive/2005/01/converting-bluecloth-to-
pdf.html>

So it sounds like that XSLT would be one place to start.
I have to think, at least with XHTML-basic <http://www.w3.org/TR/xhtml-
basic/>, that going in and renaming a few of the elements in x-contml
would get you a lot of the way there, as well.

I have to think that <http://tidy.sourceforge.net/> is a worthwhile pre-
processor to have in one's arsenal, and I pointed out <http://www.re.be/
css2xslfo/> to the foXet list just the other day. Technically speaking,
if that converts to XSL-FO as well as it appears, ConTeXt could well be
the back end to it.

But to answer your question, no I don't have anything per se. I'm just
thinking about it. (And, because of PhD obligations, that's all I'm going
to do about it, for the time being!)
-- 
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
 Adam T. Lindsay, Computing Dept.     atl@comp.lancs.ac.uk
 Lancaster University, InfoLab21        +44(0)1524/510.514
 Lancaster, LA1 4WA, UK             Fax:+44(0)1524/510.492
-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-

^ permalink raw reply	[flat|nested] 19+ messages in thread

* html to context
@ 2005-01-15 15:12 ishamid
  2005-01-15 15:23 ` Adam Lindsay
  2005-01-15 15:24 ` Patrick Gundlach
  0 siblings, 2 replies; 19+ messages in thread
From: ishamid @ 2005-01-15 15:12 UTC (permalink / raw)


Hi gang,

Have any of you guys a script or something to convert simple html to ConTeXt? 
There are apparently a couple of html2latex converters out there but is there 
anything for ConTeXt?

Best
Idris

Professor Idris Samawi Hamid
Department of Philosophy
Colorado State University
Fort Collins, CO 80523

^ permalink raw reply	[flat|nested] 19+ messages in thread

end of thread, other threads:[~2007-11-10 19:08 UTC | newest]

Thread overview: 19+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-10-25 14:50 HTML to ConTeXt Aditya Mahajan
2007-10-25 20:17 ` Idris Samawi Hamid
2007-10-26  4:22   ` Aditya Mahajan
2007-10-26 11:37     ` Doc to ConTeXt [was Re: HTML to ConTeXt] Idris Samawi Hamid
2007-11-10  1:30       ` Andrea Valle
2007-11-10  3:14         ` Idris Samawi Hamid
2007-11-10 11:25           ` Andrea Valle
2007-11-10 12:09             ` Andrea Valle
2007-11-10  3:33         ` Idris Samawi Hamid
2007-11-10 11:59           ` Andrea Valle
2007-11-10 14:07             ` Idris Samawi Hamid
2007-11-10 14:11               ` Andrea Valle
2007-11-10 19:08                 ` Hans Hagen
2007-11-10  5:44         ` Saji Njarackalazhikam Hameed
2007-11-10 13:10           ` Andrea Valle
     [not found]         ` <6faad9f00711100331h547664c6l97d2c3b82c16d8dd@mail.gmail.com>
2007-11-10 12:30           ` Andrea Valle
  -- strict thread matches above, loose matches on Subject: below --
2005-01-15 15:12 html to context ishamid
2005-01-15 15:23 ` Adam Lindsay
2005-01-15 15:24 ` Patrick Gundlach

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).