First, HTML is very likely NOT to be well-formed, and hence, not XML. XSLT 2.0 has some new features that enable it to handle non-XML input, but I think you would be better off to get a suitable version of HTMLTidy for your platform. It' raison d'etre (excuse the lack of a circomflex there) is to take untidy HTML and output XML.
Now that you have XML, you should transform it to XSL-FO. From there, get a copy of fop. That is a program that takes in XSL-FO and outputs PDF.
HTML->HTMLTidy->XSL-FO->FOP->PDF
tidy.sourceforge.net - binaries and C source code available
http://www.w3.org/TR/xsl/ - The spec on xsl-fo
xmlgraphics.apache.org/fop/ - binaries and Java source code available
--
Charles Knell
cknell@xxxxxxxxxx - email
-----Original Message-----
From: Bokluci <bokluci@xxxxxxxxxxxxxx>
Sent: Thu, 13 Apr 2006 12:01:03 +0200
To: <xsl-list@xxxxxxxxxxxxxxxxxxxxxx>
Subject: pdf
I need to create dinamyc pdf files.
Is there already xsl converter html->pdf or good tutorial how to write your
own?
Thanks.
| Current Thread |
- pdf
- Bokluci - 13 Apr 2006 10:01:19 -0000
- <Possible follow-ups>
- cknell - 13 Apr 2006 11:25:16 -0000 <=
|
|