Every time I see a utility like this, I think *maybe* I could switch to producin...

imurray · on April 16, 2012

I take some of it back. Getting the latest version of wkhtmltopdf and telling it to wait (probably longer than necessary) to process javascript, works pretty well.

    wkhtmltopdf --javascript-delay 10000 --no-stop-slow-scripts 'http://bit-player.org/deck.js/limits-to-growth-Harvard-2012-03-30/ltg-talk.html#Lotka-Volterra' slide.pdf

It's a bit slow, and a bit too hacky for me. But this tool does the best job of those I've seen.

And I've just received an email pointing me to: http://search.cpan.org/perldoc?deckjs2pdf https://github.com/potyl/perl-App-deckjs2pdf that will specifically deal with Deck.JS slides.

kelvin0 · on April 16, 2012

Well, I am looking for some feedback on a project that converts XML to PDF. Give it a try: https://github.com/kelvin0/PyXML2PDF

imurray · on April 16, 2012

I am looking for a command-line utility that could do:

    webpage2pdf 'http://bit-player.org/deck.js/limits-to-growth-Harvard-2012-03-30/ltg-talk.html#Lotka-Volterra' slide.pdf

and actually work (create a sensible PDF representation of what I can see in a browser). So my feedback wouldn't be useful, as my use case is out of scope for your project: "PyXML2PDF is NOT compatible with any XHTML/HTML/CSS. It uses a small set of tags to quickly allow generation of PDFs."

potyl · on April 16, 2012

It seems that what you want is deckjs2pdf, get it from CPAN [1] or GitHub [2]

[1] http://search.cpan.org/perldoc?deckjs2pdf

[2] https://github.com/potyl/perl-App-deckjs2pdf

pbhjpbhj · on April 16, 2012

Would it be sufficient to create PNGs of the web pages and extract the text of the webpage to place in the background of a PDF file (for search, screenreading)?

imurray · on April 16, 2012

Not for me. Personally I'll stick to ways of making decent PDFs that don't go via HTML.