boo2pdf

About

This is the home of boo2pdf, an IBM BookManager to PDF conversion app & web service. I’m currently experimenting with the HTML to PDF backends and would like feedback with book files I haven’t tried. Once the code is cleaned up, I will dump it on my site.  You can find the web service at http://ps-2.kev009.com:8081/boo2pdf/

Motivation

I have a large collection of old IBM machines and documentation. I want this documentation indexed by my own search facilities and Google for easy retrieval. PDF is widely read while BookManager requires proprietary software and no search engines I know of parse it.

This will probably be useful to Mainframers as well.

Known Limitations

  • Currently, internal hyperlinks and headings are not parsed, indexed, or otherwise handled.
  • The Linux SoftCopy Reader does not convert some of the older embedded image formats. Possible formats are: GIF, PNG. JPG, MET, GDF, WMF. I’m guessing it is one of the later that does not have a Linux filter. You will know an image did not convert by red text indicating such in your PDF. I’ve seen this in a few .boo files from the early to mid ’90s.

Technical Details

I am using the JAR files from IBM SoftCopy Reader for Linux. I’ve decompiled these and written my own main class and and a wrapper script to take care of setting the LD_LIBRARY_PATH, Java classpath, and other such glue code. I use SoftCopy Reader’s API to output HTML and images from the BookManager files. I then pass this to htmldoc for PDF conversion.

Code

boo2pdf Gitweb

Share this article:
  • Reddit
  • HackerNews
  • Slashdot
  • Facebook
  • StumbleUpon
  • Google Bookmarks
  • FSDaily
  • Twitter
  • Identi.ca
  • Digg
  • del.icio.us
  • Print
  • email
  • PDF

Leave a Reply

Your email address will not be published. Required fields are marked *

*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>