Fork me on GitHub

Need your expertise with text pages.  Bottom

  • Hello,

    Hope everyone is well.

    We have a community website currently up and running with PN 0.75. We have many small booklets (say 100 or so) that have been published over the years, and are stored as either PDF or MS word format.

    We want them to be online and:
    1) all the text is searchable
    2) can be read online
    3) can be printed
    4) can be downloaded

    Now, I've tried both htmlpages and Nukewrapper. Nukewrapper doesn't seem to allow searching through the pages it wraps, which is a problem
    htmlpages doesn't create a block based on the theme for the text it incorporates, which results in the page generated looking weird.

    Are there any other options? Or am I doing this all wrong? Let me know what you think.

    Thank you.
  • You won't be able to incorporate Word files, the best you can do is export as HTML (and Word is notoriously bad at HTML; it bloats the code terribly). PDF is a digital format, and needs the Acrobat Reader plugin to read online. There are no practical solutions to allow either to be viewed natively in a browser and searched. On Unix/Linux servers there is a technical solution for PDF, but it's a bit involved. Any document that can be viewed in a browser can also be saved, but apart from that download links in maybe something like Pagesetter may be good.
    No, NukeWrapper doesn't currently include a Search plugin, it is not finished, there are no wrappers that has a Search plugin for static pages on the server. One of the things I have to get back to soon. Although I did see a post by ColdRolledSteel (Craig Saunders) making a basic one for Unix systems. htmlpages stores basic pages in the database (which also limits size), so it is searchable. I don't know enough about Pagesetter to tell if it can help in that.
  • Your best bet is going to be to go with something like Pagesetter. You will be able to make the entire document searchable, archivable etc... the only catch is that the contents of the documents you currently have will need to be either "cut and pasted" into the WYSIWYG editor included with Pagesetter (has a MS word code kill option so it doesn't get all screwey) or to export all the documents to XML and then import them through Pagesetter.

    All in all it's going to take some initial investment of time to grandfather you existiong docs, but once it's up and runnign you should be rather satisfied with the results.

    Good luck
  • remmingtonshowdown

    Pagesetter (has a MS word code kill option so it doesn't get all screwey)

    Cool feature. Do you cut from the original Word doc, or from the HTML file saved from the original Word doc?
  • You know I've never actually tried it as I write all my articles on the site... some of my users have though-I should ask... Based on my experience and reading on Jorn's site I believe you can just copy the body of a .doc and them paste it into HTMLarea and voila!

    I believe in the forums on Jorn's site there are some links to the code and plugins if anyone wanted to impliment them into another module etc...

    --
    Burnham Racing
    Illinois Cycling Association
  • I may have to give that a try. I have a client that insists on using Word for making articles and converting to HTML, and it's a pain in the you-know-where. Thanks for the info!
  • Wendell,

    Keep in mind that if your client uses Firefox, he (or more likely you) will have to make it so firefox will allow the paste function. There are instructions all over the Mozilla forums...

    What's even cooler is that HTMLarea is customizable so you can add and remove buttons and features as you like...

This list is based on users active over the last 60 minutes.