Mike Borozdin's Blog

A blog about programming, web and IT in general

Why I hate PDF as an E-book Format or PDF vs CHM

PDF (Portable Document Format) is a common choice for e-books, big manuals comprising of several hundred pages and other documents. At a first glance, it seems to be a reasonable choice. PDF is a really cross-platform format, the most popular PDF viewer – Adobe Acrobat – is freely available in all popular operating systems – Windows, Mac OS and Linux. But what is wrong with it?

Well, the first of all, it tends to be slow, of course, it might depend on an application you are viewing your document with, but still opening large files in Adobe Acrobat doesn’t seem to be an easy walk. Of course, it’s not the only reason why I think this format is a poor choice for e-books.

There is a significant disadvantage of PDF – its searching capabilities. Frankly speaking, it’s not format’s fault, but a glitch of the current PDF viewers. The problem with searching through PDF is its linearity, I mean, when finding some word or a phrase it will show you every occurrence of that word or phrase from the top of the document to its bottom. Sure, you can open the “Full Reader Search” window, but still it will just represent a list of matched occurrences.  So, it’s not very convenient to search through a 500 pages document for a word that you might find in its bottom or in the middle of the document.

image

Nevertheless, there is a document word that doesn’t have the above mentioned disadvantages. I suppose everyone must be familiar with it. It’s CHM (Microsoft HTML Help). In fact, the format is just a collection of simple HTML pages. But unlike PDF, CHM is quite fast. It supports chapters, yes, PDF does also can contain chapters in the document, but they are no so convenient. In addition, when you open a PDF document, all its contents get loaded at once, while CHM only loads the pages and the chapters you are reading.

And of course the main advantage of CHM over PDF is its search capabilities. When you search through a CHM documents you will get a list of the relevant chapters and pages ordered by their relevancy.  The CHM search engine is also morphological aware.

image

 

In a conclusion I’m going to say that actually it’s quite obvious that PDF is such common as an e-book format because it is the format that is used for submitting books to a printing office and therefore they don’t have to prepare any other kind of documents to publish an e-book, they can just sell PDFs, while creating a CHM file involves a greater amount of work. As a result such e-books don’t have any significant advantages over printed ones. Sure, they don’t take any space on a bookshelf, they are cheaper than paper books and they don’t require cutting wood to create paper, but still they are not very convenient for reading and finding some information.

Anyway, there are some places on the Net where it is possible to find CHM e-books.


Tags:
Posted by Mike Borozdin on Thursday, July 30, 2009 12:17 PM GMT
  Shout it Kick it!  
Permalink | Comments (13) | Post RSSRSS comment feed

Comments

Chris Hope New Zealand

Friday, July 31, 2009 12:05 AM GMT

But doesn't CHM only work on Windows? Sounds like a pretty big disadvantage to the format to me.

Mike Borozdin Russia

Friday, July 31, 2009 1:26 AM GMT

Hi Chris,

Yes, by default CHM works only on Windows. I mean you don't need to have anything installed to view it on Windows. However there are some free CHM viewers available for other platforms as well.

raveman Poland

Friday, July 31, 2009 3:34 AM GMT

i think you cant print chm (or at least not all at once) and i like viewing pdf better since i can only use (page-down or down arrow), but with chm u have to use mouse.

Sanil India

Friday, July 31, 2009 5:56 AM GMT

But chm is always easy to read, compared to pdf. Try searching something in pdf, I bet it is even worser then reading a real book.

Chocolim Paraguay

Friday, July 31, 2009 7:03 AM GMT

I think you are using a bad reader, yes the Adobe reader is a bad one. Try Foxit.
It will be all things faster.
PDF is better than CHM, just put the PDF on full screen to read it. A CHM with tables and images is horrible

Antonio Garcia Marin Spain

Friday, July 31, 2009 7:05 AM GMT

CHM work in Linux too. You have Gnochm or Kchm. I like too this file format.

Mike Borozdin Russia

Friday, July 31, 2009 7:31 AM GMT

@Sanil,

Exactly! PDF is much harder to read than a real paper book.

@Chocolim,

Yes, I tried Foxit. It doesn't provide the search functionality that already presents in CHM.

gorlok Argentina

Friday, July 31, 2009 7:44 AM GMT

Are you saying this seriously? Really?

I can't read a chm ebook. I read many tech ebooks in my job. I love the 'continous' mode for reading. And I use Ubuntu mainly (and Windows XP/Vista sometimes).

I hate the navigation model of all CHM readers. It's brainless for me. When I read a pdf ebook is like a real paper book. It's natural. And I can use only keyboard (no-mouse/pad) with all the pdf viewers.

Besides, when I print a fragment of a pdf ebook, I get what I see. Try this in chm!

CHM doesn't work. CHM is like a help system. It doesn't work por reading. Just my opinion.

Mike Borozdin Russia

Friday, July 31, 2009 7:58 AM GMT

@gorlok,

I agree that reading a PDF file is like reading a real book, but the problem (for me and for other people who agree with me) that it is still hard to read. I mean, the computer is just a different type of medium, it's not a paper book. And while I can't look for a specific word in a real book without having to search through all the pages, I still can't do the same with PDF. At the same time as I have already mentioned before, the search functionality of CHM is fantastic!

Sanil India

Friday, July 31, 2009 8:50 AM GMT

The only thing in which PDF is better than CHM is that it's easiness to author and print. In rest of areas pdf pretty much lacks the feature of CHM.

Even most CHM readers boosts its search capabilities by prepare an indexing well in advance, so results are instant and already grouped, and not like PDF, where one has to go sequentialy. (that is independent of format, but yet a feature)

@gorlok: Yeah! I totally agree, CHM is a particularly suitable for a help system, but again much of educational content resemble grouping similiar to a help system. Still, PDF is suitable for content such as literature, letters, and stories which are continous in nature. So, you don't have to flip back and forward.

The thing is that much is dependent on the author, a CHM can be authored as a continous document, like PDF or arranged into hierarchial categories. Still you can print individual chapters or the whole book, in case you need it.

Sanil India

Friday, July 31, 2009 8:58 AM GMT

One thing to note is that a PDF format accomodate same number of pages in smaller size files, as compared to CHM, Which is an advantage for e-book providers, not so advantageous for a reader though.

Alexander Kartashov Russia

Saturday, August 08, 2009 7:40 AM GMT

It’s a point of designation. Yes, PDF is only convenient for printing. CHM is very useful for technical documentation. You can easily locate necessary place without flipping paper pages or scrolling a screen. You can also use copy/paste without any limitations. Try it with PDF – it’s awful. Furthermore, if you use on-screen OCR (like Kleptomania), you can’t use PDF at all due its font smoothing.

kevin Republic of the Philippines

Saturday, August 22, 2009 6:51 AM GMT

it's a matter of personal preference, really. for me, i liked neither. the main reason why i like PDFs is that i can view them on my symbian smartphone.

another reason is that PDF readers usually allow users to scroll through the text with the hand tool, which, for me, is extremely enjoyable. again, a matter of personal preference Smile still, as pointed out by some people, PDF stuff are hard to read. i'd patronize PDFs for print jobs. ^^

CHMs? i've had a few experiences with them, but at least they were good. i remember downloading two ebooks of the same title. one in CHM, and one in PDF. guess what, reading the CHM one, even if it was less "eye-candyish", was more comfortable to read.

i know this is out of topic, but i heard adobe products get more bloated every new release. im not sure though. the only thing i can be sure of is that flash player 10 has memory leak issues on vista. sorry if i got to the flash player thingy. pdfs connect with adobe acrobat, adobe acrobat connects with adobe, and adobe connects with flash Smile hehe.

Add comment




biuquote
Loading