TIFF is a file format commonly used for storing digital versions of paper documents because it is a standard format for most scanners and software applications. However, the advent of Portable Document Format (PDF) has added new dimensions and powerful capabilities to electronic documents because PDF is more extensible than other image-based formats.
PDF (Portable Document Format) is a universal file format for document exchange that preserves all the fonts, formatting, colours, and graphics of any source document (whether it’s on paper or from the Web or other electronic sources). Preservation is faithful regardless of the application and platform used to create or view the material. PDF files can be shared, viewed, navigated, and printed on a broad range of operating systems by anyone using free Adobe Acrobat Reader™ or other software.
With scanning software, volumes of legacy paper documents may be converted to PDF so you can search, annotate, publish, and archive all of your information in a digital environment.
However there are different types of PDF for use when scanning paper-based documents:
PDF Image Only
PDF Image Only is the simplest scanning for documents that don’t require searchable text
PDF Image Only takes a bitmapped image of a document (like a TIF file) and applies a PDF wrapper to that raster image. Because PDF Image Only files do not contain OCR text, their content is not searchable. But the file can be integrated with other Adobe PDF documents and read by anyone on any platform with Adobe Acrobat Reader software. In addition, you can add keywords to the file, so you can search for the document later.
PDF Image Only is ideal for
transactional documents, such as invoices and forms. For example, you can use
Image Only to scan invoices into an imaging archive. Digital versions of
invoices must be absolutely faithful to the originals, yet they are rarely
retrieved once they have been entered into the system. When an invoice does need
to be retrieved, it can easily be found with an index search for the invoice
number or customer name.
PDF Searchable Image is a PDF Image Only document with the addition of a text layer beneath the image. This approach retains the look of the original page while enabling text searchability.
A document created in PDF Searchable Image offers the best of both worlds—an exact replica of the original document that is also fully searchable. PDF Searchable Image files contain two layers: a bitmapped (image) layer and a hidden text layer. The bitmapped layer maintains the visual representation of the original document. The text layer contains the Optical Character Recognition (OCR) version so you can search for any word on any page. PDF Searchable Image comes in two variants: Exact and Compact. These two are similar in many ways, but they have a few key differences.
The OCR process required to create
PDF Searchable Image typically provides text accuracy of 97 to 99 percent. One
to three wrong characters for every 100 may seem like a lot errors. But this is
not a problem for those applications that this approach is designed for. Since
the user sees a scanned image representation of the original paper page, OCR
errors will not be visible to the eye. The errors are only an issue when
searching or copying text, which accesses the hidden text layer.
Alliance BatchScan can scan into PDF Searchable format
* PDF Searchable Images * PDF Scanned Images * Scanning PDF Searchable Images * PDF Searchable File * PDF Searchable OCR *