A Hybrid Systems Approach to Preservation of Printed
Materials
Comparing Micrographics and Digital Technology: This paper will
focus on questions about the use of micrographics and digital
imaging technologies for preservation of printed materials. It will
not address any of the issues involved in the preservation of sound,
motion pictures, video, art, or color images. The author is aware
that other document preservation issues exist; however, it was felt
that these two technologies were of most interest to the
preservation community at this time. Topics to be covered
include:
- What are the advantages and disadvantages of each technology?
- What are the trade-offs involved in selecting one technology
over the other?
- What are the benefits of a hybrid approach?
- In a hybrid system, should the page be captured first to film
and converted to digital, or vice versa; or can it be done
simultaneously?
- What options are available for converting from film to digital
and back?
- What are the cost factors; how does one maximize image quality
while minimizing cost?
- What role should ASCII* text and
OCR (optical character recognition) play?
- How can the required resolution be determined, and what are the
resolution issues with each technology?
- What standards should concern the practitioner?
Areas of Analysis: There are three primary areas of analysis in
comparing digital electronic image systems to film-based systems for
preservation: document capture, storage, and access. In capture, the
analyst will be concerned with the capture mechanism, resolution,
quality of the captured image, acquisition speed, system cost,
operating cost, and indexing requirements. In storage, the concerns
are media permanence, media refresh requirements, technology
obsolescence, drive cost, media cost, interchangeability of media,
reliability, performance and access tradeoffs. Finally, with regard
to access, the designer must examine retrieval capability (both
searching and browsing), retrieval speed, transmission and
distribution capability, and retrieval quality. Micrographics and
imaging technologies can complement each other and best address
these concerns together in the well-designed preservation
system.
This paper will survey micrographic and digital technologies in
light of the issues and concerns defined above. The objective is to
arrive at short and long-term recommendations for developing
document preservation systems based on these technologies.
Executive Summary: Based on a review of the technology,
our findings are:
- Design objectives are extremely important: The preservation
systems designer must identify the objectives of the preservation
system in detail. For example, if practitioners desire to preserve a
faithful reproduction of the document, do they want the page as it
currently exists complete with its discoloration due to age and
water stains, or do they desire a cleaned up page, similar to what
was originally published? Obviously, an image can only be cleaned up
by using electronic technology, so system requirements have a
definite impact on the technology that must be used.
Other important system design criteria include the volume of the
workload, quality required, methods for storing and accessing the
documents, frequency of access, urgency of access, response-time
requirements, condition of the documents, and page sizes*.
- A micrographics-based preservation system is a generally
acceptable solution here and now for most printed materials. It is a
mature technology with widespread familiarity and a large installed
base. High-quality film created and stored according to standards
will last up to 500 years.
- Centralized master vaults already exist where over 3 million
rolls of film masters are stored in secure, climate-controlled
conditions for only about $1.00 per reel per year.
- Microfilm's major weakness is its inadequate access and
distribution characteristics.
- Although microforms are currently a relatively inexpensive
preservation medium for printed materials, costs for this type of
solution will increase at five to ten percent per year due to the
increasing cost of labor.
- Micrographics cannot be considered an acceptable solution for
all preservation needs; for example, it is not ideal for preserving
high-quality greyscale images, color images (e.g., artworks), sound
recordings or full motion video. In these areas, digital
technologies are the only reasonable alternative.
- It can be twenty times more expensive to store 9 X 5 inch
archival resolution page images on optical disc than on 35mm film.
- For digital preservation systems, productivity increases will be
brought on by technology advances, and these advances are expected
to accelerate rapidly over the next several years.
- There are no forms of digital storage currently on the market
that would be considered archival according to the traditional
definition.
- Write-once optical disc could be considered permanent* but not archival. The reason is not the
longevity of the media--it's the fact that the technology becomes
obsolete. Even if the media were to last 50 years, chances are there
wouldn't be a drive available to play it.
- Perhaps when referring to digital storage media, "archival"
needs to be redefined as the ability to recreate an exact copy from
the original medium before it degrades or the technology necessary
to read it becomes obsolete.
- Assuming that refreshing of media (recopying) would be cost
justified by the increase in capacity and/or reduction of cost of
the new media, a key question preservationists must answer is, "Is a
solution acceptable which requires the media to be recopied onto
more advanced media every "N" years in order to keep up with
advancing technologies?" If so, who would be in charge of assuring
that the conversion was carried out on schedule? This whole topic
could be the subject of a new paper.
- A digital image based preservation system is the most promising
future solution for printed materials. It is a rapidly changing
technology in quality, speed, and economics. Its major weaknesses
are that the technology is fairly new, has high data-storage
requirements, and lacks proven archival storage capability.
- Digital imaging technology will increase in functionality and
decrease in cost for the foreseeable future. Many experts believe
that an all-digital system will provide the most economical future
preservation solution. In fact, if one were to do a five year
present value analysis of a micrographics based versus a digital
image based preservation system today, factoring in the costs of
access and distribution, the digital system would most likely prove
to be the least expensive alternative.
- Access to the preserved materials is a key benefit of the
digital image preservation system. Access can be through a separate
database of indexes, abstracts and indexes, full-text search on the
ASCII portion of compound documents, or by browsing through the
database item by item.
- With digital technology it will no longer be necessary for the
researcher to travel to where the preserved materials are physically
located; access to historic collections throughout the country can
be as close as the nearest computer or printer.
- For digital preservation systems, productivity increase will be
brought on by technology advances, and these advances are expected
to accelerate rapidly over the next several years.
- There are no forms of digital storage currently on the market
that would be considered archival according to the traditional
definition.
- Write-once optical disc could be considered permanent3 but not
archival. The reason is not the longevity of the media--it's the
fact that the technology becomes obsolete. Even if the media were to
last 50 years, chances are there wouldn't be a drive available to
play it.
- Perhaps when referring to digital storage media, "archival"
needs to be redefined as the ability to recreate an exact copy from
the original medium before it degrades or the technology necessary
to read it becomes obsolete.
- Assuming that refreshing of media (recopying) would be cost
justified by the increase in capacity and/or reduction of cost of
the new media, a key question preservationists must answer is, "Is a
solution acceptable which requires the media to be recopied onto
more advanced media every "N" years in order to keep up with
advancing technologies?" If so, who would be in charge of assuring
that the conversion was carried out on schedule? This whole topic
could be the subject of a new paper.
- A digital image based preservation system is the most promising
future solution for printed materials. It is a rapidly changing
technology in quality, speed, and economics. Its major weaknesses
are that the technology is fairly new, has high data-storage
requirements, and lacks proven archival storage capability.
- Digital imaging technology will increase in functionality and
decrease in cost for the foreseeable future. Many experts believe
that an all-digital system will provide the most economical future
preservation solution. In fact, if one were to do a five year
present value analysis of a micrographics based versus a digital
image based preservation system today, factoring in the costs of
access and distribution, the digital system would most likely prove
to be the least expensive alternative.
- Access to the preserved materials is a key benefit of the
digital image preservation system. Access can be through a separate
database of indexes, abstracts and indexes, full-text search on the
ASCII portion of compound documents, or by browsing through the
database item by item.
- With digital technology it will no longer be necessary for the
researcher to travel to where the preserved materials are physically
located; access to historic collections throughout the country can
be as close as the nearest computer or printer.
- Efficient access to the preserved collections has the potential
of allowing the institution to self-fund some of the preservation
costs through revenues generated from the improved access to the
archival collection.
- An inexpensive solution to preservation has been explored in a
pioneering project of Cornell University. They have used digital
scanning at 600 dots per inch (dpi) binary, to create high-quality
copies on acid-free paper. The idea is to create a permanent, not
archival, paper copy that can go back on the shelf-- preservation
reformatting.
- A hybrid system, one that combines both film and digital
imaging, could well offer the best overall design for current
preservation needs. Micrographics provide a relatively inexpensive,
high-quality archival storage medium. Digital imaging contributes
access, distribution, and transmission strengths. It should be noted
that in the near future, most national service bureaus will have the
capability to transfer from one technology to the other, so the
practitioner need not design the full hybrid capability into the
local system.
- A hybrid system can be implemented with today's technology by
filming first and scanning some or all of the film to enhance access
to the preserved collection. We will designate this as the
"film-first archival preservation system."
- The latest possibility for implementing a hybrid system is
through filming and scanning simultaneously. New belt-fed
combination duplex scanner/filmer image capture devices were
introduced at the 1992 AIIM show by Bell & Howell and Kodak.
These devices could be used on non-brittle documents. As far as
processing goes, this type of system suffers from some of the same
limitations as the film-first system which will be discussed later.
- The "scan-first archival preservation system" is rapidly
becoming an acceptable alternative for the preservation system
designer. By scanning first, each page can be decomposed into
separate areas of text, line art, and halftones. Each of these will
be electronically processed independently to maximize overall page
quality. By scanning in greyscale and enhancing the digital data
prior to creating film, it will be possible to create higher quality
film than can currently be created using light/lens methodology.
- Scanning first will also allow more intelligent retrieval aids
in bar code format or blip marks to be recorded onto the film so
that retrieval can be automated.
- Digital imaging allows end-users to obtain higher quality
printed copies than micrographics. Each copy will be a
first-generation copy. As with music on a compact disc, there is no
degradation during usage. Because of the aforementioned, the
scan-first archival preservation system will be more cost-effective
to build and operate than any other type of preservation system once
all the technology is available.
- Resolution is the key design parameter for a digital image
preservation system (see Appendix A). We've
defined various levels of resolution referred to in this paper as
follows:
- "Archival resolution" is defined as the resolution necessary to
capture a faithful replica of the original document, regardless of
cost.
- "Optimal archival resolution" is the lowest resolution that will
completely satisfy the archival image objectives defined for the
system.
- "Adequate access resolution," on the order of 300 dpi binary, is
defined as the resolution sufficient to capture about 99.9 percent
of the information content of the page.
- Microfilm is "resolution-indifferent". Each frame of film can
store high-quality images with equivalent digital resolution of
about 800 to 1,000 dpi with about 8 - 12 levels of greyscale.
- Digital imaging is "resolution dependent": the higher the
resolution requirements, the higher the cost and complexity of the
system.
- The above suggests a second question pertaining to resolution
that must be answered if we are to accurately evaluate our
alternatives. It is "should film standards, which primarily measure
the high contrast components of a reproduction, be used to measure
digital reproducibility?" Do we want to have perfect print or a
high-quality copy of the entire original including halftones.
Recommendation: Currently, practitioners choosing microfilm for a
preservation solution can feel confident that their printed
materials will be adequately preserved and that even in the next
century or beyond the technology will be available to transfer this
material to other media if desired. This is true because of its
accepted archival nature, and the fact that one only needs a lens
and light to read it. Optical storage can be considered for
preservation on a selective basis provided there is a plan to recopy
the media prior to any substantial degradation. For the longer term,
practitioners should immediately begin planning for, and designing,
the hybrid archival preservation system of the future. The
continuous and accelerating improvements in electronic imaging and
optical disc technology will be the key to solving preservation
problems.
[Table of Contents] [Next]