Cornell University Library Digital Collections
Technical Details
The materials in the CHLA collection were scanned from brittle paper originals. Scanning was outsourced to service vendors at various times from 1994 to present. Early in the project, images were captured at 600 DPI in bitonal TIFF image format and compressed using CCITT Group 4. Simple document structuring occurred either at the point of conversion or during on-campus processing, primarily linking image numbers to pagination and tagging self-referencing portions of the text. Further conversion of CHLA materials included optical character recognition (OCR) of the page images. In the case of serials, extensive low-level indexing was added post-conversion to allow for increased access to the articles contained in each issue.
CHLA originally operated on the Dienst system, then migrated to a “middleware” developed by the University of Michigan, migrated again to the DLXS platform, and migrated most recently to its present platform, Samvera, an open source repository.