UFDC Home  

Imaging Specifications

Digital Library of the Caribbean
Permanent Link: http://ufdc.ufl.edu/AA00016215/00001

Material Information

Title: Imaging Specifications
Series Title: dLOC Advanced Topics Training Institute
Physical Description: Training slides
Language: English
Creator: Sullivan, Mark V.
Publisher: University of Florida Libraries / Digital Library of the Caribbean
Place of Publication: Gainesville, Florida
Publication Date: 2013

Subjects

Subjects / Keywords: Digitization
Training
SobekCM
dLOC Training Presentation

Notes

Acquisition: Resource was uploaded and editing during training for dLOC partners in Gainesville, FL on 7/29/2013. Submitted by Mark Sullivan.
General Note: Slides for dLOC partner training

Record Information

Source Institution: University of Florida
Holding Location: University of Florida
Rights Management:
The author dedicated the work to the Commons by waiving all of his or her rights to the work worldwide under copyright law and all related or neighboring legal rights he or she had in the work, to the extent allowable by law.
System ID: AA00016215:00002

Permanent Link: http://ufdc.ufl.edu/AA00016215/00001

Material Information

Title: Imaging Specifications
Series Title: dLOC Advanced Topics Training Institute
Physical Description: Training slides
Language: English
Creator: Sullivan, Mark V.
Publisher: University of Florida Libraries / Digital Library of the Caribbean
Place of Publication: Gainesville, Florida
Publication Date: 2013

Subjects

Subjects / Keywords: Digitization
Training
SobekCM
dLOC Training Presentation

Notes

Acquisition: Resource was uploaded and editing during training for dLOC partners in Gainesville, FL on 7/29/2013. Submitted by Mark Sullivan.
General Note: Slides for dLOC partner training

Record Information

Source Institution: University of Florida
Holding Location: University of Florida
Rights Management:
The author dedicated the work to the Commons by waiving all of his or her rights to the work worldwide under copyright law and all related or neighboring legal rights he or she had in the work, to the extent allowable by law.
System ID: AA00016215:00002


This item is only available as the following downloads:


Full Text

PAGE 1

Mark Sullivan Digital Library of the Caribbean

PAGE 2

Imaging Imaging Theory & Specifications Recommended Equipment and Software 2 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 3

3 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 4

Imaging Theory & Best Practices Bit Depth & Color Space Resolution File Types Image Compression OCR Sample Directories Questions 4 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 5

Bit Depth & Color Space Bi Greyscales 8 bit ( 256 shades of gray ) 16 bit (65536 shades of gray ) RGB ( usually 24 bit ) CMYK ( usually 32 bit ) 5 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 6

Bit Depth & Color Space Image: Nevit Dilmen found at Wikimedia commons 6 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 7

Bit Depth & Color Space Color Fidelity Meaningful color should be retained 7 Bi tonal 8 bit Greyscale 24 bit Color dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 8

Bit (Almost) never scan 1 bit Completely grey items should (usually) be scanned 8 bit greyscale. Items with meaningful color should be scanned 24 bit RGB Trade offs between quality and file size 8 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 9

Text Optical Character Recognition 9 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 10

Resolution Resolution of an image expressed in pixels PPI pixels per inch DPI dots per inch 10 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 11

Resolution : Recommended R ESOLUTION U SE F OR 300 pixels per inch ( ppi ) Printed text with normal sized fonts Oversized documents and maps Manuscripts with legible script 600 pixels per inch ( ppi ) Photographs and select graphic arts Printed text with very small fonts Manuscripts with difficult scripts 11 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 12

Resolution : Rationale 1 Newspaper graphics printed at 80 dpi Magazine graphics printed at 120 dpi High end graphics printed at 300 dpi Scanning at 300 dpi is sufficient 12 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 13

Resolution : Rationale 2 Text Optical Character Recognition 13 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 14

Resolution : Rationale 3 Photographs Use 600 dpi Continuous tone images Unexpected use capture all details 14 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 15

File Types Save archival masters as TIFF Internet delivery as JPEGs or JPEG2000s 15 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 16

Image Compression Save archival TIFFs as non compressed Lossy 16 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 17

OCR Optical Character Recognition Creation of plain text from an image file Just as important is the positional information! Text highlighting Text analysis 17 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 18

OCR : ALTO XML LOC XML schema / standard Contains position (and style) of each word, with possible variants Can be embedded within a METS file Used by NDNP 18 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 19

OCR : ALTO XML 19 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 20

File Types (sample directory 1) 00001.tif (archival master TIFFs) 00001.jpg (standard page view) 00001.jp2 ( zoomable page view) 00001thm.jpg (thumbnail) 00001.txt ( text) GOOD! 20 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 21

File Types (sample directory 2) 00001_archive.tif (archival master TIFFs) 00001_processed.tif (processed TIFF) 00001.jpg (standard page view) 00001.jp2 ( zoomable page view) 00001thm.jpg (thumbnail) GOOD! 21 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 22

File Types (sample directory 3) 00001.tif (archival master TIFFs) 00002.tif (archival master TIFFs) 00003.tif (archival master TIFFs) 00004.tif (archival master TIFFs) Book.pdf (presentation PDF) FINE! 22 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 23

File Types (sample directory 4) Book.pdf (presentation PDF) BAD! Do not scan directly to PDF, or any other presentation file type 23 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 24

Review of Topics Bit Depth & Color Space Resolution File Types Image Compression OCR Sample Directories 24 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 25

dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 25

PAGE 26

26 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 27

Scanning Equipment Flatbed scanners Sheet feed scanners Book scanners Map scanners Microfilm 27 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 28

Flatbed Scanners Microtek ScanMaker 9800XL Epson Expression 10000XL 28 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 29

Sheet feed Scanners Panasonic KV S2046C 29 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 30

Book Scanners i2S CopiBook ( 24 bit color ) Konica Minolta PS7000 with grayscale up grade 30 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 31

Oversized Document Scanners Camera back, vacuum table, etc.. Betterlight Super 8K HS 31 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 32

Microfilm Scanners 32 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 33

dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 33



PAGE 1

Mark Sullivan Digital Library of the Caribbean

PAGE 2

Imaging Imaging Theory & Specifications Recommended Equipment and Software 2 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 3

3 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 4

Imaging Theory & Best Practices Bit Depth & Color Space Resolution File Types Image Compression OCR Questions 4 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 5

Bit Depth & Color Space Bi Greyscales 8 bit ( 256 shades of gray ) 16 bit (65536 shades of gray ) RGB ( usually 24 bit ) CMYK ( usually 32 bit ) 5 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 6

Bit Depth & Color Space Image: Nevit Dilmen found at Wikimedia commons 6 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 7

Bit Depth & Color Space Color Fidelity Meaningful color should be retained 7 Bi tonal 8 bit Greyscale 24 bit Color dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 8

Bit (Almost) never scan 1 bit Completely grey items should (usually) be scanned 8 bit greyscale. Items with meaningful color should be scanned 24 bit RGB Trade offs between quality and file size 8 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 9

Text Optical Character Recognition 9 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 10

Resolution Resolution of an image expressed in pixels PPI pixels per inch DPI dots per inch 10 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 11

Resolution : Recommended R ESOLUTION U SE F OR 300 pixels per inch ( ppi ) Printed text with normal sized fonts Oversized documents and maps Manuscripts with legible script 600 pixels per inch ( ppi ) Photographs and select graphic arts Printed text with very small fonts Manuscripts with difficult scripts 11 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 12

Resolution : Rationale 1 Newspaper graphics printed at 80 dpi Magazine graphics printed at 120 dpi High end graphics printed at 300 dpi Scanning at 300 dpi is sufficient 12 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 13

Resolution : Rationale 2 Text Optical Character Recognition 13 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 14

Resolution : Rationale 3 Photographs Use 600 dpi Continuous tone images Unexpected use capture all details 14 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 15

File Types Save archival masters as TIFF Internet delivery as JPEGs or JPEG2000s 15 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 16

Image Compression Save archival TIFFs as non compressed Lossy 16 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 17

OCR Optical Character Recognition Creation of plain text from an image file Just as important is the positional information! Text highlighting Text analysis 17 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 18

OCR : ALTO XML LOC XML schema / standard Contains position (and style) of each word, with possible variants Can be embedded within a METS file Used by NDNP 18 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 19

OCR : ALTO XML 19 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 20

Review of Topics Bit Depth & Color Space Resolution File Types Image Compression OCR 20 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 21

dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 21

PAGE 22

22 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 23

Scanning Equipment Flatbed scanners Sheet feed scanners Book scanners Map scanners Microfilm 23 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 24

Flatbed Scanners Microtek ScanMaker 9800XL Epson Expression 10000XL 24 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 25

Sheet feed Scanners Panasonic KV S2046C 25 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 26

Book Scanners i2S CopiBook ( 24 bit color ) Konica Minolta PS7000 with grayscale up grade 26 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 27

Oversized Document Scanners Camera back, vacuum table, etc.. Betterlight Super 8K HS 27 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 28

Microfilm Scanners 28 dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan

PAGE 29

dLOC Training (7/29/2013) Gainesville, FL Mark Sullivan 29


  Home | About dLOC | Collections | Governance | Digitization | Outreach | Contact  
  Powered by SobekCM
Acceptable Use, Copyright, and Disclaimer Statement  
© All rights reserved