UF Recommended Data and File Formats for Electronic Theses and Dissertations (2012 and 2007 )

MISSING IMAGE

Material Information

Title:
UF Recommended Data and File Formats for Electronic Theses and Dissertations (2012 and 2007 )
Abbreviated Title:
University of Florida Recommended Data and File Formats for Electronic Theses and Dissertations (2012 and 2007 )
Physical Description:
Documentation
Language:
English
Creator:
Varied
Publisher:
George A. Smathers Libraries, University of Florida
Place of Publication:
Gainesville, FL
Publication Date:

Subjects

Subjects / Keywords:
Curator tools
Documentation
Competencies
SobekCM
Florida Digital Archive ( FDA )

Notes

Abstract:
At UF, each digital collection is under the purview of one or more Curators or Program Directors, with collaborative support from the Head of Digital Development & Web Services and/or Digital Humanities Librarian. For scholarly digital projects and archives, teaching faculty partner and collaborate with Curators/Program Directors following the processes for their collections. This documentation provides the 2012 and 2007 recommended formats for electronic theses and dissertations ( ETD or ETDs ) for inclusion in the Institutional Repository @ UF ( IR@UF ) and for long term digital preservation at UF and with the Florida Digital Archive ( FDA ), migration of file formats as needed of preservation, and optimal performance support for permanent access and preservation.
General Note:
Additional related materials are available from the ETD Committee website: http://www.library.ufl.edu/committees/etd/

Record Information

Source Institution:
University of Florida
Holding Location:
University of Florida
Rights Management:

The author dedicated the work to the Commons by waiving all of his or her rights to the work worldwide under copyright law and all related or neighboring legal rights he or she had in the work, to the extent allowable by law.
System ID:
AA00017119:00011


This item is only available as the following downloads:


Full Text

PAGE 1

Recommended Data Formats for Preservation Purposes in the Florida Digital Archive This table is intended to help Florida university administrators develop guidelines for preparing and submitting files to the Florida Digital Archive. It is not intended to suggest that these file formats are allowable formats for ETDs or any other official publication of any Florida university. (See the notes following this table.) Source: FCLA, Mar. 2012. Media High Confidence Level Medium Confidence Level Low Confidence Level Text Plain text (encoding: USASCII, UTF-8, UTF-16 with BOM) XML (includes XSD/XSL/ XHTML, etc.; with included or accessible schema and character encoding explicitly specified) PDF/A-1 (ISO 19005-1) (*.pdf) PDF/A-2 (ISO 19005-2:2011) (*.pdf) Cascading Style Sheets (*.css) DTD (*.dtd) Plain text (ISO 8859-1 encoding) PDF (*.pdf) (embedded fonts) Rich Text Format 1.x (*.rtf) HTML (include a DOCTYPE declaration) SGML (*.sgml) Open Office (*.sxw/*.odt) OOXML (ISO/IEC DIS 29500) (*.docx) EPUB (unencrypted) (*.epub) PDF (*.pdf) (encrypted) Microsoft Word (*.doc) WordPerfect (*.wpd) DVI (*.dvi) All other text formats not listed here Raster Image TIFF (uncompressed) JPEG2000 (lossless) (*.jp2) PNG (*.png) BMP (*.bmp) JPEG/JFIF (*.jpg) JPEG2000 (lossy) (*.jp2) TIFF (compressed) GIF (*.gif) Digital Negative DNG (*.dng) Google WebP (*.webp) MrSID (*.sid) TIFF (in Planar format) FlashPix (*.fpx) PhotoShop (*.psd) RAW JPEG 2000 Part 2 (*.jpf, *.jpx) All other raster image formats not listed here Vector Graphics SVG (no Java script binding) (*.svg) Computer Graphic Metafile (CGM, WebCGM) (*.cgm) Encapsulated Postscript (EPS) Macromedia Flash (*.swf) All other vector image formats not listed here Audio AIFF (PCM) (*.aif, *.aiff) WAV (PCM) (*.wav) AES3 (LPCM) (*.aes) SUN Audio (uncompressed) (*.au) Standard MIDI (*.mid, *.midi) Ogg Vorbis (*.ogg) Free Lossless Audio Codec (*.flac) Advance Audio Coding (*.mp4, *.m4a, *.aac) MP3 (MPEG-1/2, Layer 3) (*.mp3) AIFC (compressed) (*.aifc) NeXT SND (*.snd) RealNetworks 'Real Audio' (*.ra, *.rm, *.ram) Windows Media Audio (*.wma) Protected AAC (*.m4p) WAV (compressed) (*.wav) All other audio formats not listed here

PAGE 2

Media High Confidence Level Medium Confidence Level Low Confidence Level Video Motion JPEG 2000 (ISO/IEC 15444-4)*.mj2) AVI (uncompressed, motion JPEG) (*.avi) QuickTime Movie (uncompressed, motion JPEG) (*.mov) Ogg Theora (*.ogg) MPEG-1, MPEG-2 (*.mpg, *.mpeg, wrapped in AVI, MOV) MPEG-4 (H.263, H.264) (*.mp4, wrapped in AVI, MOV) AVI (others) (*.avi) QuickTime Movie (others) (*.mov) RealNetworks 'Real Video' (*.rv) Windows Media Video (*.wmv) Flash Video (*.flv) All other video formats not listed here Spreadsheet/ Database Comma Separated Values (*.csv) Delimited Text (*.txt) SQL DDL DBF (*.dbf) OpenOffice (*.sxc/*.ods) OOXML (ISO/IEC 29500) (*.xlsx) Excel (*.xls) All other spreadsheet/ database formats not listed here Virtual Reality X3D (*.x3d) VRML (*.wrl, *.vrml) U3D (Universal 3D file format) All other virtual reality formats not listed here Computer Programs Computer program source code (*.c, *.c++, *.java, *.js, *.jsp, *.php, *.pl, etc.) Compiled / Executable files (EXE, *.class, COM, DLL, BIN, DRV, OVL, SYS, PIF) Presentation OpenOffice (*.sxi/*.odp) OOXML (ISO/IEC DIS 29500) (*.pptx) PowerPoint (*.ppt) All other presentation formats not listed here Notes : We do not intend to provide full preservation support for formats listed under Low Confidence Level. UMI ProQuest lists file formats that they will accept for theses and dissertations on their website (http://www.proquest.com/assets/downloads/products/ UMI_PreparingYourManuscriptGuide.pdf ). Some of the formats UMI ProQuest accepts are listed in the Medium or Low confidence level in the above table. Because UMI ProQuest currently does not provide full preservation support, this should not represent a conflict for any universities in deciding on acceptable ETD file formats. No files with viruses will be accepted (refers especially to these file formats: DOC, XLS, MDB, PPT, ZIP, EXE). Please scan your files for viruses with up-to-date virus scanners before transmitting your files to the Florida Digital Archive. We do not intend to provide full preservation support for any files fully or partially encrypted. We do not intend to provide full preservation support for any files fully or partially passwordprotected. We do not intend to provide full preservation support for any files fully or partially compressed with a proprietary compression algorithm.

PAGE 3

We do not intend to provide full preservation support for any files fully or partially encoded with unembedded proprietary fonts. We do not intend to provide full preservation support for any files produced with Digital Right Management controls. It is permissible to archive files in the unencrypted ZIP (*.zip) format for transmission to the Florida Digital Archive. Upon the receipt at FCLA, those ZIP files will be restored to their original formats and archived accordingly. As a general rule, use platform-independent, vendor-independent, non-proprietary, stable, open and well-supported formats.

PAGE 4

University of Florida acceptable ETD formats The Acceptable ETD formats below are defined based on those media for which a preservation stra tegy exists. The aim is to make all ETDs ava ilable into the future as the technology changes and software becomes obsolete. Media formats that are not listed in this table are ones that cannot presently be preserved at an acceptable le vel. Many of these can easily be converted to one of the acceptable formats. If you create a document using Word or LaTex it should be exported as a PDF. Please co nsult with the CIRCA ETD unit ( etd@grove.ufl.edu ) for assistance or if you have any questions. The Acceptable ETD Formats are reviewed and updated regularly by the Graduate School, the Library, the ETD training staff in Academic Technology, and the Florida Center for Library Automation. All files should be scanned with up-to-date virus software before submission. Files with viruses will not be ac cepted. Media Acceptable Formats Text PDF or PDF/A Plain text (*.txt US-ASCII, Latin-1 or Unicode) Cascading style sheets (*.css) XML, XHTML, XSD, XSL (with included schema and character encoding explicitly spec ified) HTML SGML DTD Computer program source code (*.c, *. c++, *.java, *.js, *jsp, *.jhp, *.php, *.pl, etc.) Image TIFF (bitonal Group4 compression; grayscale uncompressed; RGB uncompressed) PNG JPEG/JFIF BMP JPEG2000 (jp2) without embedded metadata, color profiles, or ROI. Save metadata, profiles an d ROI as te xt files together with JP2 images Vector Graphics SVG CGM Web CGM Audio AIFF (uncompressed) (*aif, *.aiff) WAV (PCM only) (*.wav) MIDI Ogg Vorbis (OGG) (uncompressed) FLAC Video MPEG-2 MPEG-4 AVI (MJPEG) MOV (MJEPG-A with uncompressed header) Spreadsheet/Data base CSV (Comma Separated Variable) Text Tab Delimited Test SQL DBF (*.dbf) OpenOffice (*sxc) Computer programs See Text formats Virtual Reality X3D VRML Presentation OpenOffice (*.sxi) Last revised March 8th, 2007