File Use Vocabulary
The File Use Vocabulary is a list of subclasses of pcdm:File that describe the role a pcdm:File plays within a pcdm:Object. These classes, combined with technical metadata (mime type, image resolution, etc.), should be used to determine which file to use in a given context.
File Use Values
Original File: the original file uploaded by the user
Thumbnail Image: low resolution placeholder image
Extracted Text: text extracted from documents/OCR
Preservation Master File: best quality file in a format appropriate for long-term preservation
Intermediate File: high quality representation of the Object, appropriate for generating derivatives or other additional processing
Service File: a format generated for serving to users, such as the PDF generated from a Word/LaTeX source file, MP3 generated from a WAV file, JPEG generated from a TIFF, etc.
Transcript: text representation that can be a substitute or complement for accessibility purposes, such as a transcript, subtitles, or closed captions
Examples
Object Type | Original File | Preservation Master File | Thumbnail Image | Extracted Text | Transcript | Service File |
|---|---|---|---|---|---|---|
Audio | Logic source file | WAV | JPEG of album cover, promotional poster, etc. |
| transcript | MP3 |
Document | Word Doc | JPEG of first page | text dump |
| ||
Image | Photoshop source file, uncropped/uncorrected TIFF | TIFF image | low-res JPEG | OCR text |
| med-res JPEG |
Video | Premiere source file | full-quality MOV | JPEG of title frame | text-to-speech output | subtitles | 720p MPEG4 |
Using Multiple Types
Multiple types may be appropriate for a single file, such as an image originally created as a medium-resolution JPEG. In that case, you can assign both Original File (because it's the original creation format) and Service File (because it's appropriate for serving to end users).