Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This page will be used to enumerate a list of features and requirements for robust management, discovery, display, and sharing of digitized newspaper content within a DAMS.


PCDM Mapping for Welsh Newspapers (NLW)

Admin (Ingestion, Metadata management, and Reporting)

FeatureWho wants it?Notes/links/etc.
Split PDF into page-level imagesBPL, UU, Alberta, Penn State, IUPUI 
Ingest from CONTENTdm exportYale, IUPUI 
Bulk ingest of NDNP-compliant dataUU, Maryland, Penn State

https://www.loc.gov/ndnp/

specification doc https://www.loc.gov/ndnp/guidelines/NDNPTechSpecs_Overview.pdf

Manage content according to alternative orderings/groupings (e.g. microfilm reel, or book for bound sets)MarylandThis would support cases where issue-level metadata is not (yet) available

Bulk ingest of "generic" METS/ALTO data

*including METS/ALTO with article segmentation

Alberta; Princeton, UU, Cornell, Penn State

*Boston College

Supporting tweaking of params to address minor differences
Bulk ingest of Olive dataAlbertaAnyone?

Bulk ingest of PDFs

Penn State; Alberta
Generate a report for collections like item counts and usage statisticsIUPUI; Alberta
Admin dashboardIUPUI, UU; Alberta
Google Analytics SupportUUMetadata Managment Tools; Alberta; IUPUIOr Piwik
Edit metadata recordsUU, BPL, Alberta,
  • Updates metadata records
  • Modify binary stream object (rotate images, etc)
  • Bulk record updatingIUPUI

    Individual record editing; and bulk record updating

    Modify images
    rotate, regenerate derivatives, etc.
    Edit image sequence
    • Remove images
    • Reorder images
    • Insert images

    User

    FeatureWho wants it?Notes/links/etc.
    Search within issue OCRBPL, Yale, Princeton, Maryland, Alberta, UU, IUPUI, Cornell, Penn State 
    Highlight search terms on page imageBPL, Yale, Maryland, Alberta; Princeton, UU, IUPUI, Cornell, Penn StateWith ability to toggle these on/off
    Highlight article boundsMaryland, UU, Cornell, Penn StateRequires OCR in ALTO XML format
    Download "clipped" article for newspapers with article segmentationBoston CollegeRequires METS/ALTO with article segmentation and following "continuations" across pages
    Download article as PDFMaryland, UU, Cornell, IUPUIIf issue bounds are available, offer the option to download just the pages containing a particular article
    Download issue as PDFBPL, Yale, Princeton, Maryland, UU, IUPUI, Cornell(from Yale) if the PDF could have a cover page that includes Rights information and metadata it would be ideal.
    Download item textBPLWould allow user to download plain text for item (article, page, issue).
    If animations are used for page turning, ability to turn them off.Yale, Maryland, UU 
    Search using multiple translated languages on a single pageYale, Penn StateIn the event that the OCR/coordinates for a page has been translated into multiple languages.
    Advanced search - ability to search just the text captions for imagesYalewith understanding that it is possible to code the content in a way that makes this work (PCDM?)
    Advanced search - ability to search text exclusively for advertisements (or exclude this text from a search including faceting)Yale, UUwith understanding that it is possible to code the content in a way that makes this work (PCDM?)
    Filter search by article type (article, obituary, advertisement etc.) where provided in the dataAlberta, UU 
    Gallery display modeUU
    Advanced search based on regions (city, county, state, country)Penn State
    Calendar browsePenn State, IUPUI, Boston Collegehttp://chroniclingamerica.loc.gov/lccn/sn83045433/issues/
    Download visualized snippetPenn State
    Zoom & Pan with ability to serve image files and PDF filesPenn State
    Front page viewsPenn State

    http://chroniclingamerica.loc.gov/lccn/sn83045433/issues/first_pages/

    (strangely used a lot in user stats)

    OCR correction functionsPenn State, Boston College, IUPUI, UU

    ...

    FeatureWho wants it?Notes/links/etc.
    IIIF Presentation API for issue-level objectsBPL, Yale, Princeton, Alberta, Cornell, Penn State 
    IIIF manifests for newspapers should function in off-the-shelf IIIF viewers (Mirador etc.) without modification or enhancement.Alberta, UU
    Ability to integrate with existing IIIf services (i.e. make support for IIIf and other external services pluggable)Maryland, Alberta, Penn State, IUPUIFor example, if an institution has a Loris server and a manifest generation app already, this app should support integration of those services
    Expose PURLs for issues and pagesMaryland, Alberta, IUPUI 
    Expose PURLs and citation metadata (human and machine readable) for articles where article-level metadata is availableAlberta
    Support redirects from legacy URLs for issues, pages, articlesAlberta, IUPUI 
    API to get just the textPenn State, UU, Princetonhttp://chroniclingamerica.loc.gov/about/api/#bulk-data
    Open Annotations for OCR text, ideally linked in IIIF Presentation APIPenn State
    Open SearchPenn Statehttp://chroniclingamerica.loc.gov/about/api/#search
    Linked Data optionsPenn Statehttp://chroniclingamerica.loc.gov/about/api/#linked-data
    CORS and JSONP supportPenn Statehttp://chroniclingamerica.loc.gov/about/api/#cors_jsonp

    ...