Features & Requirements
This page will be used to enumerate a list of features and requirements for robust management, discovery, display, and sharing of digitized newspaper content within a DAMS.
PCDM Mapping for Welsh Newspapers (NLW)
Admin (Ingestion, Metadata management, and Reporting)
Feature | Who wants it? | Notes/links/etc. |
---|---|---|
Split PDF into page-level images | BPL, UU, Alberta, Penn State, IUPUI | Ā |
Ingest from CONTENTdm export | Yale, IUPUI | Ā |
Bulk ingest of NDNP-compliant data | UU, Maryland, Penn State | specification doc https://www.loc.gov/ndnp/guidelines/NDNPTechSpecs_Overview.pdf |
Manage content according to alternative orderings/groupings (e.g. microfilm reel, or book for bound sets) | Maryland | This would support cases where issue-level metadata is not (yet) available |
Bulk ingest of "generic" METS/ALTO data *including METS/ALTO with article segmentation | Alberta; Princeton, UU, Cornell, Penn State *Boston College | Supporting tweaking of params to address minor differences |
Bulk ingest of Olive data | Alberta | Anyone? |
Bulk ingest of PDFs | Penn State; Alberta | |
Generate a report for collections like item counts and usage statistics | IUPUI; Alberta | |
Admin dashboard | IUPUI, UU; Alberta | |
Google Analytics Support | UU; Alberta; IUPUI | Or Piwik |
Edit metadata records | UU, BPL, Alberta, IUPUI | Individual record editing; and bulk record updating |
Modify images | rotate, regenerate derivatives, etc. | |
Edit image sequence |
|
User
Feature | Who wants it? | Notes/links/etc. |
---|---|---|
Search within issue OCR | BPL, Yale, Princeton, Maryland, Alberta, UU, IUPUI, Cornell, Penn State | Ā |
Highlight search terms on page image | BPL, Yale, Maryland, Alberta; Princeton, UU, IUPUI, Cornell, Penn State | With ability to toggle these on/off |
Highlight article bounds | Maryland, UU, Cornell, Penn State | Requires OCR in ALTO XML format |
Download "clipped" article for newspapers with article segmentation | Boston College | Requires METS/ALTO with article segmentation and following "continuations" across pages |
Download article as PDF | Maryland, UU, Cornell, IUPUI | If issue bounds are available, offer the option to download just the pages containing a particular article |
Download issue as PDF | BPL, Yale, Princeton, Maryland, UU, IUPUI, Cornell | (from Yale) if the PDF could have a cover page that includes Rights information and metadata it would be ideal. |
Download item text | BPL | Would allow user to download plain text for item (article, page, issue). |
If animations are used for page turning, ability to turn them off. | Yale, Maryland, UU | Ā |
Search using multiple translated languages on a single page | Yale, Penn State | In the event that the OCR/coordinates for a page has been translated into multiple languages. |
Advanced search - ability to search just the text captions for images | Yale | with understanding that it is possible to code the content in a way that makes this work (PCDM?) |
Advanced search - ability to search text exclusively for advertisements (or exclude this text from a search including faceting) | Yale, UU | with understanding that it is possible to code the content in a way that makes this work (PCDM?) |
Filter search by article type (article, obituary, advertisement etc.) where provided in the data | Alberta, UU | Ā |
Gallery display mode | UU | |
Advanced search based on regions (city, county, state, country) | Penn State | |
Calendar browse | Penn State, IUPUI, Boston College | http://chroniclingamerica.loc.gov/lccn/sn83045433/issues/ |
Download visualized snippet | Penn State | |
Zoom & Pan with ability to serve image files and PDF files | Penn State | |
Front page views | Penn State | http://chroniclingamerica.loc.gov/lccn/sn83045433/issues/first_pages/ (strangely used a lot in user stats) |
OCR correction functions | Penn State, Boston College, IUPUI, UU |
API
Feature | Who wants it? | Notes/links/etc. |
---|---|---|
IIIF Presentation API for issue-level objects | BPL, Yale, Princeton, Alberta, Cornell, Penn State | Ā |
IIIF manifests for newspapers should function in off-the-shelf IIIF viewers (Mirador etc.) without modification or enhancement. | Alberta, UU | |
Ability to integrate with existing IIIf services (i.e. make support for IIIf and other external services pluggable) | Maryland, Alberta, Penn State, IUPUI | For example, if an institution has a Loris server and a manifest generation app already, this app should support integration of those services |
Expose PURLs for issues and pages | Maryland, Alberta, IUPUI | Ā |
Expose PURLs and citation metadata (human and machine readable) for articles where article-level metadata is available | Alberta | |
Support redirects from legacy URLs for issues, pages, articles | Alberta, IUPUI | Ā |
API to get just the text | Penn State, UU, Princeton | http://chroniclingamerica.loc.gov/about/api/#bulk-data |
Open Annotations for OCR text, ideally linked in IIIF Presentation API | Penn State | |
Open Search | Penn State | http://chroniclingamerica.loc.gov/about/api/#search |
Linked Data options | Penn State | http://chroniclingamerica.loc.gov/about/api/#linked-data |
CORS and JSONP support | Penn State | http://chroniclingamerica.loc.gov/about/api/#cors_jsonp |
Ā
Non-Functional
Feature | Who wants it? | Notes/links/etc. |
---|---|---|
Page turning is high performing | Ā Yale, Cornell, Boston College | Ā Less than 1 second to navigate page to page |
every web page should be able to be bookmarked | Yale, Cornell, Penn State | meaning pages should not use session variables that construct page contents. A user should be able to email a link and the receiver sees the page same as the sender. |