...
Moderator: Eben English (Boston Public Library)
Notetaker: TBD Eben English
Attendees:
- your name here
Agenda
- Eben English (BPL)
- Brian McBride (Utah)
- Sean Upton (Utah)
- Kalee Sprague (Yale)
- Randall Floyd (Indiana)
Agenda
- Ongoing work
- IUPUI / Indiana
- Andy Smith at IUPUI has done quite a bit of work on newspaper ingest
- App based on CurationConcerns and Princeton's Plum?
- Migrating content from CONTENTdm
- Have YAML format for describing structural metadata used for ingest
- Crosswalking between METS structural metadata and YAML format
- Yale
- Has CONTENTdm stuff, wants to migrate
- Article-level segmentation
- Can provide samples
- Eben will send Kalee a link to Google Drive folder where BPL-Utah grant project is collecting samples
- IUPUI / Indiana
- PCDM model review: https://docs.google.com/document/d/1T_gKqkKoik7h9WweYB46S9NrwAXmJTB0g3hCrqH3Q6Q/edit?usp=sharing
- Grant documentation review:
- NewspaperWorks Design Overview: https://docs.google.com/document/d/1X6OLz9OfoyMyUBsCuLUMROe9EBqF6upDqPhPKFQxLAY/edit?usp=sharing
- NewspaperViews Design Overview: https://drive.google.com/open?id=1LorDyCVB9UW6exfA1y5fG2bQb6ASXLHa03nNgQYrtZA
- Ingest Scenarios:
PDFs - Ingest PDF which is a single issue (of single title) Ingest batch of PDFs which are issues (of single title)
- Page level Article level
- Image files are pages Image files are articles
- Ingest n master files (page image(s)) as single issue (of single title)
- Ingest batch of master files (page images) which are multiple issues (of single title)
- Ingest n master files (article images) which are a single article (of a single issue of a single title) Ingest batch of master files (article images) which are articles (of a single issue of a single title???)
- Proxy ordering for articles in issues
- Still unsure if this is needed.
- Files for articles
- Articles may have binary files representing the page image segment
- Generic vs. specific
- Question was raised about whether this model is too specific too newspaper content, might be daunting to some implementers
- Could potentially pursue a more generic 'paged media' model
- Do newspapers have any intrinsic features that are not shared with other common paged media objects such as books or magazines?
- Could call it 'Periodical' rather than 'Newspaper'?
- More thought needed.
- ArcLight may have some modeling that could be useful?
- Performance could be an issue
- Indiana ran into performance problems when objects had more than 200 pages or so.
- Valkyrie provides better performance.
NDNP data
Arbitrary files
Controlled genre list for articles
Other agenda items