Samvera Community Wiki
August 14, 2020
Samvera Partners Call |
Friday, August 14th, 2020 |
11:30 am | Eastern Daylight Time (New York, GMT-05:00) | 1 hr |
Meeting number: | 737 192 431 |
Meeting password: | h9cwrT45 |
Join from a video conferencing system or application |
Join by phone |
+1-617-627-6767 US Toll |
Access code: 737 192 431 |
Code of Conduct
We want Samvera Community to be a fun, informative, engaging event for all our partners and participants. We've got a few strategies to help make this happen:
If you have any questions or concerns, please feel free to reach out to community helpers .
Date
Aug 14, 2020
Facilitator: @Richard Green
Note Taker: @Esmé Cowles
Attendees
@Richard Green
@Esmé Cowles
@Karen Cariani
@Mike Korcynski
@Nabeela Jaffer
@Margaret Mellinger
@Rosalyn Metz
@Brian McBride
@Daniel Kerchner
@Jon Dunn
@Harriett Green
@Stuart Kenny
@Rick Johnson
@John Weise
@Robin Lindley Ruggaber
@Maria Whitaker
Regrets: @Simeon Warner
Presentation:
Agenda and notes
Notetaker for next call (September 11th)
Nabeela volunteered
Any other items for the agenda?
No other items added, but feel free to add things as we go
Presentation from George Washington University on their ISIS files repository - Dan Kerchner, GWU
Background: experience with Samvera
Build GW Scholarsapce IR on Sufia 6/7 in 2015, rebuilt on Hyrax in 2017
Built a batch loader, now have 10K+ works (Hyrax 2.8.0)
ISIS files
ISIS was the governing entity in the area they controlled, so basic government functions
NY Times reporter embedded with Iraqi army that liberated Mosul, discovered huge trove of documents spanning the whole range of government activities, and got permission to digitize, curate, translate, and make them public
NY Times reached out to Program on Extremism at GWU, got grant from Mellon Foundation to support this work
The project involved staff in many roles, including project management, metadata, translation, etc. and some development time.
Part of the goal was figuring out how to work with such sensitive documents in a repository in a responsible way
Repository content and workflow
15K pages, ~1500 files, initial batch was 62 documents
Ingesting both original documents, plus analysts' reports
Workflow: digitize, sort/review, translate, describe, redact, convert to accessible PDFs, then ingest
Accessibility is key both for the website and for the files that are ingested
Technology evaluation
Evaluated Hyrax, Drupal, Wordpress, and Murkutu
Chose Hyrax because: out-of-the-box feature set, accessibility, design, preservation, security
Other factors: existing (positive) skills/experience, pretty low effort to get the features they needed, community support
Had to decide between Hyrax 2.x or 3.0-rc (hoped that 3.0 would be released by now, but it hasn't been a problem)
Features and customizations built for the app
Click-through consent to cookies, becoming very common with GDPR
Customized menus, building static pages with TinyMCE
Multi-lingual/multi-script metadata: Arabic original, Roman-script transliteration, and translation to English
Original documents and translated into English appear side-by-side
PDF handoff to browser instead of download or embedded viewer, though that was a better experience
Hid technical metadata, other display simplifications
Added Captcha for login/signup/comment forms — still some spam, but helps a lot
Release event in June with speakers
Considerable amount of press coverage, global interest
Did not notice any malicious attacks
Deployment
AWS instead of GWU infrastructure, helpful to isolate from local apps, and also allows auto-scaling
Full security review, including some remediations (not much for Hyrax other than upgrading jQuery), making source code private
What's next?
More content
More features: IIIF, etc.
Metadata values i18n
Adding UI languages like Arabic
Technical debt, and contributing local features back where it makes sense
Some modeling changes to differentiate original vs. translations
Things that make ISIS Files different
Sensitive documents with unusual provenance
High profile content
Critical success factors
Community support
Hyrax is a solid platform to build from: "Hyrax was the easy part"
Status check on Community Manager appointment
Position posted several weeks ago through Emory, have had many applicants
Have done preliminary phone screenings, and narrowed down to shortlist to interview
Interviews will happen in the next couple of weeks
Status check on Code of Conduct work
Work is going ahead, but don't expect to be complete by Connect because of scheduling
Connect 2020 Calls for proposals
Update: Call for workshops closes this week (may extend, but please put in proposals), call for presentations closes at the end of the month
Sanity check on proposed mechanism for posters
Have an approach we think is workable: ask for poster submissions, accompanied by a short video or notes
Presenter also signs up for 30-minute block to have a chat session with any interested parties (scheduled on sched.com)
Anything to raise with the Steering Group (standing item)
No issues raised
Date of next call
Friday 11th September, usual time slot