August 14, 2020
Samvera Partners Call |
Friday, August 14th, 2020 |
11:30 am | Eastern Daylight Time (New York, GMT-05:00) | 1 hr |
Meeting number: | 737 192 431 |
Meeting password: | h9cwrT45 |
Join from a video conferencing system or application |
Dial 738605799@tufts.webex.com |
Join by phone |
+1-617-627-6767 US Toll |
Access code: 737 192 431 |
Code of Conduct
We want Samvera Community to be a fun, informative, engaging event for all our partners and participants. We've got a few strategies to help make this happen:
We encourage everyone to apply the Samvera community principles of openness, inquiry, and respect in their interactions at the event.
We have officially adopted an Anti-Harassment Policy.
If you have any questions or concerns, please feel free to reach out to community helpers .
Date
Facilitator: Richard Green
Note Taker: Esmé Cowles
Attendees
Regrets: Simeon Warner
Presentation:
Agenda and notes
- Notetaker for next call (September 11th)
- Nabeela volunteered
- Any other items for the agenda?
- No other items added, but feel free to add things as we go
- Presentation from George Washington University on their ISIS files repository - Dan Kerchner, GWU
- Background: experience with Samvera
- Build GW Scholarsapce IR on Sufia 6/7 in 2015, rebuilt on Hyrax in 2017
- Built a batch loader, now have 10K+ works (Hyrax 2.8.0)
- ISIS files
- ISIS was the governing entity in the area they controlled, so basic government functions
- NY Times reporter embedded with Iraqi army that liberated Mosul, discovered huge trove of documents spanning the whole range of government activities, and got permission to digitize, curate, translate, and make them public
- NY Times reached out to Program on Extremism at GWU, got grant from Mellon Foundation to support this work
- The project involved staff in many roles, including project management, metadata, translation, etc. and some development time.
- Part of the goal was figuring out how to work with such sensitive documents in a repository in a responsible way
- Repository content and workflow
- 15K pages, ~1500 files, initial batch was 62 documents
- Ingesting both original documents, plus analysts' reports
- Workflow: digitize, sort/review, translate, describe, redact, convert to accessible PDFs, then ingest
- Accessibility is key both for the website and for the files that are ingested
- 15K pages, ~1500 files, initial batch was 62 documents
- Technology evaluation
- Evaluated Hyrax, Drupal, Wordpress, and Murkutu
- Chose Hyrax because: out-of-the-box feature set, accessibility, design, preservation, security
- Other factors: existing (positive) skills/experience, pretty low effort to get the features they needed, community support
- Had to decide between Hyrax 2.x or 3.0-rc (hoped that 3.0 would be released by now, but it hasn't been a problem)
- Features and customizations built for the app
- Click-through consent to cookies, becoming very common with GDPR
- Customized menus, building static pages with TinyMCE
- Multi-lingual/multi-script metadata: Arabic original, Roman-script transliteration, and translation to English
- Original documents and translated into English appear side-by-side
- PDF handoff to browser instead of download or embedded viewer, though that was a better experience
- Hid technical metadata, other display simplifications
- Added Captcha for login/signup/comment forms — still some spam, but helps a lot
- Release event in June with speakers
- Considerable amount of press coverage, global interest
- Did not notice any malicious attacks
- Deployment
- AWS instead of GWU infrastructure, helpful to isolate from local apps, and also allows auto-scaling
- Full security review, including some remediations (not much for Hyrax other than upgrading jQuery), making source code private
- What's next?
- More content
- More features: IIIF, etc.
- Metadata values i18n
- Adding UI languages like Arabic
- Technical debt, and contributing local features back where it makes sense
- Some modeling changes to differentiate original vs. translations
- Things that make ISIS Files different
- Sensitive documents with unusual provenance
- High profile content
- Critical success factors
- Community support
- Hyrax is a solid platform to build from: "Hyrax was the easy part"
- Background: experience with Samvera
- Status check on Community Manager appointment
- Position posted several weeks ago through Emory, have had many applicants
- Have done preliminary phone screenings, and narrowed down to shortlist to interview
- Interviews will happen in the next couple of weeks
- Status check on Code of Conduct work
- Work is going ahead, but don't expect to be complete by Connect because of scheduling
- Connect 2020 Calls for proposals
- Update: Call for workshops closes this week (may extend, but please put in proposals), call for presentations closes at the end of the month
- Sanity check on proposed mechanism for posters
- Have an approach we think is workable: ask for poster submissions, accompanied by a short video or notes
- Presenter also signs up for 30-minute block to have a chat session with any interested parties (scheduled on sched.com)
- Anything to raise with the Steering Group (standing item)
- No issues raised
- Date of next call
- Friday 11th September, usual time slot