2023-10-12 Hyrax Fedora 6 Working Group

 Date

Oct 12, 2023 10:00 AM EDT

 Participants

  • @Heather Greer Klein

  • @Jon Dunn

  • @Arran Griffith

  • @Daniel Pierce

  • @Tom Wrobel

  • @Emily Porter

  • @Bradley Watson

  • @Ayoub Belemlih

  • @Ben Pennell

  • @Dan Field

  • @Rebekah Kati

 Goals

  • Review the Use Cases completed by Stakeholders to identify commonalities between institutional needs.

 Discussion topics

Time

Item

Facilitator

Time

Item

Facilitator

5 minutes

Welcome

All

10 minutes

Update on Hyrax work

Daniel Pierce

45 mins

Review Use Cases

  • Identify common themes/ideas

  • Identify additional discussions needed

  • Review use cases

 

Notes

  • Valkyrization update

    • Brad and Ayoub have been helping and finding issues to fix in Hyrax and Valkyrie, big thanks to them! Another release candidate coming today or tomorrow. Need to get Koppie tests green, because they are looking for activefedora objects. Writing duplicate tests or migrating the tests when appropriate.

    • Brad got RDF predicates linked up. A few to assign for internal IDs form Fedora 4, on the whole should match closely. A few things at file set level are new to Valkyrie, but on the whole Fedora 6 was easy to get up and going. Daniel opened issues for two difficulties.

    • Daniel is feeling confident and is able to say that Fedora 6 really does work, not just should work, on a clean install of Hyrax 5. Migration of customized data will need customized migration, but we want to work for a basic set of migration guidance.

  • Use cases and determining pre-work

    • Need to come to an agreement about the pre-work needed, so the use cases document was the place to start. https://docs.google.com/spreadsheets/d/1Jxq2w8f_RjOBZwjXPen5Gx37lhn-MleJdeMKZfH4dk8/edit?usp=sharing

    • What should of Hyrax should be stored in Fedora 6? To make sure a repository should be rebuildable from scratch, but Oxford wants to choose what is okay to lose (admin and user data, templates). But what do others want to do? Store the binaries. One case of storing access controls, but not clear if that needs to be its own metadata object. Curious to see what OCFL through Valkyrie would look like. Access controls as they relate to what its public or not would be pretty important for us to retain when having to rebuild the application, so we wouldn’t lose what items are embargoed, etc

    • Assumptions around binaries and derivatives? Emory has customized to store up to five files per Fileset (1 primary and up to 4 derivatives) accessible through Hyrax. Bring them in via ingest, and store via same file system (Fedora 4 with S3 config). But the same front ends interact with all those files so might be different use case than others. One fileset with many files.

    • Valk has 2 locations, metadata adapter and storage adapter, can be configured individually. Can have multiple storage adapters. 4 out of the box: disk, versioned disk, fedora, memory (for quicker tests). Separate gem for S3 directly.

    • Want to take a look at performance issues for storage of large files that we know exist now. Storing triples metadata on the Fedora object – is this something that is problem for everyone?

    • The RDF stored by fedora is in an n-triples file right next to binaries for the same object in the ocfl directory

    • Postgres metatdata adapter uses a JSON file, the same would be possible for Fedora.

    • Does Hyrax use the triples for anything but storage in Fedora? No.

    • Separate metadata adapters would be useful to meet identified issues.

    • not sure that there are a huge number of users really taking full advantage of the LDP side of Fedora

    • Don’t see a migration path from fcrepo 4 to 6 without activefedora. Because of that we have the option of not taking advantage of LDP. In order to understand an object in Fedora 4, you need activefedora to make sense of relationships.

 Action items

Ask Rob Kaufman about what he has discovered re: migration
@Arran Griffith to share the use cases with the institutions not here that would be impacted.
Make arrangements for in-person/hybrid meeting at Samvera Connect

 

 

 Decisions