Hydra Tech Call 2015-03-04

Time: 9:00am PDT / Noon EDT

Call-In Info: 1-530-881-1400, access code 651025

Moderator: Carolyn Cole (Penn State)

Notetaker: Karen Estlund (U. Oregon)

Attendees - Please add yourself: 

  • Peter Binkley (U. Alberta)

  • Weiwei Shi (U. Alberta)

  • Piyapong Charoenwattana (U. Alberta)

  • Mike Giarlo (Penn State)

  • Justin Coyne (Data Curation Experts)

  • Steven Ng (Temple University) 

  • Corey Harper (NYU)

  • Marisa Strong (CDL)

  • Esmé Cowles (UC San Diego)

  • Trey Terrell (OSU)

  • Adam Wead (Penn State)

  • Anna Headley (Chemical Heritage Foundation)


Agenda:


  1. Call for Agenda Items

  2. Committing updates to README files (Marisa)
    1. Updated file with books and XML for corrections and was able to make corrections without review on the README "Dive Into Hydra Tutorial" on GitHub, https://github.com/projecthydra/hydra/wiki/Dive-into-Hydra
    2. Practice is to submit a pull request even though have access to do without, but that doesn't work on wiki pages
    3. Should there be a practice to review wiki pages? 
      1. Proposal to open up an issue to state changes and post for discussion
      2. Then it will get closed as accepted
    4. Thank you Marisa!
  3. Approximate Date ranges (Justin)
    1. How to split dates such as ranges in MODS date-created
    2. Generally UCSD and Oregon have split date ranges into either European (begin and end) with a Skos pref label or VRA (earliest and latest) with view date
  4. Strategies for caching authoritative linked data (Justin)
    1. It is slow to fetch terms for linked sites like id.loc.gov and getty vocabs
    2. Oregon formerly had been using MongoDB but now switching to Marmotta - (DPLA also uses Marmotta), Oregon moved to Marmotta:
      1. Wanted real SPARQL endpoint
      2. Has LDcache then don't have to manage some of the fetching and indexing issues we 
      3. Oregon is testing in a sprint ending this Friday with a controlled vocab manager, but we're not doing bulk inserts or queries, yet
    3. Marmotta was throwing errors - Justin wrote an RDF repository for SOLR
      1. Just querying a single term was throwing errors in Marmotta in id.loc.gov
      2. SPARQL query does not function as sold 
    4. Active Triples using LDP could help with that
    5. Corey suggests linked data fragments client as a possible solution
    6. Possible quick fix cache in SQL backed repository instead of building in SOLR
    7. Corey talks about all sorts of other things to test - Trey mentions that a SPARQL based repository could be attached to some of these things
  5. Ordered ontologies & Active Fedora (Justin, DCE)
    1. Justin is looking for feedback on: https://github.com/curationexperts/alexandria-v2/commit/33f05f65895e7170a6c9d2ab8678eddcb6d2dad5
    2. Looking for volunteers to review who understand ORE: Esme and Rob?
    3. Mike mentions an ORE validator: http://www.openarchives.org/ore/1.0/tools
    4. Good to have a test that tests output
  6. Next call
    1. Date: March, 11, 2015
    2. Moderator: Justin Coyne (DCE)
    3. Notetaker: Thomas Scherz (Cincinnati)