2024-05-10 Partner Call
Samvera Partners Call
Friday, May 10th, 2024
11:30 am | Eastern Daylight Time (New York, GMT-05:00) | 1 hr
Join Zoom Meeting Link:
https://us06web.zoom.us/j/81973313501?pwd=dXVlcHZOODF2c2NiNGpoNEFad3NZQT09
Join by phone
Dial by your location
+1 646 931 3860 US
+1 646 558 8656 US (New York)
+1 253 215 8782 US (Tacoma)
+1 346 248 7799 US (Houston)
Meeting ID: 819 7331 3501
Passcode: 603236
Find your local number: https://us06web.zoom.us/u/kkbAJ584n
Code of Conduct
We want Samvera Community calls to be fun, informative, engaging events for all our partners and participants.
We encourage everyone to apply the Samvera community principles of openness, inquiry, and respect in their interactions at the event.
Please review the Code of Conduct and Anti-Harassment Policy.
If you have any questions or concerns, please feel free to reach out to community helpers.
Facilitator: @Heather Greer Klein
Note Taker: @Kevin Kochanski
Attendees
@Robin Lindley Ruggaber
@Karen Cariani
@Margaret Mellinger
@Collin Brittle
@Nicholas Mark Homenda
@Nora Zimmerman
@Kevin Kochanski
@Paul Walk
@Chris Awre
@Jill Morris
@Thomas Scherz
@Esmé Cowles
@Jon Dunn
@Carolyn Caizzi
@Emily Lynema
@david.schober
@Glen Horton
@Alberto Martinez
Agenda & Notes
Any other items for the agenda?
Partner: Welcome Paul from AntLeaf, new Partners
Board Election: Possibly two candidates; will do a vote when that is finalized
Samvera Virtual Connect reflections
120 Registered, with 70-85 participating live at any given time
Zoom - may move from Webinar to standard Zoom to encourage participation
Videos should be up on YouTube by the end of today
Thoughts:
Good mix of presentations
Good duration/scope
Please remember to fill out the feedback survey that Heather sent - https://forms.gle/QEeDibu4wmSRhXZb9
Discussion: AI and LLM (@david.schober @Brendan Quinn )
Vector databases - turn letters into numbers in a three dimensional space. Requires a special kind of database to do this, elastic search can do this now. Semantic relationships represented as numbers. See James Griffin’s presentation at Virtual Connect for the details.
Transformer-Based Language Models (LMs) - allows you to take words, pictures, etc. and embed it in a language model
Vector Search - similar to Transformer-Based, we embed in a model and ask “How close are these to the question asked?”
Retrieval-Augmented Generation - Takes the question, puts it through a vector search, and packets it together - Answers and questions. The system is given a prompt, which is akin to a persona. It answers with a specific tone and providing specific augmented information.
Northwestern Built:
Infrastructure - AWS Cloud
Chatbot
Sharing best practices
Scrolling - from a design prospective, important for the user to know that text is being generated.
Can toggle to public works only, limit to image audio or video
Gives LLM response about what it doesn’t know, and what the user needs to look at and documents to start for finding the answers. It knows enough from metadata to know what documents might have the requested information. Indexing metadata, grouped into archival collections.
Pulling from elastic search vector search.
Multi-lingual, 150 languages. Even with English-only metadata, responded cleanly in Italian. Colmex finds the quality of English results is better than Spanish; NW hasn’t yet evaluated.
Rough estimate for cost at launch is between $300-$500 a month with the current chatbot, limiting to logged in users to start. NW is currently evaluating the quality of response vs. cost to determine the right level (cost-benefit analysis).
Team is 6 including David. 2 Senior Devs, 1 Lead and 2 front end devs
NU applied for an IMLS grant for developing a tool for metadata description, transcribing handwritten text, element description.
Regional Meetings Updates and Planning
Midwest - Indiana University and PALNI (Private Academic Library Network of Indiana) are tentatively planning to host a Samvera Midwest Regional Meeting in Indianapolis, Indiana on September 25-26, 2024.
In order to gauge interest and help ensure that we are structuring the meeting to best meet the needs of Samvera community members in the Midwest, if you’re interested in this meeting, we would appreciate it if you could complete the following brief survey by Friday, May 17: https://forms.gle/Nq6MFuurwnmAtfU77Northeast - Princeton in October, save the date coming soon. Three small events in one week: Samvera, devops, and Blacklight. Samvera Regional event on 10/14-15, DevOps4Lib on 10/16-17, and a Blacklight on 10/18
West Coast - Chrissy at UCSB and Kevin at SoftServ met for an in initial viability conversation. Likely to host in San Diego or Portland, but will keep the community updated when we assess interest in hosting hopefully by the end of next week (May 17). This could happen as soon as August and hopefully before the end of October. Next steps are to contact Western US/Canada institutions to assess level of interest and gather broader input.
Europe - Chris is investigating
Other Updates:
Sign up to present or facilitate a topic (5 - 30 minutes) at a future Partner call. One requested topic without a facilitator is around hiring challenges and approaches.
Hyrax and Hyku development updates - see Virtual Connect presentations
Wishing Carolyn Caizzi well in new position at Harvard!
Anything for the Samvera Board? (Standing item)
Date of next call: June 14th
Notetaker: @Chris Awre