Loading…
2020 LD4 Conference on Linked Data in Libraries has ended
To see the "Open Zoom" button to join the sessions, sign in to your Sched account.
Welcome to the 2020 LD4 Conference on Linked Data in Libraries! There is no charge to participate. Attend one session or many! To join the sessions, create a Sched account. You will need a Sched account to see the "Open Zoom" button to join the actual session. Creating an account also enables you to create a personalized schedule of the sessions you want to attend and receive   reminders specific to those sessions. Session times are shown in Pacific Daylight Time (UTC -7). To view the schedule in your local time zone, use the Timezone dropdown; then you can print, email or download your schedule in your timezone.
Join the ld4 Slack workspace with channels for each conference track.
See the LD4 Conference website for information about:
• Conference goals and track descriptions
Zoom tips and settings for anonymous participation (most sessions will be recorded for sharing)
Community Participation Guidelines and how to report related issues
Questions? Technical difficulties? Post on #ld4_2020__troubleshooting Slack channel or email ld4conf_chairs@googlegroups.com
New: Playlist of session recordings (including pre-recorded sessions)
To see the "Open Zoom" button to join the sessions, sign in to your Sched account.
Back To Schedule
Friday, July 17 • 9:15am - 10:00am
Open and not very linked data: metadata quality in art museum data sets

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Session recording

Open and Not Very Linked Data -- Metadata Quality in Art Museum Data Sets: A Review and Case Study of Example Data Sets from a Data Analyst Perspective, Yingying Han (University of Illinois)
The American Art Collaborative (AAC) is a consortium of 14 art museums. They strive to provide linked open data for their collections. Open data is gaining attention and adoption recently among museums. Even though these datasets are open, they typically are published in isolation from one another. Integrated datasets are more valuable than standalone subsets. This review examines geographic metadata from three representative AAC members: Princeton University Art Museum (PUAM), Gilcrease Museum (GM) and Smithsonian American Art Museum (SAAM) and explores metadata quality problems across three dimensions: interoperability, completeness and accuracy. Interoperability: Each museum used different data structure. Both GM and PUAM published geographic data about each object's original location. While SAAM shared geographic location data about the artists' place of birth and death. Furthermore, curators did not adopt a standard when assigning the data value, especially for country name and state name and hence created interoperability problems. Completeness: This review examined geographic data completeness to identify: (1) The records where every attribute value is "null"; (2) The records without a longitude value and latitude value. The longitude and latitude numbers are important geographic markers because other metadata can be inferred computationally to identify country, state, and city. This presentation will share python scripts to accomplish these tasks. Accuracy: Frequent errors in geographic metadata include: (1) Vague or incorrect values for continent, country, and state. For example, "Central America" was value for continent in PUAM dataset. The review presents python scripts to identify this kind of error. (2) Data uncertainty introduced by unsure curators assigning data value, such as "probably Chiapas". This points out the importance of longitude and latitude values which constrain options for curators and improve data accuracy. Audience: Library catalogers, museum curators, technologist and researchers seeking to understand the data analysts' needs regarding cultural Linked Open Data (LOD) and those who aim to improve LOD quality in data sets.

Session Facilitators
avatar for Merrilee Proffitt

Merrilee Proffitt

Senior Manager, OCLC
Helps manage the OCLC Research Library Partnership. Webinar & event wrangler. Editor of Leveraging Wikipedia: Connecting Communities of Knowledge (ALA Editions, 2018). Co-author of Reimagine Descriptive Workflows: A Community-informed Agenda for Reparative and Inclusive Descriptive Practice (OCLC Research, 2022).Her current projects include developing better relationships between Wikimedia projects and cultural heritage institutions, and developing more inclusive descriptive practices.Find m... Read More →
avatar for Elizabeth Russey Roke

Elizabeth Russey Roke

Discovery and Metadata Archivist, Emory University
avatar for Chela Scott Weber

Chela Scott Weber

Sr. Program Officer, RLP, OCLC
Chela Scott Weber is a Sr Program Officer for the OCLC Research Library Partnership. She previously helped to create and lead the Archival Collections Management department at NYU Libraries, where she also served as the Associate Head of the Tamiment Library & Wagner Labor Archive... Read More →

Presenters
avatar for Yingying Han

Yingying Han

Doctoral student, University of Illinois, Urbana-Champaign
Hello, I am a second-year Ph.D. student in the School of Information Sciences, University of Illinois, Urbana-Champaign. I got my Bachelor\\'s degree in Management from Sichuan University, China, and a Master\\'s degree in Library Science from Wuhan University. Inspired by my volunteer... Read More →


Friday July 17, 2020 9:15am - 10:00am PDT