This Is AuburnAU Scholarly Repository

Show simple item record

Reusable for who? Discussing "data science ready" repository design


Metadata FieldValueLanguage
dc.contributorAli Krzton, alk0043@auburn.eduen_US
dc.creatorKrzton, Ali
dc.date.accessioned2025-03-06T17:36:25Z
dc.date.available2025-03-06T17:36:25Z
dc.date.created2023-03-28
dc.identifier.urihttps://osf.io/uydpe/wiki/RDAP%20Summit%202023%20Public%20materials/en_US
dc.identifier.urihttps://aurora.auburn.edu/handle/11200/50669
dc.identifier.urihttps://doi.org/10.35099/5z65-2q84
dc.description.abstractResearch data repositories are intended to make data FAIR - findable, accessible, interoperable, and reusable - but implementing these principles in practical terms entails deciding how to evaluate FAIRness. Recent interpretations of interoperability and reusability, in particular, have based their metrics of progress in implementation around facilitating machine processes. In other words, improving the “I” and the “R” of data in a repository translates into structuring data to be crawled and ingested by automated agents and incorporated as seamlessly as possible into aggregate datasets. This is referred to as “data science ready” research data, and some repositories are working towards this vision in consultation with computer scientists. This talk explores the implications of prioritizing the machine digestibility of research data in repository curation processes. What does this perspective imply about the relative value of datasets intended for reuse by researchers working manually to interpret, restructure, and analyze the data? About the fields of study that primarily produce data of that nature? There are tradeoffs in metadata structure and content when the intended “audience” of data is human vs. machine, and there are also risks associated with stripping research data of its context, as can easily happen when employing big data methodologies. Finally, potential impacts to the research ecosystem are considered. For instance, when data is automatically scraped at scale, what happens if the researchers that made it available miss out on attribution and citation of their work?Who will have oversight over later use, and perhaps misuse, of that data? How would the original creators know if this happened, and what could they do about it?These considerations are important because design choices that can shape the future of research practice should not go forward unexamined and unchallenged.en_US
dc.format.mp4en_US
dc.publisherResearch Data Access and Preservation Associationen_US
dc.relation.ispartofRDAP Summit 2023en_US
dc.rightsCC-BYen_US
dc.subjectFAIR dataen_US
dc.titleReusable for who? Discussing "data science ready" repository designen_US
dc.typeAudiovisualen_US
dc.type.genrePresentation, Paper Presentationen_US
dc.locationvirtualen_US
dc.creator.orcid0000-0001-9979-2471en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record