Session Abstract:
Data rescue requires resources that are often scarce, and can also hinge of the timeliness of rescue efforts. Efforts by ESIP, RDA, EarthCube, and the IMLS-funded Data Rescue Toolkit have all focused on the development of elements to help focus and motivate data rescue efforts and improve the efficiency of efforts in the future. A clear need from within the community is to help synthesize findings from each of these groups, and to obtain input from the broader geosciences community, those actively involved in ad hoc data rescue efforts, and other issues around data rescue that may be particular to that community.
This session will begin with a brief overview of the work accomplished by the various groups mentioned above, and provide an introduction to the goals of the newly-approved CODATA Task Group, Improving Data Access and Reusability (IDAR-TG). The IDAR-TG highlights the importance and timeliness of not only data rescue, but also the need to collect and/or develop tools and frameworks to assist with assessing what data is in most need of rescuing.
The majority of the session will be a working session where we will discuss what characteristics of the data and its environment (repository, funding stream, etc.) increase data’s risk level, what considerations need to be made when deciding whether the data needs to be rescued, what resources are already available for this task, and how the Data Nomination Tool and other activities in the Data Rescue Toolkit project might be able to help.
Agenda: Set-up of problem (20-30 minutes):
- Intro (Hills)
- RDA Data Rescue (Diggs
- EarthCube/GeoDeepDive (Goring)
- ESIP Data Stewardship (Mayernick) slides
- Data Nomination Toolkit (Boehm)
- CODATA IDAR TG (Diggs)
Working on defining the path to a solution (45-60 minutes) Topics might include:
- How to define risk
- categories of risk
- ways to quantify (or weight) various risk factors
- other geo-disciplines that we might need to reach out to
- other scientific disciplines that have addressed data at risk evaluation
- ??
Summary/wrap up (5-10 minutes)
Please help with collaborative note-taking
HERE!Session Takeaways (post-meeting):1) Sharing/telling data rescue stories (the process; what was enabled by the rescue; financial benefit) is necessary to get the essential support for data rescue efforts.
2) Coordination with aligned groups is key to be sure that effort is not duplicated and that efforts are aligned.
3) Harnessing Code for America style-infrastructure to pair motivated people/students with relevant skills to the opportunities to participate in data rescue efforts could be a good model to identify data heroes / data minions within the data nomination tool.