Data science training for Arctic research

Course overview

The Arctic Data Center conducts training in data science and management, both of which are critical skills for stewardship of data, software, and other products of research that are preserved at the Arctic Data Center.

Course topics will include:

  • Arctic Data Center and NSF Standards and Policies
  • Data Management Plans
  • Effective data management for data preservation
    • Storing and Preparing Data in Open Source Formats
    • Stability, longevity, interoperability
    • Metadata
  • Publishing data at the Arctic Data Center
    • Web-based submission
    • Automating submission for large data sets
  • Data and Metadata Quality
  • Provenance for data and software


Agenda Day 1. Day 2.
8:30-9:00 Welcome and introductions Writing Good Data Management Plans
9:00-9:45 Arctic Data Center and NSF Standards and Policies Writing Good Data Management Plans
9:45-10:00 Break Break
10:00-12:00 Effective data modeling and management Data packaging and file hierarchies
Noon-1:15 Lunch Lunch
1:15-2:15 Authoring Quality metadata Authoring large data sets
2:15-2:30 Break Break
2:30-4:30 Authoring Quality metadata Large data and Tracking data provenance
4:30-5:00 Question and Answer Discussion


Work on this package was supported by:

  • NSF award #1546024 to M. B. Jones, S. Baker-Yeboah, A. Budden, J. Dozier, and M. Schildhauer

Additional support was provided for working group collaboration by the National Center for Ecological Analysis and Synthesis, a Center funded by the University of California, Santa Barbara, and the State of California.