Curation and Publication of Simulation Data in DesignSafe, a Natural Hazards Engineering Open Platform and Repository
Maria Esteva,
Craig Jansen,
Pedro Arduino,
Mahyar Sharifi-Mood,
Clint N. Dawson and
Josue Balandrano-Coronel
Additional contact information
Maria Esteva: Texas Advanced Computing Center, University of Texas at Austin, Austin, TX 78758, USA
Craig Jansen: Texas Advanced Computing Center, University of Texas at Austin, Austin, TX 78758, USA
Pedro Arduino: Department of Civil and Environmental Engineering, University of Washington, Seattle, WA 98195, USA
Mahyar Sharifi-Mood: Texas Advanced Computing Center, University of Texas at Austin, Austin, TX 78758, USA
Clint N. Dawson: Aerospace Engineering and Engineering Mechanics, University of Texas at Austin, Austin, TX 78712, USA
Josue Balandrano-Coronel: Texas Advanced Computing Center, University of Texas at Austin, Austin, TX 78758, USA
Publications, 2019, vol. 7, issue 3, 1-17
Abstract:
Most open repositories present a similar interface and workflow to publish data resultant from different types of research methods. Publishing simulation datasets is challenging due to the iterative nature of simulations that generate large numbers and sizes of files, and their need for detailed documentation. DesignSafe is a web-based open platform for natural hazards engineering research where users can conduct simulations in high performance computing resources, curate, and publish their data. Working closely with experts, we completed a data design project for curation and representation of simulation datasets. The design involved the creation of a data and metadata model that captures the main processes, data, and documentation used in natural hazards simulation research. The model became the foundation to design an interactive curation pipeline integrated with the rest of the platform functions. In the curation interface, users are guided to move, select, categorize, describe, and register relations between files corresponding to the simulation model, the inputs and the outputs categories. Curation steps can be undertaken at any time during active research. To engage users, the web interactions were designed to facilitate managing large numbers of files. The resultant data landing pages show the structure and metadata of a simulation process both as a tree, and a browsing interface for understandability and ease of access. To evaluate the design, we mapped real simulation data to interactive mockups and sought out experts’ feed-back. Upon implementing a first release of the pipeline, we evaluated the data publications and made necessary enhancements.
Keywords: simulations; open repositories; data design; metadata; data curation; data representation; natural hazards engineering datasets (search for similar items in EconPapers)
JEL-codes: A2 D83 L82 (search for similar items in EconPapers)
Date: 2019
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2304-6775/7/3/51/pdf (application/pdf)
https://www.mdpi.com/2304-6775/7/3/51/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jpubli:v:7:y:2019:i:3:p:51-:d:246783
Access Statistics for this article
Publications is currently edited by Ms. Jennifer Zhang
More articles in Publications from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().