Affiliation:
1. University of Saskatchewan, Saskatoon, SK, Canada
Abstract
Advances in scientific domains are led to an increase in the complexity of the experiments. To address this growing complexity, scientists from different domains require to work collaboratively. Scientific Workflow Management Systems (SWfMSs) are popular tools for data-intensive experiments. To the best of our knowledge, very few of the existing SWfMSs support collaboration, and it is not efficient in many cases. Researchers share a single version of the workflow in existing collaborative data analysis systems, which increases the chance of interference as the number of collaborators grows. Moreover, for effective collaboration, contributors require a clear view of the project's status, the information that existing SWfMSs do not provide. Another significant problem is most scientists are not capable of adding collaborative tools to existing SWfMSs, and they need software engineers to take on this responsibility. Even for software engineers such tasks could be challenging and time consuming. In this paper, we attempted to address this crucial issue in scientific workflow composition and doing so in a collaborative setting. Hence, we propose a tool to facilitate collaborative workflow composition. This tool provides branching and versioning, which are standard version control system features to allow multiple researchers to contribute to the project asynchronously. We also suggest some visualizations and a variety of reports to increase group awareness and help the scientists to realize the project's status and issues. As a proof of concept, we developed an API to capture the provenance data and provide collaborative tools. This API is developed as an example for software engineers to help them understand how to integrate collaborative tools into any SWfMS. We collect provenance information during workflow composition and then employ it to track workflow versions using the proposed collaborative tool. Prior to implementing the visualizations, we surveyed to discover how much the proposed visualizations could contribute to group awareness. Moreover, in the survey we investigated to what extent the proposed version control system could help address shortcomings in collaborative experiments. The survey participants provided us with valuable feedback. In future, we will use the survey responses to enhance the proposed version control system and visualizations.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Human-Computer Interaction,Social Sciences (miscellaneous)
Reference39 articles.
1. [n. d.]. Bioinformatics experiment. https://www.myexperiment.org/workflows/5147.html. [n. d.]. Bioinformatics experiment. https://www.myexperiment.org/workflows/5147.html.
2. [n. d.]. Collaboration and group awarenss survey link. https://forms.gle/DavjWTFi88nBN8rL8. [n. d.]. Collaboration and group awarenss survey link. https://forms.gle/DavjWTFi88nBN8rL8.
3. Youtube [n. d.]. Facilitate collaboration survey. Youtube. https://youtu.be/IzGJsPvaUDw Youtube [n. d.]. Facilitate collaboration survey. Youtube. https://youtu.be/IzGJsPvaUDw
4. Benchmark Requirements for Microservices Architecture Research
5. A Data Model for Analyzing User Collaborations in Workflow-Driven e-Science;Altintas Ilkay;Int. J. Comput. Their Appl.,2011
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献