Revolutions Take Time
Abstract
:1. Introduction
2. Vision
2.1. Integrated Virtual Data Collection
2.2. Easy Plug-In
2.3. Self-Standing Entities
2.4. Built-In Security
- PID records that are so crucial for managing and accessing FDOs are protected using a PKI infrastructure ensuring that only accepted authorities, in general, the owners, are allowed to make changes. It is even possible to prevent access to attributes in the PID record in case that industry for example wants to protect crucial business information.
- A PID should persistently include a hash code in the PID record characterizing the bit-sequence and indicating to every user whether it is indeed the bit-sequence one is expecting and also allowing data providers to look for (unauthorized) copies in the data space (It should be noted that repositories need to state explicitly what their policies with respect to assigning PIDs are and whether they for example allow for mutable DOs where adding checksums does not make much sense.). Some repositories use such a hash code as a suffix of the PID which has the same effect. Tracking of data copying is made possible.
- Most important is that the PID record is always the anchor for dealing with digital artifacts encoded as bit-sequences of an FDO. The PID record includes all references to relevant information including license terms, access permissions, etc. This means that the rights specifications are persistently bound with the bit-sequence independent where the copies of these may be stored. Therefore, FDOs offer a solution for proper authorization in distributed landscapes where copies are being exchanged and traded (In general, when solutions for Authentication and Authorization Infrastructures (AAI) are being offered this only includes mechanisms for distributed authentication.).
- This could be extended to persistently include a reference to a blockchain that stores transaction events if this is requested by the data provider. This would enable data providers to point to a transaction of a specific object that is characterized among others by a hash value included probably as an attribute in the PID record.
- Of course, interested data providers could extend these measures by adding hidden information into the data stream, for example, using certificates to sign data or using cryptography to increase security. These are all mechanisms changing the bit-sequence which are beyond the FDO mechanisms.
2.5. Summary
3. Current Initiatives
3.1. Research Infrastructures
- Almost all researchers are aware of the FAIR guiding principles.
- A variety of communities can access well-organized repositories, have developed comprehensive metadata schemas and tools useful for their research work, and built a set of supporting semantic artifacts.
- Experts are starting to experiment with workflow frameworks to automate recurring processes.
- More early career experts are being employed who are ready to make use of new technology.
3.2. eInfrastructures
3.3. EOSC
3.4. NFDI
3.5. NIH Commons
4. Stakeholder Interests
4.1. Researchers
4.2. Tool Developers
- Tool developers are, of course, driven by the wishes of their customers and so functional extensions have the highest priority (Of course debugging and software maintenance have highest priority and the costs for this are in general underestimated.).
- Tool developers are partial to their tool and want to keep it alive in an area of dynamic competition. Since software maintenance, in general, is not funded, they are under constant pressure to find funding sources.
- Tool developer plans have short time horizons and therefore it is not surprising that FAIRness and other emerging principles and standards do not have high relevance.
- Relevant tools need to broaden their functionality to be attractive, which implies that infrastructural elements are being integrated leading to an “all-in-one” concept. This is the opposite of how infrastructures should be designed. Often the strict separation between data and operations is not maintained due to short-term design and development considerations, which leads to unwanted dependencies.
4.3. Industry
- Engagement of industry will be necessary to make real steps towards convergence. As in the case of the Internet standards, some form of public-private partnerships will be needed.
- Existing big data industry is not interested in new open standards, just as in the Internet case, since it would influence their market position where proprietary solutions dominate.
- New companies would have to emerge that see potential in new standards and are ready to take risks and/or the open standards would have to emerge from the science community (as in the Internet case) and then be taken up by such new companies.
4.4. Policy-Makers
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Conflicts of Interest
References
- Wittenburg, P.; Strawn, G. Common Patterns in Revolutionary Infrastructures and Data. B2SHARE Arch. 2018. [Google Scholar] [CrossRef]
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.-W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- De Smedt, K.; Koureas, D.; Wittenburg, P. Analysis of Scientific Practice towards FAIR Digital Objects. B2SHARE Arch. 2019. [Google Scholar] [CrossRef]
- FDO Forum. Available online: https://fairdo.org/ (accessed on 1 November 2021).
- Harari, Y.N. Homo Deus; Harvill Secker: London, UK, 2016. [Google Scholar]
- Floridi, L. The Fourth Revolution: How the Infosphere is Reshaping Human Reality; Oxford University Press: Oxford, UK, 2014. [Google Scholar]
- Strawn, G. Open Science, Business Analytics, and FAIR Digital Objects. In Proceedings of the 43rd Annual Computer Software and Applications Conference (COMPSAC), Milwaukee, WI, USA, 15–19 July 2019; Available online: https://ieeexplore.ieee.org/document/8754334 (accessed on 15 November 2021).
- Int. Data Space. Available online: https://internationaldataspaces.org/ (accessed on 1 November 2021).
- Kahn, R.; Wilensky, R. A framework for distributed digital object services. Int. J. Digit. Libr. 2006, 6, 115–123. [Google Scholar] [CrossRef]
- DOIP. Available online: https://www.dona.net/sites/default/files/2018-11/DOIPv2Spec_1.pdf (accessed on 1 November 2021).
- Linked Data. Available online: https://en.wikipedia.org/wiki/Linked_data (accessed on 1 November 2021).
- Hughes, T. Networks of Power; Johns Hopkins University Press: Baltimore, MD, USA, 1983. [Google Scholar]
- ResourceSync. Available online: http://www.openarchives.org/rs/toc (accessed on 1 November 2021).
- ESFRI. Available online: https://www.esfri.eu/ (accessed on 1 November 2021).
- Galaxy. Available online: https://galaxyproject.org/ (accessed on 1 November 2021).
- Icebear. Available online: https://www.icebear.fi/ (accessed on 1 November 2021).
- EBRAINS. Available online: https://ebrains.eu/ (accessed on 1 November 2021).
- CaosDB. Available online: https://caosdb.org/ (accessed on 1 November 2021).
- eInfrastructures. Available online: https://ec.europa.eu/programmes/horizon2020/en/h2020-section/e-infrastructures (accessed on 1 November 2021).
- eIRG Workshop. Available online: https://indico.fccn.pt/event/15/overview (accessed on 1 November 2021).
- EOSC. Available online: https://www.eoscsecretariat.eu/ (accessed on 1 November 2021).
- Jeffery, K.; Wittenburg, P.; Lannom, L.; Strawn, G.; Biniossek, C.; Betz, D.; Blanchi, C. Not Ready for Convergence in Data Infrastructures. Data Intell. 2021, 3, 116–135. [Google Scholar] [CrossRef]
- NFDI. Available online: https://www.nfdi.de/ (accessed on 1 November 2021).
- NFDI Cross-Cutting Topics. Available online: https://zenodo.org/record/4593770#.YT8VOX1CTb0 (accessed on 1 November 2021).
- NIH Commons. Available online: https://era.nih.gov/ (accessed on 1 November 2021).
1950s | many individual computers | separated data sets |
1990s | one virtual computer | separated data sets |
2030s | one virtual computer | one virtual data collection |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wittenburg, P.; Strawn, G. Revolutions Take Time. Information 2021, 12, 472. https://doi.org/10.3390/info12110472
Wittenburg P, Strawn G. Revolutions Take Time. Information. 2021; 12(11):472. https://doi.org/10.3390/info12110472
Chicago/Turabian StyleWittenburg, Peter, and George Strawn. 2021. "Revolutions Take Time" Information 12, no. 11: 472. https://doi.org/10.3390/info12110472
APA StyleWittenburg, P., & Strawn, G. (2021). Revolutions Take Time. Information, 12(11), 472. https://doi.org/10.3390/info12110472