The Management of Data for the Banking, Qualification, and Distribution of Induced Pluripotent Stem Cells: Lessons Learned from the European Bank for Induced Pluripotent Stem Cells
Abstract
:1. Introduction
- Determine which data are essential and must be generated, and hence, which data must be made available and how access is managed. This requires a set of mandatory data, which may change over time as use requirements change because of application evolution.
- Harmonise data from different sources. Donor material or pre-reprogrammed iPSCs may be provided by diverse sources, including individual researchers and clinical research groups, university laboratories, core facilities or academic institutions, or biobanks. These may, for example, apply various informed consent models to the donors and different quality standards using variable methods to characterise the generated cells. This variability also applies to the associated datasets, with variable metadata, data, and storage formats, as well as access requirements. These data must be harmonised, missing data be generated, and all data be made findable, accessible, interoperable, and reusable.
- Make existing inventories of iPSC lines easily discoverable to researchers. For the most part, each of the iPSC repositories in Table 1 has an online searchable catalogue of cell lines with some basic search functions to query for lines according health status, sex, and age. However, there is no minimum operational standard for data availability, a deficiency which applies to mandatory data content and formats and extends to the use of taxonomies, ontology terms, or standard interoperable identifiers (names). These are necessary to solve complex queries from users. For example, from a user perspective, is it possible to compose a genetically diverse, sex-balanced, neurodegenerative disease cohort of gene-edited isogenic cell lines for an organoid-based drug screen, ensuring all iPSCs come with the essential information needed to develop a commercial product or service? Such a query may also require interoperability across cell banks. Different approaches for stem cell data standardisation are emanating from various stakeholders [3,4,5]. However, there is no consensus yet from the stem cell community on how to achieve full interoperability, which would allow complex queries such as in the previous example.
2. EBiSC—The European Bank for Induced Pluripotent Stem Cells
3. EBiSC Data Management Infrastructure
3.1. Fair Data Principles
3.2. Identifier and Labelling
3.3. Data Types
3.4. Catalogue Search and Filter Functionality
4. Ethical and Legal Framework
4.1. Informed Consent Data
4.2. Downstream Usage Agreements
5. Discussion and Outlook
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Takahashi, K.; Yamanaka, S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 2006, 126, 663–676. [Google Scholar] [CrossRef]
- Kim, J.-H.; Kawase, E.; Bharti, K.; Karnieli, O.; Arakawa, Y.; Stacey, G. Perspectives on the cost of goods for hPSC banks for manufacture of cell therapies. NPJ Regen. Med. 2022, 7, 54. [Google Scholar] [CrossRef] [PubMed]
- Under Development: Biotechnology—Data Interoperability for Stem Cell Data—Part 1: Framework. Available online: https://www.iso.org/standard/83185.html (accessed on 5 January 2023).
- International Society for Stem Cell Research Guidelines for Stem Cell Research and Clinical Translation 2021. Available online: https://www.isscr.org/s/isscr-guidelines-for-stem-cell-research-and-clinical-translation-2021.pdf (accessed on 5 September 2022).
- Sakurai, K.; Kurtz, A.; Stacey, G.; Sheldon, M.; Fujibuchi, W. First Proposal of Minimum Information About a Cellular Assay for Regenerative Medicine. Stem Cells Transl. Med. 2016, 5, 1345–1361. [Google Scholar] [CrossRef] [PubMed]
- de Sousa, P.A.; Steeg, R.; Kreisel, B.; Allsopp, T.E. Hot Start to European Pluripotent Stem Cell Banking. Trends Biotechnol. 2017, 35, 573–576. [Google Scholar] [CrossRef] [PubMed]
- Sousa, P.A.; de Steeg, R.; Wachter, E.; Bruce, K.; King, J.; Hoeve, M.; Khadun, S.; McConnachie, G.; Holder, J.; Kurtz, A.; et al. Rapid establishment of the European Bank for induced Pluripotent Stem Cells (EBiSC)—the Hot Start experience. Stem Cell Res. 2017, 20, 105–114. [Google Scholar] [CrossRef]
- Steeg, R.; Neubauer, J.C.; Müller, S.C.; Ebneth, A.; Zimmermann, H. The EBiSC iPSC bank for disease studies. Stem Cell Res. 2020, 49, 102034. [Google Scholar] [CrossRef]
- Kurtz, A.; Mah, N.; Chen, Y.; Fuhr, A.; Kobold, S.; Seltmann, S.; Müller, S.C. Human pluripotent stem cell registry: Operations, role and current directions. Cell Prolif. 2022, 55, e13238. [Google Scholar] [CrossRef] [PubMed]
- Mah, N.; Seltmann, S.; Aran, B.; Steeg, R.; Dewender, J.; Bultjer, N.; Veiga, A.; Stacey, G.N.; Kurtz, A. Access to stem cell data and registration of pluripotent cell lines: The Human Pluripotent Stem Cell Registry (hPSCreg). Stem Cell Res. 2020, 47, 101887. [Google Scholar] [CrossRef]
- Seltmann, S.; Lekschas, F.; Müller, R.; Stachelscheid, H.; Bittner, M.-S.; Zhang, W.; Kidane, L.; Seriola, A.; Veiga, A.; Stacey, G.; et al. hPSCreg—The human pluripotent stem cell registry. Nucleic Acids Res. 2016, 44, D757–D763. [Google Scholar] [CrossRef]
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.; Jsbrand, J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.-W.; da Silva Santos, L.B.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef]
- Kurtz, A.; Seltmann, S.; Bairoch, A.; Bittner, M.-S.; Bruce, K.; Capes-Davis, A.; Clarke, L.; Crook, J.M.; Daheron, L.; Dewender, J.; et al. A Standard Nomenclature for Referencing and Authentication of Pluripotent Stem Cells. Stem Cell Rep. 2018, 10, 1–6. [Google Scholar] [CrossRef]
- Courtot, M.; Cherubin, L.; Faulconbridge, A.; Vaughan, D.; Green, M.; Richardson, D.; Harrison, P.; Whetzel, P.L.; Parkinson, H.; Burdett, T. BioSamples database: An updated sample metadata hub. Nucleic Acids Res. 2019, 47, D1172–D1178. [Google Scholar] [CrossRef] [PubMed]
- Athar, A.; Füllgrabe, A.; George, N.; Iqbal, H.; Huerta, L.; Ali, A.; Snow, C.; Fonseca, N.A.; Petryszak, R.; Papatheodorou, I. ArrayExpress update—From bulk to single-cell expression data. Nucleic Acids Res. 2019, 47, D711–D715. [Google Scholar] [CrossRef] [PubMed]
- Freeberg, M.A.; Fromont, L.A.; D’Altri, T.; Romero, A.F.; Ciges, J.I.; Jene, A.; Kerry, G.; Moldes, M.; Ariosa, R.; Bahena, S.; et al. The European Genome-phenome Archive in 2021. Nucleic Acids Res. 2022, 50, D980–D987. [Google Scholar] [CrossRef] [PubMed]
- Bairoch, A. The Cellosaurus, a Cell-Line Knowledge Resource. J. Biomol. Tech. JBT 2018, 29, 25–38. [Google Scholar] [CrossRef]
- Steeg, R.; Mueller, S.C.; Mah, N.; Holst, B.; Cabrera-Socorro, A.; Stacey, G.N.; De Sousa, P.A.; Courtney, A.; Zimmermann, H. EBiSC best practice: How to ensure optimal generation, qualification, and distribution of iPSC lines. Stem Cell Rep. 2021, 16, 1853–1867. [Google Scholar] [CrossRef]
- Diehl, A.D.; Meehan, T.F.; Bradford, Y.M.; Brush, M.H.; Dahdul, W.M.; Dougall, D.S.; He, Y.; Osumi-Sutherland, D.; Ruttenberg, A.; Sarntivijai, S.; et al. The Cell Ontology 2016: Enhanced content, modularization, and ontology interoperability. J. Biomed. Semant. 2016, 7, 44. [Google Scholar] [CrossRef]
- Mungall, C.J.; McMurry, J.A.; Köhler, S.; Balhoff, J.P.; Borromeo, C.; Brush, M.; Carbon, S.; Conlin, T.; Dunn, N.; Engelstad, M.; et al. The Monarch Initiative: An integrative data and analytic platform connecting phenotypes to genotypes across species. Nucleic Acids Res. 2017, 45, D712–D722. [Google Scholar] [CrossRef] [PubMed]
- Schriml, L.M.; Arze, C.; Nadendla, S.; Chang, Y.-W.W.; Mazaitis, M.; Felix, V.; Feng, G.; Kibbe, W.A. Disease Ontology: A backbone for disease semantic integration. Nucleic Acids Res. 2012, 40, D940–D946. [Google Scholar] [CrossRef]
- Vasant, D.; Chanas, L.; Malone, J.; Hanauer, M.; Olry, A.; Jupp, S.; Robinson, P.N.; Parkinson, H.; Rath, A. ORDO: An Ontology Connecting Rare Disease, Epidemiology and Genetic Data. In Proceedings of the 22nd Annual International Conference on Intelligent Systems for Molecular Biology, Boston, MA, USA, 11–15 July 2014; Available online: https://www.researchgate.net/publication/281824026_ORDO_An_Ontology_Connecting_Rare_Disease_Epidemiology_and_Genetic_Data, (accessed on 21 February 2023).
- Besser, D. Advancing Stem Cell Technologies and Applications: A Special Collection from the PluriCore Network in the German Stem Cell Network (GSCN). Curr. Protoc. Stem Cell Biol. 2020, 55, e129. [Google Scholar] [CrossRef]
- Dahéron, L.; Diecke, S.; Healy, L.; D’Souza, S. Cores laboratories: Organization for stem cell technology advancement. Stem Cell Res. 2021, 53, 102266. [Google Scholar] [CrossRef] [PubMed]
- Vasilevsky, N.; Johnson, T.; Corday, K.; Torniai, C.; Brush, M.; Segerdell, E.; Wilson, M.; Shaffer, C.; Robinson, D.; Haendel, M.; et al. Research resources: Curating the new eagle-i discovery system. Database J. Biol. Databases Curation 2012, 2012, bar067. [Google Scholar] [CrossRef] [PubMed]
- Chen, Y.; Sakurai, K.; Maeda, S.; Masui, T.; Okano, H.; Dewender, J.; Seltmann, S.; Kurtz, A.; Masuya, H.; Nakamura, Y. Integrated Collection of Stem Cell Bank Data, a Data Portal for Standardized Stem Cell Information. Stem Cell Rep. 2021, 16, 997–1005. [Google Scholar] [CrossRef]
- Kurtz, A.; Elsallab, M.; Sanzenbacher, R.; Abou-El-Enein, M. Linking Scattered Stem Cell-Based Data to Advance Therapeutic Development. Trends Mol. Med. 2019, 25, 8–19. [Google Scholar] [CrossRef] [PubMed]
iPSC Bank | No. of Human iPSC Lines in Catalogue (February 2023) | Description |
---|---|---|
BRC RIKEN 1 | 4239 | Contains lines from healthy/disease donors; holdings from primarily Japanese public institutions |
WiCell 2 | 1447 | Maintains collections of healthy/disease and gene-edited iPSC lines |
Fujifilm CDI 3 | 1554 | Hosts the California Institute for Regenerative Medicine (CIRM) and other collections |
Sampled 4 | 479 | Catalogue includes the National Institute of Neurological Disorders and Stroke (NINDS; 184 lines) and the National Institute of Mental Health Repository (NIMH) and Genetics Resource (NRGR; 295 lines) |
Coriell Institute for Medical Research 5 | 206 | Distributor for various collections, including the Allen Institute Cell Catalog (56 fluorescently tagged iPSC lines); National Institute of General Medical Sciences (NIGMS; 108 lines); Orphan Disease Center (ODC; 9 lines); and National Aging Institute (NIA; 33 lines) |
EBiSC 6 | 928 | Diverse collection of healthy/disease iPSC lines produced from EC- and UK-funded projects |
Data Area | Data Types | Examples |
---|---|---|
The donation of original primary material/tissue | Descriptors Clinical features Genetic features | Donor age, sex, ethnicity/geographical ancestry, tissue type, and source Diseases and phenotypes Mutations, omics, and authenticators (e.g., short tandem repeats) |
Manufacturing and cryopreservation | Protocols Process data Batch data | Reprogramming methods Culture protocols (e.g., media, passaging, environment, and bioreactors) and genetic modifications of cells |
Cells | Characterisation Genetic features Safety Genetic stability | Cell morphology, differentiation potency (such as teratoma formation or marker expression), passage number, karyotype, omics, authenticators, cell line quality standard (e.g., research vs. clinical use), vector clearance, and microbiology/virus screening |
Applications | Drug screening Disease models Clinical | Organoids, differentiated cell types, and clinical trials |
Legal and ethics | Patents MTAs Consent documentation templates | Informed consent data Ethics approvals Restrictions for the use of cells and data (e.g., commercial, research, clinical, and global access) |
Management Task | hPSCreg® | IMS | EBiSC Catalogue | EBI/EGA |
---|---|---|---|---|
Cell line registration by depositors | X | |||
Unique identifier/standard name through an automated programming interface (API) | X | X | ||
Standardised iPSC data | X | |||
Ontology lookup | X | |||
Searchable, publicly accessible database | X | X | ||
iPSC data validation | X | |||
Ethics provenance assessment/validation | X | |||
Genetic data storage and access | X | |||
Data management tools/dashboard | X | |||
Inventory/batch management | X | |||
Batch certificate of analysis management | X | |||
Catalogue | X | X | ||
Operational alignment, internal data exchange, and transfer between components/banking facilities | X | |||
Protocols, user guidance, and support | X |
FAIR Principle | Features |
---|---|
Findable |
|
Accessible |
|
Interoperable |
|
Reusable |
|
A. Basic data for cell line registration | |
Data type | Specific information |
Unique cell line identifiers | Standard stem cell nomenclature |
Cell line provider | Generator Owner Distributor |
Donor information | Donor characteristics: sex, disease status, clinical features, age, and genotype Ethics/consent: consent documentation and provenance has been established Genetic/clinical data: managed access through EBiSC |
B. iPSC Line Data | |
Data type | Specific information |
iPSC derivation | Source cell type (donated tissue) Potency Morphology Marker expression |
Genetic data | STR profiles HLA typing Karyotype status |
Cultivation conditions | Surface coating, passage method, medium, and use of ROCK inhibitor |
Sterility | Microbiology/virology screening status |
Genetic modification | Confirmation of gene edits (if applicable) Type of modification: transgene expression, gene knock-in/-out, and isogenic modification |
Batch information | Growth characteristics, morphology, differentiation potency, and passage number Authentication results and karyotype Cultivation conditions: matrix, culture medium, and O2 and CO2 concentrations |
Supplementary data | Publications iPSC line applications Omics data at the cell line or donor level |
Terms and conditions of use | Allowed usage: research/clinical/commercial Third-party obligations Application grade: research/clinical tracking Access to sensitive personal data (e.g., genetic or clinical data) managed by EBiSC DAC |
Operational data | Sales inventory Aggregated cell line sales data |
No. | EBiSC Mandatory Item for Consent Documentation Review | Relevant Questionnaire Statements: Ethics/Usage |
---|---|---|
1 | Was participation and sample/data donation voluntary? Yes: deposition in EBiSC possible No: ineligible for deposition in EBiSC |
|
2 | Was the generation of iPSCs or cell lines explicitly mentioned? Yes: deposition in EBiSC possible No: deposition in EBiSC is still possible on a case-by-case basis |
|
3 | Can donated samples and derivatives be shared to other researchers at non-profit and for-profit organisations internationally? Yes: deposition in EBiSC possible No: ineligible for deposition in EBiSC |
|
4 | Is the generation, analysis, storage and sharing of genomic data included? Yes: deposition in EBiSC possible No: ineligible for EBiSC deposition |
|
5 | Is undefined ‘future research’ consented for? Yes: deposition in EBiSC possible No: eligibility for deposition in EBiSC is subject to ad hoc ethical review considering the status quo of research at the time of consent vs. present day |
|
6 | What are the consequences of consent withdrawal? Remark: Discussion with clinical representative to clarify that once cell lines are deposited in EBiSC and distributed, fully halting their use cannot be guaranteed. |
|
7 | Have consent templates undergone independent ethical review according to local procedure? Yes: deposition in EBiSC possible No: ineligible for EBiSC deposition |
|
8 | Do consent templates contain restrictions for use of the cell line/derivatives? Yes: these restrictions should be explained explicitly and deposition in EBiSC is possible No: deposition in EBiSC is possible. |
|
9 | When did sample collection take place, e.g., before or after GDPR came into effect? Remark: for samples collected after 2018 and in the EU, data protection clarification is mandatory for deposition in EBiSC. |
|
10 | Do consent templates prevent or limit potential commercial exploitation of derivatives? Yes: this limitation should be explained explicitly and deposition in EBiSC is possible No: deposition in EBiSC is possible as long as cells can still be used by commercial companies for research purposes. |
|
Cells | ||
EMDA | EBiSC Material Deposit Agreement | Legal agreement between the depositor and EBiSC to grant EBiSC the right to bank, QC, and distribute a depositor’s cell line |
CLIP/PIP | Cell Line or Product Information Pack | Contains all iPSC information relating to permitted and restricted uses (as IP or TPOs) |
EAUA | EBiSC Access and Use Agreement | Legal agreement between EBiSC and the user to purchase and use a cell line, including information on existing TPOs |
Data | ||
DAC | Data Access Committee | Manages user access to sensitive personal data associated with an EBiSC cell line |
DAA | Data Access Agreement | Legal agreement between EBiSC and the user to use sensitive personal data |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Mah, N.; Kurtz, A.; Fuhr, A.; Seltmann, S.; Chen, Y.; Bultjer, N.; Dewender, J.; Lual, A.; Steeg, R.; Mueller, S.C. The Management of Data for the Banking, Qualification, and Distribution of Induced Pluripotent Stem Cells: Lessons Learned from the European Bank for Induced Pluripotent Stem Cells. Cells 2023, 12, 2756. https://doi.org/10.3390/cells12232756
Mah N, Kurtz A, Fuhr A, Seltmann S, Chen Y, Bultjer N, Dewender J, Lual A, Steeg R, Mueller SC. The Management of Data for the Banking, Qualification, and Distribution of Induced Pluripotent Stem Cells: Lessons Learned from the European Bank for Induced Pluripotent Stem Cells. Cells. 2023; 12(23):2756. https://doi.org/10.3390/cells12232756
Chicago/Turabian StyleMah, Nancy, Andreas Kurtz, Antonie Fuhr, Stefanie Seltmann, Ying Chen, Nils Bultjer, Johannes Dewender, Ayuen Lual, Rachel Steeg, and Sabine C. Mueller. 2023. "The Management of Data for the Banking, Qualification, and Distribution of Induced Pluripotent Stem Cells: Lessons Learned from the European Bank for Induced Pluripotent Stem Cells" Cells 12, no. 23: 2756. https://doi.org/10.3390/cells12232756
APA StyleMah, N., Kurtz, A., Fuhr, A., Seltmann, S., Chen, Y., Bultjer, N., Dewender, J., Lual, A., Steeg, R., & Mueller, S. C. (2023). The Management of Data for the Banking, Qualification, and Distribution of Induced Pluripotent Stem Cells: Lessons Learned from the European Bank for Induced Pluripotent Stem Cells. Cells, 12(23), 2756. https://doi.org/10.3390/cells12232756