Rogue Scholar

IdentifiersIdentifier designReproducibilitye-ScienceBig data

10 Simple Rules For Design, Provision, And Reuse Of Persistent Identifiers For Life Science Data

Publié 26 mai 2015

Auteurs Julie McMurry, Niklas Blomberg, Tony Burdett, Nathalie Conte, Michel Dumontier, Donal K Fellows, Alejandra Gonzalez-Beltran, Philipp Gormanns, Janna Hastings, Melissa A Haendel, Henning Hermjakob, Jean-Karim Hériché, Jon C Ison, Rafael C Jimenez, Simon Jupp, Nick Juty, Camille Laibe, Nicolas Le Novère, James Malone, Maria J Martin, Johanna R McEntyre, Chris Morris, Juha Muilu, Wolfgang Müller, Christopher J Mungall, Philippe Rocca-Serra, Susanna-Assunta Sansone, Murat Sariyar, Jacky L Snoep, Natalie J Stanford, Neil Swainston, Nicole Washington, Alan R Williams, Katherine Wolstencroft, Carole Goble, Helen Parkinson

In the life sciences, problems with identifiers impede the flow and integrity of information. This is especially challenging within “synthesis research” disciplines such as systems biology, translational medicine, and ecology. Implementation-driven initiatives such as ELIXIR, BD2K, and others have therefore been actively working to understand and address underlying problems with identifiers. Good, global-scale, persistent identifier design is harder than it appears, and is essential for data to be Findable, Accessible, Interoperable, and Reusable (Data FAIRport principles). Here, we build on emerging conventions and existing general recommendations and summarise the identifier characteristics most important to optimising the utility of life-science data. We propose actions to take in the identifier ‘green field’ and offer guidance for using real-world identifiers from diverse sources.

Human-readable and machine-readable Persistent Identifiers

References

10 Simple Rules For Design, Provision, And Reuse Of Persistent Identifiers For Life Science Data