New paper: pyBiodatafuse: Extending interoperability of data using modular queries across biomedical resources
The number of data and knowledge source relevant to your biological or chemical question increases every year. They all come with different API and different data models. These need to be documented and mapped. What better way to do that than actually do that and then use that. I never asked, but I can imagine that was the original idea of Tooba and Yojana. At the very least, it demonstrates the level of interoperability we need in the life sciences.
In a recent paper, Yojana Gadiya, Javier Millán Acosta, and Tooba Abbassi-Daloii led a project called BioDataFuse (worked on at the biohackathons of ELIXIR in 2023 and 2024 and of SWAT4HCLS in 2024 and 2025) and the matching Python package, pyBiodatafuse (doi:10.1093/bioinformatics/btag064).
With a group of researchers from The Netherlands, Switzerland, Czech Republic, and the USA, multiple databases are wrapped in a uniform data model. The package allows the generation of a graph across the imported databases which can then be further analyzed and visualized. This is an example (RDF) graph that was generated:

Seeing this kind of interoperability brings back good memories.
Congrats to all authors!
Additional details
Description
The number of data and knowledge source relevant to your biological or chemical question increases every year. They all come with different API and different data models. These need to be documented and mapped. What better way to do that than actually do that and then use that. I never asked, but I can imagine that was the original idea of Tooba and Yojana.
Identifiers
- GUID
- https://doi.org/10.59350/7n2bs-zsm80
- URL
- https://chem-bla-ics.linkedchemistry.info/2026/05/30/new-paper-pybiodatafuse.html
Dates
- Issued
-
2026-05-30T02:00:00
- Updated
-
2026-05-30T02:00:00
References
- Gadiya, Y., Millán Acosta, J., Ammar, A., Adriaque Lozano, A., Wetstede, D., Martinát, D., Sima, A. C., Mei, H., Willighagen, E., Abbassi-Daloii, T., & Wren, J. (2026). pyBiodatafuse: extending interoperability of data using modular queries across biomedical resources. Bioinformatics, 42(3). https://doi.org/10.1093/bioinformatics/btag064
- Gadiya, Y., Ammar, A., Willighagen, E., Martinat, D., Sima, A. C., Balci, H., & Abbassi-Daloii, T. (2023). BioHackEU23 report: Extending interoperability of experimental data using modular queries across biomedical resources. In BioHackrXiv. Center for Open Science. https://doi.org/10.37044/osf.io/mhsqp
- Acosta, J. M., Kawashima, S., Katayama, T., Bolleman, J., Martinat, D., Detering, H., Gayo, J. E. L., Gadiya, Y., & Abbassi-Daloii, T. (2025). BioHackEU24 report: Expanding FAIR database integration through elucidation and transformation of underlying graph schemas. In BioHackrXiv. Center for Open Science. https://doi.org/10.37044/osf.io/ptmg5_v1