I've put a short note up on bioRxiv about ways to geocode nucleotide sequences in databases such as GenBank. The preprint is "Geocoding genomic databases using GBIF" https://doi.org/10.1101/469650.
I've put a short note up on bioRxiv about ways to geocode nucleotide sequences in databases such as GenBank. The preprint is "Geocoding genomic databases using GBIF" https://doi.org/10.1101/469650.
Quick note to express my delight and surprise that my entry for the 2018 GBIF Ebbe Nielsen Challenge come in joint first! My entry was Ozymandias - a biodiversity knowledge graph which built upon data from sources such as ALA, AFD, BioStor, CrossRef, ORCID), Wikispecies, and BLR.
rgbif was seven years old yesterday! What is rgbif? rgbif gives you access to data from the Global Biodiversity Information Facility (GBIF) via their API. A samping of use cases covered in rgbif: Search for datasets Get metrics on usage of datasets Get metadata about organizations providing data to GBIF Search taxonomic names Get quick taxonomic name suggestions Search occurrences by taxonomic name/country/collector/etc.
I've submitted an entry for the 2018 GBIF Ebbe Nielsen Challenge. It's a couple of weeks before the deadline but I will be away then so have decided to submit early. My entry is Ozymandias - a biodiversity knowledge graph.
GBIF has reached 1 billion occurrences which is, of course, something to celebrate: An achievement on this scale represents a lot of work by many people over many years, years spent developing simple standards for sharing data, agreeing that sharing is a good thing in the first place, tools to enable sharing, and a place to aggregate all that shared data (GBIF). So, I asked a question: My point is not to do this: Rather it is to encourage a
First off, let me say that what follows is a lot of arm waving to try and obscure how little I understand what I'm talking about. I'm going to sketch out what I think is a "radical" idea for a GBIF Challenge entry. The motivation for this idea comes from several sources: 1. GBIF is (under-)funded by direct contributions from governments, hence each year it essentially "begs" for money.
Last year I finished my four-year stint as Chair of the GBIF Science Committee. During that time, partly as a result of my urging, GBIF launched an annual "GBIF Ebbe Nielsen Challenge", and I'm please that this year GBIF is continuing to run the challenge. In 2015 and 2016 the challenge received some great entries.
The following is a guest post by Bob Mesibov. Nico Franz and Beckett Sterner created a stir last year with a preprint in bioRxiv about expert validation (or the lack of it) in the "backbone" classifications used by aggregators.
Notes on how many georeferenced DNA sequences there are in GenBank, and how many could potentially be georeferenced.
Some random notes on the first day of TDWG 2017. First off, great organisation with the first usable conference calendar app that I've seen (https://tdwg2017.sched.com). I gave the day's keynote address in the morning (slides below). Towards a biodiversity knowledge graph from Roderic Page It was something of a stream of consciousness brain dump, and tried to cover a lot of (maybe too much) stuff.