Rogue Scholar

Publicado 9 de diciembre de 2015 in iPhylo

It's a nice feeling when work that one did ages ago seems relevant again. Markus Döring has been working on a new backbone classification of all the species which occur in taxonomic checklists harvested by GBIF. After building a new classification the obvious question arises "how does this compare to the previous GBIF classification?" A simple question, answering it however is a little tricky.

DOIGBIFGithubORCIDInformática y Ciencias de la InformaciónInglés

Thoughts on ReCon 15: DOIs, GitHub, ORCID, altmetric, and transitive credit

https://doi.org/10.59350/cgdvk-qhq18

Publicado 24 de junio de 2015 in iPhylo

Autor Roderic Page

I spent last Friday and Saturday at ( Research in the 21st Century: Data, Analytics and Impact , hashtag #ReCon_15) in Edinburgh. Friday 19th was conference day, followed by a hackday at CodeBase. There's a Storify archive of the tweets so you can get a sense of the meeting. Sitting in the audience a few things struck me. No identifier wars, DOIs have won and are everywhere.

Creative CommonsGeoJSONGeophylogenyGithubPLoSInformática y Ciencias de la InformaciónInglés

Visualising Geophylogenies in Web Maps Using GeoJSON

https://doi.org/10.59350/q7mg0-yq203

Publicado 24 de junio de 2015 in iPhylo

Autor Roderic Page

I've published a short note on my work on geophylogenies and GeoJSON in PLoS Currents Tree of Life : At the time of writing the DOI hasn't registered, so the direct link is here. There is a GitHub repository for the manuscript and code. I chose PLoS Currents Tree of Life because it is (supposedly) quick and cheap.

GitHubInformática y Ciencias de la InformaciónInglés

Introducing a Wishlist for Scientific R Packages

https://doi.org/10.59350/nvgd4-t4s24

Publicado 10 de marzo de 2015 in rOpenSci - open tools for open science

Autor Os Keyes

There are two things that make R such a wonderful programming environment - the vast number of packages to access, process and interpretdata, and the enthusiastic individuals and subcommunities (of which rOpenSci is a great example). One, of course, flows from the other:R programmers write R packages to provide language users with more features, which makes everyone’s jobs easier and (hopefully!)attracts more users and more contributions.

AnnotationDOIGBIFGithubNanopublicationInformática y Ciencias de la InformaciónInglés

Annotating GBIF, from datasets to nanopublications

https://doi.org/10.59350/zfdfv-82093

Publicado 28 de enero de 2015 in iPhylo

Autor Roderic Page

Below I sketch what I believe is a straightforward way GBIF could tackle the issue of annotating and cleaning its data. It continues a series of posts Annotating GBIF: some thoughts, Rethinking annotating biodiversity data, and More on annotating biodiversity data: beyond sticky notes and wikis on this topic. Let's simplify things a little and state that GBIF at present is essentially an aggregation of Darwin Core Archive files.

ChameleonsGBIFGithubInformática y Ciencias de la InformaciónInglés

Exploring the chameleon dataset: broken GBIF links and lack of georeferencing

https://doi.org/10.59350/qrffj-ncr79

Publicado 23 de septiembre de 2014 in iPhylo

Autor Roderic Page

Following on from the discussion of the African chameleon data, I've started to explore Angelique Hjarding's data in more detail. The data is available from figshare (doi:10.6084/m9.figshare.1141858), so I've grabbed a copy and put it in github. Several things are immediately apparent.There is a lot of ungeoreferenced data.

BioStorGeoreferencingGistGithubJournalMapInformática y Ciencias de la InformaciónInglés

Geotagging stats for BioStor

https://doi.org/10.59350/scxvs-swq77

Publicado 25 de agosto de 2014 in iPhylo

Autor Roderic Page

Note to self for upcoming discussion with JournalMap.As of Monday August 25th, BioStor has 106,617 articles comprising 1,484,050 BHL pages. From the full text for these articles, I have extracted 45,452 distinct localities (i.e., geotagged with latitude and longitude). 15,860 BHL pages in BioStor pages have at least one geotag, these pages belong to 5,675 BioStor articles.In summary, BioStor has 5,675 full-text articles that are geotagged.

GithubPhylogenyVisualisationInformática y Ciencias de la InformaciónInglés

Very large phylogeny viewer

https://doi.org/10.59350/19z31-gm249

Publicado 6 de mayo de 2014 in iPhylo

Autor Roderic Page

As announced on phylobabble I've started to revisit visualising large phylogenies, building on some work I did a couple of years ago (my how time flies). This time, there is actual code (see https://github.com/rdmpage/deep-tree) as well as a live demo http://iphylo.org/~rpage/deep-tree/demo/. You can see the amphibian tree below at http://iphylo.org/~rpage/deep-tree/demo/show.php?id=5369171e32b7a:You can upload or paste a tree (for now in NEXUS

FeaturedForkGitHubGoogle DocsOpen AccessOtras Ciencias SocialesInglés

Fork, merge and crowd-sourcing data curation

https://doi.org/10.59350/sfea0-0sy53

Publicado 26 de abril de 2014 in Science in the Open

Autor Cameron Neylon

Over the past few weeks there has been a sudden increase in the amount of financial data on scholarly communications in the public domain. This was triggered in large part by the Wellcome Trust releasing data on the prices paid for Article Processing Charges by the institutions it funds.

FigShareGBIFGithubInformática y Ciencias de la InformaciónInglés

Publishing biodiversity data directly from GitHub to GBIF

https://doi.org/10.59350/7xxdj-gx657

Publicado 13 de marzo de 2014 in iPhylo

Autor Roderic Page

Today I managed to publish some data from a GitHub repository directly to GBIF. Within a few minutes (and with Tim Robertson on hand via Skype to debug a few glitches) the data was automatically indexed by GBIF and its maps updated. You can see the data I uploaded here.The data I uploaded came from this paper:This is the data I used to build the geophylogeny for Banza using Google Earth.

Publicaciones de Rogue Scholar

Visualising the difference between two taxonomic classifications

Thoughts on ReCon 15: DOIs, GitHub, ORCID, altmetric, and transitive credit

Visualising Geophylogenies in Web Maps Using GeoJSON

Introducing a Wishlist for Scientific R Packages

Annotating GBIF, from datasets to nanopublications

Exploring the chameleon dataset: broken GBIF links and lack of georeferencing

Geotagging stats for BioStor

Very large phylogeny viewer

Fork, merge and crowd-sourcing data curation

Publishing biodiversity data directly from GitHub to GBIF