Rogue Scholar

Publicados 19 de outubro de 2022

Autor Pachá (aka Mauricio Vargas Sepúlveda)

Summary This post is about the surprising uses I’ve noticed and the questionsabout the censo2017 R package, a tool foraccessing the Chilean census 2017 data, I’ve gotten since it was peer-reviewedthrough rOpenSci one year ago.

Open-scienceReproducible-researchData-accessData-extractionGeospatialCiências da Computação e da InformaçãoInglês

The Story Behind censo2017, the First rOpenSci Package to be Reviewed in Spanish

https://doi.org/10.59350/2jakv-t6d66

Publicados 27 de julho de 2021

Autor Pachá (aka Mauricio Vargas Sepúlveda)

Summary censo2017 is an R package designed toorganize the Redatam ¹ filesprovided by the Chilean National Bureau of Statistics (Instituto Nacional deEstadísticas de Chile in spanish) in DVD format ² . This package was inspiredby citesdb(Noam Ross, 2020) and taxadb(Carl Boettiger et al, 2021).This post is about thispackage, the problem it solves, how to use it, and the fact that the package andits review process were all

Software Peer ReviewPackagesCommunityReproducible ResearchHigh Performance ComputingCiências da Computação e da InformaçãoInglês

targets: Democratizing Reproducible Analysis Pipelines

https://doi.org/10.59350/bbx9k-qgh44

Publicados 3 de fevereiro de 2021

Autor Will Landau

Make ¹ -like pipelines enhance the integrity, transparency, shelf life, efficiency, and scale of large analysis projects.With pipelines, data science feels smoother and more rewarding, and the results are worthy of more trust. targets install.packages("targets") The targets ² package is a new pipeline toolkit for R.It recently cleared software review, and it is now on CRAN.

Open-scienceReproducible-researchData-accessDatasharingOsfCiências da Computação e da InformaçãoInglês

OSF: A Project Management Service Built for Research

https://doi.org/10.59350/jf1mh-j5w74

Publicados 4 de agosto de 2020

Autor Aaron Wolen

osfr provides a ( hopefully ) convenient R interface to OSF (Open Science Framework, https://www.osf.io), a free service for managing research developed by the Center for Open Science (COS). osfr completed its rOpenSci peer-review earlier this year and has been available on CRAN since February.

CommunityEventsCommunity CallReproducibilityReproducible-researchCiências da Computação e da InformaçãoInglês

Community Call - Last Night, Testing Saved my Life

https://doi.org/10.59350/512az-r1y21

Publicados 12 de novembro de 2019

Autor Stefanie Butland

To the uninitiated, software testing may seem variously boring, daunting or bogged down in obscure terminology. However, it has the potential to be enormously useful for people developing software at any level of expertise, and can often be put into practice with relatively little effort. Our 1-hour Call will include two speakers and at least 20 minutes for Q &

CommunityEventsCommunity CallReproducibilityReproducible-researchCiências da Computação e da InformaçãoInglês

Community Call - Reproducible Workflows at Scale with drake

https://doi.org/10.59350/v6kvt-xb523

Publicados 8 de agosto de 2019

Autor Stefanie Butland

Ambitious workflows in R, such as machine learning analyses, can be difficult to manage. A single round of computation can take several hours to complete, and routine updates to the code and data tend to invalidate hard-earned results. You can enhance the maintainability, hygiene, speed, scale, and reproducibility of such projects with the drake R package.

CommunityEventsCommunity CallReproducibilityReproducible-researchCiências da Computação e da InformaçãoInglês

Community Call - Reproducible Research with R

https://doi.org/10.59350/v7xr3-6sn63

Publicados 11 de julho de 2019

Autor Stefanie Butland

Our 1-hour Call on Reproducible Research with R will include three speakers and 20 minutes for Q & A. Ben Marwick will introduce you to a research compendium, which accompanies, enhances, or is a scientific publication providing data, code, and documentation for reproducing a scientific workflow.

ReproducibilityReproducible ResearchTidydataDatasharingSoftwareCiências da Computação e da InformaçãoInglês

Building Reproducible Data Packages with DataPackageR

https://doi.org/10.59350/fwpdf-qd151

Publicados 18 de setembro de 2018

Autor Greg Finak

Sharing data sets for collaboration or publication has always been challenging, but it’s become increasingly problematic as complex and high dimensional data sets have become ubiquitous in the life sciences. Studies are large and time consuming; data collection takes time, data analysis is a moving target, as is the software used to carry it out.

Reproducible ResearchArchivingOpen ScienceCiências da Computação e da InformaçãoInglês

The challenge of combining 176 otherpeoplesdata to create the Biomass And Allometry Database

https://doi.org/10.59350/ec8b2-x1a35

Publicados 3 de junho de 2015

Autores Daniel Falster, Rich FitzJohn, Remko Duursma, Diego Barneche

Despite the hype around “big data”, a more immediate problem facing many scientific analyses is that large-scale databases must be assembled from a collection of small independent and heterogeneous fragments – the outputs of many and isolated scientific studies conducted around the globe. Collecting and compiling these fragments is challenging at both political and technical levels.

Reproducible ResearchCiências da Computação e da InformaçãoInglês

Introducing Rocker: Docker for R

https://doi.org/10.59350/j1028-pq649

Publicados 23 de outubro de 2014

Autores Carl Boettiger, Dirk Eddelbuettel

So what is Docker? Docker is a relatively new opensource applicationand service, which is seeing interest across a number of areas. Ituses recent Linux kernel features (containers, namespaces) to shieldprocesses.

rOpenSci - open tools for open science

Interesting Uses of censo2017 a Year After Publishing

The Story Behind censo2017, the First rOpenSci Package to be Reviewed in Spanish

targets: Democratizing Reproducible Analysis Pipelines

OSF: A Project Management Service Built for Research

Community Call - Last Night, Testing Saved my Life

Community Call - Reproducible Workflows at Scale with drake

Community Call - Reproducible Research with R

Building Reproducible Data Packages with DataPackageR

The challenge of combining 176 otherpeoplesdata to create the Biomass And Allometry Database

Introducing Rocker: Docker for R