Messages de Rogue Scholar

language
Publié in tarleb
Auteur Albert Krewinkel

Sitemaps are an easy way to list all pages that a search engine should crawl and index. Quarto supports the standard, XML based Sitemap Protocol, although that fact is a bit hidden in the docs. Quarto will automatically produce a sitemap if the website’s URL is given as site-url property: website: title: tarleb site-url: 'https://tarleb.com' site-path: '/' The URL will contain a colon :, which is why the YAML value must be put in quotes.

Publié in tarleb
Auteur Albert Krewinkel

Sitemaps are an easy way to list all pages that a search engine should crawl and index. Quarto supports the standard, XML based Sitemap Protocol, although that fact is a bit hidden in the docs. Quarto will automatically produce a sitemap if the website’s URL is given as site-url property: website: title: tarleb site-url: 'https://tarleb.com' site-path: '/' The URL will contain a colon :, which is why the YAML value must be put in quotes.

Publié in Stories by Adam Day on Medium
Auteur Adam Day

This post is about The Papermill Alarm: an API for detecting potential papermill-products. There’s a field of study called ‘stylometry’ where we look at the statistical properties of someone’s writing and use that to model their ‘style’. People write in idiosyncratic ways.

Publié in tarleb
Auteur Albert Krewinkel

These are the resources that I’d want to have if I was to learn about pandoc (the universal document converter) all over again: Installing latest release The GitHub release page has installers for Windows, macOS, and Linux. Docker images There are three types of pandoc Docker images: minimal – very small, just the bare pandoc binary; core – includes pandoc-crossref and helpers programs, e.g. those used by pandoc for SVG image conversion;

Publié in tarleb
Auteur Albert Krewinkel

These are the resources that I’d want to have if I was to learn about pandoc (the universal document converter) all over again: Installing latest release The GitHub release page has installers for Windows, macOS, and Linux. Docker images There are three types of pandoc Docker images: minimal – very small, just the bare pandoc binary; core – includes pandoc-crossref and helpers programs, e.g. those used by pandoc for SVG image conversion;

Publié in iPhylo

Just some thoughts as I work through some datasets linking taxonomic names to the literature. In the diagram above I've tried to capture the different situatios I encounter. Much of the work I've done on this has focussed on case 1 in the diagram: I want to link a taxonomic name to an identifier for the work in which that name was published. In practise this means linking names to DOIs.

Publié in re3data COREF Project Blog
Auteur re3data Team

Data quality assurance is a fundamental issue for repositories that seek to ensure trust in their services. This workshop focused on measures research data repositories undertake to ensure data quality. The objective of this workshop was to raise awareness and make the various activities repositories perform to assure, assess, and improve data quality visible.

Publié in Front Matter

Since last year this blog is powered by the Ghost open source blogging platform. Two important and long-standing shortcomings of the platform were search and comments, which I added via integrating third-party tools (Typesense and Discourse, respectively). In the last several weeks Ghost team has worked hard to add these features to the core platform, described here and here.

Publié in lab.sub - Articles

(Dependency Track, Best Practices) Introduction In May 2021 President of the USA Joe Biden has issued the Executive Order 14028 on “Improving the Nation’s Cybersecurity”. The document makes several demands on federal agencies concerning the implementation of Cybersecurity and identifies the security and integrity of the software supply chain as one vital claim.