Pubblicato in rOpenSci - open tools for open science

You might have read my blog post analyzing the social weather ofrOpenScionboarding,based on a text analysis of GitHub issues. I extracted text out ofMarkdown-formatted threads with regular expressions. I basicallyhammered away at the issues using tools I was familiar with until itworked! Now I know there’s a much better and cleaner way, that I’llpresent in this note. Read on if you want to extract insights abouttext, code, links, etc.