How to use GROBID to extract text from PDF Author Aland Astudillo (ORCID: 0009-0008-8672-3168) GROBID is a powerful and useful tool based on machine learning that can extract text information from PDF files and other files to a structured format. One of the key challenges in knowledge mining from academic articles is reading the content of PDF files.