Bing Zhang, Mikio Takeuchi, et al.
NAACL 2025
This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion. It is powered by state-of-the-art specialized AI models for layout analysis (DocLayNet) and table structure recognition (TableFormer), and runs efficiently on commodity hardware in a small resource budget. The code interface allows for easy extensibility and addition of new features and models.
Bing Zhang, Mikio Takeuchi, et al.
NAACL 2025
Ryan Johnson, Ippokratis Pandis
CIDR 2013
Hannah Kim, Celia Cintas, et al.
IJCAI 2023
Shashank Ahire, Melissa Guyre, et al.
CUI 2025