Saeel Sandeep Nachane, Ojas Gramopadhye, et al.
EMNLP 2024
This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion. It is powered by state-of-the-art specialized AI models for layout analysis (DocLayNet) and table structure recognition (TableFormer), and runs efficiently on commodity hardware in a small resource budget. The code interface allows for easy extensibility and addition of new features and models.
Saeel Sandeep Nachane, Ojas Gramopadhye, et al.
EMNLP 2024
Merve Unuvar, Yurdaer Doganata, et al.
CLOUD 2014
Ella Barkan, Ibrahim Siddiqui, et al.
Computational And Structural Biotechnology Journal
Hagen Soltau, Lidia Mangu, et al.
ASRU 2011