Ask a Question

Prefer a chat interface with context about you and your work?

DSG: An End-to-End Document Structure Generator

DSG: An End-to-End Document Structure Generator

Information in industry, research, and the public sector is widely stored as rendered documents (e.g., PDF files, scans). Hence, to enable downstream tasks, systems are needed that map rendered documents onto a structured hierarchical format. However, existing systems for this task are limited by heuristics and are not end-to-end trainable. …