Skip to main content

Handle messy PDFs

Real-world documents aren't always clean. Here's how to handle common issues.

Split bundled PDFs

If one PDF contains multiple documents (e.g., a bundle of invoices), split it before matching or extraction.

When to split:

  • Multiple invoices in one file
  • Bank statements with multiple months
  • Bundled contracts and amendments

How to do it: See Workspace Tools → Split PDF.

Merge PDFs

Merge related PDFs into a single file when you want one combined document.

When to merge:

  • Invoice with separate attachments
  • Multi-part contracts
  • Related documents that belong together

How to do it: See Workspace Tools → Merge PDFs.

Poor scan quality

For documents with poor OCR quality:

  1. Try re-scanning at higher resolution (300 DPI minimum)
  2. Improve contrast — Dark text on white background works best
  3. Straighten images — Skewed scans reduce accuracy
  4. If re-scanning isn't possible — Manual verification may be needed

Other common issues

IssueSolution
Password-protected PDFRemove password before uploading
Image-based PDFUse OCR to Excel for best results
Rotated pagesMoby handles rotation automatically
Multi-column layoutMay require manual verification