Handle messy PDFs
Real-world documents aren't always clean. Here's how to handle common issues.
Split bundled PDFs
If one PDF contains multiple documents (e.g., a bundle of invoices), split it before matching or extraction.
When to split:
- Multiple invoices in one file
- Bank statements with multiple months
- Bundled contracts and amendments
How to do it: See Workspace Tools → Split PDF.
Merge PDFs
Merge related PDFs into a single file when you want one combined document.
When to merge:
- Invoice with separate attachments
- Multi-part contracts
- Related documents that belong together
How to do it: See Workspace Tools → Merge PDFs.
Poor scan quality
For documents with poor OCR quality:
- Try re-scanning at higher resolution (300 DPI minimum)
- Improve contrast — Dark text on white background works best
- Straighten images — Skewed scans reduce accuracy
- If re-scanning isn't possible — Manual verification may be needed
Other common issues
| Issue | Solution |
|---|---|
| Password-protected PDF | Remove password before uploading |
| Image-based PDF | Use OCR to Excel for best results |
| Rotated pages | Moby handles rotation automatically |
| Multi-column layout | May require manual verification |