Extraction stops early
Extraction only processed part of your document? Here's why and what to do.
Page limit
Extraction processes the first 100 pages of each document. This is a system limit.
Solution for long documents
- Split the PDF into smaller files (under 100 pages each)
- Process each part separately
- Combine results in Excel after extraction
How to split a PDF
Option 1: Use Moby's split feature
- Select the PDF in Workspace
- Click Split PDF
- Choose how to split (by page count or document boundaries)
Option 2: Use external tools
- Adobe Acrobat
- Preview on Mac
Job timeout
Very complex documents may time out before completion.
Signs of a timeout
- Job shows "Failed" status
- Partial results available
- Error mentions "timeout"
Solutions
- Split into smaller batches
- Try a simpler extraction model
- Process during off-peak hours
Unsupported content
Some page types may be skipped:
| Content Type | Behavior |
|---|---|
| Blank pages | Skipped |
| Cover pages (no data) | Skipped |
| Image-only pages (poor OCR) | May have limited extraction |
| Complex layouts | May have reduced accuracy |
Checking what was processed
- Open the extraction results
- Check the page count in the summary
- Compare to original document pages
- Review any warnings or notes
Large documents
For documents over 100 pages, consider splitting proactively before upload. This gives you more control over what's extracted and makes review easier.