Skip to main content

Extraction stops early

Extraction only processed part of your document? Here's why and what to do.

Page limit

Extraction processes the first 100 pages of each document. This is a system limit.

Solution for long documents

  1. Split the PDF into smaller files (under 100 pages each)
  2. Process each part separately
  3. Combine results in Excel after extraction

How to split a PDF

Option 1: Use Moby's split feature

  1. Select the PDF in Workspace
  2. Click Split PDF
  3. Choose how to split (by page count or document boundaries)

Option 2: Use external tools

  • Adobe Acrobat
  • Preview on Mac

Job timeout

Very complex documents may time out before completion.

Signs of a timeout

  • Job shows "Failed" status
  • Partial results available
  • Error mentions "timeout"

Solutions

  • Split into smaller batches
  • Try a simpler extraction model
  • Process during off-peak hours

Unsupported content

Some page types may be skipped:

Content TypeBehavior
Blank pagesSkipped
Cover pages (no data)Skipped
Image-only pages (poor OCR)May have limited extraction
Complex layoutsMay have reduced accuracy

Checking what was processed

  1. Open the extraction results
  2. Check the page count in the summary
  3. Compare to original document pages
  4. Review any warnings or notes
Large documents

For documents over 100 pages, consider splitting proactively before upload. This gives you more control over what's extracted and makes review easier.