PDF parsing

PDF Resume Parser

Handle any PDF resume format

Parse PDF resumes including scanned documents, multi-column layouts, and complex formatting.

How it works

1

PDF analysis

Detect PDF type: native text, scanned image, or hybrid.

2

Text extraction

Extract text using optimized methods for each PDF type.

3

Layout understanding

AI understands multi-column layouts, tables, and sections.

4

Structured data

Output clean JSON regardless of PDF complexity.

Key Benefits

Multiple PDF types

Native, scanned, and hybrid PDFs supported.

Layout handling

Multi-column and table layouts supported.

OCR support

Scanned resumes processed with OCR.

Fast processing

Quick turnaround on PDF parsing.

Code examples

cURL examplecurl
curl -X POST https://<base-url>/v1/parse \
  -H "authorization: Bearer <api-key>" \
  -F "file=@resume.pdf"
JavaScript examplejavascript
const formData = new FormData();
formData.append('file', pdfFile);

const response = await fetch('https://<base-url>/v1/parse', {
  method: 'POST',
  headers: { authorization: 'Bearer <api-key>' },
  body: formData
});

const parsed = await response.json();
console.log(parsed.firstName, parsed.lastName);
console.log(parsed.workExperiences);

Technical Details

Our PDF parser uses a combination of native text extraction, OCR for scanned documents, and computer vision for layout analysis. Complex multi-column layouts are correctly sequenced.

Frequently asked questions

Yes, our built-in OCR handles scanned resumes with high accuracy. We detect whether a PDF needs OCR automatically.
Our layout analysis AI correctly identifies columns and reading order, ensuring data is extracted in the right sequence.
We handle most creative formats including infographic resumes, though highly artistic designs may have lower accuracy.
PDFs up to 1MB are supported. Most resumes are well under this limit.

Parse PDFs now

Try our PDF parser with your most challenging resumes.

Contact us