PDF Vector

ID Document Data Extraction API for developers

Extract structured data from passports, driver's licenses, and ID cards using AI. Define your JSON schema and get perfectly formatted data.

  • Custom JSON SchemaDefine any data structure you need - the AI extracts information to match your exact schema
  • Multiple Format SupportExtract data from PDFs, Word documents (DOCX), and image files (PNG, JPG) with consistent accuracy
  • Simple IntegrationClean REST API and TypeScript SDK for easy integration into any application

Easy to use APIs

Use our simple APIs directly or our TypeScript SDK with just a few lines of code.

ID Extract

API Docs
import { PDFVector } from "pdfvector";

const client = new PDFVector({
  apiKey: "pdfvector_xxxxxxx"
});

// From URL
const results = await client.idExtract({
  url: "https://example.com/passport.pdf",
  prompt: "Extract all personal information from this ID document",
  schema: {
    type: "object",
    properties: {
      fullName: { type: "string" },
      dateOfBirth: { type: "string" },
      documentNumber: { type: "string" },
      expirationDate: { type: "string" },
      nationality: { type: "string" }
    }
  }
});

// From file
import { readFile } from "fs/promises";
const results = await client.idExtract({
  data: await readFile("drivers-license.jpg"),
  contentType: "image/jpeg",
  prompt: "Extract driver's license information",
  schema: {
    type: "object",
    properties: {
      fullName: { type: "string" },
      licenseNumber: { type: "string" },
      expirationDate: { type: "string" },
      address: { type: "string" }
    }
  }
});

What people are saying

See how PDF Vector is helping teams improve their document processing workflows

Abdo El-Mobayad

Can't recommend PDF Vector enough! It boosts your AI workflow accuracy to 100% while dropping your costs! Especially if you're a T4 Org in the $150/m spend range!

Abdo El-Mobayad

Trent

Gotta give a shoutout to PDF Vector team for helping me set up PDF Vector for a project. They even delivered on a feature request before I purchased. Incredible customer service. 👏

Trent

Praneeth Pike

Been implementing RAG and changing a lot of things under the hood for @rabbitholesai, came across PDF Vector and it was a huge time saver. I got a document parsing solution for the rag pipeline within minutes! one less thing to worry about

Praneeth Pike

Extract Structured Data from ID Documents

Extract structured data from ID documents using AI. Define your own JSON schema and get perfectly formatted data that matches your exact requirements.

Get started

AI-Powered Extraction

Advanced AI understands ID document layouts and personal information to extract data accurately into your custom schema.

Custom JSON Schema

Define any JSON schema structure you need. Extract nested objects, arrays, specific fields, or complex identity data structures.

Database-Ready Output

Get structured data that's ready to insert into your database, process in your application, or integrate with your verification systems.

Identity Document Expertise

Specialized for passports, driver's licenses, and national ID cards. Handles various layouts and extracts personal information with precision.

Example Output

Real examples of structured data extracted from passports, driver's licenses, and ID cards based on custom JSON schemas

Original Document

Passport Extraction

Output

AI-generated answer to your question

Question

Extract all passport details

Schema

{
  "type": "object",
  "properties": {
    "documentType": {
      "type": "string"
    },
    "documentNumber": {
      "type": "string"
    },
    "firstName": {
      "type": "string"
    },
    "lastName": {
      "type": "string"
    },
    "dateOfBirth": {
      "type": "string"
    },
    "sex": {
      "type": "string"
    },
    "nationality": {
      "type": "string"
    },
    "placeOfBirth": {
      "type": "string"
    },
    "issueDate": {
      "type": "string"
    },
    "expirationDate": {
      "type": "string"
    },
    "issuingAuthority": {
      "type": "string"
    }
  }
}

Answer

{
  "documentType": "PASSPORT",
  "documentNumber": "0002028373",
  "firstName": "LI",
  "lastName": "JUAN",
  "dateOfBirth": "1982-05-01T00:00:00",
  "sex": "F",
  "nationality": "USA",
  "placeOfBirth": "NEW YORK CITY",
  "issueDate": "2019-05-09T00:00:00",
  "expirationDate": "2029-05-09T00:00:00",
  "issuingAuthority": "USA"
}

One subscription, all APIs

Start for free, then scale as you grow. No hidden fees.

Save one month

Free

$0

Credit Card Required

Perfect for testing and small projects

  • Access to all APIs
  • 100 credits
Subscribe to Free

Basic

$23/month

$275 billed annually

Great for personal projects and small businesses

  • Access to all APIs
  • 3,000 credits
Subscribe to Basic
Most Popular

Pro

$89/month

$1067 billed annually

Most popular plan for growing businesses

  • Access to all APIs
  • 100,000 credits
Subscribe to Pro

Enterprise

$457/month

$5489 billed annually

For large-scale applications and enterprises

  • Access to all APIs
  • 500,000 credits
Subscribe to Enterprise

Ready to Structure Your ID Document Data?

Transform unstructured ID documents into structured data for your applications. Define your schema and get perfectly formatted JSON in minutes.

No setup fees • Integrate in minutes • Cancel anytime

Everything you need for document processing

Transform any document into clean text. Extract structured data from PDFs, Word files, Excel spreadsheets, and images with precision and speed. Also search 5M+ academic papers and fetch papers by paper IDs.

Get started for free today

No setup fees • Integrate in minutes • Cancel anytime

Frequently asked questions