PDF Vector

Image Q&A API for developers

Ask questions and get AI-powered answers about any image. Perfect for understanding screenshots, diagrams, photos, and scanned documents with natural language Q&A.

  • Visual + Text UnderstandingCombines OCR with visual AI to understand both text content and visual context in images
  • Intelligent AnalysisAI analyzes the entire image to provide accurate, contextual answers to your questions
  • Natural Language InterfaceAsk questions in plain English and get detailed, markdown-formatted responses

Easy to use APIs

Use our simple APIs directly or our TypeScript SDK with just a few lines of code.

Image Ask

API Docs
import { PDFVector } from "pdfvector";

const client = new PDFVector({
  apiKey: "pdfvector_xxxxxxx"
});

// From URL
const results = await client.ask({
  url: "https://example.com/document.jpg",
  prompt: "What are the payment terms in this contract?"
});

// From file
import { readFile } from "fs/promises";
const results = await client.ask({
  data: await readFile("document.jpg"),
  contentType: "image/jpeg",
  prompt: "Summarize the key findings in this report"
});

What people are saying

See how PDF Vector is helping teams improve their document processing workflows

Abdo El-Mobayad

Can't recommend PDF Vector enough! It boosts your AI workflow accuracy to 100% while dropping your costs! Especially if you're a T4 Org in the $150/m spend range!

Abdo El-Mobayad

Trent

Gotta give a shoutout to PDF Vector team for helping me set up PDF Vector for a project. They even delivered on a feature request before I purchased. Incredible customer service. 👏

Trent

Praneeth Pike

Been implementing RAG and changing a lot of things under the hood for @rabbitholesai, came across PDF Vector and it was a huge time saver. I got a document parsing solution for the rag pipeline within minutes! one less thing to worry about

Praneeth Pike

AI-Powered Image Q&A

Transform images into intelligent conversational interfaces. Our AI understands text, visual context, and relationships to provide accurate answers about any image.

Get started

Visual Understanding

Advanced AI analyzes images comprehensively - reading text, understanding layouts, identifying objects, and grasping visual context.

Natural Language Q&A

Ask questions in plain language and get human-readable answers about any aspect of the image, from text content to visual elements.

Multi-Format Support

Works with screenshots, photos, scanned documents, diagrams, charts, and more. Supports PNG, JPEG, GIF, BMP, TIFF formats.

Contextual Intelligence

Goes beyond simple OCR - understands relationships, layout meaning, and can make intelligent inferences from visual information.

Example Output

Real examples of questions and AI-generated answers from images

Original Document

With invoice

Output

AI-generated answer to your question

Question

What information is shown in this invoice?

Answer

Here’s a breakdown of the information shown in the invoice:

### **Vendor (Seller) Information**
- **Company Name:** East Repair Inc.
- **Address:** 1912 Harvest Lane, New York, NY 12210

### **Billing (Customer) Information**
- **Name:** John Smith
- **Address:** 2 Court Square, New York, NY 12210

### **Shipping Information**
- **Name:** John Smith
- **Address:** 3787 Pineview Drive, Cambridge, MA 12210

### **Invoice Details**
- **Invoice Number:** (Missing, but likely intended to be **2312/2019**)
- **Invoice Date:** **11/02/2019** (November 2, 2019)
- **P.O. Number:** (Blank)
- **Due Date:** **26/02/2019** (February 26, 2019) *(Note: This seems inconsistent with the invoice date—likely a typo, as the due date should be after the invoice date.)*

### **Line Items (Products/Services)**
| **Qty** | **Description**          | **Unit Price** | **Amount** |
|---------|--------------------------|----------------|------------|
| 1       | Front and rear brake cables | $100.00        | $100.00    |
| 2       | New set of pedal arms     | $15.00         | $30.00     |
| 3       | Labor (3 hrs)             | $5.00/hr       | $15.00     |

### **Financial Summary**
- **Subtotal:** $145.00
- **Sales Tax (6.25%):** $9.06
- **Total Due:** **$154.06**

### **Payment Terms & Conditions**
- **Payment Due:** Within 15 days
- **Payable To:** East Repair Inc. (checks accepted)

### **Notes on Potential Errors**
1. **Due Date Issue:** The due date (**26/02/2019**) is *before* the invoice date (**11/02/2019**), which is likely a typo. It should probably be **26/11/2019** (November 26, 2019) or **11/03/2019** (March 11, 2019).
2. **Invoice Number Placement:** The invoice number appears misaligned (under "P.O.#" instead of "INVOICE #").
3. **P.O.# Field:** Left blank.

Would you like help correcting or reformatting this invoice?

One subscription, all APIs

Start for free, then scale as you grow. No hidden fees.

Save one month

Free

$0

Credit Card Required

Perfect for testing and small projects

  • Access to all APIs
  • 100 credits
Subscribe to Free

Basic

$23/month

$275 billed annually

Great for personal projects and small businesses

  • Access to all APIs
  • 3,000 credits
Subscribe to Basic
Most Popular

Pro

$89/month

$1067 billed annually

Most popular plan for growing businesses

  • Access to all APIs
  • 100,000 credits
Subscribe to Pro

Enterprise

$457/month

$5489 billed annually

For large-scale applications and enterprises

  • Access to all APIs
  • 500,000 credits
Subscribe to Enterprise

Ready to Make Your Images Conversational?

Transform your images into conversational knowledge sources. Start asking questions about any image and get intelligent AI-powered answers in seconds.

No setup fees • Integrate in minutes • Cancel anytime

Everything you need for document processing

Transform any document into clean text. Extract structured data from PDFs, Word files, Excel spreadsheets, and images with precision and speed. Also search 5M+ academic papers and fetch papers by paper IDs.

Get started for free today

No setup fees • Integrate in minutes • Cancel anytime

Frequently asked questions