Need to get the words out of a photo? Whether it's a scanned receipt, a whiteboard snapshot, a screenshot of a document, or a photo of a sign, OCR (Optical Character Recognition) turns image pixels into actual text you can copy, edit, and search. This guide explains how to use the free Image to Text tool on this site — entirely in your browser, with no file ever leaving your device.
What Is OCR?
OCR stands for Optical Character Recognition. It's the technology that analyses an image and identifies the shapes of letters, digits, and punctuation to produce machine-readable text.
Modern OCR engines use neural networks trained on millions of printed and handwritten characters. They can recognise text in a wide range of fonts, sizes, languages, and image conditions — even at moderate angles or with some blur.
OCR is how:
- Banks digitise cheques and identity documents
- Google Search indexes text inside images
- Postal services read handwritten addresses on envelopes
- Archivists make millions of scanned books searchable
The same technology is available free in your browser here.
How to Extract Text from an Image — Step by Step
The Image to Text tool runs Tesseract OCR compiled to WebAssembly in your browser. No file is uploaded to any server.
Step 1 — Open the Image to Text tool
Go to want2convert.com/image-to-text. The page loads the Tesseract OCR engine. On the first visit this may take a few seconds as the WASM binary downloads.
Step 2 — Upload your image
Drag and drop your image into the upload zone or click to browse your device. Supported formats include JPG, PNG, WebP, and BMP. For scanned documents, use the highest resolution scan available — 300 DPI or higher produces the best results.
Step 3 — Select the language (if needed)
By default, the engine recognises English. If your image contains text in another language (French, Spanish, German, Portuguese, and others), select it from the language dropdown. Using the correct language model significantly improves accuracy for accented characters and language-specific letter patterns.
Step 4 — Click "Extract Text"
Click the button. Tesseract analyses the image pixel by pixel, identifies text regions, and converts them to characters. Processing typically takes 2–10 seconds depending on image size and text density.
Step 5 — Copy or download the result
The extracted text appears in a text area. You can:
- Select all and copy — paste directly into Word, Google Docs, or any text editor
- Download as TXT — save the result as a
.txtfile
What Makes a Good Source Image for OCR
OCR accuracy depends heavily on the quality of the input image. Here's what makes the biggest difference:
Resolution: Text needs enough pixels to be legible. Mobile phone photos of documents taken in good light at reasonable distance (30–50 cm) typically work well. The minimum practical resolution for clear small print is around 150–200 DPI; 300 DPI is the OCR standard for archival scanning.
Contrast: Dark text on a light background gives the engine the clearest signal. Faded ink, light grey text on white paper, or coloured text on a patterned background all reduce accuracy.
Sharpness: Blurry images — from camera shake or focus problems — are the single biggest cause of OCR errors. Use a stable surface when photographing documents. Phone macro modes work well for small-print documents.
Orientation: Text that is severely tilted or rotated reduces accuracy. The OCR engine can handle minor skew (a few degrees), but pages photographed at a steep angle perform worse. Use the Rotate Image tool to straighten your photo first if needed.
Noise and shadows: Creases, shadows across the page, and background textures reduce accuracy. Photograph documents flat on a light-coloured desk with even lighting.
Accuracy: What OCR Can and Can't Do
OCR is highly accurate on clean, high-contrast, printed documents — often 99%+ character accuracy on good input. The technology has real limits to understand though:
| Input type | Expected accuracy | |------------|------------------| | Clean scanned printed document (300 DPI) | 98–99%+ | | Sharp phone photo of printed text | 90–97% | | Faded or low-contrast print | 70–85% | | Handwriting (neat, consistent) | 60–80% | | Handwriting (cursive, varied) | 40–65% | | Heavily stylised fonts or logos | 30–60% | | Text in an image with heavy noise/background | 50–75% |
For printed text in good conditions, expect near-perfect results. Always proofread the output — a single character error in a number or name can have real consequences.
Common Uses for Image to Text
Scanned Documents and PDFs
Offices still deal with mountains of paper. When you receive a scanned contract, form, or letter as a PDF or image, OCR converts it into a searchable, editable document. After extracting the text, paste it into Word or Google Docs for editing, or save it as a file you can search with Ctrl+F.
Receipts and Invoices
Photographing receipts is a common workflow for expense tracking. OCR extracts the line items, totals, and vendor names so you can paste them directly into a spreadsheet or expense report without retyping.
Business Cards
A quick phone photo of a business card, run through OCR, gives you the contact's name, title, phone number, email, and company — ready to copy into your contacts app.
Screenshots of Articles and Web Pages
If you need to quote from a screenshot or copy text from an image-based PDF (a scanned book, a non-selectable PDF), OCR extracts it as actual text you can use.
Whiteboard Photos
After a meeting, photographs of whiteboard notes often sit in a camera roll never acted upon. OCR turns them into text files or searchable notes.
Book and Archive Digitisation
Researchers and genealogists scanning old books, newspapers, and historical records use OCR to make those archives searchable. The Tesseract engine used here is the same one powering major archival digitisation projects worldwide.
Translation Workflow
Extract text from an image first, then paste it into a translation tool. This is far faster than manually retyping text from a photo of a foreign-language document, menu, or sign.
OCR vs Copy-Paste: When to Use Each
If you're working with a digital PDF where the text layer is present, Ctrl+A + Ctrl+C (select all, copy) is faster and more accurate than OCR. Use OCR when:
- The PDF is a scanned image (no text layer)
- The source is a photo or screenshot
- The document doesn't allow text selection (locked PDFs — though you might also want to unlock it first)
- The text is in a physical document, sign, or whiteboard
Frequently Asked Questions
Is the OCR tool really free?
Yes, completely free. No account, no subscription, no watermark on the output.
Are my images safe?
Yes. Processing happens entirely in your browser using Tesseract WebAssembly. Your images never leave your device.
What languages does it support?
English is the default. Additional language models are available via the language selector including Spanish, French, German, Portuguese, Italian, Dutch, Polish, Russian, Chinese (Simplified and Traditional), Japanese, Korean, Arabic, and more.
How accurate is it?
For clean printed text at adequate resolution, accuracy is very high (95–99%). For handwriting or poor-quality images, accuracy is lower. Always proofread the output for critical documents.
Can it read handwriting?
Tesseract handles clear, block handwriting reasonably well, but accuracy varies significantly by handwriting style. It is not reliable for cursive script.
What's the maximum image size?
There's no hard limit — processing is local. In practice, very large images (50 MB+) may be slow. Resizing to 2000–3000 pixels wide with the Resize Image tool before OCR often speeds things up without reducing accuracy.
Can I extract text from a PDF?
If the PDF is image-based (scanned), save each page as a JPG or PNG first using the PDF to JPG tool, then run OCR on each image.
Why are some characters wrong?
OCR errors most often come from low resolution, blur, low contrast, or unusual fonts. Try improving the input image quality and re-running. For systematic errors (e.g. l being read as 1), manually correct the output after extraction.
Related Tools
- Resize Image — scale up a small image before OCR to improve accuracy
- Rotate Image — straighten a tilted photo before OCR
- PDF to JPG — convert scanned PDF pages to images for OCR
- Compress Image — reduce image size without losing the resolution that OCR needs
- JPG to PNG — convert to PNG for lossless quality before OCR on fine-print documents
- Crop Image — isolate the text area of a photo before running OCR
Try it free — no signup needed
All tools run in your browser. Your files never leave your device.