About OCR
OCR (Optical Character Recognition) extracts text from images and scanned PDFs across 13 languages. Powered by Tesseract running in WebAssembly — your files never upload, even though the OCR engine is several megabytes. The first run downloads language data; after that it's offline.
- No uploads
- Browser-only
- Works offline
- 100% free
How it works
- 1
Drop an image or PDF
JPG, PNG, WebP, or PDF (each page is recognised separately).
- 2
Pick the language
13 languages including English, Spanish, French, German, Chinese, Japanese, Korean, Arabic, Hindi, Russian.
- 3
Copy or download
Recognised text appears with confidence and word count. Copy to clipboard or save as .txt.
Related tools
Browse allFrequently asked questions
How accurate is the OCR?
For clean printed text, 90%+ word accuracy. For low-resolution photos or handwriting, accuracy drops sharply. Always proofread before publishing.
Why is the first run slower?
Tesseract downloads ~5 MB of language training data on first use. The browser caches it; subsequent runs are instant.
Are my files uploaded to a server?
No. Every tool on SnapToolz runs entirely inside your browser using JavaScript and WebAssembly. Your file is read locally, processed in memory, and the result is offered as a download. Nothing is sent to a server — there isn't one to send to.
Does it work offline?
Yes. SnapToolz is a Progressive Web App. After your first visit, the app is cached on your device and every tool keeps working without an internet connection.