About PDF to Excel
Pull tabular data out of a PDF into an editable Excel workbook, entirely in your browser. SnapToolz uses pdfjs-dist to extract every positioned text run, then runs a heuristic table detector that clusters items into rows by Y-position, finds consistent column anchors across rows, and snaps each row's cells to those anchors. The result is a .xlsx file you can open directly in Excel, Google Sheets, Numbers, or LibreOffice Calc — no upload, no watermark. The trade-off is honest: PDFs don't store tables as tables, only as positioned glyphs, so this works very well on PDFs that started as spreadsheets or invoices and less well on PDFs of free-flowing prose. The output is always editable text — no broken formulas, no proprietary lock-in.
- No uploads
- Browser-only
- Works offline
- 100% free
How it works
- 1
Drop your PDF
Up to 100 MB. Single PDF. Each page is parsed in sequence with progress.
- 2
Pick the layout
One worksheet per PDF page (default), or all pages combined into one sheet with blank-row separators.
- 3
Convert and download
Output is a standards-compliant .xlsx file. Open in Excel, Google Sheets, Numbers, or LibreOffice Calc.
Related tools
Browse allFrequently asked questions
Are my files uploaded to a server?
No. Every tool on SnapToolz runs entirely inside your browser using JavaScript and WebAssembly. Your file is read locally, processed in memory, and the result is offered as a download. Nothing is sent to a server — there isn't one to send to.
How accurate is the table detection?
Excellent on PDFs that were originally exported from spreadsheets, accounting software, or invoice generators — those have consistent column positions that the algorithm picks up cleanly. Lower accuracy on PDFs with free-flowing text, multi-line cells, merged cells, or complex layouts. The algorithm uses no machine learning — just geometry — so its behaviour is predictable and inspectable.
What about merged cells?
Merged cells in the source PDF aren't honoured — each visual region renders as a single cell at its top-left position; downstream cells stay empty in the row. After conversion, you can re-merge in Excel if needed.
Why does my output have empty cells in unexpected places?
When the detector finds N column anchors, every row is padded out to N columns. If a particular row was visually shorter than the table's normal column count, the missing columns appear as empty cells. Toggle the “skip empty rows” option to drop rows that are entirely blank.
Will numbers in my PDF appear as numbers in Excel?
Cells are written as text values, not numbers. Excel treats them as strings — you can use Excel's Text-to-Columns or VALUE() formula to convert them. We default to text because PDF text doesn't reliably distinguish “1,234.56” (a number with thousands separator) from “1.234,56” (a European-format number) from “1234, 56” (two unrelated cells).
Why is my scanned PDF empty?
Scanned PDFs are images of tables, not actual text — the extractor has nothing to read. Run the scanned PDF through the OCR tool first to add a text layer, then convert that to Excel.
Does it work offline?
Yes. SnapToolz is a Progressive Web App. After your first visit, the app is cached on your device and every tool keeps working without an internet connection.
Is SnapToolz free?
Yes — every tool is 100% free with no sign-up, no watermark, no hidden tier. The whole platform is open source and we have no plan to gate features.