You need a specific quote from a contract you signed sometime last year. You have 200 contracts in your archive. The old workflow: open each one, search inside, give up after 15.
New workflow: OS searches all 200 simultaneously. Find the quote in 10 seconds.
Make sure PDFs are searchable
PDFs come in two flavours: text-based (real text, fully searchable) and image-based (scanned, just pixels). Image-based PDFs are not searchable without OCR.
For scanned PDFs you'll need to find later, run OCR before archiving. Most scanners offer OCR; cloud services like Adobe and Google Drive can OCR existing PDFs.
Enable OS-level PDF content indexing
macOS Spotlight indexes PDF content by default. Windows Search needs to be configured to include PDF content (Indexing Options → Advanced → File Types → PDF → Index Properties and File Contents).
Once indexed, OS search finds content inside PDFs along with filenames. Type a quote, see every PDF containing it.
Use dedicated tools for advanced search
For frequent content search, dedicated tools like DEVONthink (Mac) or PDF-XChange (Windows) offer more powerful search than OS-level. Useful for legal, research, or compliance work with thousands of PDFs.
Most users don't need these — OS search plus good naming covers 95% of cases.
Combine content search with filename search
The fastest searches combine both. 'Find Acme NDA from 2024 mentioning indemnity' = filename starts with `2024-` plus `Acme` plus contains `indemnity`.
Good naming gives you the rough date and counterparty; content search finds the specific clause. The two together are powerful.
FAQ
What's the fastest way to search a specific PDF?
Open in any PDF viewer, Cmd/Ctrl+F. Searches the current PDF instantly. For across-folder search, use OS-level search.
Why aren't my scanned PDFs searchable?
Because they're images, not text. Run OCR (built into many tools including Adobe and Google Drive) to convert pixels into searchable text.
Can I search PDFs in the cloud without downloading?
Google Drive, OneDrive, and Dropbox all OCR and index PDFs server-side. Search works directly from their interfaces.
What if my PDF has both text and images?
Text portions are searchable; image portions aren't. OCR the image portions to make the whole document searchable.
Good search starts with good archiving and OCR. Set up your PDFs in Flint so future you can find anything.