Category: Software

Search Inside a PDF in File Explorer

Reading Time: 3 minutes

A practical guide to fixing PDF content search on Windows using TET PDF IFilter

If you’ve ever tried to search inside a folder full of PDFs using Windows File Explorer, you’ve probably run into this maddening scenario:

  • You type content:keyword
  • Windows returns nothing
  • But when you open the PDF, the word is right there
  • Your PDF viewer (PDFgear, PDF‑XChange, Foxit) finds it instantly
  • Windows Search acts like the word doesn’t exist

This is not your imagination. It’s how Windows Search actually works — and why you need the right PDF iFilter.

This post explains:

  • Why Windows Search often fails to find text inside PDFs
  • Why your PDF viewer can find words Windows cannot
  • How to fix the problem permanently using TET PDF IFilter
  • When you need to re‑OCR PDFs
  • How to avoid re‑OCRing hundreds of files manually

1. Why Windows Search can’t find text inside many PDFs

Windows Search does not read the visible text on a PDF page.

It only indexes the hidden text layer inside the PDF — the layer created by OCR or embedded fonts.

This means:

  • If the PDF is a scan
  • If the OCR layer is missing
  • If the OCR layer is corrupted
  • If the OCR layer contains mis‑recognised characters
  • If the PDF uses weird embedded fonts
  • If only some pages were OCR’d

…then Windows Search will not find the word, even though you can see it.

Example from real life

A PDF titled How Bombay Was Ceded contains the author name:

J. H. GENSE

PDFgear finds “Gense” instantly.

Windows Search (content:gense) finds nothing.

Why?

Because the title page has no OCR text layer. The word “Gense” exists only visually — not in the text layer Windows indexes.

2. Why PDF viewers can find text Windows Search cannot

PDF viewers like PDFgear, Foxit, and PDF‑XChange use visual text extraction:

  • They search the rendered glyphs on the page
  • They don’t rely on the hidden OCR layer
  • They can find text even when the PDF’s internal text layer is broken

Windows Search cannot do this.

It only indexes what the PDF’s iFilter gives it — and if the iFilter sees nothing, Windows sees nothing.

3. The fix: Install a proper PDF iFilter (TET PDF IFilter)

Windows needs a PDF iFilter to read inside PDFs.

Adobe Reader used to provide one, but I don’t want Adobe on my system — and modern Windows versions don’t include a reliable PDF iFilter by default.

The best non‑Adobe solution:

TET PDF IFilter (PDFlib)

  • Free for personal use (old version 5.4)
  • 64‑bit native
  • Fast and accurate
  • Works with Windows Search indexing
  • No Adobe components required
  • Ideal for large collections of historical or scanned PDFs

Once installed, Windows Search can finally index PDF contents properly.

4. How to enable PDF content indexing

After installing TET:

  1. Open Control Panel → Indexing Options
  2. Click Advanced
  3. Go to File Types
  4. Scroll to .pdf
  5. Ensure it says: Filter Description: TET PDF IFilter
  6. Select Index Properties and File Contents
  7. Rebuild the index (optional but recommended)

Now Windows Search can read inside PDFs — if the text layer exists.

5. Why some PDFs still won’t index (even with TET installed)

If a PDF has no text layer, Windows Search still cannot index it.

This is common in:

  • DjVu → PDF conversions
  • Old scanned books
  • Microfilm scans
  • PDFs with partial OCR
  • PDFs with broken embedded fonts
  • PDFs where only some pages were OCR’d

In these cases, Windows Search will only find:

  • Words in the filename
  • Words in the metadata
  • Words in pages that do have a text layer

But not words that exist only visually.

6. How to fix PDFs with missing or broken text layers

You have three options:

A. Re‑OCR the PDF

This regenerates a clean text layer.

PDFgear can do this, but doing it manually for 200+ files is painful.

B. Batch‑OCR everything automatically

Use a tool like:

  • OCRmyPDF (free, open‑source)
  • ABBYY FineReader PDF (paid, industrial‑strength)

These tools can process entire folders unattended.

C. Use a search tool that ignores the text layer

If you don’t want to fix the PDFs:

  • PDF‑XChange Editor
  • DocFetcher
  • Recoll

These tools search the rendered text, not the OCR layer.

7. When to add folders to the Windows index

If you want instant search results:

  • Add your PDF folders to Indexing Options → Modify

If you don’t:

  • Windows will still search them, but it will do a live scan
  • This is slower but works fine for occasional searches

8. Summary: The reliable workflow

Here’s the practical, repeatable setup:

✔ Install TET PDF IFilter

This gives Windows the ability to read PDF contents.

✔ Ensure .pdf is set to “Index Properties and File Contents”

This enables content indexing.

✔ Add your PDF folders to the Windows index (optional)

This makes searches instant.

✔ Re‑OCR only the PDFs that matter

Or batch‑OCR everything once.

✔ Use PDFgear/PDF‑XChange for visual searches

These tools can find text Windows cannot.

9. The bottom line

Windows Search is powerful — but only when paired with a proper PDF iFilter and clean text layers.

If a PDF contains text visually but not in its OCR layer, Windows Search will never find it. TET PDF IFilter fixes the indexing side. OCR fixes the PDF side.

Once both are in place, content:keyword becomes a reliable, fast way to search inside thousands of PDFs.

How to remove “You’re almost done setting up your PC”

Reading Time: < 1 minute

Every few weeks, Windows 11 pops up a startup message, “You’re almost done setting up your PC”. It’s easy to dismiss with the “Remind me in 3 days” — and it is a lot more than three days that you will see it again. I can live with it, but many are irritated at the constant attempts to upsell unwanted services. I don’t trust any online service to store my files or passwords, so it does irritate me for a second or two.

Some PCs show a slightly different wording – “Let’s finish setting up your PC”

Windows prompt to finish setting up the PC

The Solution

Go to to Settings: Open the Start menu, go to Settings.

In Notifications: Select System, then Notifications.

Disable Suggestions: Scroll down to Additional settings, expand it and uncheck the following option:

“Suggest ways to get the most out of Windows and finish setting up this device”

Disabling Windows setup prompt

That will disable this irritation permanently.

How to Create Large Checkbox Lists in Word Using Segoe MDL2 Assets

Reading Time: 2 minutes

If you’ve ever tried to build a visually clean checklist in Microsoft Word using a serif font like Bitter, you’ve probably run into this problem: standard Unicode checkbox symbols render too small. Even the usual suspects ☐, ◻, □ look tiny and misaligned next to body text.

Here’s the fix: use Segoe MDL2 Assets, a built-in Windows font that contains large, crisp checkbox icons designed for UI elements. With one quick setup, you can create reusable checklist styles that look professional, scale well, and work perfectly with any body font.

Why Segoe MDL2 Assets Works

Segoe MDL2 Assets is a system font used by Windows to render interface icons. It includes a large hollow checkbox glyph that’s visually balanced and clean. Unlike emoji or shaded squares, it renders consistently across Word documents and prints beautifully.

Step-by-Step: Create a Large Checkbox Bullet Style in Word

1. Open the “Define New Bullet” dialog

  • Go to the Home tab
  • Click the dropdown next to the Bullets icon
  • Choose Define New Bullet…

2. Click “Symbol…”

This opens the symbol picker.

3. Manually enter the font name

In the Font box, type:

Segoe MDL2 Assets

(You won’t find it in the dropdown — you must type it manually.)

4. Scroll to find the checkbox symbol

Look for a large, clean, hollow checkbox.
It’s usually near the top third of the grid.
Select it and click OK.

5. Save as a reusable style

  • Right-click any paragraph using your new bullet
  • Choose Styles → Save Selection as a New Quick Style
  • Name it something like:

Checklist – Large Box

6. Set your body font to Bitter

Select the checklist text and apply Bitter as the font.
Your bullet stays large and clean, while your text uses your preferred typeface.

Result

You now have a checklist that looks like this:

Large checkboxes in Word …and it will indent cleanly across levels if you adjust the list formatting.

Bonus: Fixing Indentation for Sub-Items

If your second-level items don’t indent properly:

  • Right-click the item → Adjust List Indents…
  • Set Bullet position and Text indent to your preferred spacing
  • Reapply the checkbox symbol if needed

This ensures sub-items align visually and maintain the same checkbox style.

Why This Method Beats Unicode Hacks

  • No shading or emoji artifacts
  • Works with any body font
  • Scales cleanly across print and digital
  • Easy to reuse as a style
  • Professional appearance for publishing, documentation, or editorial workflows

If you want to add a second bullet level with a filled checkbox or a tick icon, you can define a multilevel list using the same Segoe MDL2 Assets font.

 

Mastodon