Question 1

What is a soft 404?

Accepted Answer

A soft 404 is a page that tells a human "this content does not exist" but tells crawlers everything is fine by returning HTTP 200 (or by redirecting a missing URL to the homepage). Google then wastes crawl budget on it and may report it as "Soft 404" in Search Console. The fix is to return a real 404 or 410 status for content that is genuinely gone.

Question 2

How does soft404scan detect a soft 404?

Accepted Answer

It fetches your URL and, at the same time, a made-up URL in the same directory that is guaranteed not to exist. It then compares the two: the HTTP status code, whether either redirects to the homepage, and how similar the page content is. If your URL returns 200 but is the same page a non-existent URL returns — or its title/heading says "not found", or it redirects to the homepage like a missing page does — that is a soft 404. Every rule is published in the methodology.

Question 3

Is it free?

Accepted Answer

Yes — free, no account, no sign-up. Enter a URL and get an instant verdict. We keep no logs of the URLs you check.

Question 4

Why does this matter for SEO?

Accepted Answer

Search engines have a limited crawl budget per site. Soft 404s burn that budget on pages that should not be indexed, can keep dead URLs lingering in the index, and muddy your coverage reports. Returning a clean 404/410 lets engines drop missing pages quickly and spend crawl budget on pages that matter.

Question 5

Can I use it to confirm I fixed a soft 404?

Accepted Answer

Yes — that is a primary use case. After you change a missing URL to return a real 404/410, paste it here again: a "True 404" verdict confirms the fix is live and that engines will now drop the URL cleanly.

Question 6

Does it run JavaScript / work on SPAs?

Accepted Answer

No — it reads the server-rendered HTML, like a crawler that does not execute JavaScript. Many single-page apps serve the same HTML shell for every URL, including ones that do not exist, so the static comparison can look identical. soft404scan detects that case and tells you to verify in a browser or Search Console rather than giving a false "soft 404" verdict.

Question 7

Is this the same heuristic Google uses?

Accepted Answer

It reproduces the well-known, published approach (compare a URL against a guaranteed-missing baseline by status and content similarity). It is not Google's private classifier, so treat the result as a strong, transparent signal — not a guarantee of exactly what Search Console will say.

Question 8

Is my data safe? Any SSRF concerns?

Accepted Answer

The scan runs on Cloudflare and only fetches public http(s) URLs; requests to private, loopback, link-local and cloud-metadata addresses are blocked, redirects are re-validated on every hop, and responses are size- and time-capped. We keep no logs of what you check.

Is it a real 404
— or a soft 404?

How a soft 404 gives itself away

Status code

The missing-URL baseline

Content match

Not-found wording

Redirect-to-home

JS-app honesty

Every rule, in the open

Frequently asked questions

Is it a real 404— or a soft 404?