Link rot is the gradual decay of hyperlinks as the destinations they point to disappear, change ownership, or fundamentally change purpose. A 2014 study of legal academic papers found that 50% of URLs cited had rotted within seven years. A 2021 study of New York Times articles found 25% of links from 1998 were dead by 2019. Whatever your content's longevity, a portion of the outbound links in it will eventually break — and the longer your content lives, the higher that portion gets.
You can't prevent external link rot. The destinations are outside your control. But you can detect it consistently, fix it efficiently, and stop accumulating rotten links faster than you fix them.
What Counts as Rot
External link rot includes more than just plain 404s:
- Hard 404s — destination explicitly returns "not found".
- Connection failures — domain has been taken down, DNS no longer resolves, or the server doesn't respond.
- SSL errors — destination's HTTPS certificate has expired or become invalid. (Run SSL Checker if you suspect this.)
- Domain squatters — the domain expired and was bought by a parking page or unrelated content. The link technically works but goes somewhere meaningless.
- Content drift — the page still exists but the content has changed completely. A link that pointed to an authoritative guide now lands on a generic homepage. HTTP-200 but contextually rotten.
- Soft 404s on the destination — the page returns 200 but the body says "page not found" (see our soft 404s guide).
- Redirects to unrelated content — the original URL now 301s to a sales page, the company's homepage, or an unrelated section of the site.
The first three are easy to detect with any link checker. The last four are harder and need either manual review or smarter detection (e.g. checking whether the destination's title still resembles what your link's anchor text describes).
Why External Rot Hurts You
You might assume that if a link rots, it just doesn't work — bad for the visitor, but no real cost to you. That's only partly true. The actual costs:
- Visitor frustration. A reader hits a dead link in your article, loses trust in the article's currency, and may not return.
- Search engine signals. Lots of broken outbound links signal an unmaintained site. Google has historically used outbound link quality as a ranking factor.
- Lost SEO value of the citation. When you cite a high-authority external source, search engines see your link as evidence your content is well-researched. A broken citation provides none of that signal.
- Spread of misinformation. If a domain gets repurposed by a squatter and your old link now points to spam, you're effectively endorsing spam from your archived content.
- Cumulative trust erosion. A few broken links is fine. A site with 30% of its outbound links broken looks abandoned even if everything else is fresh.
Building a Detection Pipeline
For small sites (under a few hundred pages), running Broken Link Finder against your sitemap once a month catches the bulk of rot. For larger sites, set up a scheduled scan:
- Crawl your site to enumerate every outbound link. Most CMS-driven sites embed external links in post content stored in the database — pulling these out via SQL is faster than crawling.
- Check each link's HTTP status. Use HEAD requests where possible to save bandwidth; fall back to GET if the destination doesn't support HEAD.
- Categorise the responses — 200 OK, 301 redirect (resolve to final), 4xx client errors, 5xx server errors, connection failures, SSL errors.
- Spot-check 200 responses for content drift. If the destination's
<title>tag has changed dramatically, flag for manual review. - Track historical state. A link that rotted last month is harder to fix than one that rotted last week (the original content may have been archived; the URL may have been re-bought).
Triage: Which Rot to Fix First
You'll find more rotten links than you have time to fix. Triage them by:
- Page traffic. A rotten link on your top-traffic post hurts more than the same link on an obscure archive page. Use analytics to rank affected pages.
- Recency of content. A rotten link in an article published last week looks worse than one in a 10-year-old archive. Visitors expect old content to age.
- Anchor text criticality. If your text reads "as detailed in this study", the link is loadbearing for the article's credibility. Fix urgently.
- Domain trustworthiness post-rot. A link to a former government website now pointing to a casino is a credibility hazard. Fix immediately, even on low-traffic pages.
Repair Strategies
For each rotten link, you have five options:
1. Replace with the current canonical URL
Many "rotten" links are just URL changes — the content moved to a new path. Search the destination domain for the cited material; if it's still there at a new URL, update the link. This is the cleanest fix.
2. Replace with a different authoritative source
If the original source is gone but the underlying information is still authoritative elsewhere, find an equivalent. The article's argument doesn't change; only the citation does.
3. Link to the Wayback Machine
The Internet Archive's Wayback Machine has snapshots of most public websites going back decades. If the original page is gone but was archived, link to the archive instead:
https://web.archive.org/web/2020*/example.com/original-url
The asterisk auto-resolves to the closest snapshot. Wayback links are stable — once a URL is archived, the archive URL doesn't change. Many publications now proactively archive every external link they cite to insulate against future rot.
4. Remove the link entirely
If the cited content is gone and no replacement exists, remove the link. Leaving the anchor text without the link is fine — readers won't notice, and the content is no worse without a citation than with a broken one. For longer-running content, a short note like "[archived source no longer available]" is sometimes appropriate.
5. Rewrite the section
If the link was load-bearing for an entire paragraph or section, the rotted link may mean the underlying content needs updating too. A study that's been retracted, statistics that have been superseded, or a guide that no longer reflects current practice — sometimes link rot is a symptom of content rot.
Preventing Rot Pile-Up
You can't stop external sites from disappearing, but you can keep your repair pace ahead of your rot rate.
- Schedule a monthly scan. A consistent cadence prevents the "haven't checked in three years" pile-up. Even 30 minutes a month is enough for most sites.
- Archive at publish time. When you publish a new post, immediately submit the cited URLs to the Wayback Machine via
web.archive.org/save/<URL>. Even if the original rots, you have a stable archive link to swap in. - Prefer authoritative domains. Wikipedia, government sites, peer-reviewed journals, and major publications rot far less often than personal blogs and small businesses. When you have a choice of citation, pick the more durable one.
- Avoid linking to URLs that look fragile. URLs with session tokens, search parameters, or numeric IDs are more likely to rot than clean canonical URLs. If you have a choice, link to the cleaner URL.
- Use a redirect/citation tracker. Some sites maintain an internal "citation registry" — every external link is stored once, and updates propagate. Overkill for a small site, valuable for one with thousands of articles.
Tooling
For one-off scans: Broken Link Finder, free, no signup.
For scheduled site-wide scans: a paid crawler like Screaming Frog (one-off purchase) or Ahrefs (subscription). Both cover thousands of URLs and produce filterable reports.
For Wayback Machine snapshots: https://web.archive.org/save/<URL> triggers a fresh snapshot. The Save Page Now browser extension makes this one-click.
For server-log-driven detection: parse access logs for outbound link clicks (if you proxy them) or use the Internet Archive's API to check whether each link's destination is still in active rotation.
The Honest Truth
External link rot is a tax on running content for any length of time. Articles published a decade ago will have a meaningful percentage of dead outbound links no matter how diligent you are. The goal isn't to eliminate rot — it's to keep the rot rate visible, to fix the highest-impact rot first, and to insulate your most important pages with archived snapshots.
A regular scan with Broken Link Finder is the single best habit. Fifteen minutes a month, applied consistently, is enough to keep most sites' outbound link health in good shape over the long term.