Discussion:
Mod Rewrite to eliminate broken links caused by Disqus
(too old to reply)
akirah
2012-09-25 16:38:09 UTC
Permalink
As noted elsewhere, when Google crawls a Wordpress site with Disqus enabled, Google attempts to index some links that are dynamically generated by Disqus (these links do not appear in the page source, so I presume these are Javascript-based links).

So for example, Google Webmaster Tools attempts to crawl URL (A) below and throws a Page Not Found error because the correct URL (B) has been modified by Disqus:

(A) www.example.com/blog/2012/09/blog-post-title/2147423647/1346789815000

(B) www.example.com/blog/2012/09/blog-post-title/

The Dynamic URL created by Disqus always includes the "2147423647" component; these digits do not change. The "1346789815000" portion may or may not exist, and the digits change from page-to-page.

I'd like to use mod-rewrite so that attempts to access URLs of the form (A) get a 301 redirect to (B) so that I stop getting crawl errors.

Please advise. Note that I'm a mod-rewrite noob so any and all help is appreciated! Thanks in advance.
akirah
2012-09-25 21:29:00 UTC
Permalink
I believe I've found a fix that works:

RewriteRule ^([0-9]{4})/([0-9]{2})/([^/]+)/[0-9]+ /blog/$1/$2/$3/ [L,R=301]
Loading...