akirah
2012-09-25 16:38:09 UTC
As noted elsewhere, when Google crawls a Wordpress site with Disqus enabled, Google attempts to index some links that are dynamically generated by Disqus (these links do not appear in the page source, so I presume these are Javascript-based links).
So for example, Google Webmaster Tools attempts to crawl URL (A) below and throws a Page Not Found error because the correct URL (B) has been modified by Disqus:
(A) www.example.com/blog/2012/09/blog-post-title/2147423647/1346789815000
(B) www.example.com/blog/2012/09/blog-post-title/
The Dynamic URL created by Disqus always includes the "2147423647" component; these digits do not change. The "1346789815000" portion may or may not exist, and the digits change from page-to-page.
I'd like to use mod-rewrite so that attempts to access URLs of the form (A) get a 301 redirect to (B) so that I stop getting crawl errors.
Please advise. Note that I'm a mod-rewrite noob so any and all help is appreciated! Thanks in advance.
So for example, Google Webmaster Tools attempts to crawl URL (A) below and throws a Page Not Found error because the correct URL (B) has been modified by Disqus:
(A) www.example.com/blog/2012/09/blog-post-title/2147423647/1346789815000
(B) www.example.com/blog/2012/09/blog-post-title/
The Dynamic URL created by Disqus always includes the "2147423647" component; these digits do not change. The "1346789815000" portion may or may not exist, and the digits change from page-to-page.
I'd like to use mod-rewrite so that attempts to access URLs of the form (A) get a 301 redirect to (B) so that I stop getting crawl errors.
Please advise. Note that I'm a mod-rewrite noob so any and all help is appreciated! Thanks in advance.