Google webmasters tools has a bunch crawl errors (404) for urls to non-existent pages on my site. I never link to these urls. How can I solve this problem ... it is affecting my adwords campaign? I tried to do a 302 redirect, but that did not work.
Related
I have an e-commerce website and I want to share a link on my Facebook profile.
When I do I get a 404 - File or Directory not found even though the URL is working properly when you click on it. Below is what I see exactly:
I have tried:
To post the URL with https:// or without
https://developers.facebook.com/tools/debug/og/object
https://developers.facebook.com/tools/debug/sharing/
None of these solutions work!
The Sharing debugger shows this error:
But I don't understand how the error is legit since I have the parameters it's asking on my Page's Header tags.
What exactly is wrong?
I had the same problem at some point and the issue was that the team who manages our server had blocked the bots of Facebook and other search engines from entering our website and they forgot to undo this! Have a look if this is the reason in your occasion.
On April 12th, Facebook upgraded the FB App we use to share new articles to our page to API version 2.6. Since then, stories get posted, but the image is usually not added to the story.
When I check with the opengraph debugger, I see strange errors like:
Circular Redirect
We could not resolve the canonical URL because the redirect path contained a cycle.
With redirect paths of:
Redirect Path
Input URL arrow-right https://www.bleepingcomputer.com/news/government/34-tech-firms-sign-accord-not-to-assist-government-hacking-operations/
301 HTTP Redirect arrow-right https://www.bleepingcomputer.com/news/government/34-tech-firms-sign-accord-not-to-assist-government-hacking-operations/
Even stranger, sometimes the redirect paths show urls like below, which are not from my site:
Input URL arrow-right https://www.bleepingcomputer.com/news/security/crooks-hijack-router-dns-settings-to-redirect-users-to-android-malware/
301 HTTP Redirect arrow-right https://www.bleepingcomputer.com/news/security/crooks-hijack-router-dns-settings-to-redirect-users-to-android-malware/?utm_content=70198165&utm_medium=social&utm_source=facebook
og:url Meta Tag arrow-right https://www.bleepingcomputer.com/news/security/crooks-hijack-router-dns-settings-to-redirect-users-to-android-malware/
Once I click on the scrape button again, it gets rid of the error, but the images still do not show in the post on my FB page.
It is almost as if the scraper is targeting URLs that are not mine, even though I am sending them correctly and I have the same urls in my og:url and canonical tags.
Any ideas?
It's only happening on a handful of URLs. For example:
https://gateless.com/articles/generate-leads/happy-thanksgiving-to-all-115-million-american-households
Facebook Debugger:
https://developers.facebook.com/tools/debug/sharing/?q=https%3A%2F%2Fgateless.com%2Farticles%2Fgenerate-leads%2Fhappy-thanksgiving-to-all-115-million-american-households
But this one works great:
https://gateless.com/articles/generate-leads/your-blueprint-for-converting-inbound-leads
Facebook Debugger:
https://developers.facebook.com/tools/debug/sharing/?q=https%3A%2F%2Fgateless.com%2Farticles%2Fgenerate-leads%2Fyour-blueprint-for-converting-inbound-leads
Via DNS, this site is only accessible within the US.
It was the SSL certificate. We reissued one from within AWS and now the URLs scrape OK. Don't buy SSL certificates from GeoTrust.
The articles are throwing error:
This site can’t be reached
gateless.com’s server DNS address could not be found.
Since your site is inaccessible, the crawler is unable to fetch the data for og tags.
The second URL seems to be working as it is showing cached data.
You can click on "See exactly what our scraper sees for your URL" in the Debugger to see what content is being scraped.
In this case, the page is blank which is a good indication that the crawler is unable to see the content or your page is not returning the correct data that the crawler expects.
I made an error when bulk 301 redirecting from an old domain to new domain with same url structure.
Google bot followed the 301 redirect for each page on the old site to my new site, which gave a 404 error. I tested it browser in worked for a user but somehow did not work for the google bot and I detected it too late.
I fixed the error now and the pages could be (hopefully) accessed by google bot at their new urls.
Question: will google recrawl 301 redirects which led to 404?
They'll crawl it for a little while but eventually the 404 status will tell them the page is gone and they will stop crawling it and remove it from their index. If you fixed the error before they stopped crawling the original URL then they will follow the redirect and associate the new URL with the old URL.
Today, I tried to post a link from my website on my wall, but no OG meta information was fetched. Therefore, I went on to Facebook URL Linter to check things for myself and see if OG meta tags are fetched properly.
To my surprise, every link on my website including the domain itself, generated Response code: 302.
My OG tags are well set and they were working fine a few days ago. Following is what the debugger shows for my domain : http://www.price-tag.org
Response Code: 302
Fetched URL: http://price-tag.org/
Canonical URL: http://price-tag.org/
Final URL: http://price-tag.org/WpjZW/
For every page from my website, facebook linter is adding an arbitrary string like above WpjZW.
Please let me know if this is a facebook error or something has gone wrong at my end.
It seems the linter is being sent on a circular redirect loop. It is seeing a 302 redirect.
For http://www.price-tag.org I don't see anything but a 200 response code when I navigate to your website using Firefox.
However, if I remove the www from the url and go to http://price-tag.org, I do see the strange behaviour of the 302s and then a random url like http://price-tag.org/UKXRN/
I would suggest you contact your server admin or web master to see what they have in place that is doing this 302 redirect and strange url characters.