alexm Posted July 5, 2021 Posted July 5, 2021 Does anyone else get a lot on /http404/ from crawlers like facebook? See screenshot. I'm just trying to think why I'd be getting so many and replicate the issue.
horst Posted July 5, 2021 Posted July 5, 2021 You should inspect what the bots tried to find. Maybe this will shed some light? You can use the jumplinks module for this, for example. 2
alexm Posted July 5, 2021 Author Posted July 5, 2021 Perfect! Cheers @horst Jumplinks looks to be perfect for monitoring it. If it's anything of interest, I'll report back. Otherwise thankyou and have a good evening! 1
wbmnfktr Posted July 5, 2021 Posted July 5, 2021 Just in case it gets worse: https://processwire.com/blog/posts/optimizing-404s-in-processwire/ 2
alexm Posted July 5, 2021 Author Posted July 5, 2021 @wbmnfktr Ah awesome, thank you! That's a great post. I've taken those steps now as they definitely seem like good additions! 1
alexm Posted July 5, 2021 Author Posted July 5, 2021 Digging around in logs, I think I've managed to figure what's happening. If you type a URL on Facebook rather than paste the complete correct URL, it tries to request the URL as you type. So until the full, valid URL is completed it's trying to request a page that doesn't exist. 1
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now