alexm Posted July 5, 2021 Share Posted July 5, 2021 Does anyone else get a lot on /http404/ from crawlers like facebook? See screenshot. I'm just trying to think why I'd be getting so many and replicate the issue. Link to comment Share on other sites More sharing options...
horst Posted July 5, 2021 Share Posted July 5, 2021 You should inspect what the bots tried to find. Maybe this will shed some light? You can use the jumplinks module for this, for example. 2 Link to comment Share on other sites More sharing options...
alexm Posted July 5, 2021 Author Share Posted July 5, 2021 Perfect! Cheers @horst Jumplinks looks to be perfect for monitoring it. If it's anything of interest, I'll report back. Otherwise thankyou and have a good evening! 1 Link to comment Share on other sites More sharing options...
wbmnfktr Posted July 5, 2021 Share Posted July 5, 2021 Just in case it gets worse: https://processwire.com/blog/posts/optimizing-404s-in-processwire/ 2 Link to comment Share on other sites More sharing options...
alexm Posted July 5, 2021 Author Share Posted July 5, 2021 @wbmnfktr Ah awesome, thank you! That's a great post. I've taken those steps now as they definitely seem like good additions! 1 Link to comment Share on other sites More sharing options...
alexm Posted July 5, 2021 Author Share Posted July 5, 2021 Digging around in logs, I think I've managed to figure what's happening. If you type a URL on Facebook rather than paste the complete correct URL, it tries to request the URL as you type. So until the full, valid URL is completed it's trying to request a page that doesn't exist. 1 Link to comment Share on other sites More sharing options...
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now