Jump to content

Recommended Posts

Posted

Hi

I notice if you add a hashtag after a PW domain, you can still access that page as opposed to a 404 page

IE
processwire.com/#
processwire.com/about/#

both load the correct page.

An SEO company one of my clients use has pointed out that the duplicate pages are getting indexed in Google.
Shouldn't the URLs above load the 404 page instead?

 

 

 

Posted

# and everything after that are technically not part of the site structure, but fragments in the (same) page (see links to #id). I'd imagine that site crawlers treat them as different pages, because of all those single page apps miss-using that for actually different pages.

  • Like 2
Posted

I vaguely remember this coming up a while ago when Google was changing the way it crawled AJAX enabled pages. I would check to see if Google is actually caching the page with the # as a separate page. If so, grabbing the url and determining if the # is present might help with creating a dynamically created meta tag to tell the crawler not to index the page.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...