SE机械手不会从sitemap.xml中索引页面

时间:2019-12-03 10:13:09

标签: indexing google-analytics sitemap robots.txt google-search-console

我将站点地图上传到了我的站点,但并非所有的URL都已被索引。我有一堆没有被Google索引的url。而且我不知道为什么会这样...

现在,我已经有了716个没有索引的网址。

enter image description here

如果我看到哪个网​​址没有被索引,我们将看到以下网址示例:

enter image description here

所有这些url都是完全可以访问的。如果您单击其中的一些,则可以正确访问该网站:

https://www.calzadosniza.es/es/mujer/zapatos-mujer/zapato-descubierto-puntera-charol-ancho-juan-mastre-108-7920#/62-tallas_grandes-40/116-color-azul
https://www.calzadosniza.es/es/mujer/sandalias-mujer/sandalia-cuna-pala-cruzada-combi-plata-glenda-porronet-6551-porronet-8751#/62-tallas_grandes-40/114-color-blanco
https://www.calzadosniza.es/es/mujer/botas-y-botines-mujer/bota-militar-cordon-piso-volumen-2670-tekila-3999#/63-tallas_grandes-41/113-color-negro

如果我检查了其中一个,例如:https://www.calzadosniza.es/es/mujer/zapatos-mujer/zapato-descubierto-puntera-charol-ancho-juan-mastre-108-7920#/62-tallas_grandes-40/116-color-azul

我得到了这个结果:

enter image description here

我的robot.txt文件是:

# Allow Directives
Allow: */modules/*.css
Allow: */modules/*.js
Allow: */modules/*.png
Allow: */modules/*.jpg
# Private pages
Disallow: /*?orderby=
Disallow: /*?orderway=
Disallow: /*?tag=
Disallow: /*?id_currency=
Disallow: /*?search_query=
Disallow: /*?back=
Disallow: /*?n=
Disallow: /*&orderby=
Disallow: /*&orderway=
Disallow: /*&tag=
Disallow: /*&id_currency=
Disallow: /*&search_query=
Disallow: /*&back=
Disallow: /*&n=
Disallow: /*controller=addresses
Disallow: /*controller=address
Disallow: /*controller=authentication
Disallow: /*controller=cart
Disallow: /*controller=discount
Disallow: /*controller=footer
Disallow: /*controller=get-file
Disallow: /*controller=header
Disallow: /*controller=history
Disallow: /*controller=identity
Disallow: /*controller=images.inc
Disallow: /*controller=init
Disallow: /*controller=my-account
Disallow: /*controller=order
Disallow: /*controller=order-slip
Disallow: /*controller=order-detail
Disallow: /*controller=order-follow
Disallow: /*controller=order-return
Disallow: /*controller=order-confirmation
Disallow: /*controller=pagination
Disallow: /*controller=password
Disallow: /*controller=pdf-invoice
Disallow: /*controller=pdf-order-return
Disallow: /*controller=pdf-order-slip
Disallow: /*controller=product-sort
Disallow: /*controller=search
Disallow: /*controller=statistics
Disallow: /*controller=attachment
Disallow: /*controller=guest-tracking
# Directories
Disallow: */cache/
Disallow: */classes/
Disallow: */config/
Disallow: */controllers/
Disallow: */css/
Disallow: */download/
Disallow: */js/
Disallow: */localization/
Disallow: */log/
Disallow: */mails/
Disallow: */modules/
Disallow: */override/
Disallow: */pdf/
Disallow: */src/
Disallow: */tools/
Disallow: */translations/
Disallow: */upload/
Disallow: */vendor/
Disallow: */web/
Disallow: */webservice/
# Files
Disallow: /*es/password-recovery
Disallow: /*es/address
Disallow: /*es/addresses
Disallow: /*es/login
Disallow: /*es/cart
Disallow: /*es/discount
Disallow: /*es/order-history
Disallow: /*es/identity
Disallow: /*es/my-account
Disallow: /*es/order-follow
Disallow: /*es/credit-slip
Disallow: /*es/order
Disallow: /*es/search
Disallow: /*es/guest-tracking
Disallow: /*es/order-confirmation
Disallow: /*ca/password-recovery
Disallow: /*ca/address
Disallow: /*ca/addresses
Disallow: /*ca/login
Disallow: /*ca/cart
Disallow: /*ca/discount
Disallow: /*ca/order-history
Disallow: /*ca/identity
Disallow: /*ca/my-account
Disallow: /*ca/order-follow
Disallow: /*ca/credit-slip
Disallow: /*ca/order
Disallow: /*ca/search
Disallow: /*ca/guest-tracking
Disallow: /*ca/order-confirmation
Disallow: /*gl/password-recovery
Disallow: /*gl/address
Disallow: /*gl/addresses
Disallow: /*gl/login
Disallow: /*gl/cart
Disallow: /*gl/discount
Disallow: /*gl/order-history
Disallow: /*gl/identity
Disallow: /*gl/my-account
Disallow: /*gl/order-follow
Disallow: /*gl/credit-slip
Disallow: /*gl/order
Disallow: /*gl/search
Disallow: /*gl/guest-tracking
Disallow: /*gl/order-confirmation
Disallow: /*eu/password-recovery
Disallow: /*eu/address
Disallow: /*eu/addresses
Disallow: /*eu/login
Disallow: /*eu/cart
Disallow: /*eu/discount
Disallow: /*eu/order-history
Disallow: /*eu/identity
Disallow: /*eu/my-account
Disallow: /*eu/order-follow
Disallow: /*eu/credit-slip
Disallow: /*eu/order
Disallow: /*eu/search
Disallow: /*eu/guest-tracking
Disallow: /*eu/order-confirmation

那么,当我将站点地图上传到Google Console Search时,为什么所有这些URL都没有索引?

我做错什么了吗?

1 个答案:

答案 0 :(得分:1)

Sitemap不是搜索引擎的指令,它只是建议。 SE不能抓取所有页面,请在站点地图中使用“优先级”字段。

尝试手动检查未编入索引的页面上的html,可能有禁止标记:

<meta name="robots" content="noindex" />