The meaning of the tag <meta name="robots" content=""> in HTML (transfer)

<meta name="robots" content="">

  <meta name="robots" content="noarchive">

  The above piece of code restricts all search engines from creating snapshots of your web pages. If we need to limit only one search engine to create snapshots, we can write as follows

  <meta name="Baiduspider" content="noarchive">

  It should be noted that such a mark only prohibits search engines from creating snapshots of your website. If you want to prohibit search engines from indexing your page, please refer to the following method.

  The second case: prohibit search engines from crawling this page.

  In SEO, it is often used to prohibit search engines from crawling this page or to allow search engines to crawl this page. So we need a discussion on this part of the focus.

  In order to prevent search engines from crawling this page, our general practice is to add the following code to the meta tag of the page:

  <META NAME="ROBOTS" CONTENT="NOINDEX,FOLLOW">

  Here, META NAME="ROBOTS" refers to all search engines in general, and here we can also refer to a search engine specifically, such as META NAME="Googlebot", META NAME="Baiduspide", etc. There are four commands in the content section: index, noindex, follow, and nofollow. The commands are separated by English ",".

  INDEX command: tell search engines to crawl this page

  FOLLOW command: Tell search engines to find links from this page, and then continue to visit and crawl.

  NOINDEX command: tell search engines not to crawl this page

  NOFOLLOW command: Tell search engines not to allow links to be found from this page and deny them further access.

  According to the above command, we have the following four combinations

  <META NAME="ROBOTS" CONTENT="INDEX,FOLLOW">: This page can be crawled and other links can be indexed along this page

  <META NAME="ROBOTS" CONTENT="NOINDEX,FOLLOW">: This page is not allowed to be crawled, but other links can be indexed along this page

  <META NAME="ROBOTS" CONTENT="INDEX,NOFOLLOW">: You can crawl this page, but you are not allowed to crawl other links in the index along this page

  <META NAME="ROBOTS" CONTENT="NOINDEX,NOFOLLOW">: Do not crawl this page or crawl other links to index along this page.

  It should be noted here that two opposite antonyms cannot be written together, such as

  <META NAME="ROBOTS" CONTENT="INDEX,NOINDEX">

  Or just write two sentences at the same time

  <META NAME="ROBOTS" CONTENT="INDEX,FOLLOW">

  <META NAME="ROBOTS" CONTENT="NOINDEX,FOLLOW">

  Here is a convenient way to write it, if it is

  <META NAME="ROBOTS" CONTENT="INDEX,FOLLOW"> can be written as:

  <META NAME="ROBOTS" CONTENT="ALL">

  if

  <META NAME="ROBOTS" CONTENT="NOINDEX,NOFOLLOW"> can be written as:

  <META NAME="ROBOTS" CONTENT="NONE">

  Of course, we can also write commands that prohibit snapshotting and search engines into a command meta tag. From the above article, we know that the command to prohibit the creation of webpage snapshots is noarchive, then we can write it in the following form: <META NAME="ROBOTS" CONTENT="INDEX,FOLLOW,noarchive">

  If it is not allowed to create snapshots for a single search engine, such as Baidu, we can write:

  <META NAME=" Baiduspider" CONTENT="INDEX,FOLLOW,noarchive">

  If you disdain the command about spiders in the meta tag, then the default command is as follows

  <META NAME="ROBOTS" CONTENT="INDEX,FOLLOW, archive">

  Therefore, if we are not sure about this part, we can directly write the above line of command, or leave it blank.

  In SEO, the control of spiders is a very important part of the content, so I hope you can accurately grasp this part of the content.

 

Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=326015232&siteId=291194637