理解robots.txt on url||爬虫
发布日期:2021-05-07 14:23:17 浏览次数:15 分类:技术文章

本文共 362 字,大约阅读时间需要 1 分钟。

  • robots.txt

    The , also known as the robots exclusion protocol or simply robots.txt, is a standard used by websites to communicate with web crawlers and other web robots.

    The standard specifies how to inform the web robot about which areas of the website should not be processed or scanned. Robots are often used by search engines to categorize websits.

  • References

上一篇:理解cryptographic primitive
下一篇:理解endpoint vs API

发表评论

最新留言

不错!
[***.144.177.141]2025年04月12日 15时49分37秒