{"id":2657,"date":"2024-08-19T11:15:42","date_gmt":"2024-08-19T11:15:42","guid":{"rendered":"https:\/\/ewebtoolz.com\/blog\/crawl-me-maybe-how-website-crawlers-work\/"},"modified":"2024-08-19T11:15:42","modified_gmt":"2024-08-19T11:15:42","slug":"crawl-me-maybe-how-website-crawlers-work","status":"publish","type":"post","link":"https:\/\/ewebtoolz.com\/blog\/crawl-me-maybe-how-website-crawlers-work\/","title":{"rendered":"Crawl Me Maybe? How Website Crawlers Work"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div id=\"\">\n<p> You might have heard of website crawling before \u2014 you may even have a vague idea of what it\u2019s about \u2014 but do you know why it\u2019s important, or what differentiates it from web crawling? (yes, there is a difference!)\u00a0<\/p>\n<p>Search engines are increasingly ruthless when it comes to the quality of the sites they allow into the search results.<\/p>\n<p>If you don\u2019t grasp the basics of optimizing for web crawlers (and eventual users), your organic traffic may well pay the\u00a0price.<\/p>\n<p>A good web<span style=\"text-decoration: underline;\">site<\/span> crawler can show you how to protect and even enhance your site\u2019s visibility.<\/p>\n<p>Here\u2019s what you need to know about both web crawlers and site crawlers.<\/p>\n<p>A web crawler is a software program or script that automatically scours the internet, analyzing and indexing web\u00a0pages.<\/p>\n<p>Also known as a web spider or spiderbot, web crawlers assess a page\u2019s content to decide how to prioritize it in their indexes.<\/p>\n<p>Googlebot, Google\u2019s web crawler, meticulously browses the web, following links from page to page, gathering data, and processing content for inclusion in Google\u2019s search engine.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_hp80z6yb58oh\"\/>How do web crawlers impact SEO?<\/h3>\n<p>Web crawlers analyze your page and decide how indexable or rankable it is, which ultimately determines your ability to drive organic traffic.<\/p>\n<p>If you want to be discovered in search results, then it\u2019s important you ready your content for crawling and indexing.<\/p>\n<div class=\"recommendation\">\n<p>Did you\u00a0know?<\/p>\n<div class=\"recommendation-content\"> <a href=\"https:\/\/ahrefs.com\/robot\">AhrefsBot<\/a> is a web crawler that:<\/p>\n<ul class=\"wp-block-list\">\n<li>Visits over 8 billion web pages every 24\u00a0hours<\/li>\n<li>Updates every 15\u201330 minutes<\/li>\n<li>Is the #1 most active SEO crawler (and 4th most active crawler worldwide)<\/li>\n<\/ul>\n<figure class=\"wp-block-image\"><img fetchpriority=\"high\" decoding=\"async\" width=\"2048\" height=\"1226\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-ahrefsbot-crawler-as-the-1-most-a-2.jpg\" alt=\"Graphic showing AhrefsBot crawler as the #1 most active SEO crawler and #4 most active web crawler in the world\" class=\"wp-image-178361\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-ahrefsbot-crawler-as-the-1-most-a-2.jpg 2048w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-ahrefsbot-crawler-as-the-1-most-a-2-680x407.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-ahrefsbot-crawler-as-the-1-most-a-2-768x460.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-ahrefsbot-crawler-as-the-1-most-a-2-1536x920.jpg 1536w\" sizes=\"(max-width: 2048px) 100vw, 2048px\"\/><\/figure>\n<\/div>\n<\/div>\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewbox=\"0 0 14 14\" style=\"\"><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\"\/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style=\"\"\/><\/g><\/svg><\/a><\/p>\n<div class=\"link-text\" data-anchor=\"How do web crawlers actually work?\" data-section=\"web-crawlers-work\">\n<h2 class=\"wp-block-heading\" id=\"web-crawlers-work\"><a id=\"post-178325-_ipdhfcfygpql\"\/>How do web crawlers actually work?<\/h2>\n<\/div>\n<\/div>\n<p>There are roughly seven stages to web crawling:<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_1khc7cvozvho\"\/>1. URL Discovery<\/h3>\n<p>When you publish your page (e.g. to your sitemap), the web crawler discovers it and uses it as a \u2018seed\u2019 URL. Just like seeds in the cycle of germination, these starter URLs allow the crawl and subsequent crawling loops to\u00a0begin.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_dnt3yksn4b0t\"\/>2. Crawling<\/h3>\n<p>After URL discovery, your page is scheduled and then crawled. Content like meta tags, images, links, and structured data are <strong>downloaded<\/strong> to the search engine\u2019s servers, where they await parsing and indexing.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_iym2irt0pa0l\"\/>3. Parsing<\/h3>\n<p>Parsing essentially means <strong>analysis<\/strong>. The crawler bot extracts the data it\u2019s just crawled to determine how to index and rank the\u00a0page.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_x9lgu7hxq0ru\"\/>3a. The URL Discovery Loop<\/h3>\n<p>Also during the parsing phase, but worthy of its own subsection, is the URL discovery loop. This is when newly discovered links (including links discovered via redirects) are added to a queue of URLs for the crawler to visit. These are effectively new \u2018seed\u2019 URLs, and steps 1\u20133 get repeated as part of the \u2018URL discovery loop\u2019.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_261szsn2g657\"\/>4. Indexing<\/h3>\n<p>While new URLs are being discovered, the original URL gets indexed. Indexing is when search engines store the data collected from web pages. It enables them to quickly retrieve relevant results for user queries.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_d3319lc3mohf\"\/>5. Ranking<\/h3>\n<p>Indexed pages get ranked in search engines based on quality, relevance to search queries, and ability to meet certain other ranking factors. These pages are then served to users when they perform a search.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_vkg35vmry3bj\"\/>6. Crawl\u00a0ends<\/h3>\n<p>Eventually the entire crawl (including the URL rediscovery loop) ends based on factors like time allocated, number of pages crawled, depth of links followed etc.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_2w3bglshtafw\"\/>7. Revisiting<\/h3>\n<p>Crawlers periodically <strong>revisit <\/strong>the page to check for updates, new content, or changes in structure.<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"1600\" height=\"1808\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-a-7-step-flow-diagram-of-how-web-c-2.jpg\" alt=\"Graphic showing a 7 step flow diagram of how web crawlers work\" class=\"wp-image-178362\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-a-7-step-flow-diagram-of-how-web-c-2.jpg 1600w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-a-7-step-flow-diagram-of-how-web-c-2-376x425.jpg 376w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-a-7-step-flow-diagram-of-how-web-c-2-768x868.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-a-7-step-flow-diagram-of-how-web-c-2-1359x1536.jpg 1359w\" sizes=\"(max-width: 1600px) 100vw, 1600px\"\/><\/noscript><img decoding=\"async\" width=\"1600\" height=\"1808\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-a-7-step-flow-diagram-of-how-web-c-2.jpg\" alt=\"Graphic showing a 7 step flow diagram of how web crawlers work\" class=\"lazyload wp-image-178362\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-a-7-step-flow-diagram-of-how-web-c-2.jpg 1600w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-a-7-step-flow-diagram-of-how-web-c-2-376x425.jpg 376w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-a-7-step-flow-diagram-of-how-web-c-2-768x868.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-a-7-step-flow-diagram-of-how-web-c-2-1359x1536.jpg 1359w\" data-sizes=\"(max-width: 1600px) 100vw, 1600px\"\/><\/figure>\n<p>As you can probably guess, the number of URLs discovered and crawled in this process grows exponentially in just a few\u00a0hops.<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"1430\" height=\"924\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/a-graphic-visualizing-website-crawlers-following-l-2.png\" alt=\"A graphic visualizing website crawlers following links exponentially\" class=\"wp-image-178363\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/a-graphic-visualizing-website-crawlers-following-l-2.png 1430w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/a-graphic-visualizing-website-crawlers-following-l-2-658x425.png 658w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/a-graphic-visualizing-website-crawlers-following-l-2-768x496.png 768w\" sizes=\"(max-width: 1430px) 100vw, 1430px\"\/><\/noscript><img decoding=\"async\" width=\"1430\" height=\"924\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/a-graphic-visualizing-website-crawlers-following-l-2.png\" alt=\"A graphic visualizing website crawlers following links exponentially\" class=\"lazyload wp-image-178363\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/a-graphic-visualizing-website-crawlers-following-l-2.png 1430w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/a-graphic-visualizing-website-crawlers-following-l-2-658x425.png 658w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/a-graphic-visualizing-website-crawlers-following-l-2-768x496.png 768w\" data-sizes=\"(max-width: 1430px) 100vw, 1430px\"\/><\/figure>\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewbox=\"0 0 14 14\" style=\"\"><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\"\/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style=\"\"\/><\/g><\/svg><\/a><\/p>\n<div class=\"link-text\" data-anchor=\"How do you get search engines to crawl your site in the first place?\" data-section=\"search-engine-crawling\">\n<h2 class=\"wp-block-heading\" id=\"search-engine-crawling\"><a id=\"post-178325-_qcawfonuzpb\"\/>How do you get search engines to crawl your site in the first\u00a0place?<\/h2>\n<\/div>\n<\/div>\n<p>Search engine web crawlers are autonomous, meaning you <a href=\"https:\/\/www.searchenginejournal.com\/how-to-trigger-a-complete-re-indexing\/506211\/\">can\u2019t trigger them to crawl or switch them on\/off<\/a> at\u00a0will.<\/p>\n<p>You can, however, notify crawlers of site updates via:<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_hk8v51mu9m1w\"\/>XML sitemaps<\/h3>\n<p>An <a href=\"https:\/\/developers.google.com\/search\/docs\/crawling-indexing\/sitemaps\/build-sitemap\">XML sitemap<\/a> is a file that lists all the important pages on your website to help search engines accurately discover and index your content.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_pyv3avc6xxia\"\/>Google\u2019s URL inspection tool<\/h3>\n<p>You can ask Google to consider recrawling your site content via its <a href=\"https:\/\/developers.google.com\/search\/docs\/crawling-indexing\/ask-google-to-recrawl\">URL inspection tool<\/a> in Google Search Console. You may get a message in GSC if Google knows about your URL but hasn\u2019t yet crawled or indexed it. If so, find out <a href=\"https:\/\/ahrefs.com\/blog\/discovered-currently-not-indexed\/\">how to fix \u201cDiscovered \u2014 currently not indexed\u201d<\/a>.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_b6do1mbkhsl8\"\/>IndexNow<\/h3>\n<p>Instead of waiting for bots to re-crawl and index your content, you can use <a href=\"https:\/\/ahrefs.com\/blog\/indexnow-yep-ahrefs\/\">IndexNow<\/a> to automatically ping search engines like Bing, Yandex, Naver, Seznam.cz, and <a href=\"https:\/\/yep.com\/\">Yep<\/a>, whenever you:<\/p>\n<ul class=\"wp-block-list\">\n<li>Add new\u00a0pages<\/li>\n<li>Update existing content<\/li>\n<li>Remove outdated pages<\/li>\n<li>Implement redirects<\/li>\n<\/ul>\n<p>You can <a href=\"https:\/\/help.ahrefs.com\/en\/articles\/9317209-how-to-submit-pages-to-indexnow-within-site-audit\">set up automatic IndexNow submissions via Ahrefs Site\u00a0Audit.<\/a><\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"2048\" height=\"965\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-indexnow-api-key-in-ahrefs-site-audi-2.jpg\" alt=\"screenshot of IndexNow API key in Ahrefs Site Audit\" class=\"wp-image-178364\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-indexnow-api-key-in-ahrefs-site-audi-2.jpg 2048w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-indexnow-api-key-in-ahrefs-site-audi-2-680x320.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-indexnow-api-key-in-ahrefs-site-audi-2-768x362.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-indexnow-api-key-in-ahrefs-site-audi-2-1536x724.jpg 1536w\" sizes=\"(max-width: 2048px) 100vw, 2048px\"\/><\/noscript><img decoding=\"async\" width=\"2048\" height=\"965\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-indexnow-api-key-in-ahrefs-site-audi-2.jpg\" alt=\"screenshot of IndexNow API key in Ahrefs Site Audit\" class=\"lazyload wp-image-178364\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-indexnow-api-key-in-ahrefs-site-audi-2.jpg 2048w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-indexnow-api-key-in-ahrefs-site-audi-2-680x320.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-indexnow-api-key-in-ahrefs-site-audi-2-768x362.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-indexnow-api-key-in-ahrefs-site-audi-2-1536x724.jpg 1536w\" data-sizes=\"(max-width: 2048px) 100vw, 2048px\"\/><\/figure>\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewbox=\"0 0 14 14\" style=\"\"><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\"\/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style=\"\"\/><\/g><\/svg><\/a><\/p>\n<div class=\"link-text\" data-anchor=\"How to get Google to crawl more of your pages, more often\" data-section=\"crawling-frequency\">\n<h2 class=\"wp-block-heading\" id=\"crawling-frequency\"><a id=\"post-178325-_dfh5ovbl8na8\"\/>How to get Google to crawl more of your pages, more\u00a0often<\/h2>\n<\/div>\n<\/div>\n<p>Search engine crawling decisions are dynamic and a <em>little<\/em> obscure.<\/p>\n<p>Although we don\u2019t know the definitive criteria Google uses to determine when or how often to crawl content, we\u2019ve deduced three of the most important areas.<\/p>\n<p>This is based on breadcrumbs dropped by Google, both in support documentation and during rep interviews.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_ervkgcjvveb\"\/>1. Prioritize quality<\/h3>\n<p><a href=\"https:\/\/ahrefs.com\/seo\/glossary\/pagerank\">Google PageRank<\/a> evaluates the number and quality of links to a page, considering them as \u201cvotes\u201d of importance.<\/p>\n<p>Pages earning quality links are deemed more important and are ranked higher in search results.<\/p>\n<p>PageRank is a foundational part of Google\u2019s algorithm. It makes sense then that the quality of your links and content plays a big part in how your site is crawled and indexed.<\/p>\n<p>To judge your site\u2019s quality, Google looks at factors such\u00a0as:<\/p>\n<p>To assess the pages on your site with the most links, check out the Best by Links report.<\/p>\n<p>Pay attention to the \u201cFirst seen\u201d, \u201cLast check\u201d column, which reveals which pages have been crawled most often, and\u00a0when.<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"1600\" height=\"907\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/ahrefs-best-by-links-report-highlighting-first-see-2.jpg\" alt=\"Ahrefs Best by Links report highlighting first seen last check column\" class=\"wp-image-178365\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/ahrefs-best-by-links-report-highlighting-first-see-2.jpg 1600w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/ahrefs-best-by-links-report-highlighting-first-see-2-680x385.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/ahrefs-best-by-links-report-highlighting-first-see-2-768x435.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/ahrefs-best-by-links-report-highlighting-first-see-2-1536x871.jpg 1536w\" sizes=\"(max-width: 1600px) 100vw, 1600px\"\/><\/noscript><img decoding=\"async\" width=\"1600\" height=\"907\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/ahrefs-best-by-links-report-highlighting-first-see-2.jpg\" alt=\"Ahrefs Best by Links report highlighting first seen last check column\" class=\"lazyload wp-image-178365\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/ahrefs-best-by-links-report-highlighting-first-see-2.jpg 1600w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/ahrefs-best-by-links-report-highlighting-first-see-2-680x385.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/ahrefs-best-by-links-report-highlighting-first-see-2-768x435.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/ahrefs-best-by-links-report-highlighting-first-see-2-1536x871.jpg 1536w\" data-sizes=\"(max-width: 1600px) 100vw, 1600px\"\/><\/figure>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_fj9jszl40cw9\"\/>2. Keep things fresh<\/h3>\n<p>According to Google\u2019s Senior Search Analyst, <a href=\"https:\/\/www.linkedin.com\/in\/johnmu\/\">John Mueller<\/a>\u2026<\/p>\n<blockquote class=\"small\">\n<p>Search engines recrawl URLs at different rates, sometimes it\u2019s multiple times a day, sometimes it\u2019s once every few months.<\/p>\n<div class=\"quote-info clearfix\">\n<div class=\"quote-photo\"><noscript><img decoding=\"async\" alt=\"John Mueller\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2022\/02\/john-mueller-google.png\"\/><\/noscript><img class=\"lazyload\" decoding=\"async\" alt=\"John Mueller\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2022\/02\/john-mueller-google.png\"\/><\/div>\n<\/div>\n<\/blockquote>\n<p>But if you regularly update your content, you\u2019ll see crawlers dropping by more\u00a0often.<\/p>\n<p>Search engines like Google want to deliver accurate and up-to-date information to remain competitive and relevant, so updating your content is like dangling a carrot on a\u00a0stick.<\/p>\n<p>You can examine just how quickly Google processes your updates by checking your <a href=\"https:\/\/support.google.com\/webmasters\/answer\/9679690?hl=en\">crawl stats in Google Search Console<\/a>.<\/p>\n<p>While you\u2019re there, look at the breakdown of crawling \u201cBy purpose\u201d (i.e. percent split of pages refreshed vs pages newly discovered). This will also help you work out just how often you\u2019re encouraging web crawlers to revisit your\u00a0site.<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"783\" height=\"307\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-6-1.png\" alt=\"\" class=\"wp-image-178366\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-6-1.png 783w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-6-1-680x267.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-6-1-768x301.png 768w\" sizes=\"(max-width: 783px) 100vw, 783px\"\/><\/noscript><img decoding=\"async\" width=\"783\" height=\"307\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-6-1.png\" alt=\"\" class=\"lazyload wp-image-178366\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-6-1.png 783w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-6-1-680x267.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-6-1-768x301.png 768w\" data-sizes=\"(max-width: 783px) 100vw, 783px\"\/><\/figure>\n<p>To find specific pages that need updating on your site, head to the Top Pages report in Ahrefs Site Explorer, then:<\/p>\n<ol class=\"wp-block-list\">\n<li>Set the traffic filter to \u201cDeclined\u201d<\/li>\n<li>Set the comparison date to the last year or\u00a0two<\/li>\n<li>Look at Content Changes status and update pages with only minor changes<\/li>\n<\/ol>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"976\" height=\"573\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/3-part-process-of-updating-pages-based-on-content-2.png\" alt=\"3 part process of updating pages based on content changes in Ahrefs\" class=\"wp-image-178367\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/3-part-process-of-updating-pages-based-on-content-2.png 976w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/3-part-process-of-updating-pages-based-on-content-2-680x399.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/3-part-process-of-updating-pages-based-on-content-2-768x451.png 768w\" sizes=\"(max-width: 976px) 100vw, 976px\"\/><\/noscript><img decoding=\"async\" width=\"976\" height=\"573\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/3-part-process-of-updating-pages-based-on-content-2.png\" alt=\"3 part process of updating pages based on content changes in Ahrefs\" class=\"lazyload wp-image-178367\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/3-part-process-of-updating-pages-based-on-content-2.png 976w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/3-part-process-of-updating-pages-based-on-content-2-680x399.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/3-part-process-of-updating-pages-based-on-content-2-768x451.png 768w\" data-sizes=\"(max-width: 976px) 100vw, 976px\"\/><\/figure>\n<p>Top Pages shows you the content on your site driving the most organic traffic. Pushing updates to these pages will encourage crawlers to visit your best content more often, and (hopefully) boost any declining traffic.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_jszwkwwfbpvp\"\/>3. Refine your site structure<\/h3>\n<p>Offering a clear site structure via a logical sitemap, and backing that up with relevant internal links will help crawlers:<\/p>\n<ul class=\"wp-block-list\">\n<li>Better navigate your\u00a0site<\/li>\n<li>Understand its hierarchy<\/li>\n<li>Index and rank your most valuable content<\/li>\n<\/ul>\n<p>Combined, these factors will also please users, since they support easy navigation, reduced bounce rates, and increased engagement.<\/p>\n<p>Below are some more elements that can potentially influence how your site gets discovered and prioritized in crawling:<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"1850\" height=\"1730\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-the-factors-that-can-affect-web-cr-2.png\" alt=\"Graphic showing the factors that can affect web crawl discoverability\" class=\"wp-image-178368\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-the-factors-that-can-affect-web-cr-2.png 1850w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-the-factors-that-can-affect-web-cr-2-454x425.png 454w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-the-factors-that-can-affect-web-cr-2-768x718.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-the-factors-that-can-affect-web-cr-2-1536x1436.png 1536w\" sizes=\"(max-width: 1850px) 100vw, 1850px\"\/><\/noscript><img decoding=\"async\" width=\"1850\" height=\"1730\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-the-factors-that-can-affect-web-cr-2.png\" alt=\"Graphic showing the factors that can affect web crawl discoverability\" class=\"lazyload wp-image-178368\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-the-factors-that-can-affect-web-cr-2.png 1850w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-the-factors-that-can-affect-web-cr-2-454x425.png 454w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-the-factors-that-can-affect-web-cr-2-768x718.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/graphic-showing-the-factors-that-can-affect-web-cr-2-1536x1436.png 1536w\" data-sizes=\"(max-width: 1850px) 100vw, 1850px\"\/><\/figure>\n<div class=\"recommendation\">\n<p>What is crawl budget?<\/p>\n<div class=\"recommendation-content\"> Crawlers mimic the behavior of human users. Every time they visit a web page, the site\u2019s server gets pinged. Pages or sites that are difficult to crawl will incur errors and slow load times, and if a page is visited too often by a crawler bot, servers and webmasters will block it for overusing resources.<\/p>\n<p>For this reason, each site has a crawl budget, which is the number of URLs a crawler <strong>can<\/strong> and <strong>wants<\/strong> to crawl. Factors like site speed, mobile-friendliness, and a logical site structure impact the efficacy of crawl budget.<\/p>\n<p>For a deeper dive into crawl budgets, check out Patrick Stox\u2019s guide: <a href=\"https:\/\/ahrefs.com\/blog\/crawl-budget\/\">When Should You Worry About Crawl Budget?<\/a><\/p>\n<\/div>\n<\/div>\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewbox=\"0 0 14 14\" style=\"\"><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\"\/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style=\"\"\/><\/g><\/svg><\/a><\/p>\n<div class=\"link-text\" data-anchor=\"What is a website crawler?\" data-section=\"website-crawler-definition\">\n<div class=\"wp-block-group is-nowrap is-layout-flex wp-container-core-group-is-layout-1 wp-block-group-is-layout-flex\">\n<h2 class=\"wp-block-heading\" id=\"website-crawler-definition\"><a id=\"post-178325-_7873si2f62zu\"\/>What is a web<span style=\"text-decoration: underline;\">site<\/span> crawler?<\/h2>\n<\/div>\n<\/div>\n<\/div>\n<p>Web crawlers like Google crawl the entire internet, and you can\u2019t control which sites they visit, or how\u00a0often.<\/p>\n<p>But you <em>can <\/em>use website crawlers, which are like your own private bots.<\/p>\n<p>Ask them to crawl your website to find and fix important SEO problems, or study your competitors\u2019 site, turning their biggest weaknesses into your opportunities.<\/p>\n<p>Site crawlers essentially simulate search performance. They help you understand how a search engine\u2019s web crawlers might interpret your pages, based on\u00a0their:<\/p>\n<ul class=\"wp-block-list\">\n<li>Structure<\/li>\n<li>Content<\/li>\n<li>Meta data<\/li>\n<li>Page load\u00a0speed<\/li>\n<li>Errors<\/li>\n<li>Etc<\/li>\n<\/ul>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_89scbnngpwqg\"\/>Example: Ahrefs Site\u00a0Audit<\/h3>\n<p>The <a href=\"https:\/\/ahrefs.com\/robot\/site-audit\">Ahrefs Site Audit<\/a> crawler powers the tools: RankTracker, Projects, and Ahrefs\u2019 main website crawling tool: Site\u00a0Audit.<\/p>\n<p>Site Audit helps SEOs\u00a0to:<\/p>\n<ul class=\"wp-block-list\">\n<li>Analyze 170+ technical SEO issues<\/li>\n<li>Conduct on-demand crawls, with live site performance data<\/li>\n<li>Assess up to 170k URLs a minute<\/li>\n<li>Troubleshoot, maintain, and improve their visibility in search engines<\/li>\n<\/ul>\n<p>From URL discovery to revisiting, website crawlers operate very similarly to web crawlers \u2013 only instead of indexing and ranking your page in the SERPs, they store and analyze it in their own database.<\/p>\n<p>You can crawl your site either locally or remotely. Desktop crawlers like ScreamingFrog let you download and customize your site crawl, while cloud-based tools like Ahrefs Site Audit perform the crawl without using your computer\u2019s resources \u2013 helping you work collaboratively on fixes and site optimization.<\/p>\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewbox=\"0 0 14 14\" style=\"\"><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\"\/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style=\"\"\/><\/g><\/svg><\/a><\/p>\n<div class=\"link-text\" data-anchor=\"How to crawl your own website\" data-section=\"crawl-site\">\n<h2 class=\"wp-block-heading\" id=\"crawl-site\"><a id=\"post-178325-_ftnx4vnoe6j3\"\/>How to crawl your own website<\/h2>\n<\/div>\n<\/div>\n<p>If you want to scan entire websites in real time to detect technical SEO problems, configure a crawl in Site\u00a0Audit.<\/p>\n<p>It will give you visual data breakdowns, site health scores, and detailed fix recommendations to help you understand how a search engine interprets your\u00a0site.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_ktfiu6h7yz75\"\/>1. Set up your\u00a0crawl<\/h3>\n<p>Navigate to the Site Audit tab and choose an existing project, or <a href=\"https:\/\/help.ahrefs.com\/en\/articles\/4455322-setting-up-your-first-project-in-ahrefs-webmaster-tools-awt\">set one up<\/a>.<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"949\" height=\"706\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-import-add-project-page-in-ahrefs-si-2.png\" alt=\"Screenshot of import\/add project page in Ahrefs Site Audit\" class=\"wp-image-178369\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-import-add-project-page-in-ahrefs-si-2.png 949w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-import-add-project-page-in-ahrefs-si-2-571x425.png 571w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-import-add-project-page-in-ahrefs-si-2-768x571.png 768w\" sizes=\"(max-width: 949px) 100vw, 949px\"\/><\/noscript><img decoding=\"async\" width=\"949\" height=\"706\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-import-add-project-page-in-ahrefs-si-2.png\" alt=\"Screenshot of import\/add project page in Ahrefs Site Audit\" class=\"lazyload wp-image-178369\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-import-add-project-page-in-ahrefs-si-2.png 949w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-import-add-project-page-in-ahrefs-si-2-571x425.png 571w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-import-add-project-page-in-ahrefs-si-2-768x571.png 768w\" data-sizes=\"(max-width: 949px) 100vw, 949px\"\/><\/figure>\n<p>A project is any domain, subdomain, or URL you want to track over\u00a0time.<\/p>\n<p>Once you\u2019ve <a href=\"https:\/\/help.ahrefs.com\/en\/articles\/9082329-how-should-i-configure-my-site-audit-settings\">configured your crawl settings<\/a> \u2013 including your crawl schedule and URL sources \u2013 you can start your audit and you\u2019ll be notified as soon as it\u2019s complete.<\/p>\n<p>Here are some things you can do right\u00a0away.<\/p>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_ptntag70nrpz\"\/>2. Diagnose top errors<\/h3>\n<p>The Top Issues overview in Site Audit shows you your most pressing errors, warnings, and notices, based on the number of URLs affected.<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"823\" height=\"414\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-10-1.png\" alt=\"\" class=\"wp-image-178370\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-10-1.png 823w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-10-1-680x342.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-10-1-768x386.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-10-1-400x200.png 400w\" sizes=\"(max-width: 823px) 100vw, 823px\"\/><\/noscript><img decoding=\"async\" width=\"823\" height=\"414\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-10-1.png\" alt=\"\" class=\"lazyload wp-image-178370\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-10-1.png 823w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-10-1-680x342.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-10-1-768x386.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/word-image-178325-10-1-400x200.png 400w\" data-sizes=\"(max-width: 823px) 100vw, 823px\"\/><\/figure>\n<p>Working through these as part of your SEO roadmap will help\u00a0you:<\/p>\n<p>1. Spot <strong>errors (red icons)<\/strong> impacting crawling \u2013 e.g.<\/p>\n<ul class=\"wp-block-list\">\n<li>HTTP status code\/client errors<\/li>\n<li>Broken links<\/li>\n<li>Canonical issues<\/li>\n<\/ul>\n<p>2. Optimize your content and rankings based on <strong>warnings (yellow) <\/strong>\u2013 e.g.<\/p>\n<ul class=\"wp-block-list\">\n<li>Missing alt\u00a0text<\/li>\n<li>Links to redirects<\/li>\n<li>Overly long meta descriptions<\/li>\n<\/ul>\n<p>3. Maintain steady visibility with <strong>notices (blue icon)<\/strong> \u2013 e.g.<\/p>\n<ul class=\"wp-block-list\">\n<li>Organic traffic drops<\/li>\n<li>Multiple H1s<\/li>\n<li>Indexable pages not in sitemap<\/li>\n<\/ul>\n<h4 class=\"wp-block-heading\"><a id=\"post-178325-_vnztyjm20gsj\"\/>Filter issues<\/h4>\n<p>You can also prioritize fixes using filters.<\/p>\n<p>Say you have thousands of pages with missing meta descriptions. Make the task more manageable and impactful by targeting high traffic pages\u00a0first.<\/p>\n<ol class=\"wp-block-list\">\n<li>Head to the Page Explorer report in Site\u00a0Audit<\/li>\n<li>Select the advanced filter dropdown<\/li>\n<li>Set an internal pages filter<\/li>\n<li>Select an \u2018And\u2019 operator<\/li>\n<li>Select \u2018Meta description\u2019 and \u2018Not exists\u2019<\/li>\n<li>Select \u2018Organic traffic &gt;\u00a0100\u2019<\/li>\n<\/ol>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"1080\" height=\"332\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-how-to-find-pages-with-missing-meta-2.png\" alt=\"Screenshot of how to find pages with missing meta descriptions, over 100 organic traffic, in Ahrefs Page Explorer\" class=\"wp-image-178371\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-how-to-find-pages-with-missing-meta-2.png 1080w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-how-to-find-pages-with-missing-meta-2-680x209.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-how-to-find-pages-with-missing-meta-2-768x236.png 768w\" sizes=\"(max-width: 1080px) 100vw, 1080px\"\/><\/noscript><img decoding=\"async\" width=\"1080\" height=\"332\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-how-to-find-pages-with-missing-meta-2.png\" alt=\"Screenshot of how to find pages with missing meta descriptions, over 100 organic traffic, in Ahrefs Page Explorer\" class=\"lazyload wp-image-178371\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-how-to-find-pages-with-missing-meta-2.png 1080w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-how-to-find-pages-with-missing-meta-2-680x209.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-how-to-find-pages-with-missing-meta-2-768x236.png 768w\" data-sizes=\"(max-width: 1080px) 100vw, 1080px\"\/><\/figure>\n<h4 class=\"wp-block-heading\"><a id=\"post-178325-_x4d7jfac3ecj\"\/>Crawl the most important parts of your\u00a0site<\/h4>\n<p>Segment and zero-in on the most important pages on your site (e.g. subfolders or subdomains) using Site Audit\u2019s 200+ filters \u2013 whether that\u2019s your blog, ecommerce store, or even pages that earn over a certain traffic threshold.<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"2048\" height=\"713\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-site-audit-pointing-out-confi-2.jpg\" alt=\"Screenshot of Ahrefs Site Audit pointing out configure segment option\" class=\"wp-image-178372\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-site-audit-pointing-out-confi-2.jpg 2048w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-site-audit-pointing-out-confi-2-680x237.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-site-audit-pointing-out-confi-2-768x267.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-site-audit-pointing-out-confi-2-1536x535.jpg 1536w\" sizes=\"(max-width: 2048px) 100vw, 2048px\"\/><\/noscript><img decoding=\"async\" width=\"2048\" height=\"713\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-site-audit-pointing-out-confi-2.jpg\" alt=\"Screenshot of Ahrefs Site Audit pointing out configure segment option\" class=\"lazyload wp-image-178372\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-site-audit-pointing-out-confi-2.jpg 2048w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-site-audit-pointing-out-confi-2-680x237.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-site-audit-pointing-out-confi-2-768x267.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-site-audit-pointing-out-confi-2-1536x535.jpg 1536w\" data-sizes=\"(max-width: 2048px) 100vw, 2048px\"\/><\/figure>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_r7gxs9mzjifo\"\/>3. Expedite fixes<\/h3>\n<p>If you don\u2019t have coding experience, then the prospect of crawling your site and implementing fixes can be intimidating.<\/p>\n<p>If you <em>do <\/em>have dev support, issues are easier to remedy, but then it becomes a matter of bargaining for another person\u2019s time.<\/p>\n<p>We\u2019ve got a new feature on the way to help you solve for these kinds of headaches.<\/p>\n<p>Coming soon, <a href=\"https:\/\/ahrefs.com\/blog\/site-audit-patches\/\">Patches<\/a> are fixes you can make autonomously in Site\u00a0Audit.<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"1500\" height=\"650\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-the-2.png\" alt=\"Screenshot of Ahrefs Patches tool calling out the Patch It feature\" class=\"wp-image-178373\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-the-2.png 1500w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-the-2-680x295.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-the-2-768x333.png 768w\" sizes=\"(max-width: 1500px) 100vw, 1500px\"\/><\/noscript><img decoding=\"async\" width=\"1500\" height=\"650\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-the-2.png\" alt=\"Screenshot of Ahrefs Patches tool calling out the Patch It feature\" class=\"lazyload wp-image-178373\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-the-2.png 1500w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-the-2-680x295.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-the-2-768x333.png 768w\" data-sizes=\"(max-width: 1500px) 100vw, 1500px\"\/><\/figure>\n<p>Title changes, missing meta descriptions, site-wide broken links \u2013 when you face these kinds of errors you can hit \u201cPatch it\u201d to publish a fix directly to your website, without having to pester a\u00a0dev.<\/p>\n<p>And if you\u2019re unsure of anything, you can roll-back your patches at any\u00a0point.<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"1500\" height=\"229\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-draf-2.png\" alt=\"Screenshot of Ahrefs Patches tool calling out drafts, published, and unpublished statuses\" class=\"wp-image-178374\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-draf-2.png 1500w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-draf-2-680x104.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-draf-2-768x117.png 768w\" sizes=\"(max-width: 1500px) 100vw, 1500px\"\/><\/noscript><img decoding=\"async\" width=\"1500\" height=\"229\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-draf-2.png\" alt=\"Screenshot of Ahrefs Patches tool calling out drafts, published, and unpublished statuses\" class=\"lazyload wp-image-178374\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-draf-2.png 1500w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-draf-2-680x104.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-ahrefs-patches-tool-calling-out-draf-2-768x117.png 768w\" data-sizes=\"(max-width: 1500px) 100vw, 1500px\"\/><\/figure>\n<h3 class=\"wp-block-heading\"><a id=\"post-178325-_jeroofko2hvh\"\/>4. Spot optimization opportunities<\/h3>\n<p>Auditing your site with a website crawler is as much about spotting opportunities as it is about fixing bugs.<\/p>\n<h4 class=\"wp-block-heading\"><a id=\"post-178325-_gfeuj0p4xihm\"\/>Improve internal linking<\/h4>\n<p>The Internal Link Opportunities report in Site Audit shows you relevant internal linking suggestions, by taking the top 10 keywords (by traffic) for each crawled page, then looking for mentions of them on your other crawled pages.<\/p>\n<p>\u2018Source\u2019 pages are the ones you should link <strong>from<\/strong>, and \u2018Target\u2019 pages are the ones you should link <strong>to<\/strong>.<\/p>\n<figure class=\"wp-block-image\"><noscript><img decoding=\"async\" width=\"1100\" height=\"435\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-internal-link-opportunities-report-i-2.png\" alt=\"Screenshot of Internal Link Opportunities report in Ahrefs Site Audit highlighting source page and target page\" class=\"wp-image-178375\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-internal-link-opportunities-report-i-2.png 1100w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-internal-link-opportunities-report-i-2-680x269.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-internal-link-opportunities-report-i-2-768x304.png 768w\" sizes=\"(max-width: 1100px) 100vw, 1100px\"\/><\/noscript><img decoding=\"async\" width=\"1100\" height=\"435\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-internal-link-opportunities-report-i-2.png\" alt=\"Screenshot of Internal Link Opportunities report in Ahrefs Site Audit highlighting source page and target page\" class=\"lazyload wp-image-178375\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-internal-link-opportunities-report-i-2.png 1100w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-internal-link-opportunities-report-i-2-680x269.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/08\/screenshot-of-internal-link-opportunities-report-i-2-768x304.png 768w\" data-sizes=\"(max-width: 1100px) 100vw, 1100px\"\/><\/figure>\n<p>The more high quality connections you make between your content, the easier it will be for Googlebot to crawl your\u00a0site.<\/p>\n<h2 class=\"wp-block-heading\"><a id=\"post-178325-_4hpkglbpmvkt\"\/>Final thoughts<\/h2>\n<p>Understanding website crawling is more than just an SEO hack \u2013 it\u2019s foundational knowledge that directly impacts your traffic and\u00a0ROI.<\/p>\n<p>Knowing how crawlers work means knowing how search engines \u201csee\u201d your site, and that\u2019s half the battle when it comes to ranking.<\/p>\n<\/p><\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/ahrefs.com\/blog\/website-crawlers\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>You might have heard of website crawling before \u2014 you may even have a vague idea of what it\u2019s about \u2014 but do you know why it\u2019s important, or what differentiates it from web crawling? (yes, there is a difference!)\u00a0 Search engines are increasingly ruthless when it comes to the quality of the sites they [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":2658,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[22],"tags":[],"class_list":["post-2657","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-seo"],"_links":{"self":[{"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/posts\/2657","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/comments?post=2657"}],"version-history":[{"count":0,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/posts\/2657\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/media\/2658"}],"wp:attachment":[{"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/media?parent=2657"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/categories?post=2657"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/tags?post=2657"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}