You can improve scan job times by following the tips below:
1. Separate Large Jobs (multiple sites):
Scan Jobs are run in a single thread to ensure logs are captured correctly and aid in validating links and making sure content is scanned correctly. However, if you have a site collection with many sub sites or a single site with many lists or large lists, this may not be the most efficient approach. We recommend to break large site collections (many sub sites), or sites with large lists into multiple jobs. To do this, uncheck “Include Sub Sites” in the Scan Options page and schedule each site individually. You can save the Job Template to make this easier, or you can also hold SHIFT or CTRL and select all sites (you want to build a report on) from the Home Page Dashboard, right click, and click “Create Reports -> Broken Links” – this will allow you create multiple jobs (multi-threaded) at once.
2. Separate Large Jobs (large lists or many lists) - Use Inclusion Rules:
If you have a single site with many lists or one or more large lists (greater than 10K items), do not run this site in a single job; Instead, execute or schedule this job and use “Inclusion Rules -> List Inclusions” to separate each large list into a single job; or multiple lists if you have many, into a few jobs - such as a site with 50 lists, separate into 10 per job. Again, you can create a Job Template (click “Save Job”) to make this easier.
Then, you can then schedule these jobs to run in parallel (at the same Start Time).
NOTE: This may require you to have more CPU or RAM or to run on a different machine to get the best results.
3. Use Exclusions Rules:
Exclude content you do not need in the report. Set the “Exclusion Rules -> List Exclusions” to skip lists that are not needed in the report. Add any SharePoint URLs (such as folders or sections of a site) and Link URLs (links found within pages and files – such as navigation menu items) that you do not need in the report, or that you know are valid.
4. Disable Detailed Logging:
Disable “Show Detailed Logging”. Detailed Logging should be used when you are first setting up jobs to ensure they are working correctly and capturing the URLs and links you expect. This option is also useful when troubleshooting issues. Once jobs are tested and configured to run as you need them, uncheck “Show Detailed Logging” to minimize the report time spent on writing each link found and other details.
5. Limit Large Files from being scanned:
If you are using “Scan List Attachment File Contents” or “Scan Document File Contents” - Specify a lower “File Size Limit (MB)” in the Scan Options page (“Scan Links (tab) -> File Size Limit (MB)”) – such as 10MB. This setting is used to skip large files when scanning file contents. The default is set to 20MB.
6. Turn off: Scanning MS Office text links, PDF text links, Linked CSS and JS files, Debug Mode:
Go to the "Scan Links (tab) -> Options" page, make sure the following are turned off these 'optional' settings: (they are OFF by default)
"Search text links within MS Office documents"
"Search text links within PDF documents"
"Scan linked CSS and JS files"
"Scan list form pages"
This information is also included in the Documentation (with screenshots) included with your download.
If you need further assistance, please feel free to contact us!