Uses Screaming Frog Internal HTML with text extraction along with a shingling algorithm to compare content duplication across the pages of a crawled site.
-
Updated
Oct 2, 2019 - Python
Uses Screaming Frog Internal HTML with text extraction along with a shingling algorithm to compare content duplication across the pages of a crawled site.
Add a description, image, and links to the duplicate-content topic page so that developers can more easily learn about it.
To associate your repository with the duplicate-content topic, visit your repo's landing page and select "manage topics."