GitHub - ratalla816/regex-tutorial: The student is becoming the master Tutorial that teaches others how to use regular expressions to validate URLs and how they work!
An effective detection approach for phishing websites using URL and HTML features
system-design-primer/solutions/system_design/web_crawler/README.md at master · donnemartin/system-design-primer · GitHub
GitHub - salimk/Rcrawler: An R web crawler and scraper
WinRAR and MOTW · Issue #1 · nmantani/archiver-MOTW-support-comparison · GitHub
GitHub - karust/gogetcrawl: Extract web archive data using Wayback Machine and Common Crawl
GitHub - urbanadventurer/urlcrazy: Generate and test domain typos and variations to detect and perform typo squatting, URL hijacking, phishing, and corporate espionage.
SEO Crawler: Crawl up to 10,000 pages of a domain - SEORCH
crawlers · GitHub Topics · GitHub
Masashi VulnHub – Walk through – Research Blog
GoSpider - Fast web spider written in Go - GeeksforGeeks
Java Web Crawler Libraries - Stack Overflow
GitHub - mvdan/xurls: Extract urls from text
GitHub - salimk/Rcrawler: An R web crawler and scraper