crwlr/robots-txt

Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping

Maintainers

Package info

github.com/crwlrsoft/robots-txt

Homepage

Documentation

pkg:composer/crwlr/robots-txt

Fund package maintenance!

otsch

Statistics

Installs: 24 194

Dependents: 1

Suggesters: 0

Stars: 10

Open Issues: 0

v1.1.2 2025-01-27 17:32 UTC

This package is auto-updated.

Last update: 2026-02-06 22:10:24 UTC


README

crwlr.software logo

Robots Exclusion Standard/Protocol Parser

for Web Crawling/Scraping

Use this library within crawler/scraper programs to parse robots.txt files and check if your crawler user-agent is allowed to load certain paths.

Documentation

You can find the documentation at crwlr.software.

Contributing

If you consider contributing something to this package, read the contribution guide (CONTRIBUTING.md).