hedii / extractors
Extractor is a package that find targeted types of resources in html dom.
Installs: 1 608
Dependents: 1
Suggesters: 0
Security: 0
Stars: 7
Watchers: 2
Forks: 3
Open Issues: 1
Requires
- php: >=7.1
- symfony/css-selector: ^4.0
- symfony/dom-crawler: ^4.0
Requires (Dev)
- laravel/lumen-framework: 5.5.*
- phpunit/phpunit: ~6.0
This package is auto-updated.
Last update: 2024-12-18 19:44:52 UTC
README
Extractors
Extractor is a package that find targeted types of resources in html dom. Currently, only 2 types of resources are available: urls and emails. The found resources are returned as an array.
The Extractor
class can be extended to add more resources types.
Install
Via Composer
composer require hedii/extractors
Usage
Currently, only 2 types of resources are available: urls and emails.
// require composer autoloader require '/path/to/vendor/autoload.php'; // instantiate $extractor = new \Hedii\Extractors\Extractor(); // get all the urls on example.com page dom $urls = $extractor->searchFor(['urls']) ->at('http://example.com') ->get(); // get all the emails on example.com page dom $emails = $extractor->searchFor(['emails']) ->at('http://example.com') ->get(); // get all the urls and emails on example.com page dom $urlsAndEmails = $extractor->searchFor(['urls', 'emails']) ->at('http://example.com') ->get();
Testing
composer test