markuspoerschke/extractum

Extract information from web pages.


README

Extractum is a PHP library that extracts information from web pages.

Getting Started

Installation

composer require markuspoerschke/extractum

Usage

$uri = 'https://www.example.com/';
$html = file_get_contents($uri);

$extractor = new Extractum\Extractor();
$essence = $extractor->extract($html, $uri);

Extracted Information

The extracted information are returned as an object of type Extractum\Essence.

License

This package is released under the MIT license.