clientbg/php-boiler-pipe

PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages. Based on dotpack's PHP implementation.

Installs: 40

Dependents: 0

Suggesters: 0

Security: 0

Stars: 0

Watchers: 1

Forks: 12

pkg:composer/clientbg/php-boiler-pipe

v1.1 2018-08-21 07:46 UTC

This package is not auto-updated.

Last update: 2025-09-28 10:59:36 UTC


README

  • forked from dotpack/php-boiler-pipe

Boilerplate Removal and Fulltext Extraction from HTML pages.

Partial implementation of https://github.com/kohlschutter/boilerpipe in PHP. Requires PHP >= 5.4.

Example

# html
$path = "http://example.com/some-article.html";
$data = file_get_contents($path);

# code
$ae = new Clientbg\PhpBoilerPipe\ArticleExtractor();
echo $ae->getContent($data) . "\n";