jansuchanek/nscraper

Web scraper for competitor product analysis — Heureka, Google, JSON-LD extraction

Maintainers

Package info

github.com/JanSuchanek/nscraper

pkg:composer/jansuchanek/nscraper

Statistics

Installs: 3

Dependents: 0

Suggesters: 0

Stars: 0

Open Issues: 0

v1.0.0 2026-03-19 08:33 UTC

This package is auto-updated.

Last update: 2026-04-20 15:35:24 UTC


README

Competitor product scraping — Heureka.cz, Google snippets, JSON-LD extraction. Zero dependencies beyond ext-curl.

Installation

composer require jansuchanek/nscraper

Usage

use NScraper\WebScraper;

$scraper = new WebScraper();

// Scrape Heureka.cz product descriptions
$data = $scraper->scrapeHeureka('Samsung Galaxy S24');
// ['descriptions' => [...], 'source' => 'heureka']

// Google search snippets
$data = $scraper->scrapeGoogle('Samsung Galaxy S24');

// Scrape any URL (JSON-LD, meta, content extraction)
$desc = $scraper->scrapeUrl('https://example.com/product/123');

// Build context for AI pipelines
$context = $scraper->buildCompetitorContext('Samsung Galaxy S24');

Extraction Strategies

  1. JSON-LD structured data (@type: Product)
  2. Meta description tag
  3. CSS selectors.product-desc, .description, #description

Requirements

  • PHP >= 8.1
  • ext-curl