vstelmakh/url-highlight

Library to parse urls from string input

v3.1.1 2024-09-15 17:36 UTC

README

Url highlight logo

Build status Packagist version PHP version License

Url highlight - PHP library to parse URLs from string input. Works with complex URLs, edge cases and encoded input.

Features:

  • Replace URLs in string by HTML tags (make clickable)
  • Match URLs without scheme by top-level domain
  • Work with HTML entities encoded input
  • Extract URLs from string
  • Check if string is URL

🚀 See examples 👀

Installation

Install the latest version with Composer:

composer require vstelmakh/url-highlight

Also, there are Twig logo Twig extension and Symfony logo Symfony bundle available.

Quick start

<?php
require __DIR__ . '/vendor/autoload.php';

use VStelmakh\UrlHighlight\UrlHighlight;

$urlHighlight = new UrlHighlight();
echo $urlHighlight->highlightUrls('Hello, http://example.com.');

// Output:
// Hello, <a href="http://example.com">http://example.com</a>.

💡 Tip: For customizing highlight see Highlighter. To properly handle HTML entity escaped string see Encoder.

Usage

Check if string is URL

<?php
$urlHighlight->isUrl('http://example.com'); // return: true
$urlHighlight->isUrl('Other string'); // return: false

Parse URLs from string

<?php
$urlHighlight->getUrls('Hello, http://example.com.');
// return: ['http://example.com']

Replace URLs by HTML tags (make clickable)

<?php
$urlHighlight->highlightUrls('Hello, http://example.com.');
// return: 'Hello, <a href="http://example.com">http://example.com</a>.'

Configuration

There are 3 parts which could be configured according to your needs:

  • Validator - define if match is valid and should be recognized as URL (e.g. allow/disallow specific schemes)
  • Highlighter - define the way how URL should be highlighted (e.g. replaced by html <a> tag)
  • Encoder - define how to work with encoded input (e.g. html special chars)

Configuration provided via constructor implementing corresponding interface instance. Use null to keep default:

<?php
use VStelmakh\UrlHighlight\Encoder\HtmlSpecialcharsEncoder;
use VStelmakh\UrlHighlight\UrlHighlight;
use VStelmakh\UrlHighlight\Validator\Validator;

$validator = new Validator();
$encoder = new HtmlSpecialcharsEncoder();
$urlHighlight = new UrlHighlight($validator, null, $encoder);

Validator

There is one validator bundled with the library. Which is used by default with settings listed in example below.

🛠️ Validator usage example
<?php
use VStelmakh\UrlHighlight\UrlHighlight;
use VStelmakh\UrlHighlight\Validator\Validator;

$validator = new Validator(
    true, // bool - if should use top level domain to match urls without scheme
    [],   // string[] - array of blacklisted schemes
    [],   // string[] - array of whitelisted schemes
    true  // bool - if should match emails (if match by TLD set to "false" - will match only "mailto" urls)
);
$urlHighlight = new UrlHighlight($validator);

💡 Tip: If you need custom behavior - create and use your own validator implementing ValidatorInterface.

Highlighter

There are 2 highlighters bundled with the library:

  • HtmlHighlighter - convert matches to html tags.
    Example: http://example.com<a href="http://example.com">http://example.com</a>

  • MarkdownHighlighter - convert matches to markdown format.
    Example: http://example.com[http://example.com](http://example.com)

By default, HtmlHighlighter is used, with settings listed in example below.

🛠️ Highlighter usage example
<?php
use VStelmakh\UrlHighlight\Highlighter\HtmlHighlighter;
use VStelmakh\UrlHighlight\UrlHighlight;

$highlighter = new HtmlHighlighter(
    'http', // string - scheme to use for urls matched by top level domain
    [],     // string[] - key/value map of tag attributes, e.g. ['rel' => 'nofollow', 'class' => 'light']
    '',     // string - content to add before highlight: {here}<a...
    ''      // string - content to add after highlight: ...</a>{here}
);
$urlHighlight = new UrlHighlight(null, $highlighter);

💡 Tip: If you need custom behavior - extend HtmlHighlighter or implement HighlighterInterface.
For more details and examples see 🖍️ Custom highlighter.

Encoder

Encoder should be used to handle encoded input properly. For example HTML escaped string could contain something like: http://example.com?a=1&quot; or http://example.com?a=1&amp;b=2 which will be wrongly matched as URL.

By default, there is no encoder used. There are 2 encoders bundled with library:

  • HtmlEntitiesEncoder - to work with HTML entities encoded string (any character expected to be HTML entity encoded)
  • HtmlSpecialcharsEncoder - to work with HTML escaped string (only & " ' < > expected to be encoded)
🛠️ Encoder usage example
<?php
use VStelmakh\UrlHighlight\Encoder\HtmlSpecialcharsEncoder;
use VStelmakh\UrlHighlight\UrlHighlight;

$encoder = new HtmlSpecialcharsEncoder();
$urlHighlight = new UrlHighlight(null, null, $encoder);

$urlHighlight->highlightUrls('&lt;a href=&quot;http://example.com&quot;&gt;Example&lt;/a&gt;');
// return: '&lt;a href=&quot;<a href="http://example.com">http://example.com</a>&quot;&gt;Example&lt;/a&gt;'

💡 Tip: For custom behavior - create and use your own encoder implementing EncoderInterface.
Keep in mind - using encoder require more regex operations and could have performance impact. Better to not use encoder if you don't expect encoded string.

Credits

Volodymyr Stelmakh
Licensed under the MIT License. See LICENSE for more information.