orieg / judy
PHP Judy implements sparse dynamic arrays (aka Judy Arrays).
Requires
- php: >=8.0.0
- dev-main
- v2.4.0
- v2.3.0
- v2.2.0
- dev-docs/automate-api-docs-generation
- dev-docs/v2.4.0-pre-release-docs
- dev-release/v2.4.0
- dev-fix/adaptive-double-traversal-and-null-check
- dev-optimize/phase-4-tiered-storage
- dev-optimize/phase-3-advanced-performance
- dev-optimize/phase-2-api-expansion
- dev-optimize/phase-1-sota-infrastructure
- dev-optimize/bulk-insert
- dev-optimize/packed-fastpath
- dev-optimize/iterator-overhaul
- dev-optimize/o1-count
- dev-optimize/trivial-wins
- dev-feat/string-to-int-hash
- dev-feat/judy-hs
- dev-feat/int-to-packed
- dev-test/memuse-coverage
- dev-docs/memory-patterns
- dev-test/coverage-improvements
- dev-feat/batch-and-increment
- dev-feat/serialization
- dev-feat/slice
- dev-feat/bitset-set-ops
- dev-perf/fast-zpp
- dev-fix/pecl-notes-ascii
- dev-fix/php-judy-version
- dev-release/2.3.0
- dev-fix/php81-compat-bugs-modernization
- dev-fix/windows-build-workflow
This package is auto-updated.
Last update: 2026-03-03 07:33:53 UTC
README
PHP Judy - Extension for creating and accessing dynamic arrays
Table of Contents
Introduction
php-judy is an extension by Nicolas Brousse for the Judy C library. It is compatible with PHP 8.0 and newer.
- PECL Package: http://pecl.php.net/package/Judy
- Packagist Package: https://packagist.org/packages/orieg/judy
- GitHub Repository: http://github.com/orieg/php-judy
A Judy array is a complex but very fast associative array data structure for storing and looking up values using integer or string keys. Unlike normal arrays, Judy arrays may be sparse; that is, they may have large ranges of unassigned indices.
- Wikipedia: http://en.wikipedia.org/wiki/Judy_array
The PHP extension is based on the Judy C library that implements a dynamic array. A Judy array consumes memory only when populated yet can grow to take advantage of all available memory. Judy's key benefits are: scalability, performance, memory efficiency, and ease of use. Judy arrays are designed to grow without tuning into the peta-element range, scaling near O(log-base-256) -- 1 more RAM access at 256 X population.
- Judy C Library: http://judy.sourceforge.net
For a detailed performance comparison with native PHP arrays, please see the BENCHMARK.md file.
Directory Contents
README.md This file
API.md Complete API reference
BENCHMARK.md Performance benchmarks and analysis
MIGRATION_2.2.0.md Migration guide for version 2.2.0
LICENSE The PHP License used by this project
tests/ Unit tests (176 tests)
examples/ Benchmark and example scripts
libjudy/ Bundled libJudy
*.c, *.h C source and header files
Judy.stub.php PHP stub for IDE autocompletion
Installation
A. Using PHP PIE (Recommended)
PHP PIE (PHP Extension Installer) is the easiest way to install PHP Judy on supported platforms:
# Install PHP PIE if you don't have it curl -sSL https://pie.dev/installer | php # Install PHP Judy using PIE pie install judy
Note: PHP PIE automatically handles dependencies and builds the extension for your specific PHP version and platform.
B. Using PECL
You can also install PHP Judy using PECL:
# Install the extension with pecl
pecl install judy
Note: You may need to install the Judy C library first on some systems.
C. Linux (Manual Build)
From the PHP Judy sources:
phpize ./configure --with-judy[=DIR] make make test make install
If you are using Ubuntu or Debian, you can install libJudy with apt:
apt-get install libjudydebian1 libjudy-dev
phpize
./configure --with-judy=/usr
make
make test
make install
D. Windows
On Windows, you will need to build LibJudy yourself.
Download the sources at http://sourceforge.net/projects/judy/
Extract the sources, and open the Visual Studio command prompt and navigate to the source directory. Then execute:
build
This creates "Judy.lib", copy this into the php-sdk library folder and name it libJudy.lib
Then copy the include file "judy.h" into the php-sdk includes folder. Now it's time to build pecl/judy, extract the pecl/judy into your build folder where the build scripts will be able to pick it up, e.g.:
C:\php\pecl\judy\
If your source of PHP is located in:
C:\php\src\
The rest of the steps is pretty straightforward, like any other external extension:
buildconf configure --with-judy=shared nmake
E. Mac OS X
The recommended way to install php-judy on Mac OS X is by using pie or pecl. You will need to have the Judy C library installed first, which can be done easily with Homebrew.
Using PHP PIE (Recommended)
# Install PHP PIE if you don't have it curl -sSL https://pie.dev/installer | php # Install PHP Judy using PIE pie install judy
Using PECL
# First, install the Judy C library brew install judy # Then, install the extension with pecl pecl install judy
Manual Install
If you prefer to compile from source, you will need to install the libJudy first. Download the sources at http://sourceforge.net/projects/judy/
Extract the sources, then cd into the source directory and execute:
./configure make make install
Usage Examples
Judy arrays can be used like usual PHP arrays. The difference will be in the type of key/values that you can use. Judy arrays are optimized for memory usage but it forces some limitations in the PHP API.
There are 10 types of PHP Judy Arrays, organized into three families:
Integer-Keyed Types
1. Judy::BITSET
A Judy array with only 1 bit per index. It can be used to store boolean values.
$judy = new Judy(Judy::BITSET); $judy[100] = true; $judy[200] = true; $judy[300] = false; if ($judy[100]) { echo "Index 100 is set\n"; }
2. Judy::INT_TO_INT
A Judy array with integer keys and integer values.
$judy = new Judy(Judy::INT_TO_INT); $judy[1] = 100; $judy[2] = 200; $judy[3] = 300; echo $judy[2]; // Outputs: 200
3. Judy::INT_TO_MIXED
A Judy array with integer keys and mixed values (strings, integers, etc.).
$judy = new Judy(Judy::INT_TO_MIXED); $judy[1] = "Hello"; $judy[2] = 42; $judy[3] = [1, 2, 3]; echo $judy[1]; // Outputs: Hello
4. Judy::INT_TO_PACKED
A Judy array with integer keys and serialized ("packed") values. Values are stored as opaque byte buffers outside PHP's garbage collector using php_var_serialize/php_var_unserialize. This trades serialize/deserialize CPU cost for reduced GC pressure, making it suitable for large datasets where GC pauses are a concern.
Supports any serializable PHP value (strings, integers, floats, arrays, objects). Closures and generators cannot be stored.
$judy = new Judy(Judy::INT_TO_PACKED); $judy[0] = "Hello"; $judy[1] = 42; $judy[2] = [1, 2, 3]; $judy[3] = new DateTimeImmutable(); echo $judy[0]; // Outputs: Hello // Values are fully reconstructed on read $arr = $judy[2]; // Returns [1, 2, 3]
When to use INT_TO_PACKED vs INT_TO_MIXED:
- Use
INT_TO_MIXEDfor small-to-medium arrays or when read/write speed is critical - Use
INT_TO_PACKEDfor large arrays (100K+ elements) where GC pause reduction matters more than individual read/write latency
String-Keyed Types (Trie-Based)
Trie-based types use JudySL internally. Keys are stored in sorted lexicographic order, making iteration ordered and range queries efficient. Lookup is O(key-length).
5. Judy::STRING_TO_INT
A Judy array with string keys and integer values.
$judy = new Judy(Judy::STRING_TO_INT); $judy["apple"] = 1; $judy["banana"] = 2; $judy["cherry"] = 3; echo $judy["banana"]; // Outputs: 2
6. Judy::STRING_TO_MIXED
A Judy array with string keys and mixed values.
$judy = new Judy(Judy::STRING_TO_MIXED); $judy["name"] = "John Doe"; $judy["age"] = 30; $judy["scores"] = [85, 92, 78]; echo $judy["name"]; // Outputs: John Doe
String-Keyed Types (Hash-Based)
Hash-based types use JudyHS for O(1) average-case lookups, with a parallel JudySL key index that maintains sorted iteration order. Best for workloads dominated by random key access where you still need ordered iteration.
7. Judy::STRING_TO_INT_HASH
A hash-backed Judy array with string keys and integer values.
$judy = new Judy(Judy::STRING_TO_INT_HASH); $judy["session_abc"] = 1; $judy["session_xyz"] = 2; echo $judy["session_abc"]; // Outputs: 1 // Iteration is still sorted (via the key index) foreach ($judy as $key => $value) { echo "$key => $value\n"; }
8. Judy::STRING_TO_MIXED_HASH
A hash-backed Judy array with string keys and mixed values.
$judy = new Judy(Judy::STRING_TO_MIXED_HASH); $judy["config_a"] = ["enabled" => true]; $judy["config_b"] = 42;
String-Keyed Types (Adaptive / SSO)
Adaptive types use Short-String Optimization (SSO): keys of 7 bytes or fewer are packed into a 64-bit integer and stored in a JudyL array, avoiding hashing overhead entirely. Longer keys fall back to JudyHS. A JudySL key index maintains sorted iteration. Best for mixed-length key workloads with many short keys.
9. Judy::STRING_TO_INT_ADAPTIVE
An adaptive Judy array with string keys and integer values.
$judy = new Judy(Judy::STRING_TO_INT_ADAPTIVE); $judy["us"] = 1; // SSO: packed into JudyL (2 bytes) $judy["uk"] = 2; // SSO: packed into JudyL (2 bytes) $judy["a_very_long_country_name"] = 3; // Falls back to JudyHS echo $judy["us"]; // Outputs: 1
10. Judy::STRING_TO_MIXED_ADAPTIVE
An adaptive Judy array with string keys and mixed values.
$judy = new Judy(Judy::STRING_TO_MIXED_ADAPTIVE); $judy["id"] = 12345; $judy["name"] = "Alice"; $judy["metadata"] = ["role" => "admin"];
Iterator Interface (PHP 8+)
Judy arrays implement the PHP Iterator interface, allowing you to use them in foreach loops:
$judy = new Judy(Judy::INT_TO_MIXED); $judy[1] = "First"; $judy[5] = "Fifth"; $judy[10] = "Tenth"; // Iterate through all elements foreach ($judy as $key => $value) { echo "Key: $key, Value: $value\n"; } // Manual iteration $judy->rewind(); while ($judy->valid()) { $key = $judy->key(); $value = $judy->current(); echo "Key: $key, Value: $value\n"; $judy->next(); }
Performance Considerations
- Memory Efficiency: Judy arrays use 2-4x less memory than PHP arrays
- Sequential Access: Excellent performance for ordered iteration
- Range Queries: Native support via
slice(),deleteRange(), andpopulationCount() - Random Access: Trie types are slower than PHP arrays (O(log n) vs O(1)); Hash types offer O(1) average-case lookups for string keys
- String Lookups: Use
STRING_TO_*_HASHorSTRING_TO_*_ADAPTIVEtypes for faster string key access when sorted traversal is not the primary use case
Batch Operations and Conversion
Judy arrays provide batch methods for efficient bulk operations:
// Convert a PHP array to a Judy array $judy = Judy::fromArray(Judy::INT_TO_INT, [0 => 100, 5 => 200, 10 => 300]); // Convert a Judy array back to a PHP array $arr = $judy->toArray(); // [0 => 100, 5 => 200, 10 => 300] // Bulk-insert from an existing array $judy->putAll([20 => 400, 30 => 500]); // Retrieve multiple values at once (missing keys return null) $values = $judy->getAll([0, 5, 99]); // [0 => 100, 5 => 200, 99 => null]
Atomic Increment
For INT_TO_INT, STRING_TO_INT, and STRING_TO_INT_HASH types, increment() performs an efficient counter update:
$counters = new Judy(Judy::STRING_TO_INT); // Increment creates the key with the given amount if it doesn't exist $counters->increment("page_views"); // returns 1 $counters->increment("page_views"); // returns 2 $counters->increment("page_views", 10); // returns 12 $counters->increment("page_views", -3); // returns 9
For detailed performance analysis, see BENCHMARK.md.
Expanded API
Beyond basic array access, Judy provides a rich API including:
- Set operations:
union(),intersect(),diff(),xor(),mergeWith() - Functional iteration:
forEach(),filter(),map()(C-level, bypasses Iterator overhead) - Range operations:
slice(),deleteRange(),populationCount() - Aggregation:
sumValues(),averageValues() - Batch operations:
putAll(),getAll(),keys(),values(),toArray(),fromArray() - Serialization:
serialize()/unserialize(),json_encode() - Comparison:
equals()
For complete method signatures, parameter details, and type compatibility, see API.md.
Reporting Bugs
Please report bugs and issues on the GitHub repository:
https://github.com/orieg/php-judy/issues
Roadmap
- Eliminate redundant JLG+JLI double traversal in write hot paths for MIXED/PACKED types
- C-level
forEach()/filter()/map()performance tuning (vtable dispatch) - Binary serialization format for faster
__serialize/__unserialize - Extend set operations (
union/intersect/diff/xor) to adaptive types - Extend
increment()to adaptive types
License
This project is licensed under the PHP License - see the LICENSE file for details.
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
Support
- API Reference: API.md for complete method documentation
- Benchmarks: BENCHMARK.md for performance analysis
- Migration Guide: MIGRATION_2.2.0.md for version 2.2.0 changes
- Examples: Check the
examples/directory for usage examples