teppokoivula / search-engine-pdf-indexer
Optional and experimental PDF file indexing add-on for the ProcessWire CMS/CMF module SearchEngine
                                    Fund package maintenance!
                                                                            
                                                                                                                                        teppokoivula
                                                                                    
                                                                
Installs: 5
Dependents: 0
Suggesters: 0
Security: 0
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Type:pw-module
pkg:composer/teppokoivula/search-engine-pdf-indexer
Requires
- php: >=7.4
- smalot/pdfparser: ^2.2
- spatie/pdf-to-text: ^1.52
- symfony/process: ^5.0
- wireframe-framework/processwire-composer-installer: ^1.0.0
This package is auto-updated.
Last update: 2025-10-19 01:28:18 UTC
README
This module adds (experimental) PDF indexing support for the SearchEngine module.
Please note that in order to parse PDF files, we need to install some third party dependencies. Currently two PDF parsing libraries are supported: smalot/pdfparser and spatie/pdf-to-text. These are automatically installed along with this module when you install it via Composer, but if you install the module via file upload or using the modules manager in admin, please run composer install in the directory of the module after installing it.
Note also that spatie/pdf-to-text requires the pdftotext CLI tool, which needs to be installed on your OS. Please check out the spatie/pdf-to-text GitHub repository at https://github.com/spatie/pdf-to-text for more details.
Usage
WARNING: this module is currently considered highly experimental. There's a good chance that installing it will cause fatal errors on your site. Please backup your data before installing the module and/or enabling it. If you run into any problems, please open a GitHub issue at https://github.com/teppokoivula/SearchEnginePdfIndexer/issues/new.
- Install and configure SearchEngine
- Install SearchEnginePdfIndexer, preferably via Composer (composer require teppokoivula/search-engine-pdf-indexer)
- If you installed SearchEnginePdfIndexer via modules manager or file upload, run composer installin the directory of the module
- Configure SearchEnginePdfIndexer
Installing
This module can be installed by downloading or cloning the SearchEnginePdfIndexer directory into the /site/modules/ directory of your site, but the recommended method installign it using Composer: composer require teppokoivula/search-engine-pdf-indexer. Composer installation takes care of dependencies automatically, which makes following steps easier.
License
This project is licensed under the Mozilla Public License Version 2.0. For licensing of any third party dependencies that this module interfaces with, see their respective README or LICENSE files.