Apache Tika for TYPO3

Fund package maintenance!
www.typo3-solr.com

Installs: 448 637

Dependents: 0

Suggesters: 1

Security: 0

Stars: 6

Watchers: 9

Forks: 29

Open Issues: 7

Type:typo3-cms-extension

13.0.0-alpha-1 2024-10-07 14:46 UTC

README

Build Status Latest Stable Version License Total Downloads Monthly Downloads TYPO3 13

A TYPO3 CMS extension that provides Apache Tika functionality including

  • text extraction
  • meta data extraction
  • language detection (from strings or files)

Tika can be used as standalone Tika app/jar, Tika server, and via SolrCell integrated in Apache Solr.

We're open for contributions !

Please find further information regarding Apache Tika on the project's homepage

Powered by the TYPO3 community and

dkd Internet Service GmbH

Continuous Integration

We use GitHub Actions for continuous integration.

To run the test suite locally, please use our DDEV docker environment https://github.com/TYPO3-Solr/solr-ddev-site.

Note: This requires a proper combination of branches:

  • solr-ddev-site on main branch
    • packages/ext-solr on main
    • packages/ext-tika on main
  • Please refer to version matrix for proper combination of branches
ddev solr:enable tika
ddev composer t3:standards:fix packages/ext-tika/
ddev composer tests:tika:phpstan
ddev composer tests:tika:unit
ddev composer tests:tika:integration

Contributions

  1. Fork the repository
  2. Clone repository
  3. Create a new branch
  4. Make your changes
  5. Commit your changes to your fork. In your commit message refer to the issue number if there is already one, e.g. [BUGFIX] short description of fix (resolves #4711)
  6. Submit a Pull Request (here are some hints on How to write the perfect pull request)

Keep your fork in sync with original repository

  1. git remote add upstream https://github.com/TYPO3-Solr/ext-tika.git
  2. git fetch upstream
  3. git checkout master
  4. git merge upstream/master
  5. git push origin master