A package to perform OCR on the pages in a IIIF Manifest.
ocr_manifest is a Python package that provides a script, ocr-manifest, to download OCR files from a Manifest URI and write them to a specified directory.
To install the ocr_manifest package, ensure you have uv installed to manage your project dependencies. Then, follow these steps:
- Clone the repository or download the source code.
- Install the package and its dependencies using uv:
uv install
- Verify that the ocr-manifest script is installed and available in
your PATH:
ocr-manifest --help
The ocr-manifest script processes a Manifest URI and downloads the corresponding OCR files to a specified directory.
ocr-manifest <manifest_uri> <output_directory>
- <manifest_uri>: The URI of the Manifest to process.
- <output_directory>: The directory where OCR files will be written.
ocr-manifest https://example.com/manifest.json output_files
After execution, all the OCR files will be available in the output_files directory.