Skip to content

Releases: OCR-D/ocrd_all

v2025-07-04

04 Jul 16:37
@kba kba

Choose a tag to compare

cor-asv-ann 7ce4631..0746af1

Release: v0.2.0

  • docker: prepackage ocrd-all-module-dir.json
  • docker: use latest core base stage

core 66bb7bc..8b854e9

Release: v3.5.1

  • 📦 v3.5.1
  • pyproject.toml: Saner pinning of numpy build requirements
  • pin numpy<2 shapely <2.0.2 for py3.8
  • 📦 v3.5.0
  • 📝 changelog
  • Merge branch 'master' into page-id-logical-filter
  • Merge remote-tracking branch 'bertsky/fix-network-agent-type-click' into page-id-logical-filter
  • OcrdMets.get_physical_pages(for_pageIds): more syntax checks
  • OcrdMets.get_physical_pages(for_pageIds): store has_matched in pat
  • OcrdMets.get_physical_pages(for_pageIds): keep matching pages
  • OcrdMets.get_physical_pages(for_pageIds): prettier types
  • tests: adapt to Click 8.2 API change
  • OcrdMets.get_physical_pages(for_pageIds): introduce attribute disambiguation prefixes
  • OcrdMets.get_physical_pages(for_pageIds): refactor with dataclasses for patterns
  • OcrdMets.get_physical_pages(for_pageIds): also support logical @Label
  • test_ocrd_mets: cover logical div attribute selection
  • OcrdMets.get_physical_pages(for_pageIds): support logical structMap div attribute matching
  • test_workspace: cover negative page selection
  • test_processor: cover negative page selection
  • test_ocrd_mets: cover negative page selection
  • OcrdMets.get_physical_pages(for_pageIds): support negative subpatterns
  • require frozendict 2.4 (needed for json compatibility)
  • 📦 v3.4.0
  • 📝 changelog
  • Merge branch 'network-logging-permissions'
  • 📝 changelog
  • Merge remote-tracking branch 'bertsky/add-test-cuda'
  • 📝 changelog
  • cli.workspace.find: fix --undo-download
  • ClientSideOcrdMets: fix typo in exception
  • Workspace.resolve_image_exif|resolve_image_as_pil: try as local path first
  • mets_server.OcrdFileModel: do not convert None to str

dinglehopper 946d77d..7cbecd7

Release: v1.0.0.pre3

  • docker: prepackage ocrd-all-module-dir.json
  • docker: use latest core base stage

docstruct 1183e1a..2d33f7e

  • docker: prepackage ocrd-all-module-dir.json
  • docker: use latest core base stage

eynollah 41318f0..b7b218f

Release: v0.4.0

  • OCR-D processor: same behavior as standalone wrt light_version/textline_light
  • Fixed duplicate textline_light assignments (true and false) in the OCR-D framework for the Eynollah light version, which caused rectangles to be used instead of contours for textlines
  • Merge pull request #159 from bertsky/main
  • Fix model name to return the correct machine-based model name
  • Fix OCR text cleaning to correctly handle 'U', 'K', and 'N' starting sentence; update text line splitting size
  • Resolve remaining issue with #158 and resolving #124
  • resolving issue #158
  • Merge pull request #157 from qurator-spk/kba-patch-1
  • Merge pull request #154 from qurator-spk/ci-pypi

format-converters 0894801..7667455

Release: v0.1.0

  • Merge pull request #28 from bertsky/setuptools

nmalign 50edf46..7c37ad9

Release: v0.0.5

  • docker: prepackage ocrd-all-module-dir.json
  • docker: use latest core base stage

ocrd_anybaseocr 0aa1d57..8a78663

Release: v2.0.0

  • docker: use latest core base stage
  • docker: prepackage ocrd-all-module-dir.json

ocrd_calamari 416420f..4d66fa1

Release: v2.1.0

  • 📝 v2.1.0
  • 📝 changelog
  • Merge pull request #124 from bertsky/master

ocrd_detectron2 10aea24..6ff0a12

Release: v0.2.0

  • docker: prepackage ocrd-all-module-dir.json
  • docker: use latest core base stage

ocrd_doxa a6df2f2..d391f05

Release: v0.0.3

  • dockerfile: fix typo
  • docker: prepackage ocrd-all-module-dir.json
  • docker: use latest core base stage

ocrd_fileformat 2e821ea..aee593f

Release: v0.12.0

  • 📦 v0.12.0
  • 📝 changelog
  • Merge pull request #64 from bertsky/pythonize
  • 📝 changelog
  • Update ocr-fileformat to v0.8.0

ocrd_froc 323a6ba..8af9441

Release: v1.1.0

  • 📦 v1.1.0
  • 📝 changelog
  • Merge pull request #22 from bertsky/main

ocrd_keraslm 676cc00..71dc491

Release: v0.5.0

  • docker: prepackage ocrd-all-module-dir.json
  • docker: use latest core base stage

ocrd_kraken 4309d91..b98dd48

Release: v1.0.1

  • docker: prepackage ocrd-all-module-dir.json
  • docker: use latest core base stage

ocrd_neat b9cb043..5d9d9f0

Release: v0.0.1

  • Update README.md
  • Update cli help text.
  • Update README
  • Update cli help text.
  • Update required python version;update README

ocrd_olahd_client 3a90373..4a9c9d5

Release: v1.1.0

  • 📦 v1.1.0
  • 📝 changlog
  • Merge pull request #16 from bertsky/master

ocrd_olena bf5cf76..7f153df

Release: v2.0.0

  • docker: prepackage ocrd-all-module-dir.json
  • docker: use latest core base stage

ocrd_page2alto 841b614..c35b160

Release: v2.1.0

  • CI: build with python 3.9
  • CI: make build
  • 📦 v2.1.0
  • 📝 changelog
  • Merge pull request #48 from bertsky/master

ocrd_pagetopdf c6b414d..393ae0b

Release: v2.0.2

  • 📦 v2.0.2
  • altotopdf: support running on missing ALTO files for some pages (by creating an empty PAGE for the image)
  • multipage: re-enter workspace.directory for correct relative paths
  • package-data: fix missing PageConverter.java
    *...
Read more

v2025-04-17

17 Apr 18:17
@kba kba

Choose a tag to compare

Added:

  • make ocrd-all-meta.json: Generate mapping from processor to dockerhub/version/commit SHA, #461

Removed:

  • ocrd_pc_segmentation long unmaintained and with better alternatives, #464
  • ocrd_repair_inconsistencies upstream archived and unsupported, #464

cor-asv-ann eb47a54..7ce4631

Release: v0.2.0

  • ocrd-tool.json: fix typo
  • ocrd-tool.json: forgot to add dockerhub key
  • CD PyPI: use latest packaging to fix metadata issue with twine
  • fix license string to conform to PyPI req
  • CD PyPI: use Pythonic base image
  • docker: add vars
  • Partial revert of "CI: install hunspell for ocrd-cor-asv-ann-mark test"…
  • CI: differentiate store_test_results / store_artifacts
  • requirements_test: fix typo (xdist instead of isolate)
  • CI: reduce workload further, isolate tests to free memory
  • CI: install hunspell for ocrd-cor-asv-ann-mark test
  • tests: reduce workload of combinations to fit into 1h without GPU
  • CI: cache model download, increase timeout
  • CI: forgot to prefix model download directory
  • CI: forgot to stage actual removal of Orb
  • 📦 0.2.0
  • CI: switch from Orb to CLI for Codecov
  • CI: change codecov orb version
  • setup.py → pyproject.toml (w/ dynamic setuptools-ocrd versioning)
  • ocrd-tool.json: add 2 model files as examples
  • add coverage, add tests to CI and pypi to CD
  • docker: update+improve…
  • increase default batch size for better GPU utilisation
  • autosized generators: make work with Keras 2.3.x (stateful metrics)
  • train: add model checkpointing callback
  • seq2seq: be robust if EOL is missing in binary (prob/confmat) line input
  • seq2seq: cover non-string line input (prob/confmat) in fallback
  • transcode: simplify updating sequence, improve logging…
  • transcode: pass all input TextEquiv alternatives…
  • fix charmap kwarg init
  • transcode test: abort on failure
  • add test for transcode only, cover more fileGrps and parameters, check and compare confidences
  • transcode: also remove lower hierarchy levels for consistency
  • transcode: utilise .parent_object_, simplify
  • tests: eval all against 'MULTI', test 'mark' separately, assert equal CER for both 'eval' strategies
  • replace editdistance with rapidfuzz
  • add tests
  • migrate all processors to ocrd>=3.0
  • CircleCI: separate CD and CI

core 80eb58c..66bb7bc

Release: v3.3.2

  • 📦 v3.3.2
  • 📝 changelog
  • test_mets_server: escape '.' in assertion
  • ClientSideOcrdMets.init: also escape '.' in UDS paths
  • Workspace.init / Resolver.resolve_mets_arguments: also resolve UDS path
  • 📦 v3.3.1
  • 📝 changelog
  • ocrd-tool.json: add dockerhub key
  • Processor.zip_input_files: fix check for OCRD_MISSING_INPUT
  • deps-ubuntu: do apt-get update first
  • ocrd_cli_wrap_processor: strictly require non-empty input_file_grp kwarg
  • GHA CI: remove u20 runner here, too
  • ocrd_cli_wrap_processor: allow 'None' for file grp arguments
  • run_processor/get_processor: replace 'None' with empty str

dinglehopper 071e6a8..9573539

Release: v1.0.0pre2

  • pyproject: support 3.8 as well
  • Merge remote-tracking branch 'bertsky/allow-uniseg-py38'
  • typo: report_prefix -> file_id
  • ocrd_cli: but do check for existing output files
  • ocrd_cli: no need to check fileGrp dir exists
  • OCR-D processor: properly handle missing or non-downloaded GT/OCR file
  • Dockerfile: build ocrd-all-tool.json
  • adapt tests
  • use Levenshtein.normalized_distance instead of distance
  • 🧹Don't pin uniseg and rapidfuzz
  • Port to OCR-D/core API v3
  • Merge pull request #136 from qurator-spk/chore/update-liccheck
  • Merge pull request #135 from qurator-spk/chore/update-python-version
  • Merge pull request #132 from qurator-spk/fix/uniseg-removed-index-parameter
  • Merge pull request #131 from qurator-spk/chore/update-pre-commit

docstruct 004e6ec..1183e1a

  • ocrd-tool.json: forgot to add dockerhub key
  • Merge pull request #2 from bertsky/migrate-ocrd-v3

eynollah 8080bd8..41318f0

Release: v0.4.0

  • 📝 changelog
  • Restoring the contour in the original image caused an error due to an empty tuple. This issue has been resolved, and as expected, the confidence score for this contour is set to zero

format-converters fa8b4b5..0894801

  • Merge pull request #25 from stweil/PEP8

nmalign 1698cd8..50edf46

Release: v0.0.5

  • ocrd-tool.json: forgot to add dockerhub key

ocrd_anybaseocr 0c67107..0aa1d57

Release: v2.0.0

  • 📦 v2.0.0
  • 📝 changelog
  • Merge pull request #113 from bertsky/v3-api
  • ocrd-tool: don't restrict resource_locations
  • Be defensive about uninitializing possibly uninitialized model and label_mapping
  • 3.8 typing
  • self.reset at the end of process_workspace, del self.model in shutdown
  • move models to root of package
  • adapt tests
  • rewrite with process_workspace and deferred METS modification
  • rely on resolve_resource for model path
  • use contextmanager to open pickled label mapping
  • Merge branch 'v3-api' of https://github.com/OCR-D/ocrd_anybaseocr into v3-api
  • remove redundant model-exists check
  • CI: larger resource_class
  • Merge branch 'v3-api' of https://github.com/OCR-D/ocrd_anybaseocr into v3-api
  • pickle was ignored
  • add model to package data
  • reintroduce layout-analysis
  • remove constants
  • convert to src-layout
  • remove old mets files
  • remove pix2pixhd
  • fix setuptools packages
  • CI: remove references to models
  • adapt cropper and test
  • remove everything that is not cropping-related
  • convert cropper to v3 API, setup.py -> pyproject.toml

ocrd_cis db65d7f..5cf22f5

Release: v0.2.0

  • fix license classifier for PyPI
  • PyPI CD: forgot the actual build!
  • PyPI CD: use whatever is in dist
  • PyPI CD: strip 'v' prefix from git tag
  • add PyPI CD
  • 📦 0.2.0
  • add 'build' and 'help' targets
  • setup→pyproject
  • Docker: shortcut ocrd-all-tool.json via ocrd-tool.json
  • Merge pull request #5 from MehmedGIT/port-to-v3

ocrd_detectron2 218e0b5..10aea24

Release: v0.2.0

  • ocrd-tool.json: forgot to add dockerhub key
  • relax setuptools version requirement
  • update setuptools to fix detectron2 build
  • CI: remove Python 3.11 (does not work with detectron2)
  • Update CHANGELOG.md
  • Merge pull request #31 from bertsky/migrate-ocrd-v3

ocrd_doxa 15e8423..a6df2f2

Release: v0.0.3

  • update changelog
  • update dockerfile:
  • setup.py → pyproject.toml (w/ versioning based on ocrd-tool.json)
  • add CI
  • try to avoid doxapy allocation problems
  • add tests (from ocrd_olena)
  • migrate to ocrd >= 3 API
  • improve makefile

ocrd_froc 5a5d774..323a6ba

Release: v1.0.1

  • 📦 v1.0.1
  • 📝 changelog
  • Merge pull request #20 from bertsky/patch-1
  • 📦 v1.0.0
  • 📝 changelog
  • Dockerfile: COPY ocrd-tool.json to avoid symlink
  • make docker: podman does not support --progress
  • CI/GHA: actually run make test
  • CI: docker
  • Dockerfile: set locale/XDG...
Read more

v2025-04-11

11 Apr 16:20
@kba kba

Choose a tag to compare

Changed:

  • Disable ocrd_detectron2 and ocrd_segment for OSX, #460

cor-asv-ann eb47a54..bc6a590

Release: v0.2.0

  • CD PyPI: use latest packaging to fix metadata issue with twine
  • fix license string to conform to PyPI req
  • CD PyPI: use Pythonic base image
  • docker: add vars
  • Partial revert of "CI: install hunspell for ocrd-cor-asv-ann-mark test"…
  • CI: differentiate store_test_results / store_artifacts
  • requirements_test: fix typo (xdist instead of isolate)
  • CI: reduce workload further, isolate tests to free memory
  • CI: install hunspell for ocrd-cor-asv-ann-mark test
  • tests: reduce workload of combinations to fit into 1h without GPU
  • CI: cache model download, increase timeout
  • CI: forgot to prefix model download directory
  • CI: forgot to stage actual removal of Orb
  • 📦 0.2.0
  • CI: switch from Orb to CLI for Codecov
  • CI: change codecov orb version
  • setup.py → pyproject.toml (w/ dynamic setuptools-ocrd versioning)
  • ocrd-tool.json: add 2 model files as examples
  • add coverage, add tests to CI and pypi to CD
  • docker: update+improve…
  • increase default batch size for better GPU utilisation
  • autosized generators: make work with Keras 2.3.x (stateful metrics)
  • train: add model checkpointing callback
  • seq2seq: be robust if EOL is missing in binary (prob/confmat) line input
  • seq2seq: cover non-string line input (prob/confmat) in fallback
  • transcode: simplify updating sequence, improve logging…
  • transcode: pass all input TextEquiv alternatives…
  • fix charmap kwarg init
  • transcode test: abort on failure
  • add test for transcode only, cover more fileGrps and parameters, check and compare confidences
  • transcode: also remove lower hierarchy levels for consistency
  • transcode: utilise .parent_object_, simplify
  • tests: eval all against 'MULTI', test 'mark' separately, assert equal CER for both 'eval' strategies
  • replace editdistance with rapidfuzz
  • add tests
  • migrate all processors to ocrd>=3.0
  • CircleCI: separate CD and CI

docstruct 004e6ec..f8c4e46

  • Merge pull request #2 from bertsky/migrate-ocrd-v3

format-converters fa8b4b5..0894801

  • Merge pull request #25 from stweil/PEP8

ocrd_anybaseocr 0c67107..0aa1d57

Release: v2.0.0

  • 📦 v2.0.0
  • 📝 changelog
  • Merge pull request #113 from bertsky/v3-api
  • ocrd-tool: don't restrict resource_locations
  • Be defensive about uninitializing possibly uninitialized model and label_mapping
  • 3.8 typing
  • self.reset at the end of process_workspace, del self.model in shutdown
  • move models to root of package
  • adapt tests
  • rewrite with process_workspace and deferred METS modification
  • rely on resolve_resource for model path
  • use contextmanager to open pickled label mapping
  • Merge branch 'v3-api' of https://github.com/OCR-D/ocrd_anybaseocr into v3-api
  • remove redundant model-exists check
  • CI: larger resource_class
  • Merge branch 'v3-api' of https://github.com/OCR-D/ocrd_anybaseocr into v3-api
  • pickle was ignored
  • add model to package data
  • reintroduce layout-analysis
  • remove constants
  • convert to src-layout
  • remove old mets files
  • remove pix2pixhd
  • fix setuptools packages
  • CI: remove references to models
  • adapt cropper and test
  • remove everything that is not cropping-related
  • convert cropper to v3 API, setup.py -> pyproject.toml

ocrd_cis db65d7f..5cf22f5

Release: v0.2.0

  • all: adapt to more Numpy and Shapely changes/deprecations
  • get rid of remaining python-levenshtein (in favour of rapidfuzz)
  • all: migrate importlib instead of EOL pkg_resources via ocrd_utils
  • all: migrate to OCR-D v3 API (including METS Server and page multiprocessing)
  • all: some code style, type hints and optimization
  • tests: simplify, update, improve, test serially as well as page-parallel
  • CI: ditch CircleCI, add Github Actions
  • ocrd-tool.json: add dockerhub=ocrd/cis
  • docker: add labels metadata conforming to spec
  • docker: preinstall shortcut ocrd-all-tool.json via ocrd-tool.json
  • setup.pypyproject.toml, versioning based on ocrd-tool.json
  • add continuous deployment for Dockerhub
  • add continuous deployment for PyPI

ocrd_detectron2 218e0b5..3e5f838

Release: v0.2.0

  • relax setuptools version requirement
  • update setuptools to fix detectron2 build
  • CI: remove Python 3.11 (does not work with detectron2)
  • Update CHANGELOG.md
  • Merge pull request #31 from bertsky/migrate-ocrd-v3

ocrd_froc 5a5d774..323a6ba

Release: v1.0.1

  • 📦 v1.0.1
  • 📝 changelog
  • Merge pull request #20 from bertsky/patch-1
  • 📦 v1.0.0
  • 📝 changelog
  • Dockerfile: COPY ocrd-tool.json to avoid symlink
  • make docker: podman does not support --progress
  • CI/GHA: actually run make test
  • CI: docker
  • Dockerfile: set locale/XDG variables, use COPY . . and .dockerignore
  • Dockerfile: base on ocrd/core-cuda-torch
  • Update requirements-test.txt
  • tests: use subtests.test instead of plain for loop
  • unit tests
  • processor: handle ocr_method=none vs ocr_method=adaptive correctly
  • OCR-D processor: restrict to 1 worker due to single CUDA context
  • Update pyproject.toml
  • Convert to OCR-D v3

ocrd_pagetopdf 00baf15..c6b414d

Release: v2.0.1

  • 📦 v2.0.1
  • 📝 changelog
  • add changelog
  • improve multipage structmap extraction (more robust, more cases)
  • tests: debug logging only for processors
  • deps-ubuntu: add apt-get update; add deps-test
  • Merge pull request #29 from OCR-D/missing-import-re

v2025-04-08

08 Apr 10:23
@kba kba

Choose a tag to compare

Removed

  • sbb_binarization: Integrated into eynollah as ocrd-sbb-binarize
  • ocrd_im6convert: Same functionality with improvements available as ocrd-preprocess-image in ocrd_wrap

core 71c0c19..80eb58c

Release: v3.3.0

  • 📦 3.3.0
  • 📝 changelog
  • Revert "Merge remote-tracking branch 'bertsky/resmgr-type-checking'"
  • 📦 v3.2.0
  • 📝 changelog
  • Merge remote-tracking branch 'bertsky/resmgr-type-checking'
  • 📝 changelog
  • update spec to v3.37.0
  • 📝 changelog
  • Merge remote-tracking branch 'bertsky/update-validator'
  • 📝 changelog
  • fix workdir in Dockerfile.cuda-torch
  • fix deps-tf1 requirements DIRSTACK fix
  • 📦 v3.1.2
  • 📝 changelog
  • deps-tf1: fix 7a2c4c7
  • 📦 v3.1.1
  • 📝 changelog
  • test_processor timeouts: more tolerant
  • test_resmgr: avoid name clash with preconfigured ocrd-dummy
  • update ocrd-all-tool.json
  • Dockerfile: repeat ARG BASE_IMAGE so it can be referenced below
  • nw integration test: pass build ctxt for ocrd_core_test everywhere
  • nw integration test: never pull ocrd_core_test (wait for build in PS service)
  • deps-conda: new target for non-ubuntu platforms
  • deps-cuda: expose get-conda independently, re-use if present, otherwise use Micromamba for current OS/platform
  • docker*: pass build args, also tag with version if gitref is release
  • Dockerfile*: always provide defaults for ARG, use upper case AS
  • docker-{torch,tf{1,2}}: smoke-test build, too
  • deps-{torch,tf{1,2}}: ensure core's requirements.txt is compatible
  • 📦 v3.1.0
  • 📝 changelog
  • Merge branch 'update-spec'
  • 📝 changelog
  • ocrd-tool: add command 'dump-tools'
  • adapt processor tests some more
  • adapt processor tests
  • Processor.download: apply config.OCRD_DOWNLOAD_INPUT no sooner than available, remove useless download_files kwarg
  • 📦 v3.0.4
  • 📝 changelog
  • update assets and adapt tests to match
  • tests.processor: increase timeout tolerance slightly
  • run_processor: handle frozendict or empty parameter in profile log
  • tests.processor: check profile log
  • tests.processor: add case adding document-global file (in METS Server mode, too)
  • tests.processor: add case adding document-global file
  • tests: OcrdPage roundtrip w/ full assertions
  • OcrdPage: re-generate
  • fix c47ae77c (pass on namespaceprefix_)
  • tests: add OcrdPage roundtrip
  • mets server: fix init of 'page_id' in POST /file
  • mets server: downgrade log messages INFO→DEBUG
  • test_workspace: use different asset w/ loctype URL
  • 📦 v3.0.3
  • update assets and spec
  • 📝 changelog
  • Merge remote-tracking branch 'bertsky/fix-processpool-loglistener-shutdown'
  • 📦 v3.0.2
  • 📝 changelog
  • fixup 39f76258: get queue type from mp.get_ctxt
  • Processor.ProcessPoolExecutor: use queue.Queue instead of mp.Queue for QueueHandler
  • Workspace.image_from_*: remove PIL workaround
  • ocrd_network.cli.client processing: fix CLI arg order
  • ocrd.decorators: update/improve docstring
  • config.OCRD_MAX_PARALLEL_PAGES: improve docstring
  • 📦 v3.0.1
  • 📝 changelog
  • Processor.ProcessPoolExecutor: start QueueListener only after executor to avoid thread interference
  • Processor.DummyExecutor.shutdown: reset global processor instance, too
  • CI: Use token authentication for pypi
  • 📦 🔥 v3.0.0
  • remove 3.8 breaking typing hints
  • 📝 changelog
  • Merge remote-tracking branch 'bertsky/new-processor-api-xpath-filter' into v3-release
  • 📝 changelog
  • Merge remote-tracking branch 'origin/initLogging-before-imports' into v3-release
  • 📝 changelog
  • Merge branch 'fix-help-output' into v3-release
  • 📝 changelog
  • Merge remote-tracking branch 'origin/master' into v3-release
  • merge master
  • 📦 v3.0.0b7
  • Processor parallel pages: log via QueueHandler in subprocess, QueueListener in main
  • Processor: replace weakref with del to trigger shutdown
  • initLogging: call disableLogging if already initialized and force_reinit
  • Processor: update max_workers docstring
  • logging: increase default root (not ocrd) level from INFO to WARNING
  • setOverrideLogLevel: override all currently active loggers' level
  • disableLogging: remove all handlers, reset all levels
  • initLogging: only add root handler instead of multiple redundant handlers with propagate=false
  • initLogging: do not remove any previous handlers/levels
  • tests: prevent side effects from ocrd_logging
  • 📦 v3.0.0b6
  • Merge pull request #23 from bertsky/new-processor-api-process-worker
  • Processor.process_page_file: avoid process_page_pcgts() if OCRD_EXISTING_OUTPUT!=OVERWRITE
  • ocrd.cli.workspace: use physical_pages if possible, fix default output_field
  • OcrdMets.get_physical_pages: cover return_divs w/o for_fileIds for_pageIds
  • ocrd.cli.workspace: assert non-server in cmds mutating METS
  • 📝 improve b5 changelog
  • 📦 v3.0.0b5
  • update OcrdPage from generateds
  • OcrdPage: add PageType.get_ReadingOrderGroups()
  • tests: make sure ocrd_utils.config gets reset whenever changing it globally
  • test processors: no need for 'force' kwarg anymore
  • Processor / Workspace.add_file: always force if config.OCRD_EXISTING_OUTPUT==OVERWRITE
  • lib.bash input-files: do not try to validate tasks here (impossible to get right with required parameters, and now covered by wrapped Processor.verify() already)
  • lib.bash input-files: pass on --mets-server-url, --overwrite, and parameters (necessary for required params)
  • lib.bash: fix errexit
  • run_processor: be robust if ocrd_tool is missing steps
  • Processor.verify: check output fileGrps as well (or OCRD_EXISTING_OUTPUT=OVERWRITE|SKIP or disjoint --page-id)
  • Processor.process_workspace(): do not show NotImplementedError context if fallback process() raises anything itself
  • ocrd.cli.validate tasks: pass on --mets-server-url, too
  • ocrd.cli.bashlib input-files: pass on --mets-server-url, too
  • ocrd.cli.workspace server: add 'reload' and 'save'
  • ocrd.cli.workspace: consistently pass on --mets-server-url and --backup (also, simplify)
  • METS Server: also export+delegate physical_pages
  • processor CLI: delegate --resolve-resource, too
  • ocrd.cli.ocrd-tool resolve-resource: fix (forgot to print result)
  • PcGts.Page.id / make_xml_id: replace '/' with '_'
  • Processor.process_page_file / OcrdPageResultImage: allow None instead of AlternativeImageType
  • Processor.verify: revert 5819c816 (we still have no defaults in json loaded from v2)
  • add typing, extend docs
  • test_processor: add test for force (overwrite) w/ METS Server
  • test_mets_server: add test for force (overwrite)
  • OcrdMetsServer.add_file: pass on 'force' kwarg, too
  • 📦 v3.0.0b4
  • 📝 changelog
  • Workspace.reload_mets: fix for METS server case
  • Processor: log when max_workers / max_page_seconds are in effect
  • Processor.metadata_location: find location package prefix (necessary for namespace packages)
  • 📦 v3.0.0b3
  • ocrd.cli doc: don't rewrap description lists
  • Processor: improve processing log messages
  • add test for OcrdEnvConfig.reset_defaults()
  • ocrd_utils.config: add reset_defaults()
  • 📦 v3.0.0b2
  • 📝 update readmes (esp. new config variables)
  • 📝 update changelog
  • test_processor: add legacy (v2-style) dummy case
  • bashlib: re-add --log-filename, implement as stderr redirect
  • Processor.generate_processor_help: forgot to include --log-filename
  • Processor: fix 7966057f (deprecated passing of ocrd_tool or version via init)
  • test-logging: also remove ocrd.log from tempdir
  • disableLogging: re-instate root logger, to
  • ClientSideOcrdMets: partial revert of 9b80ae17ef
  • tess: adapt to wording of exceptions
  • test processor timeout/parallel: avoid side effects to dummy tool json
  • test_mets_server: use tmpdir to avoid side effects between suites
  • Processor: fix ignore (negative/zero) cases for max_workers / max_page_seconds
  • ClientSideOcrdMets: use same logger name prefix as server
  • 📝 update changelog
  • add tests for processor per-page timeout and parallelism
  • Processor: add per-page timeouts and parallelism…
  • Merge pull request #19 from OCR-D/new-processor-api-fix-editable
  • adapt tests for OCRD_MAX_MISSING_OUTPUTS
  • Processor.process_workspace: abort anyway if too many failures (OCRD_MAX_MISSING_OUTPUTS)
  • generate_processor_help: avoid repeating docstrings from superclass
  • 📦 v3.0.0b1
  • Merge branch 'master' into new-processor-api
  • Processor.process_page_file / OcrdPageResultImage: allow PageType instead of AlternativeImageType
  • test_bashlib: allow testing prereleases successfully
  • 📝 update changelog
  • Merge pull request #14 from bertsky/new-processor-api-parameter-setup
  • remove make *-workaround, we will not do that for v3+
  • 📦 v3.0.0a2
  • 📝 changelog
  • processor CLI --debug: set all to ABORT (not just MISSING_OUTPUT)
  • predefine union types OcrdFileType and OcrdPageType
  • JsonValidator: fix deprecation warning (by actually checking instance)
  • tests report.is_valid: improve output on failure...
Read more

v2025-02-11

12 Feb 12:13
@kba kba

Choose a tag to compare

core 85bde15..71c0c19

Release: v2.71.1

  • 📦 v2.71.1
  • 📝 changelog
  • ocrd_cli_wrap_processor: always do initLogging
  • 📦 v2.71.0
  • 📝 changelog
  • Merge remote-tracking branch 'github/initlogging-root'
  • 📝 changelog
  • Merge remote-tracking branch 'github/fix-make-docs'
  • 📝 changelog
  • Merge remote-tracking branch 'github/fix-network-logging'
  • fix internal_callback logging message
  • deps-cuda: make sure cudnn gets installed, pin torch version for CUDA 11

cor-asv-ann ff6bf3f..eb47a54

Release: v0.1.15

  • CircleCI: add CD via Dockerhub
  • CD: rebase Docker on ocrd/core-cuda-tf1 stage

eynollah 51f6ef6..bc9dddd

Release: v0.3.1

  • Update README.md
  • relax tf2 requirement to < 2.13

ocrd_anybaseocr 3459b41..483848f

Release: v1.10.0

  • Merge remote-tracking branch 'bertsky/patch-3'
  • Merge branch 'fix-ci-versions'
  • Merge branch 'distutils'
  • Merge pull request #101 from bertsky/master
  • 📦 v1.10.0
  • docker: build on top of 2.63
  • Base Docker on core-cuda
  • Set docker metadata with makefile
  • Add things required by spec to Dockerfile
  • Clear pip cache after installing in Dockerfile
  • Rename build folder to not interfere with core-img
  • Update dockerfile

ocrd_calamari d9cde1f..d7c30b5

Release: v1.0.6

  • Merge pull request #119 from OCR-D/dockerfile

ocrd_fileformat 8ab078d..2e821ea

Release: v0.11.2

  • 📦 v0.11.2
  • 📝 changelog
  • update repo/ocr-fileformat to current master

ocrd_froc 42f1ce0..5a5d774

Release: v0.6.1

  • Merge pull request #18 from OCR-D/dockerfile

ocrd_kraken a6160ce..17cefc6

Release: v0.4.1

  • Merge pull request #45 from OCR-D/dockerfile

ocrd_neat 06c8b38..1982216

Release: v0.0.1

  • fix command line output
  • make code more robust
  • remove spam
  • add tsv2tsv tool;make easy re-processing of tsv files possible
  • Merge pull request #17 from r0man-ist/patch-1

ocrd_pagetopdf 7c5ab70..7e8c860

Release: v1.1.0

  • Fix dockerimage creation

opencv-python 6a181ce..255564a

Release: 86

  • Merge pull request #1076 from asmorkalov:as/github_checkout_again
  • Merge pull request #1074 from asmorkalov:as/python3.13
  • Merge pull request #1071 from asmorkalov:as/opencv_4.11_tags
  • Merge pull request #1031 from mayeut:system-cmake
  • Merge pull request #1061 from SaikWolf:4.x
  • Merge pull request #1000 from zhanwenchen:patch-1
  • Merge pull request #1073 from asmorkalov:as/nasm_windows
  • Merge pull request #1065 from asmorkalov:pre_4.11

sbb_binarization d259795..b2d0d5c

Release: v0.1.0

  • add citation to README
  • Update README.md
  • Merge pull request #70 from bertsky/add-docker

v2024-10-15

15 Oct 18:33
3e33522

Choose a tag to compare

v2024-10-15

Changes:

  • added new module ocrd_page2alto (also in ocrd_fileformat, now with standalone processor)
  • new fixup recipes for shared venv without dependency conflicts
  • protect venv creation by semaphore as well
  • docker: update OCRD_MODULES (default selection for custom make docker)
  • docker: fix minimum and medium module lists
  • docker: do not rm venv created by previous stage
  • CI/CD: rewrite CircleCI config to split up mini/medi/maxi into interdependent incremental jobs
  • CI/CD: fix storing test results

core 92b217e..85bde15

Release: v2.70.0

  • PyPI: do not upload deprecated distribution aliases anymore
  • deps-cuda: retry micro.mamba.pm even more
  • 📦 v2.70.0
  • 📝 changelog
  • create PyPI CD
  • 📝 changelog
  • Merge remote-tracking branch 'github/cli-decorator-import-network'
  • deps-cuda: retry if micromamba is unresponsive
  • Merge branch 'master' of https://github.com/OCR-D/core
  • 📝 changelog
  • Merge remote-tracking branch 'github/fix_mets_server_zombies'
  • 📝 changelog
  • Merge remote-tracking branch 'github/deps-torch-torchvision'
  • 📝 changelog
  • Merge branch 'network_client_block_prints'
  • Merge pull request #1280 from OCR-D/fix-docker-cuda-torch
  • 📦 v2.69.0
  • 📝 changelog
  • Merge branch 'mexthecat-master'
  • 📝 update changelog again
  • 📝 changelog: remove spurious entries
  • 📝 changelog
  • disableLogging: clearer comment
  • ocrd.cli.workspace: use physical_pages if possible, fix default output_field
  • OcrdMets.get_physical_pages: cover return_divs w/o for_fileIds for_pageIds
  • update OcrdPage from generateds
  • OcrdPage: add PageType.get_ReadingOrderGroups()
  • ocrd.cli.workspace: assert non-server in cmds mutating METS
  • tests: make sure ocrd_utils.config gets reset whenever changing it globally
  • lib.bash: fix errexit
  • run_processor: be robust if ocrd_tool is missing steps
  • ocrd.cli.validate tasks: pass on --mets-server-url, too
  • ocrd.cli.workspace server: add 'reload' and 'save'
  • ocrd.cli.workspace: consistently pass on --mets-server-url and --backup (also, simplify)
  • METS Server: also export+delegate physical_pages
  • PcGts.Page.id / make_xml_id: replace '/' with '_'
  • test_mets_server: add test for force (overwrite)
  • OcrdMetsServer.add_file: pass on 'force' kwarg, too
  • Workspace.reload_mets: fix for METS server case
  • add test for OcrdEnvConfig.reset_defaults()
  • ocrd_utils.config: add reset_defaults()
  • bashlib: re-add --log-filename, implement as stderr redirect
  • test-logging: also remove ocrd.log from tempdir
  • disableLogging: re-instate root logger, to
  • test_mets_server: use tmpdir to avoid side effects between suites
  • ClientSideOcrdMets: use same logger name prefix as server
  • pylint: try ignoring generateds (again)
  • update pylintrc
  • OcrdMets.add_agent: does not have positional args
  • cli.workspace: pass fileGrp as well, improve description
  • adapt to PIL.Image moved constants
  • fix exception
  • fix --log-filename (6fc606027a): apply in ocrd_cli_wrap_processor
  • tests report.is_valid: improve output on failure
  • Processor.zip_input_files: more verbose log msg
  • Processor.zip_input_files: warning instead of exception for missing input files
  • fix imports
  • ocrd_utils: forgot to export scale_coordinates at toplvl
  • allow "from ocrd_models import OcrdPage
  • improve output in case of assertion failures
  • hide/test expected deprecation warnings
  • use up-to-date kwargs (avoiding old deprecations)
  • mets_server: ClientSideOcrdMets needs OcrdMets-like kwargs (without deprecation)
  • test_mets_server: fix arg vs kwarg
  • processor CLI: delegate --resolve-resource, too
  • 📦 v2.68.0
  • 📝 changelog
  • refactor client cli: process -> run
  • Merge branch 'master' into extend-network-client
  • 📝 changelog
  • Merge pull request #1270 from OCR-D/fix-parsing
  • fix: exception handling
  • add: check processing job log file
  • add: discovery cli, processors and processor
  • add sort to network agents
  • add: parameter_override
  • fix: the annoying string dict
  • fix: check report validation outside try block
  • fix: set ps address if None in constructor
  • Fix: server_utils.py > 404 to 400
  • Fix: rename to block
  • add docstring to cli commands
  • fix: required job id
  • add: help section to the cli
  • add cli job status check
  • add help for new env
  • refine status check methods
  • Update src/ocrd_network/client_utils.py
  • add timeout and wait to configs
  • add: client workflow run
  • fix: client processing request
  • fix test
  • refactor status checks
  • remove the client server
  • try docker host ip
  • Fix flag typo
  • integration test for client
  • update network client
  • fix the test dir path in docker
  • add integration test for client
  • Merge branch 'resolve-1257'
  • 📝 changelog
  • revert, and just use < v43.0.0
  • set paramiko logging to INFO
  • fix: supress paramiko warnings
  • set: propagate 0, logging config
  • set: paramiko logging to ERROR
  • remove downloading tool json
  • add: default ocrd-all-tool.json
  • download tool json if missing
  • Merge branch 'master' into resolve-1257
  • load tool json locally

dinglehopper 129e6eb..071e6a8

Release: v0.9.7

  • Merge pull request #120 from joschrew/dockerfile
  • Merge pull request #113 from qurator-spk/python-3.13
  • ✔ pre-commit: Add license check
  • 🐛 Fix --version option in OCR-D CLI
  • ✨ Support --version option in CLI
  • ⚙ pyproject.toml: Add license.file
  • ⚙ pre-commit: Update hooks

docstruct a7ffdda..004e6ec

  • add GHA CD via Dockerhub
    Submodule eynollah 032a99e...51f6ef6:
  • Merge pull request #137 from qurator-spk/dockerfile
  • Merge pull request #132 from qurator-spk/extracting_images_only
  • Merge pull request #133 from qurator-spk/src-layout
  • 📦 v0.3.1
  • 📝 changelog
  • Merge pull request #129 from qurator-spk/resolving_issue_106
  • update Makefile model location
  • update pyproject.toml for v0.3.1
  • update pyproject.toml
  • Update README.md
  • rename GH action
  • create draft pyproject.toml
  • format options table
  • Update README.md
  • improve huggingface url
  • remove CircleCI
  • Update model download url
  • Merge pull request #127 from bertsky/new-namespace-pkg
  • update GitHub actions
  • Update README.md
  • update supported Python+Tensorflow version combinations
  • pin tf2 version to 2.12.1
  • use tf1 compatibility for keras backend
    < adapt to OcrdFile.local_filename now :Path
    < adapt to ocrd>=2.54 url vs local_filename
  • comment unnecessary print commands
  • add supported OS to readme
  • filtering separators in a correct way without missing them
  • Merge pull request #117 from qurator-spk/tf-2.12-or-greater
  • apply missed commit #a56988a back
  • Merge pull request #116 from qurator-spk/fix-typos
  • Merge pull request #113 from qurator-spk/tf_<2.12.0
  • Update citation
  • Update bibtex entry
  • format citation info as bibtex
  • add HIP'23 paper reference
  • Merge pull request #109 from bertsky/patch-3
  • Merge pull request #105 from bertsky/fix-model-archive-path
    < Revert "Merge pull request #97 from qurator-spk/420-namespace-package"
  • Merge pull request #104 from bertsky/reinstate-namespace-pkg
  • Merge pull request #102 from qurator-spk/right2left_reading_order
  • delete printing resized image shape
  • issue #67 solved
  • improve links to GT guidelines
  • Update README.md
  • Update CHANGELOG.md
  • Update ocrd-tool.json
  • Merge pull request #86 from qurator-spk/eynollah_light

nmalign 7832c90..1426dbc

Release: v0.0.3

  • fix dockerfile
  • add GHA CD via Dockerhub

ocrd_calamari caac953..d9cde1f

Release: v1.0.6

  • Merge pull request #116 from bertsky/limit-batch-size
  • Merge pull request #115 from OCR-D/pyproject_toml
  • ✔ CircleCI: Don't test on Python 3.7 anymore
  • ⚙ pre-commit: Update hooks
  • 🧹 ruff: Update config
  • 🎨 Reformat (Black)
  • 🧹 Update OCR-D API usage w.r.t. workspace.add_file()
  • ⚙ pre-commit: Update hooks

ocrd_cis 38ce45b..db65d7f

Release: [v0.1.5](https://github.com/...

Read more

v2024-07-01

01 Jul 16:07
@kba kba

Choose a tag to compare

core c5b5580..79c61e3

Release: v2.66.1

  • 📦 v2.66.1
  • 📝 changelog
  • GHA Docker: build docker.io first, then tag ghcr.io
  • Dockerfile.cuda*: adapt to #1225 cc6ea57
  • 📦 v2.66.0
  • 📝 changelog
  • Merge remote-tracking branch 'origin/download-file-no-absolute-urls'
  • 📝 changelog
  • Merge remote-tracking branch 'bertsky/param-preset-resolve-anew'
  • 📝 changelog
  • Merge remote-tracking branch 'bertsky/workspace-clean'
  • 📝 changelog
  • Merge remote-tracking branch 'origin/utilize-ps-proxy-to-ms'
  • 📝 changelog
  • Merge remote-tracking branch 'bertsky/add-docker-tf1'
  • 📝 changelog
  • Merge remote-tracking branch 'origin/fix-ci-no37'
  • 📝 changelog
  • Merge remote-tracking branch 'origin/ocrd-logging-debug-true'
  • 📝 changelog
  • cli.workspace.find --undo-download: add --keep-files
  • cli.workspace.find --undo-download: only if .url exists
  • 📝 changelog
  • Merge branch 'master' of https://github.com/OCR-D/core
  • Merge branch 'rm-coverage'
  • 📝 changelog
  • Merge pull request #1225 from bertsky/docker-editable
  • 📝 changelog;
  • Merge pull request #1227 from OCR-D/fix-ocrd-file-remove-url
  • fix CD by fetching tags, too
  • 📦 v2.65.0
  • 📝 changelog
  • ci: fix integration test
  • Merge branch 'master' into test-workflow
  • make network-integration-test: disable ocrd_all test
  • disable ocrd all test in core
  • ci: disable scrutinizer build
  • Remove ocrd_all-tests from core makefile
  • make ocrd all tests callable from Makefile
  • merge master
  • Merge branch 'test-workflow' of github.com:OCR-D/core into test-workflow merge master
  • Make make assets in Dockerfile skipable
  • remove duplicates
  • Add a test for workflow run in ocrd_all

dinglehopper f8e3108..bc5818d

Release: v0.9.6

  • ✔ GitHub Actions: Update used actions
  • ⚙ pre-commit: Update hooks
  • 🐛 Fix reading plain text files
  • 📦 v0.9.6
  • Revert "✔ Test on Python 3.13"
  • 🐛 GHA: Install possible shapely build requirements (if building from source)
  • Merge pull request #111 from stweil/typos
  • 🐛 GHA: Install possible lxml build requirements (if building from source)
  • ✔ Test on Python 3.13
  • 🐛 Fix Python 3.12 support by requiring ocrd >= 2.65.0
  • ⚙ pre-commit: Update hooks
  • ✔ Test using empty files
  • ⚙ pre-commit: Update hooks
  • 🧹 tests: Move comment out of the code (bad style + weird formatting)
  • ⚙ cli: Annotate types in process_dir()
  • ⚙ pre-commit: Update hooks
  • 🧹 Make from_text_segment()'s textequiv_level keyword-only
  • 🧹 Make process_dir() keyword arguments keyword-only
  • ✒ README-DEV: Releasing a new version
  • 📦 v0.9.5
  • ⚙ pre-commit: Add mypy dependencies
  • 🐛 Check that we always get a valid ALTO namespace (satifies mypy)
  • 🎨 Reformat (Black)
  • ⚙ pre-commit: Update hooks
  • Merge branch 'master' of https://github.com/qurator-spk/dinglehopper
  • 🐛 Fix word segmentation with uniseg 0.8.0
  • 🚧 GitLab CI Test: Depend on child pipeline
  • 🚧 GitLab CI Test: Push after pulling

ocrd_detectron2 a8402d8..1f56273

Release: v0.1.8

  • patch -f instead of -t
  • suppress question if patch detectron2#5010 already applied

ocrd_fileformat ba79de9..fb769ff

Release: v0.10.0

  • CI: add Dockerhub CD
  • update textract2page again
  • forgot to add AWS-Textract→PAGE
  • update ocr-fileformat (with newer textract2page)
  • Merge pull request #53 from OCR-D/osx-cp

ocrd_froc 45d5dcd..42f1ce0

Release: v0.6.1

  • 📦 v0.6.1
  • Merge pull request #16 from stweil/typos
  • 📦 v0.6.0
  • 📝 changelog
  • Merge branch 'main' into pr-8-feedback
  • add parameter min_score_style to filter low-confidence font classification results
  • don't replace textstyle, only fontFamily if overwrite_style
  • add parameter "overwrite_text" (default: false): whether to add or replace TextEquiv
  • rename parameter network -> model
  • rename parameter replace_textstyle -> overwrite_style

ocrd_keraslm 472197f..ea79b2a

Release: v0.4.3

  • test: with initLogging
  • test: allow running with published model_dta_full instead of training model_dta_test
  • improve/update readme
  • wrapper.rate: improve/update docstring
  • wrapper.rate: use resolve_resource for model file path, split off setup()
  • save training history (metrics), add cmd to print them
  • train: allow passing single file for val_data, too
  • fix continuing from chkpt
  • update readme
  • add model_dta_full.h5 ref
  • chlog
  • 📦 0.4.3
  • generate: add option --variants (nr of nbest seqs)
  • plot_context_embeddings_projection: add years
  • test: allow passing directory for data, too

ocrd_kraken bdbe6fc..a6160ce

Release: v0.4.1

  • Merge pull request #40 from stweil/python
  • fix/add build
  • 📦 0.4.1
  • Merge pull request #43 from bertsky/segment-regions

ocrd_neat 2f7d01c..06c8b38

Release: v0.0.1

  • character normalization based on aletheia mapping
  • Merge pull request #13 from qurator-spk/fix-ppn-xpath

ocrd_pagetopdf 24f77d3..7c5ab70

Release: v1.1.0

  • Merge pull request #26 from UB-Mannheim/clean
  • Fix typos in README.md and in ocrd-tool.json (#25)
  • Fix typo in LICENSE

ocrd_segment 3993139..de824e9

Release: v0.1.24

  • 📦 v0.1.24
  • repair: add params spread / spread_level, update/improve docs
  • repair: validate/repair polygons with 0 px tolerance
  • project/repair join_polygons: fix rare case of adjacent rings
  • from_masks: skip if no seg file
  • 📦 v0.1.23
  • Merge pull request #67 from OCR-D/project-parent

ocrd_tesserocr ed73d96..d23992b

Release: v0.19.1

  • 📦 v0.19.1
  • 📦 v0.19.0
  • Update submodule tesseract
  • Merge pull request #213 from OCR-D/osx-cp
  • Merge pull request #212 from stweil/tesseract
  • Merge pull request #211 from stweil/libcurl-dev
  • model resources: fix frak2021 description
  • recognize models: use _best instead of _fast
  • sanitise loggers: no points, but warnings
  • update changelog
  • recognize: more robust polygon handling
  • Merge pull request #208 from bertsky/subclass-docstrings
  • Merge pull request #205 from stweil/update
  • Merge pull request #203 from stweil/build

sbb_binarization b89ec49..978f425

Release: v0.1.0

  • document tested Python+TF2 versions
  • remove Python 3.11 from test
  • Update README.md
  • Update tensorflow version requirements

workflow-configuration bd149f8..eeea260

Release: 0.1.3

  • ocrd-make -X: no --cleanup to prevent race for --bf
  • ocrd-make:...
Read more

v2024-03-08

12 Mar 14:39
@kba kba

Choose a tag to compare

Removed:

  • tesserocr and tesseract (which are now submodules of ocrd_tesserocr)

core f54b002..c5b5580

Release: v2.63.3

  • 📦 v2.63.3
  • 📝 changelog
  • OcrdMets.add_file: fix finding existing el_pagediv
  • 📝 changelog
  • expose uninstall-workaround
  • 📦 v2.63.2
  • 📦 v2.63.1
  • Merge branch 'fix-get-physical-pages'
  • make coverage: omit generateDS code

cor-asv-ann 0a4f684..f7ebb74

  • compare/evaluate: ensure lines are shown verbatim on single file level
  • compare/evaluate: do not record worst lines verbatim
  • compare/evaluate: also record 1% worst lines

ocrd_keraslm 759805b..472197f

Release: v0.4.2

  • update assets
  • 📦 v0.4.2
  • generators: do not require input length > window size
  • train: allow passing directory for training data, too
  • add checkpointing, allow continuing from ckpt
  • suppress TF gibberish

ocrd_tesserocr 08a020f..ed73d96

Release: v0.18.1

  • :package v0.18.1
  • 📝 changelog
  • update tesseract/tesserocr to most recent
  • update/improve readme
  • simplify dockerfile
  • CI: add make test
  • add repo/assets as proper submodule, rename -clean → clean-
  • make Tesseract build configurable
  • also install minimal needed models
  • explify dependencies install ← install-tesserocr ← install-tesseract
  • test: download files of data assets
  • test: also set up logging system during tests

workflow-configuration 8418b3f..bd149f8

Release: 0.1.3

  • unprefix regex paths by directory argument, if any
  • forgot to convert exit to continue in 6fe3c6b3
  • no need to close logger FDs on exit
  • ocrd-import: use coproc instead of handcrafted FIFOs for loggers
  • ocrd-import: rewrite (no parallel jobs, but parallel logging)…
  • ocrd-import: add option --basename, default to using directory as well
  • ocrd-import: simplify+speedup…

v2024-02-01

01 Feb 16:40
@kba kba

Choose a tag to compare

Added:

Removed

cor-asv-fst 076e04e..4211371

  • Merge pull request #4 from stweil/master

core ac1f15b..b94b185

Release: v2.62.0

  • reenable circle because docker failed to build on ghcr.io
  • 📦 v2.62.0
  • {pypi,build}-workaround: missed a commit, s/get_distribution(...).version/dist_version(...)
  • Merge branch 'circle-to-gha'
  • 📝 changelog
  • Merge branch 'master' into ocrd-tool-json-root
  • expose ocrd-tool.json for ocrd-dummy in root like processors
  • 📦 v2.60.3
  • 📝 changelog
  • Fix --editable install for setuptools>=64, setuptools#3548
  • 📦 v2.60.2
  • 📝 changelog
  • Merge pull request #1161 from OCR-D/logging-downgrade-level
  • Merge pull request #1160 from OCR-D/is-oai-content-loglevel-debug

dinglehopper f077ce2..f8e3108

Release: v0.9.4

  • 🚧 GitLab CI Test: Push after pulling
  • 🚧 GitLab CI Test: Trigger only on default branch (and do not hardcode it)
  • 🚧 GitLab CI Test
  • 🔍 ruff: Remove ignore configuration, we use multimethods in a compatible way now
  • ⚙ pre-commit: Update hooks
  • 🚧 GitLab CI Test
  • 🔍 mypy: Use an almost strict mypy configuration, and fix any issues
  • 🔍 mypy: Use a compatible syntax for multimethod
  • 🔍 mypy: Remove ExtractedText.segments converter
  • 🔍 mypy: Avoid using check() for all attr validators
  • 🔍 mypy: Make cli.process() typed so mypy checks it (and issues no warning)
  • Merge branch 'pr103'
  • ⚙ Update ruff+mypy dependencies
  • ⚙ pre-commit: Update hooks
  • ⬆ Move on to supporting Python >= 3.8 only
  • 🐛 Use typing.List instead of list, for Python <3.9
  • 🐛 Use Optional instead of | none, for Python <3.10
  • ⚙ pre-commit: Update hooks
  • 🐛 Fix generating word differences
  • ⚙ pre-commit: Update hooks
  • Merge branch 'master' of https://github.com/qurator-spk/dinglehopper
  • Merge branch 'master' into performance
  • ⬆ Update uniseg dependency
  • ❎ Make joining grapheme clusters more robust by checking joiner and handling an empty joiner
  • 🐛 Fix score_hint call in cli_line_dirs
  • 🐛 Fix docstring of distance() for grapheme clusters
  • 🐛 Fix calculation of score_hint for edge cases, e.g. when CER is infinite
  • 🕸Do not use deprecated ID, pageId options
  • ✔ Add mets:FLocat's @LOCTYPE/OTHERLOCTYPE to test data
  • ⬆ Update multimethod dependency
  • 🐛 Update tests for ExtractedText
  • use uniseg again
  • update rapidfuzz version
  • replace uniseg with uniseg2
  • apply black
  • move grapheme clusters to ExtractedText
  • remove python2.7 futures
  • remove unused includes
  • only call words_normalized once

eynollah 706433c..032a99e

Release: v0.2.0

  • adapt to OcrdFile.local_filename now :Path
  • adapt to ocrd>=2.54 url vs local_filename

ocrd_fileformat c5f0c52..ba79de9

Release: v0.10.0

  • 📦 v0.10.0
  • 📝 changelog
  • Update ocr-fileformat to include UB-Mannheim/ocr-fileformat#172
  • Merge branch 'fix-textract2page'
  • update ocr-fileformat to latest
  • Update ocr-fileformat to v0.6.0

ocrd_repair_inconsistencies cf879c1..94c482f

  • 🕸 README: Mention archival of the project

opencv-python 7cfd1ee..8ad8ec1

Release: 80

  • Merge branch 'as/4.9.0-readme-update' into 4.x
  • Merge pull request #941 from asmorkalov/as/mac_m1_venv_for_test
  • Merge pull request #940 from asmorkalov:as/donation
  • Merge pull request #938 from asmorkalov"as/4.9.0-pre
  • Merge pull request #934 from asmorkalov/as/native_mac_m1_runner
  • Merge pull request #936 from asmorkalov:as/mac_intel_update
  • Merge pull request #932 from asmorkalov/as/pre-4.9.0_linux_upgrade
  • Merge pull request #931 from asmorkalov:as/ipp_icv_license
  • Merge pull request #904 from asmorkalov:as/python_3.12
  • Merge pull request #927 from dkurt:try_enable_dependents

sbb_binarization f3c6ac8..b89ec49

Release: v0.1.0

  • Merge pull request #65 from rettinghaus/update-tests

tesseract ea0b245..8ee020e

Release: 5.3.4

  • Create new release 5.3.4
  • Set User-Agent: header field in HTTP request for curl downloads
  • Merge pull request #4178 from sadra-barikbin/patch-1
  • Merge pull request #4174 from stweil/warnings

workflow-configuration cbc3234..f54c91a

Release: 0.1.3

  • ocrd-page-transform: local_filename instead of url
  • ocrd-page-transform: fix unbound variable

v2023-12-15

15 Dec 16:52
@kba kba

Choose a tag to compare

core 742906e..ac1f15b

Release: v2.60.1

  • 📦 v2.60.1
  • docker: we need .git during build for setuptools_scm
  • 📝 changelog
  • Merge branch 'git-versioning'
  • 📝 changelog
  • defaults for mets_basename and mets_server_url
  • Merge branch 'master' of https://github.com/OCR-D/core
  • 📝 changelog
  • ocrd workpace list-page: ignore files without pageId, fix #1148