feat: Support Actor schema storages with Alias mechanism by Pijukatel · Pull Request #797 · apify/apify-sdk-python

Pijukatel · 2026-02-17T08:58:17Z

Description

Update Configuration to include actor_storages that is loaded from actor_storages_json env variable.
Update AliasResolver to be able to resolve alias mapping from Configuration.

Issues

Closes: Adapt to Apify's "multiple storages" functionality #762

Testing

Added E2E test
Added unit tests
Manual Actor test

Checklist

CI passed

…ations

…guration-mapping

codecov · 2026-02-17T08:59:37Z

Codecov Report

❌ Patch coverage is 80.95238% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 86.12%. Comparing base (9e0aa56) to head (a7306a8).
⚠️ Report is 2 commits behind head on master.

Files with missing lines	Patch %	Lines
...c/apify/storage_clients/_apify/_alias_resolving.py	60.00%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #797      +/-   ##
==========================================
+ Coverage   85.83%   86.12%   +0.28%     
==========================================
  Files          46       46              
  Lines        2761     2768       +7     
==========================================
+ Hits         2370     2384      +14     
+ Misses        391      384       -7

Flag	Coverage Δ
e2e	`37.21% <42.85%> (+0.15%)`	⬆️
integration	`58.77% <57.14%> (+0.17%)`	⬆️
unit	`73.44% <80.95%> (+0.32%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

TODO - how should it behave locally?

…guration-mapping

Copilot

Pull request overview

Adds support for Actor “schema storages” / multiple pre-created storages by loading storage IDs into Configuration and pre-registering alias→ID mappings so open_* (alias=...) resolves to those platform-provided storages when running on Apify.

Changes:

Introduces Configuration.actor_storages (parsed from an env-provided JSON object) via new ActorStorages model.
Extends AliasResolver with register_aliases() and calls it during Actor initialization on the Apify platform to seed alias mappings.
Adds unit/integration/E2E tests validating configuration parsing and alias resolution behavior.

Reviewed changes

Copilot reviewed 8 out of 9 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`src/apify/_configuration.py`	Adds `ActorStorages` + env/validator wiring for `Configuration.actor_storages`.
`src/apify/storage_clients/_apify/_alias_resolving.py`	Adds `AliasResolver.register_aliases()` to bulk write alias mappings into default KVS + in-memory cache.
`src/apify/_actor.py`	Calls `register_aliases()` on Actor startup when running on-platform.
`tests/unit/actor/test_configuration.py`	Unit test for parsing storages JSON env var into `actor_storages`.
`tests/integration/test_storages.py`	Integration test asserting alias mapping is preserved/extended in default KVS.
`tests/e2e/test_schema_storages/test_schema_storages.py`	E2E test ensuring schema-defined storages are usable via alias at runtime.
`tests/e2e/test_schema_storages/actor_source/main.py`	Actor code used by the E2E test to open a dataset by alias and validate ID.
`tests/e2e/test_schema_storages/actor_source/actor.json`	Actor schema defining storages for the E2E scenario.
`tests/e2e/test_schema_storages/__init__.py`	Marks the new E2E package.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-18T13:07:51Z

src/apify/storage_clients/_apify/_alias_resolving.py

+        client = await cls._get_default_kvs_client(configuration=configuration)
+        existing_mapping = ((await client.get_record(cls._ALIAS_MAPPING_KEY)) or {'value': {}}).get('value', {})
+
+        # Update the existing mapping with the configuration mapping.
+        existing_mapping.update(configuration_mapping)
+        # Store the updated mapping back in the KVS and in memory.
+        await client.set_record(cls._ALIAS_MAPPING_KEY, existing_mapping)


existing_mapping = ((await client.get_record(...)) or {'value': {}}).get('value', {}) assumes the record always has a value key containing a dict. However, this module already documents/handles get_record sometimes returning the mapping dict directly (without value). In that case, this code will treat the mapping as missing and overwrite it with only configuration_mapping. Also, if value is present but not a dict (e.g. None), existing_mapping.update(...) will raise. Please mirror the normalization logic used in _get_alias_map/store_mapping: normalize record into a dict[str, str] whether it comes wrapped in {key,value} or as a raw mapping, otherwise default to {}.

src/apify/_configuration.py

tests/integration/test_storages.py

tests/e2e/test_schema_storages/actor_source/actor.json

vdusek

a few comments

src/apify/storage_clients/_apify/_alias_resolving.py

tests/integration/test_storages.py

tests/e2e/test_schema_storages/actor_source/actor.json

Co-authored-by: Vlada Dusek <v.dusek96@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Pijukatel · 2026-02-19T09:28:31Z

We can probably make the storage mechanism a little bit more complicated, but avoid some API calls on actor init, which is probably worth it.

…guration-mapping

valekjo

Looks good, but I'm not strong in python :)

One question

tests/e2e/test_schema_storages/actor_source/main.py

src/apify/storage_clients/_apify/_alias_resolving.py

src/apify/_configuration.py

src/apify/storage_clients/_apify/_alias_resolving.py

tests/unit/actor/test_configuration.py

…guration-mapping

janbuchar

LGTM

janbuchar · 2026-02-20T13:55:33Z

src/apify/_configuration.py

        BeforeValidator(lambda data: json.loads(data) if isinstance(data, str) else data or None),
    ] = None

+    actor_storages: Annotated[


I'm not sure about the naming here - we usually strip the actor_ prefix. But I do agree that a plain storages would look odd. So let's keep it this way, I guess.

src/apify/_configuration.py

tests/e2e/test_schema_storages/actor_source/main.py

vdusek · 2026-02-23T12:42:48Z

tests/unit/storage_clients/test_alias_resolver.py

+    datasets = {'default': 'default_dataset_id', 'custom': 'custom_Dataset_id'}
+    request_queues = {'default': 'default_request_queue_id', 'custom': 'custom_RequestQueue_id'}
+    key_value_stores = {'default': 'default_key_value_store_id', 'custom': 'custom_KeyValueStore_id'}


I assume the mixing of camel and snake cases is not intended

Suggested change

datasets = {'default': 'default_dataset_id', 'custom': 'custom_Dataset_id'}

request_queues = {'default': 'default_request_queue_id', 'custom': 'custom_RequestQueue_id'}

key_value_stores = {'default': 'default_key_value_store_id', 'custom': 'custom_KeyValueStore_id'}

datasets = {'default': 'default_dataset_id', 'custom': 'custom_dataset_id'}

request_queues = {'default': 'default_request_queue_id', 'custom': 'custom_request_queue_id'}

key_value_stores = {'default': 'default_key_value_store_id', 'custom': 'custom_key_value_store_id'}

That value can be any string, so I just used one that made it more convenient to write the test.

Look at the assert loop and the fstring:
f'custom_{storage_type}_id'

Co-authored-by: Vlada Dusek <v.dusek96@gmail.com>

Pijukatel added 4 commits February 10, 2026 16:21

WIP, many open quiestions. Have to start testing to see the real situ…

c4adb74

…ations

Add debug

19113e7

Fix type issues, prepare for tests. Merge first

b12e27e

Merge remote-tracking branch 'origin/master' into alias-storage-confi…

fd0716c

…guration-mapping

github-actions bot assigned Pijukatel Feb 17, 2026

github-actions bot added this to the 134th sprint - Tooling team milestone Feb 17, 2026

github-actions bot added t-tooling Issues with this label are in the ownership of the tooling team. tested Temporary label used only programatically for some analytics. labels Feb 17, 2026

Pijukatel requested a review from vdusek February 17, 2026 09:01

Adapt to new test structure

3b36459

Pijukatel force-pushed the alias-storage-configuration-mapping branch from d9c7349 to 3b36459 Compare February 17, 2026 09:11

Pijukatel added 3 commits February 17, 2026 16:57

Add uni tests, WIP

72c2f35

TODO - how should it behave locally?

Finalize tests

b7604cb

Merge remote-tracking branch 'origin/master' into alias-storage-confi…

ec6e071

…guration-mapping

Pijukatel marked this pull request as ready for review February 18, 2026 12:47

Pijukatel requested a review from valekjo February 18, 2026 12:47

vdusek requested a review from Copilot February 18, 2026 13:03

Copilot started reviewing on behalf of vdusek February 18, 2026 13:04 View session

Copilot AI reviewed Feb 18, 2026

View reviewed changes

vdusek requested changes Feb 18, 2026

View reviewed changes

Pijukatel and others added 4 commits February 19, 2026 08:44

Apply suggestions from code review

aa3b9ea

Co-authored-by: Vlada Dusek <v.dusek96@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Review comments

aa5624f

Remove unnecessary record handling

17288ca

Review comments

a7e645f

Pijukatel marked this pull request as draft February 19, 2026 09:28

Pijukatel added 3 commits February 19, 2026 11:02

Simplify and avoid unnecessary API calls

a996dca

Finalize

f738e58

Merge remote-tracking branch 'origin/master' into alias-storage-confi…

2657528

…guration-mapping

Pijukatel marked this pull request as ready for review February 19, 2026 10:43

Pijukatel requested a review from vdusek February 19, 2026 10:48

valekjo reviewed Feb 19, 2026

View reviewed changes

tests/e2e/test_schema_storages/actor_source/main.py Outdated Show resolved Hide resolved

src/apify/storage_clients/_apify/_alias_resolving.py Show resolved Hide resolved

janbuchar self-requested a review February 19, 2026 15:28

vdusek mentioned this pull request Feb 19, 2026

Audit and unify usage of alias vs validation_alias in Pydantic models #807

Open

vdusek requested changes Feb 19, 2026

View reviewed changes

Pijukatel added 2 commits February 20, 2026 08:10

Review comments

bc7d13d

Merge remote-tracking branch 'origin/master' into alias-storage-confi…

b7a390b

…guration-mapping

Pijukatel force-pushed the alias-storage-configuration-mapping branch from 68d67b9 to 02f9244 Compare February 20, 2026 12:16

Change to TypedDict

e2df853

Pijukatel force-pushed the alias-storage-configuration-mapping branch from 02f9244 to e2df853 Compare February 20, 2026 12:57

Merge remote-tracking branch 'origin/master' into alias-storage-confi…

d15fadb

…guration-mapping

Pijukatel requested a review from vdusek February 20, 2026 13:03

janbuchar approved these changes Feb 20, 2026

View reviewed changes

vdusek requested changes Feb 23, 2026

View reviewed changes

Apply suggestions from code review

a7306a8

Co-authored-by: Vlada Dusek <v.dusek96@gmail.com>

Pijukatel requested a review from vdusek February 23, 2026 15:58

Comments

Conversation

Pijukatel commented Feb 17, 2026 • edited by vdusek Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issues

Testing

Checklist

Uh oh!

codecov bot commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vdusek left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pijukatel commented Feb 19, 2026

Uh oh!

valekjo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janbuchar left a comment

Choose a reason for hiding this comment

Uh oh!

janbuchar Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vdusek Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Pijukatel Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Pijukatel commented Feb 17, 2026 •

edited by vdusek

Loading

codecov bot commented Feb 17, 2026 •

edited

Loading