refactor(common): decouple PhysicalTable and ResolvedTable from Dataset #1554

LNSD · 2026-01-14T14:31:03Z

Remove Arc dependency from catalog types, storing only the
essential fields (HashReference, start_block, table name, network).

Replace ResolvedTable.dataset: Arc<Dataset> with individual fields
Remove Dataset::resolved_tables() method
Update PhysicalTable constructors to take Table instead of ResolvedTable
Store dataset_reference: HashReference directly in PhysicalTable
Update callers in dump, admin-api, and tests

claude

Review Summary

This PR successfully decouples PhysicalTable and ResolvedTable from Arc<Dataset>, replacing the direct dataset reference with individual fields (HashReference, start_block, table name, network). This is a well-structured refactoring that reduces coupling and makes the data flow more explicit.

Highlights

Positive changes:

Clean separation of concerns - PhysicalTable and ResolvedTable now store only the data they need
Removal of Dataset::resolved_tables() method simplifies the API
Good use of BTreeSet for deduplicating dataset references in streaming_query.rs

Key items to address

Breaking API change (restore.rs): The url field renamed to path in RestoredTableInfo is a breaking change for HTTP clients. Please confirm this is intentional.
Performance consideration (physical.rs): catalog_schema() and table_ref() now return owned types instead of references, causing allocations on each call. Consider caching if these are hot paths.
Data duplication (physical.rs): PhysicalTable now stores both table: Table and extracted fields (table_name, network). While acceptable given Table appears immutable, this could be documented.
Logging change (parquet_writer.rs): The trace span changed from compact to full table reference - verify this is intentional.

Minor suggestions

Consistency in using dataset_reference vs dataset.reference() in some files
Removal of network() and schema() accessors from ResolvedTable changes the caller interface

Overall, this is a solid refactoring. The inline comments have specific suggestions for the items above.

crates/services/admin-api/src/handlers/datasets/restore.rs

crates/core/dump/src/raw_dataset.rs

crates/core/dump/src/streaming_query.rs

tests/src/testlib/helpers.rs

crates/core/dump/src/parquet_writer.rs

Reduce memory overhead and break tight coupling by storing only essential dataset fields in catalog types instead of full `Arc<Dataset>` references. - Eliminate `Arc<Dataset>` dependency in `ResolvedTable` and `PhysicalTable` to reduce memory footprint - Store only required dataset metadata (reference, start block, network) in catalog types - Remove convenience method for creating resolved tables, requiring explicit construction at call sites - Simplify `PhysicalTable` by directly storing table definition instead of wrapping through `ResolvedTable` - Move table restoration logic from method to free function for better composability Signed-off-by: Lorenzo Delgado <lorenzo@edgeandnode.com>

LNSD requested a review from Theodus January 14, 2026 14:31

LNSD self-assigned this Jan 14, 2026

LNSD added data-plane claude-review labels Jan 14, 2026

LNSD requested a review from JohnSwan1503 January 14, 2026 14:32

claude bot reviewed Jan 14, 2026

View reviewed changes

LNSD force-pushed the lnsd/refactor-common-use-dataset-tables branch 3 times, most recently from 8944d17 to 317f1ac Compare January 15, 2026 19:48

JohnSwan1503 approved these changes Jan 15, 2026

View reviewed changes

LNSD force-pushed the lnsd/refactor-common-use-dataset-tables branch from 1436b0c to 9687e9e Compare January 15, 2026 23:15

LNSD merged commit b7dffbb into main Jan 15, 2026
8 checks passed

LNSD deleted the lnsd/refactor-common-use-dataset-tables branch January 15, 2026 23:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(common): decouple PhysicalTable and ResolvedTable from Dataset #1554

refactor(common): decouple PhysicalTable and ResolvedTable from Dataset #1554

Uh oh!

LNSD commented Jan 14, 2026

Uh oh!

claude bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

refactor(common): decouple PhysicalTable and ResolvedTable from Dataset #1554

refactor(common): decouple PhysicalTable and ResolvedTable from Dataset #1554

Uh oh!

Conversation

LNSD commented Jan 14, 2026

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

Review Summary

Highlights

Key items to address

Minor suggestions

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants