perf: Zero-copy binary refactor and CLOCK replacer by poyrazK · Pull Request #17 · poyrazK/cloudSQL

poyrazK · 2026-04-05T15:15:51Z

This PR finalizes Phase 3 of the performance optimizations.

Changes:

LRUReplacer: Replaced map/list with a zero-allocation CLOCK algorithm (bitsets), increasing Buffer Pool metadata throughput by ~75% to 15M ops/sec.
HeapTable Insertion: Added a last_page_id_ hint to solve the O(N^2) free space search bottleneck.
HeapTable Binary Serialization: Completely removed string parsing (std::to_string, std::stringstream, std::getline) and replaced the on-disk format with a zero-copy, direct memory-mapped binary layout.
Result: HeapTable insertion throughput increased 4.5x, from ~14k tuples/sec to ~65k tuples/sec.

Summary by CodeRabbit

Release Notes

New Features
- Added type-checking utility methods to simplify value type validation.
Improvements
- Optimized buffer pool eviction policy for better memory efficiency.
- Reduced network logging verbosity for cleaner output.
Chores
- Added performance benchmarking infrastructure and regression detection to CI pipeline.
- Refactored internal storage format for improved data handling efficiency.

…mization

coderabbitai · 2026-04-05T15:15:57Z

Warning

Rate limit exceeded

@poyrazK has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 13 minutes and 38 seconds before requesting another review.

Your organization is not enrolled in usage-based pricing. Contact your admin to enable usage-based pricing to continue reviews beyond the rate limit, or try again in 13 minutes and 38 seconds.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: a6689423-3039-4419-b873-f7e66419ed4f

📥 Commits

Reviewing files that changed from the base of the PR and between b14774c and 9ff15da.

📒 Files selected for processing (1)

scripts/check_perf_regression.py

📝 Walkthrough

Walkthrough

This PR restructures benchmark setup to avoid repeated initialization within timed loops, converts operator table ownership from unique to shared pointers, replaces the LRUReplacer algorithm from list-based LRU to CLOCK-based eviction, refactors HeapTable storage from delimiter strings to binary payload format, adds performance regression testing infrastructure, disables diagnostic logging, and introduces type-query methods for Value.

Changes

Cohort / File(s)	Summary
Benchmark and Operator Ownership `benchmarks/execution_bench.cpp`, `include/executor/operator.hpp`, `src/executor/operator.cpp`, `src/executor/query_executor.cpp`	Restructured benchmark setup to initialize storage components and tables outside timed loops; changed `SeqScanOperator` and `IndexScanOperator` to use `std::shared_ptr<storage::HeapTable>` instead of `std::unique_ptr`, enabling table reuse across operator instances.
LRUReplacer Algorithm `include/storage/lru_replacer.hpp`, `src/storage/lru_replacer.cpp`, `tests/buffer_pool_tests.cpp`	Replaced internal LRU data structures (list + map) with CLOCK/Second-Chance algorithm using `clock_hand_`, `referenced_`, and `in_replacer_` tracking; updated test expectations to reflect new victim selection order.
HeapTable Storage Format `include/storage/heap_table.hpp`, `src/storage/heap_table.cpp`	Converted record layout from delimiter-based strings to binary payload (8-byte MVCC header + typed column segments with 1-byte type tag and fixed/variable-length data); added `last_page_id_` tracking; refactored `insert()`, `remove()`, `get_meta()`, and scan iterators to use direct buffer operations instead of string parsing.
Performance Testing `.github/workflows/ci.yml`, `scripts/check_perf_regression.py`	Added new GitHub Actions job to build and run performance benchmarks, merge JSON outputs, and compare against cached baseline with configurable regression threshold; introduced Python script for JSON-based regression detection.
Network Logging `src/network/rpc_client.cpp`, `src/network/rpc_server.cpp`	Disabled diagnostic `std::cerr` logging by wrapping all log statements in `if (false)` conditions across connection, RPC calls, and request handling paths.
Value Type Queries `include/common/value.hpp`	Added `is_integer()` and `is_float()` methods to query numeric type categories for integer and floating-point types respectively.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

perf: add benchmarking infrastructure and performance baseline report #16: Modifies the same benchmark code (benchmarks/execution_bench.cpp) and adjusts operator/table ownership patterns used in those benchmarks.
Improve Stability and Achieve Zero-Warning Build #1: Directly related to LRUReplacer changes—introduces the original list+map based LRU that this PR replaces with a CLOCK algorithm.
feat: advanced join semantics, transactional integrity, and expanded SLT suite #14: Modifies the HeapTable implementation that this PR refactors with binary payload storage and revised insertion/removal logic.

Suggested labels

enhancement

Poem

🐰 From strings to bytes, our tables dance,
LRU yields to CLOCK's advanced stance,
Shared pointers flow, benchmarks gleam,
Performance tracked in every dream,
A storage symphony, optimized and lean!

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 30.56% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title directly summarizes the main performance optimizations: zero-copy binary refactor and CLOCK replacer implementation, which are the primary changes across the changeset.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feature/optimize-insertion

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 4

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

benchmarks/execution_bench.cpp (1)

31-57: ⚠️ Potential issue | 🟠 Major

Destroy the storage stack before deleting the benchmark directory.

remove_all(test_dir) runs while table, bpm, and disk_manager are still alive. If BufferPoolManager flushes dirty pages in its destructor, it will target a path you already removed, and on platforms with mandatory file locking the cleanup can fail outright. Put the storage objects in an inner scope and remove the directory afterwards.

Suggested fix

 static void BM_ExecutionSeqScan(benchmark::State& state) {
     std::string test_dir = "./bench_exec_scan_" + std::to_string(state.range(0));
     std::filesystem::remove_all(test_dir);
     std::filesystem::create_directories(test_dir);
-    
-    StorageManager disk_manager(test_dir);
-    BufferPoolManager bpm(2000, disk_manager);
-    
-    Schema schema;
-    schema.add_column("id", common::ValueType::TYPE_INT64);
-    schema.add_column("data", common::ValueType::TYPE_TEXT);
-    
-    auto table = std::make_shared<HeapTable>("scan_table", bpm, schema);
-    table->create();
-    SetupBenchTable(*table, state.range(0));
-
-    for (auto _ : state) {
-        auto scan_op = std::make_unique<SeqScanOperator>(table);
-        scan_op->init();
-        scan_op->open();
-        Tuple tuple;
-        while (scan_op->next(tuple)) {
-            benchmark::DoNotOptimize(tuple);
+    {
+        StorageManager disk_manager(test_dir);
+        BufferPoolManager bpm(2000, disk_manager);
+
+        Schema schema;
+        schema.add_column("id", common::ValueType::TYPE_INT64);
+        schema.add_column("data", common::ValueType::TYPE_TEXT);
+
+        auto table = std::make_shared<HeapTable>("scan_table", bpm, schema);
+        table->create();
+        SetupBenchTable(*table, state.range(0));
+
+        for (auto _ : state) {
+            auto scan_op = std::make_unique<SeqScanOperator>(table);
+            scan_op->init();
+            scan_op->open();
+            Tuple tuple;
+            while (scan_op->next(tuple)) {
+                benchmark::DoNotOptimize(tuple);
+            }
         }
     }
-    
+
     state.SetItemsProcessed(state.iterations() * state.range(0));
     std::filesystem::remove_all(test_dir);
 }

Apply the same scoping pattern to BM_ExecutionHashJoin.

Also applies to: 62-100

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@benchmarks/execution_bench.cpp` around lines 31 - 57, The benchmark removes
test_dir while storage objects are still alive; wrap creation of StorageManager,
BufferPoolManager, Schema, HeapTable and related setup (the block that creates
test_dir, StorageManager, BufferPoolManager, schema, table, SetupBenchTable, and
the scan loop using SeqScanOperator) in an inner scope so those objects
(StorageManager, BufferPoolManager, HeapTable, etc.) are destructed before
calling std::filesystem::remove_all(test_dir); apply the same inner-scope
pattern to the BM_ExecutionHashJoin benchmark to ensure destructors run before
deleting the directory.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@include/executor/operator.hpp`:
- Around line 114-115: The call sites still construct HeapTable with
std::make_unique but the operator constructors now take std::shared_ptr; update
the three locations that construct a HeapTable for IndexScanOperator and
SeqScanOperator to use std::make_shared<storage::HeapTable>(...) instead of
std::make_unique<storage::HeapTable>(...), ensuring the produced shared_ptr is
passed to the IndexScanOperator and SeqScanOperator constructors (refer to the
IndexScanOperator and SeqScanOperator calls around the join setup in
query_executor.cpp).

In `@include/storage/buffer_pool_manager.hpp`:
- Around line 104-107: The public accessor get_storage_manager() exposes a
mutable StorageManager& allowing callers to bypass BufferPoolManager invariants
(page_table_, replacer_, latches) — remove the accessor or change its signature
to return a const StorageManager& (or provide a narrow const-safe facade) so
callers cannot call mutating methods like read_page/write_page/allocate_page
directly; update any call sites (if any) to use BufferPoolManager's controlled
APIs instead and ensure the class header only exposes non-mutating operations or
an opaque const interface to StorageManager.

In `@src/storage/heap_table.cpp`:
- Around line 281-314: The deserializer advances cursor through the buffer
(data) without verifying remaining buffer size, so add explicit bounds checks
before each read: ensure cursor < data_len before reading the type byte, ensure
cursor + 8 <= data_len before memcpy of the 8-byte numeric payload, and ensure
cursor + 4 <= data_len before reading the uint32_t length and then ensure cursor
+ len <= data_len before constructing the string; use the existing buffer-length
parameter available to this function (e.g. data_len/tuple_size) and return an
error or throw if any check fails; update the loop around
schema_.column_count(), the branches handling common::ValueType, and the
string-deserialization path that calls common::Value::make_text to use these
guards.
- Around line 120-137: The loop in heap_table.cpp serializes all numeric values
as double causing precision loss for large integers; change the numeric branch
to distinguish integer vs floating numeric types (use val.is_integer() or check
val.type() for the integer kind) and serialize integers as int64_t via
val.to_int64() (8 bytes) while keeping floating types as double via
val.to_float64(); keep writing the uint8_t type tag to payload so
deserialization (the code that reads the numeric back, currently casting double
to int64_t around line 27) can read the correct 8-byte representation and
reconstruct int64_t for integer tags and double for floating tags. Ensure you
update the corresponding deserialization logic to check the stored type tag and
memcpy/interpret 8 bytes as int64_t for integer types and as double for floating
types.

---

Outside diff comments:
In `@benchmarks/execution_bench.cpp`:
- Around line 31-57: The benchmark removes test_dir while storage objects are
still alive; wrap creation of StorageManager, BufferPoolManager, Schema,
HeapTable and related setup (the block that creates test_dir, StorageManager,
BufferPoolManager, schema, table, SetupBenchTable, and the scan loop using
SeqScanOperator) in an inner scope so those objects (StorageManager,
BufferPoolManager, HeapTable, etc.) are destructed before calling
std::filesystem::remove_all(test_dir); apply the same inner-scope pattern to the
BM_ExecutionHashJoin benchmark to ensure destructors run before deleting the
directory.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 1f5febec-0aef-4c5b-a527-8f1e70e486a8

📥 Commits

Reviewing files that changed from the base of the PR and between 8bc4289 and 0563338.

📒 Files selected for processing (9)

benchmarks/execution_bench.cpp
include/executor/operator.hpp
include/storage/buffer_pool_manager.hpp
include/storage/heap_table.hpp
include/storage/lru_replacer.hpp
src/executor/operator.cpp
src/storage/heap_table.cpp
src/storage/lru_replacer.cpp
tests/buffer_pool_tests.cpp

coderabbitai · 2026-04-05T15:26:55Z

include/executor/operator.hpp

+    explicit SeqScanOperator(std::shared_ptr<storage::HeapTable> table, Transaction* txn = nullptr,
                             LockManager* lock_manager = nullptr);


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Find remaining scan/index operator call sites that still create HeapTable with unique_ptr. rg -n -C3 'make_unique<storage::HeapTable>|make_shared<storage::HeapTable>|SeqScanOperator|IndexScanOperator' src include

Repository: poyrazK/cloudSQL

Length of output: 9868

Three uncompiled call sites still pass std::unique_ptr<storage::HeapTable> to operators expecting std::shared_ptr.

The signature change to SeqScanOperator and IndexScanOperator requires all callers to pass std::make_shared<storage::HeapTable> instead of std::make_unique. Three locations in src/executor/query_executor.cpp still need updating:

Lines 789–793 (IndexScanOperator)

Lines 805–807 (SeqScanOperator, first join table)

Lines 851–853 (SeqScanOperator, second join table)

Replace all three instances of std::make_unique<storage::HeapTable>(...) with std::make_shared<storage::HeapTable>(...) to resolve the compilation errors.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@include/executor/operator.hpp` around lines 114 - 115, The call sites still construct HeapTable with std::make_unique but the operator constructors now take std::shared_ptr; update the three locations that construct a HeapTable for IndexScanOperator and SeqScanOperator to use std::make_shared<storage::HeapTable>(...) instead of std::make_unique<storage::HeapTable>(...), ensuring the produced shared_ptr is passed to the IndexScanOperator and SeqScanOperator constructors (refer to the IndexScanOperator and SeqScanOperator calls around the join setup in query_executor.cpp).

include/storage/buffer_pool_manager.hpp

src/storage/heap_table.cpp

…zed iterator

coderabbitai

🧹 Nitpick comments (3)

.github/workflows/ci.yml (3)

150-153: Add retry logic and either use or remove ccache.

Two issues:

Missing retry logic for apt-get install unlike the build job (line 45), which could cause intermittent CI failures.
ccache is installed but never utilized—no cache action or CCACHE_DIR export. Either add caching or remove it from the install list.

♻️ Proposed fix

     - name: Install dependencies
       run: |
         sudo apt-get update
-        sudo apt-get install -y cmake clang ninja-build ccache
+        sudo apt-get install -y cmake clang ninja-build || (sleep 10 && sudo apt-get update && sudo apt-get install -y cmake clang ninja-build)

Alternatively, if you want to use ccache, add a cache step similar to lines 47-53 and export CCACHE_DIR in the configure step.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.github/workflows/ci.yml around lines 150 - 153, The "Install dependencies"
step runs apt-get without retries and installs ccache but doesn't use it; update
that step to mirror the build job's retry wrapper for apt-get install (retry
loop with sleeps) to avoid transient CI failures, and either remove "ccache"
from the apt-get list or add ccache usage: create a cache step for ccache
(similar to the build job's cache lines) and ensure you export CCACHE_DIR in the
configure/build steps so ccache is actually used.

155-162: Consider specifying the compiler explicitly for reproducible benchmarks.

The build job uses a matrix to specify the compiler, but this job relies on the system default. For consistent benchmark comparisons across CI runs (especially if the runner image changes), consider pinning the compiler.

♻️ Optional: specify compiler

         cmake .. -G Ninja \
           -DCMAKE_BUILD_TYPE=Release \
+          -DCMAKE_CXX_COMPILER=clang++ \
           -DBUILD_BENCHMARKS=ON \
           -DBUILD_TESTS=OFF

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.github/workflows/ci.yml around lines 155 - 162, The CI CMake configure step
currently relies on system defaults (cmake .. -G Ninja ...
-DCMAKE_BUILD_TYPE=Release -DBUILD_BENCHMARKS=ON -DBUILD_TESTS=OFF); update that
CMake invocation to pin compilers by adding -DCMAKE_C_COMPILER and
-DCMAKE_CXX_COMPILER (using the matrix-provided compiler variables in the
workflow) so the build job produces reproducible benchmarks—modify the configure
command inside the "Configure CMake (Release)" step to pass those two CMake
variables referencing the job/matrix compiler names.

169-174: Add a timeout and capture benchmark results as artifacts.

The project tracks performance baselines (see docs/performance/REPORT_V1.md), making artifact capture valuable for regression detection. Without a timeout, a hung benchmark (e.g., port binding failure in network_bench) will block CI for up to 6 hours.

Google Benchmark v1.8.3 is in use and supports JSON output via built-in flags:

♻️ Proposed improvement

     - name: Run Benchmarks
+      timeout-minutes: 15
       run: |
         cd build
-        ./storage_bench
-        ./execution_bench
-        ./network_bench
+        ./storage_bench --benchmark_out=storage_results.json --benchmark_out_format=json
+        ./execution_bench --benchmark_out=execution_results.json --benchmark_out_format=json
+        ./network_bench --benchmark_out=network_results.json --benchmark_out_format=json
+
+    - name: Upload Benchmark Results
+      uses: actions/upload-artifact@v4
+      with:
+        name: benchmark-results
+        path: build/*_results.json

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.github/workflows/ci.yml around lines 169 - 174, Wrap each benchmark
invocation (storage_bench, execution_bench, network_bench) with a timeout (e.g.,
GNU timeout) to prevent CI hangs, pass Google Benchmark flags
--benchmark_format=json and --benchmark_out=<name>.json to produce
machine-readable results (use distinct names like storage_bench.json,
execution_bench.json, network_bench.json), and add a subsequent GitHub Actions
upload-artifact step to publish those JSON files as CI artifacts for
baseline/regression tracking; ensure the timeout value is reasonable for your
suites and apply the same pattern to all three binaries.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In @.github/workflows/ci.yml:
- Around line 150-153: The "Install dependencies" step runs apt-get without
retries and installs ccache but doesn't use it; update that step to mirror the
build job's retry wrapper for apt-get install (retry loop with sleeps) to avoid
transient CI failures, and either remove "ccache" from the apt-get list or add
ccache usage: create a cache step for ccache (similar to the build job's cache
lines) and ensure you export CCACHE_DIR in the configure/build steps so ccache
is actually used.
- Around line 155-162: The CI CMake configure step currently relies on system
defaults (cmake .. -G Ninja ... -DCMAKE_BUILD_TYPE=Release -DBUILD_BENCHMARKS=ON
-DBUILD_TESTS=OFF); update that CMake invocation to pin compilers by adding
-DCMAKE_C_COMPILER and -DCMAKE_CXX_COMPILER (using the matrix-provided compiler
variables in the workflow) so the build job produces reproducible
benchmarks—modify the configure command inside the "Configure CMake (Release)"
step to pass those two CMake variables referencing the job/matrix compiler
names.
- Around line 169-174: Wrap each benchmark invocation (storage_bench,
execution_bench, network_bench) with a timeout (e.g., GNU timeout) to prevent CI
hangs, pass Google Benchmark flags --benchmark_format=json and
--benchmark_out=<name>.json to produce machine-readable results (use distinct
names like storage_bench.json, execution_bench.json, network_bench.json), and
add a subsequent GitHub Actions upload-artifact step to publish those JSON files
as CI artifacts for baseline/regression tracking; ensure the timeout value is
reasonable for your suites and apply the same pattern to all three binaries.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 25b6b0bd-9cf2-4678-9bd9-bd94a9424971

📥 Commits

Reviewing files that changed from the base of the PR and between 0563338 and a5b2e82.

📒 Files selected for processing (3)

.github/workflows/ci.yml
src/network/rpc_client.cpp
src/network/rpc_server.cpp

✅ Files skipped from review due to trivial changes (2)

src/network/rpc_server.cpp
src/network/rpc_client.cpp

…urce management

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (1)

scripts/check_perf_regression.py (1)
1-3: Add shebang for executable script.

The file is marked executable but lacks a shebang line. While the CI invokes it explicitly with python3, adding a shebang improves portability for local use.
Proposed fix
+#!/usr/bin/env python3
 import json
 import sys
 import os
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@scripts/check_perf_regression.py` around lines 1 - 3, This script lacks a
shebang; add a Unix shebang line (#!/usr/bin/env python3) as the very first line
of the file before the import statements so the executable bit works for local
execution, save the file, and ensure the file mode remains executable.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@scripts/check_perf_regression.py`:
- Around line 29-35: Guard against division by zero when computing change: check
the value of old_time (from base_map[name]) before computing change = (new_time
- old_time) / old_time; if old_time is zero or None, handle it explicitly (e.g.,
log or print a warning referencing name and old_time and skip computing
change/continue, or set change to math.inf or a sentinel) and then format the
output accordingly when printing name, old_time, new_time, and change; update
the block that reads base_map, old_time, new_time, change and the print
statement to implement this check and handling.
- Around line 10-17: The JSON load currently swallows all errors and returns
True on failure; change it so a failure to load current_file fails the check
(return False or re-raise) while a missing baseline_file is handled gracefully:
split the try/except into two blocks—one that opens/parses current_file and on
any exception prints the error and returns False (or raise), and a second that
tries to open/parses baseline_file but catches FileNotFoundError to set
baseline=None and only treats other exceptions as failures (print error + return
False). Reference the current_file and baseline_file loading logic in the
existing try/except and update the error messages accordingly.

---

Nitpick comments:
In `@scripts/check_perf_regression.py`:
- Around line 1-3: This script lacks a shebang; add a Unix shebang line
(#!/usr/bin/env python3) as the very first line of the file before the import
statements so the executable bit works for local execution, save the file, and
ensure the file mode remains executable.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 9b6541bc-b4a9-42d3-b0ed-69363064cfbf

📥 Commits

Reviewing files that changed from the base of the PR and between a5b2e82 and b14774c.

📒 Files selected for processing (6)

.github/workflows/ci.yml
benchmarks/execution_bench.cpp
include/common/value.hpp
scripts/check_perf_regression.py
src/executor/query_executor.cpp
src/storage/heap_table.cpp

🚧 Files skipped from review as they are similar to previous changes (3)

benchmarks/execution_bench.cpp
.github/workflows/ci.yml
src/storage/heap_table.cpp

scripts/check_perf_regression.py

…rror guards

poyrazK added 2 commits April 5, 2026 14:16

perf: implement CLOCK algorithm and infrastructure for zero-copy opti…

4db1cba

…mization

perf: complete zero-copy binary refactor and fix drop file path

472e9fb

style: automated clang-format fixes

0563338

coderabbitai bot reviewed Apr 5, 2026

View reviewed changes

poyrazK and others added 3 commits April 5, 2026 19:05

ci: add automated performance benchmarking job

dc8d630

perf: optimize storage engine with zero-copy binary format and optimi…

bf3a901

…zed iterator

style: automated clang-format fixes

a5b2e82

coderabbitai bot reviewed Apr 5, 2026

View reviewed changes

poyrazK and others added 3 commits April 6, 2026 20:25

ci: implement performance regression check with failure threshold

fc05031

fix: address review findings for type safety, encapsulation, and reso…

94e9577

…urce management

style: automated clang-format fixes

b14774c

coderabbitai bot reviewed Apr 6, 2026

View reviewed changes

scripts/check_perf_regression.py Show resolved Hide resolved

scripts/check_perf_regression.py Show resolved Hide resolved

ci: make performance regression script more robust with shebang and e…

9ff15da

…rror guards

poyrazK merged commit f23d690 into main Apr 7, 2026
9 checks passed

coderabbitai bot mentioned this pull request Apr 8, 2026

feat: high-performance engine optimizations and Prepared Statement API #19

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Zero-copy binary refactor and CLOCK replacer#17

perf: Zero-copy binary refactor and CLOCK replacer#17
poyrazK merged 10 commits intomainfrom
feature/optimize-insertion

poyrazK commented Apr 5, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Apr 5, 2026 •

edited

Loading

Rate limit exceeded

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Apr 5, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		explicit SeqScanOperator(std::shared_ptr<storage::HeapTable> table, Transaction* txn = nullptr,
		LockManager* lock_manager = nullptr);

Conversation

poyrazK commented Apr 5, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes:

Summary by CodeRabbit

Release Notes

Uh oh!

coderabbitai bot commented Apr 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

poyrazK commented Apr 5, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 5, 2026 •

edited

Loading