cpu-o3: btbtage two slot pre entry by CJ362ff · Pull Request #832 · OpenXiangShan/GEM5

CJ362ff · 2026-04-15T08:40:18Z

Summary by CodeRabbit

Refactor
- Per-branch slot tracking added; prediction, update and allocation logic now operate at slot granularity and sync legacy entry mirrors.
- Stage-1 tag computation updated to be block-based (position removed); allocation/weakening/useful-reset behavior centralized.
Tests
- Extensive slot-aware tests and helpers covering lookup, allocation, eviction, reuse, spill and reordering scenarios.
Documentation
- Added cross-reference to dual-slot entry semantics.
Chores
- Tracing and stats extended to record slot-level metadata and slot-miss metrics.

coderabbitai · 2026-04-15T08:40:37Z

Warning

Rate limit exceeded

@CJ362ff has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 54 minutes and 46 seconds before requesting another review.

Your organization is not enrolled in usage-based pricing. Contact your admin to enable usage-based pricing to continue reviews beyond the rate limit, or try again in 54 minutes and 46 seconds.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 1e53f927-4a00-4a18-b1cd-09c8ab92e956

📥 Commits

Reviewing files that changed from the base of the PR and between a7399c2 and 1c0abaa.

📒 Files selected for processing (1)

src/cpu/pred/BranchPredictor.py

📝 Walkthrough

Walkthrough

Slot-aware TAGE: prediction, lookup, update, and allocation are rewritten to track per-entry slots (positioned by block-relative position). Tag computation no longer XORs branch position. Trace/schema extended with slot fields. Tests and helper APIs updated; several function signatures changed to return allocated slot information.

Changes

Cohort / File(s)	Summary
Core TAGE implementation `src/cpu/pred/btb/btb_tage.cc`	Introduces per-entry slots and slot-aware lookup/update/allocation flows; records main/alt way+slot; allocation now returns `allocated_slot`; many helpers refactored/added; added assert for duplicate same-tag slots.
Public headers & types `src/cpu/pred/btb/btb_tage.hh`	Adds `TageSlot`, converts `TageEntry` to contain slot array, updates `TageTableInfo` to carry `slot`+`slotInfo`; removes position param from `getTageTag` APIs; declares new slot/allocation helpers and stats; static `kNumSlotsPerEntry = 2`.
Upper-bound variant `src/cpu/pred/btb/btb_tage_ub.cc`	Adapts UpperBound helpers to use `slotInfo.counter`/`slotInfo` snapshots for strength/confidence checks and allocation decisions (no signature changes).
Trace & common schema `src/cpu/pred/btb/common.hh`, `src/cpu/pred/btb/microtage.cc`	Extends TAGE miss trace/schema to include `mainWay`, `mainSlot`, `altWay`, `altSlot`, `allocSlot` (and registers fields in `microtage`). Adjusts trace `set()` signature.
Tests `src/cpu/pred/btb/test/btb_tage.test.cc`	Massive test rewrite to exercise slot-aware behavior: new helpers, entry creation using slots, allocation/mispredict forcing, verification by slot position, and new eviction/spill/weaken scenarios.
Docs `src/cpu/pred/btb/docs/btb_tage.md`	Adds cross-reference to dual-slot entry design doc and minor newline change.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

Tage tag align #675 — touches TAGE tag computation and stage-1 hashing similar to removal of position XOR in this change.
cpu-o3: update tage index #627 — modifies allocation and index/tag computation sites that overlap handleNewEntryAllocation changes.
Tage1 align #615 — changes TAGE tagging/indexing interfaces; relates to removal of position parameter and tag API updates.

Suggested labels

perf, align-kmhv3

Suggested reviewers

jensen-yan
Yakkhini

Poem

🐇
I hopped through tables, slots in tow,
two little burrows where branch bits go.
Tags now count blocks, not tiny places,
I nibble duplicates and tidy traces.
Hooray — a soft thump for cleaner predictor flows!

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 35.82% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'cpu-o3: btbtage two slot pre entry' directly describes the main change: implementing a two-slot-per-entry architecture for the BTBTAGE predictor, which is the core focus across all modified files.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch compress-tage-entry-align

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

XiangShanRobot · 2026-04-15T08:43:25Z

[Generated by GEM5 Performance Robot]
commit: bf3f461
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	18.73	18.71	+0.15 🟢

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/cpu/pred/btb/btb_tage.cc`:
- Around line 547-556: syncEntryLegacyMirror currently updates only
counter/useful and must also mirror the slot-0 PC per the compatibility
contract; update BTBTAGE::syncEntryLegacyMirror to set entry.pc =
entry.slots[0].pc when entry.slots[0].valid (and set entry.pc to a defined
invalid/sentinel value when not valid, e.g., 0 or the project's INVALID_PC) so
legacy readers observe the same PC as slot 0; update any tests that relied on
manually rewriting entry.pc to validate the helper.
- Around line 789-824: The code currently weakens a non-useful strong slot in
same_entry and then returns false, which stops the outer loop over tables and
prevents probing higher-history tables; change that final return so the
allocator continues probing later tables: after adjusting cand_slot.counter,
call sortEntrySlotsByPosition(same_entry), syncEntryLegacyMirror(same_entry),
noteAllocationFailure(), and then continue the outer for (unsigned ti =
start_table; ti < numPredictors; ++ti) loop instead of returning false (i.e.,
replace the final "return false;" with a control flow that continues to the next
ti), keeping the existing updates to tageStats and allocation tracking only
where appropriate.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 94de6458-6410-4e28-a096-98a0a5b5f8ce

📥 Commits

Reviewing files that changed from the base of the PR and between cac9b46 and bf3f461.

📒 Files selected for processing (7)

src/cpu/pred/btb/btb_tage.cc
src/cpu/pred/btb/btb_tage.hh
src/cpu/pred/btb/btb_tage_ub.cc
src/cpu/pred/btb/common.hh
src/cpu/pred/btb/docs/btb_tage.md
src/cpu/pred/btb/microtage.cc
src/cpu/pred/btb/test/btb_tage.test.cc

coderabbitai · 2026-04-15T08:48:34Z

+BTBTAGE::syncEntryLegacyMirror(TageEntry &entry)
+{
+    if (entry.slots[0].valid) {
+        entry.counter = entry.slots[0].counter;
+        entry.useful = entry.slots[0].useful;
+    } else {
+        entry.counter = 0;
+        entry.useful = false;
+    }
+}


⚠️ Potential issue | 🟠 Major

syncEntryLegacyMirror() no longer syncs the mirrored pc.

The compatibility contract in src/cpu/pred/btb/btb_tage.hh says counter/useful/pc still mirror slots[0], but this helper only refreshes counter/useful. After sortEntrySlotsByPosition() moves a different branch into slot 0, entry.pc stays on the old branch, so any remaining legacy reader will attribute the entry to the wrong PC. The new test helper in src/cpu/pred/btb/test/btb_tage.test.cc:108-113 having to rewrite entry.pc manually is a symptom of the same gap.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/cpu/pred/btb/btb_tage.cc` around lines 547 - 556, syncEntryLegacyMirror currently updates only counter/useful and must also mirror the slot-0 PC per the compatibility contract; update BTBTAGE::syncEntryLegacyMirror to set entry.pc = entry.slots[0].pc when entry.slots[0].valid (and set entry.pc to a defined invalid/sentinel value when not valid, e.g., 0 or the project's INVALID_PC) so legacy readers observe the same PC as slot 0; update any tests that relied on manually rewriting entry.pc to validate the helper.

coderabbitai · 2026-04-15T08:48:34Z

+            int replace_slot = -1;
+            for (unsigned slot = 0; slot < same_entry.slots.size(); ++slot) {
+                const auto &cand_slot = same_entry.slots[slot];
+                if (!cand_slot.useful && isWeakishCounter(cand_slot.counter)) {
+                    replace_slot = static_cast<int>(slot);
+                    break;
+                }
+            }
+            if (replace_slot >= 0) {
+                // same-tag + full slots + replaceable slot: replace.
+                same_entry.slots[replace_slot] = TageSlot(true, position, initCounter, false);
+                sortEntrySlotsByPosition(same_entry);
+                syncEntryLegacyMirror(same_entry);
+                tageStats.updateAllocSuccess++;
+                allocated_table = ti;
+                allocated_index = newIndex;
+                allocated_way = same_tag_way;
+                allocated_slot = findSlotByPosition(same_entry, position);
+                return true;
+            }
+
+            // same-tag + full slots + no replaceable slot: weaken one non-useful strong slot.
+            for (auto &cand_slot : same_entry.slots) {
+                if (!cand_slot.useful && !isWeakishCounter(cand_slot.counter)) {
+                    if (cand_slot.counter > 0) {
+                        cand_slot.counter--;
+                    } else {
+                        cand_slot.counter++;
+                    }
+                    break;
+                }
+            }
+            sortEntrySlotsByPosition(same_entry);
+            syncEntryLegacyMirror(same_entry);
+            noteAllocationFailure();
+            return false;


⚠️ Potential issue | 🟠 Major

Keep probing higher tables after a full same-tag entry can't accept the new slot.

This path weakens the local slot and then returns immediately, which skips the rest of the for (unsigned ti = start_table; ti < numPredictors; ++ti) search. In practice, one congested shared entry in table ti can prevent allocation in every longer-history table above it, even though those later tables are still valid allocation targets.

Possible fix

// same-tag + full slots + no replaceable slot: weaken one non-useful strong slot. for (auto &cand_slot : same_entry.slots) { if (!cand_slot.useful && !isWeakishCounter(cand_slot.counter)) { if (cand_slot.counter > 0) { cand_slot.counter--; } else { cand_slot.counter++; } break; } } sortEntrySlotsByPosition(same_entry); syncEntryLegacyMirror(same_entry); noteAllocationFailure(); - return false; + continue;

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

int replace_slot = -1;

for (unsigned slot = 0; slot < same_entry.slots.size(); ++slot) {

const auto &cand_slot = same_entry.slots[slot];

if (!cand_slot.useful && isWeakishCounter(cand_slot.counter)) {

replace_slot = static_cast<int>(slot);

break;

}

}

if (replace_slot >= 0) {

// same-tag + full slots + replaceable slot: replace.

same_entry.slots[replace_slot] = TageSlot(true, position, initCounter, false);

sortEntrySlotsByPosition(same_entry);

syncEntryLegacyMirror(same_entry);

tageStats.updateAllocSuccess++;

allocated_table = ti;

allocated_index = newIndex;

allocated_way = same_tag_way;

allocated_slot = findSlotByPosition(same_entry, position);

return true;

}

// same-tag + full slots + no replaceable slot: weaken one non-useful strong slot.

for (auto &cand_slot : same_entry.slots) {

if (!cand_slot.useful && !isWeakishCounter(cand_slot.counter)) {

if (cand_slot.counter > 0) {

cand_slot.counter--;

} else {

cand_slot.counter++;

}

break;

}

}

sortEntrySlotsByPosition(same_entry);

syncEntryLegacyMirror(same_entry);

noteAllocationFailure();

return false;

int replace_slot = -1;

for (unsigned slot = 0; slot < same_entry.slots.size(); ++slot) {

const auto &cand_slot = same_entry.slots[slot];

if (!cand_slot.useful && isWeakishCounter(cand_slot.counter)) {

replace_slot = static_cast<int>(slot);

break;

}

}

if (replace_slot >= 0) {

// same-tag + full slots + replaceable slot: replace.

same_entry.slots[replace_slot] = TageSlot(true, position, initCounter, false);

sortEntrySlotsByPosition(same_entry);

syncEntryLegacyMirror(same_entry);

tageStats.updateAllocSuccess++;

allocated_table = ti;

allocated_index = newIndex;

allocated_way = same_tag_way;

allocated_slot = findSlotByPosition(same_entry, position);

return true;

}

// same-tag + full slots + no replaceable slot: weaken one non-useful strong slot.

for (auto &cand_slot : same_entry.slots) {

if (!cand_slot.useful && !isWeakishCounter(cand_slot.counter)) {

if (cand_slot.counter > 0) {

cand_slot.counter--;

} else {

cand_slot.counter++;

}

break;

}

}

sortEntrySlotsByPosition(same_entry);

syncEntryLegacyMirror(same_entry);

noteAllocationFailure();

continue;

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/cpu/pred/btb/btb_tage.cc` around lines 789 - 824, The code currently weakens a non-useful strong slot in same_entry and then returns false, which stops the outer loop over tables and prevents probing higher-history tables; change that final return so the allocator continues probing later tables: after adjusting cand_slot.counter, call sortEntrySlotsByPosition(same_entry), syncEntryLegacyMirror(same_entry), noteAllocationFailure(), and then continue the outer for (unsigned ti = start_table; ti < numPredictors; ++ti) loop instead of returning false (i.e., replace the final "return false;" with a control flow that continues to the next ti), keeping the existing updates to tageStats and allocation tracking only where appropriate.

coderabbitai

♻️ Duplicate comments (2)

src/cpu/pred/btb/btb_tage.cc (2)

588-598: ⚠️ Potential issue | 🟠 Major

syncEntryLegacyMirror() still leaves the mirrored pc stale.

After sortEntrySlotsByPosition() moves a different branch into slots[0], this helper refreshes only counter/useful. Any remaining legacy reader using entry.pc will now observe the wrong branch.

Suggested change

 void
 BTBTAGE::syncEntryLegacyMirror(TageEntry &entry)
 {
     if (entry.slots[0].valid) {
         entry.counter = entry.slots[0].counter;
         entry.useful = entry.slots[0].useful;
+        entry.pc = entry.ownerBlockBase +
+                   (static_cast<Addr>(entry.slots[0].position) << instShiftAmt);
     } else {
         entry.counter = 0;
         entry.useful = false;
+        entry.pc = 0;
     }
 }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/cpu/pred/btb/btb_tage.cc` around lines 588 - 598, syncEntryLegacyMirror
updates only counter/useful after sortEntrySlotsByPosition may have moved a
different branch into slots[0], leaving entry.pc stale; modify
syncEntryLegacyMirror (operating on TageEntry &entry) to also refresh the
mirrored pc from entry.slots[0].pc when entry.slots[0].valid, and set entry.pc
to a safe default/invalid value when not valid so legacy readers see the correct
branch identity.

1012-1014: ⚠️ Potential issue | 🟠 Major

Keep probing higher tables after a blocked same-tag entry.

This return false exits the outer allocation loop immediately after weakening the current table. A congested table at ti can therefore prevent allocation in every longer-history table above it.

Suggested change

             weakenFirstNonUsefulStrongSlot(ti, newIndex);
             noteAllocationFailure();
-            return false;
+            continue;

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/cpu/pred/btb/btb_tage.cc` around lines 1012 - 1014, The early "return
false" after calling weakenFirstNonUsefulStrongSlot(ti, newIndex) and
noteAllocationFailure() prematurely exits the outer allocation loop and prevents
probing higher-history tables; change the control flow so that instead of
returning, the code continues probing upper tables (e.g., replace the return
false with a continue of the allocation loop or otherwise skip to the next table
index), ensuring weakenFirstNonUsefulStrongSlot and noteAllocationFailure are
still invoked but allocation attempts proceed for larger ti values.

🧹 Nitpick comments (1)

src/cpu/pred/btb/btb_tage.hh (1)

130-132: Make TageTableInfo::taken() return false on misses.

slotInfo defaults to counter == 0, so a default-constructed miss currently reports taken() == true. Most call sites guard on found, but baking that check into the helper avoids another subtle misuse later.

Suggested change

             bool taken() const {
-                return slotInfo.taken();
+                return found && slotInfo.valid && slotInfo.taken();
             }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/cpu/pred/btb/btb_tage.hh` around lines 130 - 132, TageTableInfo::taken()
currently returns slotInfo.taken() even for misses (default
slotInfo.counter==0), so change taken() to return false when this entry isn't
valid by checking the `found` flag first (e.g., if (!found) return false;),
otherwise return slotInfo.taken(); update the TageTableInfo::taken()
implementation to reference `found` and `slotInfo.taken()` so callers no longer
need to guard on `found`.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@src/cpu/pred/btb/btb_tage.cc`:
- Around line 588-598: syncEntryLegacyMirror updates only counter/useful after
sortEntrySlotsByPosition may have moved a different branch into slots[0],
leaving entry.pc stale; modify syncEntryLegacyMirror (operating on TageEntry
&entry) to also refresh the mirrored pc from entry.slots[0].pc when
entry.slots[0].valid, and set entry.pc to a safe default/invalid value when not
valid so legacy readers see the correct branch identity.
- Around line 1012-1014: The early "return false" after calling
weakenFirstNonUsefulStrongSlot(ti, newIndex) and noteAllocationFailure()
prematurely exits the outer allocation loop and prevents probing higher-history
tables; change the control flow so that instead of returning, the code continues
probing upper tables (e.g., replace the return false with a continue of the
allocation loop or otherwise skip to the next table index), ensuring
weakenFirstNonUsefulStrongSlot and noteAllocationFailure are still invoked but
allocation attempts proceed for larger ti values.

---

Nitpick comments:
In `@src/cpu/pred/btb/btb_tage.hh`:
- Around line 130-132: TageTableInfo::taken() currently returns slotInfo.taken()
even for misses (default slotInfo.counter==0), so change taken() to return false
when this entry isn't valid by checking the `found` flag first (e.g., if
(!found) return false;), otherwise return slotInfo.taken(); update the
TageTableInfo::taken() implementation to reference `found` and
`slotInfo.taken()` so callers no longer need to guard on `found`.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: cc0f0afd-4086-423a-afcb-4aec26aa3263

📥 Commits

Reviewing files that changed from the base of the PR and between bf3f461 and 75528f1.

📒 Files selected for processing (5)

src/cpu/pred/btb/btb_tage.cc
src/cpu/pred/btb/btb_tage.hh
src/cpu/pred/btb/common.hh
src/cpu/pred/btb/microtage.cc
src/cpu/pred/btb/test/btb_tage.test.cc

🚧 Files skipped from review as they are similar to previous changes (2)

src/cpu/pred/btb/microtage.cc
src/cpu/pred/btb/test/btb_tage.test.cc

XiangShanRobot · 2026-04-15T12:36:12Z

[Generated by GEM5 Performance Robot]
commit: 75528f1
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	18.77	18.71	+0.37 🟢

[Generated by GEM5 Performance Robot]
commit: 75528f1
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	18.77	18.73	+0.22 🟢

Change-Id: I54fe1b814015c69233d5944f89e67bfa177886fb

coderabbitai

🧹 Nitpick comments (2)

src/cpu/pred/btb/btb_tage.cc (1)
515-525: Document the "weakish" counter threshold.

The formula std::abs(counter * 2 + 1) <= 3 classifies counters -2, -1, 0, 1 as "weakish" (not saturated or near-saturated). Consider adding a brief comment explaining this threshold choice for future maintainers.
📝 Suggested documentation
 bool
 BTBTAGE::isWeakishCounter(short counter) const
 {
+    // "Weakish" = counters -2, -1, 0, 1 (not saturated or near-saturated)
+    // Formula maps [-2,1] to |val| <= 3, while [-4,-3] and [2,3] map to |val| >= 5
     return std::abs(counter * 2 + 1) <= 3;
 }
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/cpu/pred/btb/btb_tage.cc` around lines 515 - 525, Add a short inline
comment above BTBTAGE::isWeakishCounter explaining the threshold choice: clarify
that the expression std::abs(counter * 2 + 1) <= 3 is used to detect "weakish"
2-bit-ish signed counters (it evaluates true for counter values -2, -1, 0, 1
which are not saturated/strongly biased), and note that isSlotUnprotected treats
slots with such counters (and non-useful) as unprotected; update comments near
BTBTAGE::isWeakishCounter and BTBTAGE::isSlotUnprotected to document this
rationale for future maintainers.
src/cpu/pred/btb/test/btb_tage.test.cc (1)
964-1026: Consider adding test teardown to prevent memory leaks.

Multiple tests (e.g., this one at line 965) create new BTBTAGE instances without deleting the previous one assigned to tage. While acceptable for short-lived test processes, this pattern could accumulate significant memory in large test suites.
♻️ Suggested pattern to avoid leaks
 TEST_F(BTBTAGETest, DifferentTagWholeEntryEvictionRequiresAllSlotsUnprotected) {
+    delete tage;
     tage = new BTBTAGE(1, 2, 32);
Or consider using smart pointers in the test fixture:
std::unique_ptr<BTBTAGE> tage;
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/cpu/pred/btb/test/btb_tage.test.cc` around lines 964 - 1026, The test
DifferentTagWholeEntryEvictionRequiresAllSlotsUnprotected (and others)
repeatedly does tage = new BTBTAGE(...) without freeing the previous BTBTAGE,
causing memory leaks; fix by ensuring the fixture-owned tage is freed between
tests—either change the fixture member to std::unique_ptr<BTBTAGE> tage and
replace raw new with std::make_unique, or add a TearDown/cleanup that deletes
the existing tage before reassigning (or at test end); update all usages of the
tage pointer accordingly so the BTBTAGE instance is properly destroyed.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@src/cpu/pred/btb/btb_tage.cc`:
- Around line 515-525: Add a short inline comment above
BTBTAGE::isWeakishCounter explaining the threshold choice: clarify that the
expression std::abs(counter * 2 + 1) <= 3 is used to detect "weakish" 2-bit-ish
signed counters (it evaluates true for counter values -2, -1, 0, 1 which are not
saturated/strongly biased), and note that isSlotUnprotected treats slots with
such counters (and non-useful) as unprotected; update comments near
BTBTAGE::isWeakishCounter and BTBTAGE::isSlotUnprotected to document this
rationale for future maintainers.

In `@src/cpu/pred/btb/test/btb_tage.test.cc`:
- Around line 964-1026: The test
DifferentTagWholeEntryEvictionRequiresAllSlotsUnprotected (and others)
repeatedly does tage = new BTBTAGE(...) without freeing the previous BTBTAGE,
causing memory leaks; fix by ensuring the fixture-owned tage is freed between
tests—either change the fixture member to std::unique_ptr<BTBTAGE> tage and
replace raw new with std::make_unique, or add a TearDown/cleanup that deletes
the existing tage before reassigning (or at test end); update all usages of the
tage pointer accordingly so the BTBTAGE instance is properly destroyed.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 4b0f074f-9c0b-4eb2-aec9-58219c831da4

📥 Commits

Reviewing files that changed from the base of the PR and between 75528f1 and a7399c2.

📒 Files selected for processing (3)

src/cpu/pred/btb/btb_tage.cc
src/cpu/pred/btb/btb_tage.hh
src/cpu/pred/btb/test/btb_tage.test.cc

XiangShanRobot · 2026-04-16T11:04:27Z

[Generated by GEM5 Performance Robot]
commit: a7399c2
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	18.77	18.71	+0.35 🟢

[Generated by GEM5 Performance Robot]
commit: a7399c2
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	18.77	18.77	-0.01 🔴

Change-Id: I731abc26584eaeb7d275beeeac8264836e4c0a1f

XiangShanRobot · 2026-04-16T14:00:03Z

[Generated by GEM5 Performance Robot]
commit: 1c0abaa
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Master	Diff(%)
Score	18.79	18.71	+0.46 🟢

[Generated by GEM5 Performance Robot]
commit: 1c0abaa
workflow: gem5 Align BTB Performance Test(0.3c)

Align BTB Performance

Overall Score

	PR	Previous Commit	Diff(%)
Score	18.79	18.77	+0.11 🟢

cpu-o3: btbtage two slot pre entry

bf3f461

coderabbitai Bot reviewed Apr 15, 2026

View reviewed changes

cpu-o3: tage compress allow same tag

75528f1

coderabbitai Bot reviewed Apr 15, 2026

View reviewed changes

cpu: Prefer same-tag spill before weak-slot replace

a7399c2

Change-Id: I54fe1b814015c69233d5944f89e67bfa177886fb

coderabbitai Bot reviewed Apr 16, 2026

View reviewed changes

cpu: Tune BTBTAGE table sizes for align run

1c0abaa

Change-Id: I731abc26584eaeb7d275beeeac8264836e4c0a1f

Conversation

CJ362ff commented Apr 15, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

XiangShanRobot commented Apr 15, 2026

Align BTB Performance

Overall Score

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

XiangShanRobot commented Apr 15, 2026

Align BTB Performance

Overall Score

Align BTB Performance

Overall Score

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

XiangShanRobot commented Apr 16, 2026

Align BTB Performance

Overall Score

Align BTB Performance

Overall Score

Uh oh!

XiangShanRobot commented Apr 16, 2026

Align BTB Performance

Overall Score

Align BTB Performance

Overall Score

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CJ362ff commented Apr 15, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 15, 2026 •

edited

Loading