The research landscape — has the algorithm ceiling been reached?研究現況 —— 演算法天花板是不是已經到了?

What we studied about the state of the art in accelerating switch-level / event-driven logic simulation: where each idea actually peaked, what the last 20 years really delivered, and the honest conclusion about how much room is left.

我們對「開關級 / event-driven 邏輯模擬加速」現況的研究整理:每個想法實際在哪個年代成熟、近 20 年到底帶來了什麼、以及「還剩多少空間」的誠實結論。

The short version先講結論

For our regime — single-core, event-driven, switch-level, extremely low activity (~1.4 nodes resolved per event) — the algorithmic research space looks close to its ceiling. The two strategy families we rely on (structural reduction and common-case fast-pathing) matured between the 1980s and the mid-2000s, and the last ~20 years brought no disruptive algorithmic breakthrough in this specific niche. The gains since then are systems / data-oriented engineering, not new algorithms.

就我們這個情境而言 —— 單核、event-driven、開關級、極低 activity(平均每個 event 只解析 ~1.4 個節點)—— 演算法研究空間看起來已接近天花板。我們倚賴的兩條策略(結構化簡、common-case fast-path)在 1980s 到 2000s 中期就成熟了,近 20 年在這個特定利基沒有顛覆性的演算法突破。此後的進步是系統 / 資料導向的工程,不是新演算法。

So short-term, the realistic lever is not "find a smarter graph algorithm" — it's to validate the classic methods on a real, full netlist, and compose the combination that actually runs fastest on real hardware. That is exactly where this project puts its effort, and where it claims its (modest, honest) value.

所以短期內,務實的槓桿不是「找一個更聰明的圖論演算法」,而是把這些經典方法在一顆真實、完整的網表上驗證,並組合出在真實硬體上實際跑最快的搭配。這正是本專案投入的地方,也是它(謙遜而誠實的)價值所在。

One honest caveat about this very conclusion: "the ceiling is reached" is a claim we (and an external LLM we consulted) made about our own engine once before — and then R-1 beat it by +18% by measuring rather than assuming. So read "near the ceiling" as "no known algorithm left to copy from the literature", not "physically impossible to improve". The two are very different.

對這個結論本身的一個誠實但書:「天花板到了」這句話,我們(以及我們諮詢的外部 LLM)曾對自己的引擎講過一次 —— 然後 R-1實測而非假設,把它打破 +18%。所以「接近天花板」要理解成「文獻上沒有現成可抄的新演算法」,而不是「物理上不可能再快」。這兩者差很多。

The two families, in academic terms用學術詞彙講這兩條策略

We use two engineering names; here is what they map to in the literature, so an academic reader can connect them to existing work.

我們用兩個工程名稱;以下是它們在文獻中的對應,讓學術背景的讀者能接回既有研究。

Our name我們的稱呼Academic / EDA equivalent學術 / EDA 對應
lowering
(pre-simulation netlist simplification)(模擬前的網表化簡)
Logic-network structural reduction: constant propagation / folding, dead-gate (constant-driven) elimination, structural hashing (strashing), AIG rewriting & balancing, SAT-sweeping / functional node merging.邏輯網路結構化簡:常數傳播 / 摺疊、死閘(被常數驅動)消除、structural hashing(strashing)、AIG rewriting 與 balancing、SAT-sweeping / 函式等價節點合併。
fast-path
(run-time common-case specialization)(執行期 common-case 特化)
Event-driven common-case specialization: selective-trace, activity-directed simulation, event filtering / activity bypass — recognizing the cheap case (a single-node resolve / a group that won't grow) and skipping the general group-resolution machinery.event-driven common-case 特化:selective-trace、activity-directed simulation、event filtering / activity bypass —— 辨識便宜情況(只解析單節點 / group 不會擴張),跳過通用 group-resolution。

The underlying simulation model is Bryant's switch-level model; the strategy taxonomy (event-driven vs levelized vs oblivious / compiled) is textbook. We borrow compiler ("lowering") and systems ("fast-path") vocabulary — the equivalent EDA techniques are all classic.

底層模擬模型是 Bryant 的開關級模型;策略分類(event-driven 對 levelized 對 oblivious / compiled)是教科書級。我們借用編譯器(lowering)與系統(fast-path)的詞彙 —— 對應的 EDA 技術都是經典。

Where each idea actually peaked — a 40-year timeline每個想法實際在哪一年成熟 —— 40 年時間軸

The telling pattern: the load-bearing algorithmic ideas all date from 1981–2006. After that, the data-structure work (MIG, 2014) is real but doesn't help low-activity simulation, and everything else is engineering.

關鍵在於:撐起這領域的演算法想法,全部出自 1981–2006。之後的資料結構工作(MIG,2014)是真的,但對低-activity 模擬沒幫助,其餘都是工程。

Year年份Milestone里程碑Still the state of the art for us?對我們仍是最先進?
~1981–84Bryant — switch-level model & MOSSIM (the model we simulate)Bryant —— 開關級模型與 MOSSIM(我們模擬的就是這個模型) foundational地基
~1980sUlrich et al. — selective-trace / activity-directed event-driven simulationUlrich 等 —— selective-trace / activity-directed event-driven 模擬 our "fast-path" is this我們的 fast-path 就是這個
1987COSMOS (Bryant et al.) — symbolic switch-level analysis: statically extract channel-connected components (CCC) into Boolean / BDDCOSMOS(Bryant 等)—— 符號化開關級分析:把 channel-connected component(CCC)靜態提取成 Boolean / BDD~ the "static extraction" idea; we tested its modern form (IR/codegen) — slower for us「靜態提取」的想法;我們測過它的現代版(IR/codegen)—— 對我們更慢
2006Berkeley ABC — DAG-aware AIG rewriting (Mishchenko et al.) + SAT-sweepingBerkeley ABC —— DAG-aware AIG rewriting(Mishchenko 等)+ SAT-sweeping still the structural-reduction champion至今仍是結構化簡霸主
2014MIG — Majority-Inverter Graph (Amarù, Gaillardon, De Micheli, EPFL)MIG —— Majority-Inverter Graph(Amarù、Gaillardon、De Micheli,EPFL) great for synthesis/depth; hurts a bidirectional pass-transistor sim對合成/深度很好;對雙向 pass-transistor 模擬反而有害
2005–2025Everything that actually got faster: cache-conscious / data-oriented layout (SoA), flat double-buffered event arrays, SIMD bit-parallel, devirtualization真正變快的全部:cache-conscious / 資料導向佈局(SoA)、扁平雙緩衝事件陣列、SIMD bit-parallel、去虛擬化 engineering, not algorithm工程,不是演算法

What the last 20 years actually delivered (and what's hype)近 20 年真正帶來了什麼(以及什麼是炒作)

Real, effective — but engineering / systems, not new algorithms真實有效 —— 但是工程 / 系統,不是新演算法

Looks clever on paper, loses in practice (for low activity)紙上漂亮、實務上輸(對低 activity)

The important nuance: the field did advance 2005–2025 — but in the directions we deliberately excluded (parallelism, GPU, hardware emulation, compiled cycle-based simulation à la Verilator). Those win for big/high-activity/many-instance workloads. For one low-activity NES on one core, they don't — which is exactly why "no breakthrough" is true for us, not for the field at large.

重要的細微差別:這領域 2005–2025 確實有進步 —— 但都在我們刻意排除的方向(平行、GPU、硬體模擬、Verilator 式編譯 cycle-based 模擬)。那些對大型/高-activity/多實例工作負載會贏。對「單核、單顆、低-activity 的 NES」不會 —— 這正是為什麼「沒有突破」對我們成立,而非對整個領域成立。

How we checked this我們怎麼查證的

This summary isn't one person's opinion. It triangulates three sources: (1) our own ~3-week optimization campaign and its documented dead-end catalogue; (2) two independent queries to a frontier LLM (Gemini 3.1 Pro), framed in strict academic vocabulary and asked to honestly separate "genuinely effective" from "paper-elegant"; and (3) verification of the specific citations the LLM returned.

這份整理不是一個人的意見,而是三方交叉比對:(1) 我們自己約三週的優化實驗與其記錄的死路清單;(2) 兩次獨立詢問前沿 LLM(Gemini 3.1 Pro),用嚴格學術詞彙提問、並要求誠實區分「真正有效」與「紙上漂亮」;(3) 對 LLM 回傳的具體 cite 逐一查證。

On verification — important: the two LLM runs agreed on the conclusion and on the load-bearing citations (MIG 2014, ABC/AIG rewriting 2006, COSMOS 1987, selective-trace). We confirmed those independently. We also caught the LLM returning one citation that does not exist (a plausible-looking "cache-efficient simulator" paper that no database has) — so it was dropped. We only list references we could verify. (This is the same "measure / verify, don't assume" discipline the whole project runs on.)

關於查證 —— 重要:兩次 LLM 回答在結論與關鍵 cite(MIG 2014、ABC/AIG rewriting 2006、COSMOS 1987、selective-trace)上一致,我們也獨立確認了。同時我們也抓到 LLM 回傳了一個並不存在的 cite(一篇看起來很合理、但任何資料庫都查不到的「cache-efficient simulator」論文)—— 所以把它剔除了。本頁只列出我們能查證的參考。(這正是整個專案奉行的「實測 / 查證,不要假設」紀律。)

What this means for AprVisual這對 AprVisual 意味著什麼

It clarifies the project's positioning honestly:

它把專案的定位講清楚了:

Bottom line: short-term, the breakthrough space is essentially tapped; the realistic, valuable work is rigorous engineering validation and finding the best combination on real hardware. A genuine leap would require a different computational model — and we've already tested the obvious candidates (IR/codegen, GPU, parallel) and shown they lose for this low-activity, single-instance regime. So we keep the engine honest, fast, and verifiable, and we keep measuring.

底線:短期內,突破空間基本上見底了;務實而有價值的工作,是嚴謹的工程驗證與在真實硬體上找出最佳搭配。真正的躍進需要換一個計算模型 —— 而我們已經測過明顯的候選(IR/codegen、GPU、平行),證明它們在這個低-activity、單實例情境會輸。所以我們把引擎保持誠實、快速、可驗證,然後持續實測。

Verified references查證過的參考

See also AprVisual's own optimization notes and the simulator comparison for the measured numbers behind the "engineering, not algorithm" claim.

「工程而非演算法」這個說法背後的實測數字,另見 AprVisual 自己的優化筆記與模擬器比較