rich-data-research-page

富数据调研页 Skill

已上架研究与分析Codex 可配置

把多维度公开数据（10 余条实体 × N 维指标）做成一个可比较、可筛选、可分享、可对比的富页面调研，而不是一篇 Markdown。沉淀自 /cancers-overview 与 /ai-token-usage-research 的实际打法。

基础信息

触发场景: 当用户要求做一个"富页面 / 数据可视化 / 专题调研"，需要把多条同类实体（癌症 / 国家 / 行业 / 产品 / 公司）按多维指标对比、筛选和分享时使用。
输入要求: 一份实体清单（5–20 条，同一物种，如「10 种癌症」「10 国教育投入」「8 家云厂商」） / 每条实体的多维指标（至少 3 维：一个主量级 / 一个次量级 / 一个比率类指标） / 可选：可切换的数据口径（如全球 / 中国 / 不同年份） / 可选：每条实体的权重明细（如风险因子、收入构成）
产出格式: 一个独立 Next.js 路由（/<topic>-overview） / 一个数据 module（data.js / data.ts），实体数组 + 元数据常量 + region/version 切换 helper / 一个客户端富页面组件，包含：散点图 + 排行条 + 详情面板 + 对比表，所有筛选 URL 化 / 强免责声明 + 一手数据来源链接（顶部 + 底部）
验收标准: 所有实体一屏可见可比较；散点图清晰分出 4 象限并自带 tooltip；排行条按任意维度排序；至少 ≥3 项可两两对比；筛选状态可通过 URL 完整分享；数据口径标注清楚、不虚构。

内容

16 条富数据页 pattern

v0.3

01
实体 schema：把每条实体压成一个对象，至少含 4 类字段
元信息（id / nameZh / nameEn / color / category）+ 主量级指标（如发病率、营收）+ 次量级指标（如死亡率、利润）+ 比率类指标（如生存率、利润率）+ 多维分布数组（如年龄分布 10 档）+ 加权明细数组（如风险因子带 weight + category）+ 叙述字段（主因、预警、筛查建议）。一条实体一对象，所有视图共用同一份数据。
02
口径切换：每条实体可挂多 region/version 子块
在实体上加 cn: {} 或 v2024: {} 等子对象，存储该口径下需要替换的字段（通常是主/次量级和比率类，明细字段共用）。顶部一个 toggle + 一个 cancerView(c, region) helper 把视图层和数据层解耦。换 region 不动主结构。
03
四象限散点图：一张图把所有实体定位完
X = 主量级（对数刻度 log10，跨数量级时必备）× Y = 比率类指标（0–100% 线性）。气泡半径 = 次量级。气泡颜色 = 类别 / 性别 / 阵营。背景画两条隐线分四象限，每个象限角落写一行小字（如「常见·难治」），让对比一秒成立。pure SVG + viewBox + preserveAspectRatio="xMidYMid meet"，不引图表库。
04
气泡 tooltip：HTML overlay 用百分比定位匹配 SVG viewBox
不要用 SVG <title> 或 foreignObject。chart container 用 relative，tooltip 用 absolute + left/top 百分比（来自 cx/W、cy/H）。这样 SVG 缩放时 tooltip 自动跟随。右半区翻到左侧弹出（leftPct > 65 时 flip）避免出屏。
05
焦点模式：选中后其它实体自动灰化
维护一个 focusIds 数组（单选模式 = [openId]，对比模式 = compareIds）。散点的非 focus 气泡 opacity → 0.18，排行条的非 focus 行 opacity → 0.45。视觉焦点立即出现，不需要额外组件。hover 状态优先级高于 dim。
06
横向排行条：所有实体一屏，按选中维度叠色
每行：序号 + 色块 + 名字 + 性别/类别 chip + 主条（按当前 sort 维度宽度）+ 主数值 + 次维度 chip。按 incidence 排序时，把 mortality 作为同一条上的深色叠加层（width = mortality/incidence × pct），一条条同时读出"多少人得 + 多少人死"。比卡片密度高一个数量级。
07
对比模式：≤3 项 side-by-side 表格
顶部一个「单选 / 对比」mode toggle。对比模式下点击 = 加入 compareIds（上限 3）。详情区换成横向表格：行 = 指标（年新发 / 年死亡 / 5y 生存 / 致死 / 性别 / 主因 / 筛查 / Top 4 风险因子）× 列 = 实体。每列带 × 按钮可移除。tone 着色（绿 / 红）让差异一眼看出。
08
筛选条：搜索 + 类别 pills + 排序 pills
搜索框（同时匹配中英名 + 主因）+ 类别多选 pills（点击切换 activeCategories 数组）+ 排序 pills（4 个，按主/次/比率/反向比率）+ 性别/阵营三档 pills。所有筛选用 useMemo 链式串起来，重置按钮一键归零。不用 <select>，全部用 pill button 视觉一致。
09
URL 状态：所有筛选可分享
mount 时 URL → state（一次性 reading），所有 state 变化时 state → URL（用 history.replaceState 不污染历史）。URL 参数：region / q / gender / sort / cats（逗号分隔）/ mode / compare（逗号分隔）/ open。分享一个具体对比直接复制地址栏即可。
10
免责与数据口径：顶部 + 底部双重保险
顶部 header 里一句话点出来源（带 inline 链接）+ 一句 strong 红字免责（如「不构成医学建议」）。底部 footer 用 disc 列表展开完整口径说明：每个数据字段的来源、年份、计算方式、与其它口径的差异、风险因子权重的局限（如不是 PAF）。一手链接（IARC / SEER / NCC）必须可点。涉及健康/金钱/政策的页面强制要有。
11
复合实体：每条是 X × Y 配对而不是单一物种
当实体是「平台 × 框架」「公司 × 产品」「国家 × 行业」这种二元配对时，schema 用 sideA / sideB 两个字段（如 platform / framework）而不是塞进一个 name。卡片头：色块 + sideA × sideB；散点上文字标 sideA × sideB 中间用 × 隔开。排序、筛选、搜索三处都要让两侧都能被命中。
12
主观打分：必须暴露评分 rubric
指标里有 lock_in / backlash / 整合度 / 影响力这种 0-100 主观分时，强制三件事：(1) 字段名旁边 chip 标「主观打分」与「实测」区分；(2) 顶部或 footer 写明评分 rubric（如「0=完全可移植；100=离不开该平台」）；(3) 数值取整到 5 或 10，不要 73.4% 这种假精确。compare table 里同样标。
13
Status 作为一级筛选 + 默认配色维度
当实体处在不同生命周期阶段（active / historical / forming / neutral / deprecated）时，status 用三件事承载：(1) 顶部一级筛选 chip（和性别同等地位）；(2) 散点气泡的默认颜色编码（而不是次要 category）；(3) compare table 里有专门一行用 tone-colored 单元格高亮。color = category 和 color = status 二选一，不要叠。
14
逐实体核实徽章：不确定的事实不要藏在 footer
当某些实体的关键数据来自传闻 / 内部消息 / 估算（而非官方公布）时，schema 加 verified: boolean。verified=false 的实体卡片右上角显示「估算」「待核实」「rumor」徽章，detail panel 顶部也显示，compare table 的实体列头同样标。不要让读者读完整页才在 footer 发现「这些都是估算」—— 在每个数字旁边就要看到。
15
最近信号 line：时间敏感话题的必备字段
每条实体加一个 latest_signal 字段：一句话带年份的最新动向（如「2025-Q2 Fluid Compute 公布」「2024-09 VoidZero 成立」）。detail panel 在 title 下方显示；compare table 单独一行；事件类、行业演变类、新闻驱动类页面强制要有。让读者知道这页数据"截至什么时候"，单点比统一在 footer 写 update date 强。
16
新事实重塑叙事支点时：从核心 thesis 往下重做，不要只加一条数据
当新事件 / 新事实改变研报的核心叙事结构（不是细节修正，是判断框架被推翻），不能只在数据里 append 一条。判断标准：如果新事实让"先发者是谁 / 谁在防御谁 / 这是什么类型的博弈"任意一条命题反转，就是支点级。处理：① title + thesis + eventBadge 必须重写；② 时间线节点重排（不只是 append）；③ 关键章节（动机 / 影响 / AI 分析）需要重新论证因果链；④ 数据加一条同时要看是不是引入了新 entity type，如果是就加 pairType / category 字段并相应着色；⑤ 所有入口文案、SHARE_COPY、OG image 一次性同步。沉淀自 /platform-framework-pairs 从「双巨头」改写成「三极割据」的教训：漏报 Anthropic × Bun 这件事不是 +1，是把整个 thesis 推翻了。

配置到本地 Codex / 给智能体阅读 / 分享给同事

下载 / 复制 Skill 文件

~/.codex/skills/rich-data-research-page

SKILL.md

---
name: rich-data-research-page
description: Use when the user asks to build a rich data research page or topic dashboard — a single interactive page that compares 5–20 same-kind entities (cancers, countries, companies, products) across multiple metrics with scatter chart, ranking bars, filter, focus, compare, and shareable URL state. Distilled from /cancers-overview and /ai-token-usage-research on 2aran.com.
---

# Rich Data Research Page Skill

Use this skill when the user wants a topic-research deliverable that is **not a Markdown article** but a **single interactive page** showing multi-dimensional data for a set of comparable entities.

Wrong fit: a 5000-word write-up. Right fit: 10 cancers × 6 metrics, 8 cloud providers × pricing tiers, 12 countries × education spending, 15 LLMs × benchmark scores.

## When to use

Trigger when the user says any of:

- "做一个富页面 / 数据可视化 / 专题调研"
- "把这些 X 做成一个能筛选 / 对比 / 分享的页面"
- "我希望可视化做的美观，有各种数据支撑，关键成因分析…，最好还能筛选，可以分享"

Skip if: the user wants a single timeline, a long-form essay, a chat tool, or a personal log.

## Inputs you must collect

1. The entity list — 5 to 20 of the same kind (cancers, countries, companies, products, AI models, etc.).
2. At least three metrics per entity:
   - **Primary magnitude** (e.g., annual incidence, revenue, GDP) — used for X axis and main bar
   - **Secondary magnitude** (e.g., mortality, cost) — used for bubble radius and overlay bar
   - **Rate / share** (e.g., 5-year survival %, margin %) — used for Y axis and tone coloring
3. Optional but high-value:
   - Multi-region / multi-version variants (e.g., global vs China, 2022 vs 2023)
   - Multi-bucket distribution (e.g., 10 age buckets, 6 revenue source breakdowns)
   - Weighted breakdown items with category tags (e.g., risk factors with category: lifestyle/genetic/virus/…)
   - Narrative fields (warnings, screening guidance, primary cause)
4. Disclaimers and authoritative sources — at minimum 2 first-party data citations.

If the user provides incomplete data, ask precisely which metric is missing rather than guessing.

## File layout (Next.js App Router)

```
app/(site)/<topic>-overview/
  page.jsx                  # static metadata + dynamic = 'force-static'
  data.js                   # entities array + helpers + sources
  <Topic>OverviewClient.jsx # 'use client' rich page
```

Wire it into siteNav under "实验室" with a New tag.

## The 10 patterns

### 1. Entity schema

Every entity is one object with these field groups:

```js
{
  id, nameZh, nameEn, color, category,            // meta
  incidence, mortality, survival5y,                // primary / secondary / rate
  genderRatio: { male, female },                   // share split
  ageDistribution: [10 numbers summing ≈ 100],     // multi-bucket
  riskFactors: [{ name, weight, category }],       // weighted breakdown
  warnings: [...], screening: '...',               // narrative
  cn: { incidence, mortality, survival5y },        // optional region variant
}
```

One entity = one object. All views share the same data.

### 2. Region / version toggle

Put variant blocks (`cn`, `v2024`, …) on each entity. Add a top-level toggle and a view helper:

```js
export function cancerView(c, region = 'global') {
  if (region === 'cn' && c.cn) return { ...c, ...c.cn }
  return c
}
```

Switching region must not touch chart structure — only the swapped fields change.

### 3. Quadrant scatter (the money shot)

Pure SVG, no chart library. X = log10(primary), Y = rate (0–100% inverted), r = secondary scaled to 6–24px, fill = category color, stroke darker on focus.

Background: two faint rectangles split by the median Y, with corner labels like "常见·难治" / "常见·可控" / "少见·难治" / "少见·可控". This makes the four quadrants legible at a glance.

```jsx
const xPos = (v) => padL + ((Math.log10(v) - xMin) / (xMax - xMin)) * innerW
const yPos = (v) => padT + (1 - v / 100) * innerH
```

Use `viewBox` + `preserveAspectRatio="xMidYMid meet"` — never set width/height in px. Adjust xMin/xMax when region changes so all bubbles spread out.

### 4. Tooltip overlay

Don't use SVG `<title>` or `<foreignObject>`. Wrap the SVG in a `relative` container, render an absolutely-positioned HTML div whose `left` / `top` are computed from bubble coords as **percentages of the chart viewBox**. This survives SVG scaling.

Flip right-side tooltips to the left when `leftPct > 65` so they don't overflow.

### 5. Focus dimming

Maintain one `focusIds` array (= [openId] in single mode, = compareIds in compare mode). Non-focused bubbles → `opacity 0.18`. Non-focused ranking rows → `opacity 0.45`. Hover state always wins.

This removes the need for a separate "highlight" mechanism — the eye is led automatically.

### 6. Horizontal ranking bars

Below the scatter. All entities, one row each, in selected sort order.

Per row: index + color square + name + gender/category chip + main bar + optional darker overlay (secondary metric) + main number + 2 mini chips (rate + reverse rate).

When sort = incidence, overlay mortality on the same row at `width = mortality/incidence × pct`. The viewer reads both magnitudes simultaneously.

### 7. Compare mode

Top-level toggle: 单选详情 / 两两对比. In compare mode, clicking a row/bubble toggles inclusion in `compareIds` (cap 3).

Replace the single detail panel with a horizontal table:
- columns = entities
- rows = metric labels (年新发 / 年死亡 / 5y / 致死 / 性别 / 主因 / 筛查 / Top 4 风险因子)
- per cell tone color (good / bad) by threshold
- each column header has × to remove

This is the most-requested feature on data dashboards. Don't ship the page without it.

### 8. Filter strip

- Search input — match nameZh + nameEn + primary cause
- Category pills (multi-select, click to toggle)
- Sort pills (4 options: primary / secondary / rate / inverse rate)
- Gender / camp toggle (3 options: all / A-dominant / B-dominant)
- Reset button only appears when any filter is active

All pills, no `<select>` — consistent rhythm.

### 9. URL state

On mount: read URL → setState (one-shot). On state change: state → URL via `history.replaceState` (so back button isn't polluted).

Parameters: `region`, `q`, `gender`, `sort`, `cats` (comma), `mode`, `compare` (comma), `open`. Default values are omitted from URL to keep it short.

### 10. Disclaimers + sources, top and bottom

**Top**: one sentence in the header naming the data sources with inline links + one short strong-red disclaimer ("不构成医学建议" / "不构成投资建议" / "数据为公开估算").

**Bottom**: a disc list expanding every source, year, scope, methodology caveat, and the limits of each metric (e.g., risk factor weights are illustrative, not PAF). First-party links must be clickable.

For health / money / policy topics, this is non-negotiable.

## Reuse checklist for a new topic

When building `/<topic>-overview` for a new dataset:

1. Pick a topic with 5–20 same-kind entities and 3+ comparable metrics.
2. Source the data — at least 2 first-party links (org statistics, regulator reports, peer-reviewed papers, official company filings).
3. Build `data.js` with the entity schema above. Add a region/version variant if the data has multiple credible cuts.
4. Copy the structure of `/cancers-overview/CancersOverviewClient.jsx` and rename:
   - Replace metric field names in the accessor functions (incidence → revenue, survival5y → margin, …).
   - Adjust scatter X axis log range to fit the data spread.
   - Update tone thresholds (good / bad) per metric.
   - Rewrite the four-quadrant corner labels for the new domain.
   - Rewrite filter copy and source links.
5. Update siteNav with the new route under "实验室", `tag: 'New'`.
6. Verify URL state round-trips for every filter combination.
7. Verify mobile: SVG should scale; ranking bars should remain readable; compare table should horizontally scroll.

## Output constraints

- Never invent statistics. If a number is uncertain, use the most reputable source available and cite it inline.
- Never imply medical / financial / legal advice. Frame everything as data visualization of public sources.
- Do not introduce a chart library (recharts, nivo, chart.js). The cost in bundle size and styling rigidity is not worth it — pure SVG covers every pattern here.
- Do not split into multiple routes per entity. The strength is one-page comparison.
- Do not skip the compare mode or the share URL — they are the two highest-ROI features.

### 11. Compound entities — X × Y pairs, not single objects

When each row is a pairing — platform × framework, company × product, country × sector — split into `sideA` / `sideB` fields instead of a single `name`. Card header shows both with a × separator. Search must hit both sides. Sort options may target either side.

### 12. Subjective scoring — expose the rubric

When metrics include 0–100 subjective scores (lock-in, backlash, integration quality), three rules:

1. Label the field with a "subjective" chip distinct from measured metrics.
2. State the rubric somewhere visible: "0 = fully portable; 100 = cannot leave platform".
3. Round to nearest 5 or 10 — never show 73.4%. Fake precision is worse than honest estimate.

### 13. Status as first-class filter + default color

When entities sit in different lifecycle states (active / historical / forming / neutral / deprecated), status takes over:

- Top-level filter chip, on par with the gender / camp toggle.
- Default color encoding for bubbles in the scatter (instead of category).
- Its own row in the compare table with tone-colored cells.

Don't stack `color = category` and `color = status` — pick one.

### 14. Per-entity verification badge

When some entity facts come from rumors, leaks, or estimates (vs. confirmed sources), add `verified: boolean`. For `verified=false`:

- Show a "estimate" / "unverified" / "rumor" badge top-right of the card.
- Show the same badge at the top of the detail panel.
- Surface in the compare table column header.

Don't bury this in the footer. The reader needs to know which numbers are solid before they read the numbers.

### 15. Latest-signal line

Every entity carries a `latest_signal`: one sentence with a year/quarter naming the most recent state change ("2025-Q2 Fluid Compute announced", "2024-09 VoidZero founded").

- Detail panel: under the title.
- Compare table: dedicated row.
- Required for event-driven or rapidly evolving topics.

A single global "last updated" date in the footer is not enough when entities evolve at different rates.

### 16. New facts that reshape the thesis: rewrite top-down, don't just append a row

When a new event or fact changes the **core narrative spine** of the research (not a detail correction — a verdict shift), do NOT just append one more entity to the data array.

Test for thesis-level disruption: if the new fact flips ANY of these claims, you're at a thesis-level change, not an entity-level one:
- who moved first
- who is defending against whom
- what type of game this even is

When the test fires:

1. **Rewrite** title + thesis + eventBadge. The old framing was wrong; band-aiding it confuses readers.
2. **Rebuild** the signal timeline. Don't append the new event at the end — re-order so the new pivot reads correctly.
3. **Re-argue** the load-bearing sections (motivation / impact / AI analysis). The causal chain you wrote before assumed the old framing; check each step.
4. **Check if a new entity type appeared.** A new "kind" of entity (e.g. AI-company × runtime is not the same as deployment-platform × framework) means a new `pairType` / `category` field is needed, with its own color and filter chip.
5. **Sync every surface** in one commit: SHARE_COPY (title / lead / full), OG image, page header eyebrow, hero strip, siteNav label, works summary. Mixed-framing pages — half old narrative, half new — are worse than either consistent version.

Distilled from /platform-framework-pairs rewriting from "双巨头割据" to "三极割据" after surfacing Anthropic × Bun (2025-12-02). Adding Bun wasn't +1; it inverted the question from "which deployment platform owns which framework" to "AI companies are bypassing the deployment-platform layer entirely". The old framing positioned Cloudflare as "对冲 Vercel"; the new one positions it as "回应 Anthropic". Every load-bearing sentence had to be checked.

## Version Log

- v0.3（2026-06-05）：加入 #16 新事实重塑叙事支点时要从核心 thesis 往下重做。沉淀自 /platform-framework-pairs 从「双巨头」改写成「三极割据」的实战教训。
- v0.2（2026-06-04）：加入 5 条新 pattern（11–15），覆盖复合实体、主观打分透明、Status 一级筛选、逐实体核实徽章、最近信号 line。沉淀自 /platform-framework-pairs 的建设过程。
- v0.1（2026-06-04）：初版，从 /cancers-overview 提炼 10 条 pattern 与完整施工清单。

agents/openai.yaml

interface:
  display_name: "Rich Data Research Page"
  short_description: "把多维度数据做成可比较 / 筛选 / 对比 / 分享的富页面"
  default_prompt: "Use $rich-data-research-page to build an interactive research dashboard for the given entities."

policy:
  allow_implicit_invocation: true

← 返回 Skill 中心

--- name: rich-data-research-page description: Use when the user asks to build a rich data research page or topic dashboard — a single interactive page that compares 5–20 same-kind entities (cancers, countries, companies, products) across multiple metrics with scatter chart, ranking bars, filter, focus, compare, and shareable URL state. Distilled from /cancers-overview and /ai-token-usage-research on 2aran.com. --- # Rich Data Research Page Skill Use this skill when the user wants a topic-research deliverable that is **not a Markdown article** but a **single interactive page** showing multi-dimensional data for a set of comparable entities. Wrong fit: a 5000-word write-up. Right fit: 10 cancers × 6 metrics, 8 cloud providers × pricing tiers, 12 countries × education spending, 15 LLMs × benchmark scores. ## When to use Trigger when the user says any of: - "做一个富页面 / 数据可视化 / 专题调研" - "把这些 X 做成一个能筛选 / 对比 / 分享的页面" - "我希望可视化做的美观，有各种数据支撑，关键成因分析…，最好还能筛选，可以分享" Skip if: the user wants a single timeline, a long-form essay, a chat tool, or a personal log. ## Inputs you must collect 1. The entity list — 5 to 20 of the same kind (cancers, countries, companies, products, AI models, etc.). 2. At least three metrics per entity: - **Primary magnitude** (e.g., annual incidence, revenue, GDP) — used for X axis and main bar - **Secondary magnitude** (e.g., mortality, cost) — used for bubble radius and overlay bar - **Rate / share** (e.g., 5-year survival %, margin %) — used for Y axis and tone coloring 3. Optional but high-value: - Multi-region / multi-version variants (e.g., global vs China, 2022 vs 2023) - Multi-bucket distribution (e.g., 10 age buckets, 6 revenue source breakdowns) - Weighted breakdown items with category tags (e.g., risk factors with category: lifestyle/genetic/virus/…) - Narrative fields (warnings, screening guidance, primary cause) 4. Disclaimers and authoritative sources — at minimum 2 first-party data citations. If the user provides incomplete data, ask precisely which metric is missing rather than guessing. ## File layout (Next.js App Router) ``` app/(site)/<topic>-overview/ page.jsx # static metadata + dynamic = 'force-static' data.js # entities array + helpers + sources <Topic>OverviewClient.jsx # 'use client' rich page ``` Wire it into siteNav under "实验室" with a New tag. ## The 10 patterns ### 1. Entity schema Every entity is one object with these field groups: ```js { id, nameZh, nameEn, color, category, // meta incidence, mortality, survival5y, // primary / secondary / rate genderRatio: { male, female }, // share split ageDistribution: [10 numbers summing ≈ 100], // multi-bucket riskFactors: [{ name, weight, category }], // weighted breakdown warnings: [...], screening: '...', // narrative cn: { incidence, mortality, survival5y }, // optional region variant } ``` One entity = one object. All views share the same data. ### 2. Region / version toggle Put variant blocks (`cn`, `v2024`, …) on each entity. Add a top-level toggle and a view helper: ```js export function cancerView(c, region = 'global') { if (region === 'cn' && c.cn) return { ...c, ...c.cn } return c } ``` Switching region must not touch chart structure — only the swapped fields change. ### 3. Quadrant scatter (the money shot) Pure SVG, no chart library. X = log10(primary), Y = rate (0–100% inverted), r = secondary scaled to 6–24px, fill = category color, stroke darker on focus. Background: two faint rectangles split by the median Y, with corner labels like "常见·难治" / "常见·可控" / "少见·难治" / "少见·可控". This makes the four quadrants legible at a glance. ```jsx const xPos = (v) => padL + ((Math.log10(v) - xMin) / (xMax - xMin)) * innerW const yPos = (v) => padT + (1 - v / 100) * innerH ``` Use `viewBox` + `preserveAspectRatio="xMidYMid meet"` — never set width/height in px. Adjust xMin/xMax when region changes so all bubbles spread out. ### 4. Tooltip overlay Don't use SVG `<title>` or `<foreignObject>`. Wrap the SVG in a `relative` container, render an absolutely-positioned HTML div whose `left` / `top` are computed from bubble coords as **percentages of the chart viewBox**. This survives SVG scaling. Flip right-side tooltips to the left when `leftPct > 65` so they don't overflow. ### 5. Focus dimming Maintain one `focusIds` array (= [openId] in single mode, = compareIds in compare mode). Non-focused bubbles → `opacity 0.18`. Non-focused ranking rows → `opacity 0.45`. Hover state always wins. This removes the need for a separate "highlight" mechanism — the eye is led automatically. ### 6. Horizontal ranking bars Below the scatter. All entities, one row each, in selected sort order. Per row: index + color square + name + gender/category chip + main bar + optional darker overlay (secondary metric) + main number + 2 mini chips (rate + reverse rate). When sort = incidence, overlay mortality on the same row at `width = mortality/incidence × pct`. The viewer reads both magnitudes simultaneously. ### 7. Compare mode Top-level toggle: 单选详情 / 两两对比. In compare mode, clicking a row/bubble toggles inclusion in `compareIds` (cap 3). Replace the single detail panel with a horizontal table: - columns = entities - rows = metric labels (年新发 / 年死亡 / 5y / 致死 / 性别 / 主因 / 筛查 / Top 4 风险因子) - per cell tone color (good / bad) by threshold - each column header has × to remove This is the most-requested feature on data dashboards. Don't ship the page without it. ### 8. Filter strip - Search input — match nameZh + nameEn + primary cause - Category pills (multi-select, click to toggle) - Sort pills (4 options: primary / secondary / rate / inverse rate) - Gender / camp toggle (3 options: all / A-dominant / B-dominant) - Reset button only appears when any filter is active All pills, no `<select>` — consistent rhythm. ### 9. URL state On mount: read URL → setState (one-shot). On state change: state → URL via `history.replaceState` (so back button isn't polluted). Parameters: `region`, `q`, `gender`, `sort`, `cats` (comma), `mode`, `compare` (comma), `open`. Default values are omitted from URL to keep it short. ### 10. Disclaimers + sources, top and bottom **Top**: one sentence in the header naming the data sources with inline links + one short strong-red disclaimer ("不构成医学建议" / "不构成投资建议" / "数据为公开估算"). **Bottom**: a disc list expanding every source, year, scope, methodology caveat, and the limits of each metric (e.g., risk factor weights are illustrative, not PAF). First-party links must be clickable. For health / money / policy topics, this is non-negotiable. ## Reuse checklist for a new topic When building `/<topic>-overview` for a new dataset: 1. Pick a topic with 5–20 same-kind entities and 3+ comparable metrics. 2. Source the data — at least 2 first-party links (org statistics, regulator reports, peer-reviewed papers, official company filings). 3. Build `data.js` with the entity schema above. Add a region/version variant if the data has multiple credible cuts. 4. Copy the structure of `/cancers-overview/CancersOverviewClient.jsx` and rename: - Replace metric field names in the accessor functions (incidence → revenue, survival5y → margin, …). - Adjust scatter X axis log range to fit the data spread. - Update tone thresholds (good / bad) per metric. - Rewrite the four-quadrant corner labels for the new domain. - Rewrite filter copy and source links. 5. Update siteNav with the new route under "实验室", `tag: 'New'`. 6. Verify URL state round-trips for every filter combination. 7. Verify mobile: SVG should scale; ranking bars should remain readable; compare table should horizontally scroll. ## Output constraints - Never invent statistics. If a number is uncertain, use the most reputable source available and cite it inline. - Never imply medical / financial / legal advice. Frame everything as data visualization of public sources. - Do not introduce a chart library (recharts, nivo, chart.js). The cost in bundle size and styling rigidity is not worth it — pure SVG covers every pattern here. - Do not split into multiple routes per entity. The strength is one-page comparison. - Do not skip the compare mode or the share URL — they are the two highest-ROI features. ### 11. Compound entities — X × Y pairs, not single objects When each row is a pairing — platform × framework, company × product, country × sector — split into `sideA` / `sideB` fields instead of a single `name`. Card header shows both with a × separator. Search must hit both sides. Sort options may target either side. ### 12. Subjective scoring — expose the rubric When metrics include 0–100 subjective scores (lock-in, backlash, integration quality), three rules: 1. Label the field with a "subjective" chip distinct from measured metrics. 2. State the rubric somewhere visible: "0 = fully portable; 100 = cannot leave platform". 3. Round to nearest 5 or 10 — never show 73.4%. Fake precision is worse than honest estimate. ### 13. Status as first-class filter + default color When entities sit in different lifecycle states (active / historical / forming / neutral / deprecated), status takes over: - Top-level filter chip, on par with the gender / camp toggle. - Default color encoding for bubbles in the scatter (instead of category). - Its own row in the compare table with tone-colored cells. Don't stack `color = category` and `color = status` — pick one. ### 14. Per-entity verification badge When some entity facts come from rumors, leaks, or estimates (vs. confirmed sources), add `verified: boolean`. For `verified=false`: - Show a "estimate" / "unverified" / "rumor" badge top-right of the card. - Show the same badge at the top of the detail panel. - Surface in the compare table column header. Don't bury this in the footer. The reader needs to know which numbers are solid before they read the numbers. ### 15. Latest-signal line Every entity carries a `latest_signal`: one sentence with a year/quarter naming the most recent state change ("2025-Q2 Fluid Compute announced", "2024-09 VoidZero founded"). - Detail panel: under the title. - Compare table: dedicated row. - Required for event-driven or rapidly evolving topics. A single global "last updated" date in the footer is not enough when entities evolve at different rates. ### 16. New facts that reshape the thesis: rewrite top-down, don't just append a row When a new event or fact changes the **core narrative spine** of the research (not a detail correction — a verdict shift), do NOT just append one more entity to the data array. Test for thesis-level disruption: if the new fact flips ANY of these claims, you're at a thesis-level change, not an entity-level one: - who moved first - who is defending against whom - what type of game this even is When the test fires: 1. **Rewrite** title + thesis + eventBadge. The old framing was wrong; band-aiding it confuses readers. 2. **Rebuild** the signal timeline. Don't append the new event at the end — re-order so the new pivot reads correctly. 3. **Re-argue** the load-bearing sections (motivation / impact / AI analysis). The causal chain you wrote before assumed the old framing; check each step. 4. **Check if a new entity type appeared.** A new "kind" of entity (e.g. AI-company × runtime is not the same as deployment-platform × framework) means a new `pairType` / `category` field is needed, with its own color and filter chip. 5. **Sync every surface** in one commit: SHARE_COPY (title / lead / full), OG image, page header eyebrow, hero strip, siteNav label, works summary. Mixed-framing pages — half old narrative, half new — are worse than either consistent version. Distilled from /platform-framework-pairs rewriting from "双巨头割据" to "三极割据" after surfacing Anthropic × Bun (2025-12-02). Adding Bun wasn't +1; it inverted the question from "which deployment platform owns which framework" to "AI companies are bypassing the deployment-platform layer entirely". The old framing positioned Cloudflare as "对冲 Vercel"; the new one positions it as "回应 Anthropic". Every load-bearing sentence had to be checked. ## Version Log - v0.3（2026-06-05）：加入 #16 新事实重塑叙事支点时要从核心 thesis 往下重做。沉淀自 /platform-framework-pairs 从「双巨头」改写成「三极割据」的实战教训。 - v0.2（2026-06-04）：加入 5 条新 pattern（11–15），覆盖复合实体、主观打分透明、Status 一级筛选、逐实体核实徽章、最近信号 line。沉淀自 /platform-framework-pairs 的建设过程。 - v0.1（2026-06-04）：初版，从 /cancers-overview 提炼 10 条 pattern 与完整施工清单。

interface: display_name: "Rich Data Research Page" short_description: "把多维度数据做成可比较 / 筛选 / 对比 / 分享的富页面" default_prompt: "Use $rich-data-research-page to build an interactive research dashboard for the given entities." policy: allow_implicit_invocation: true

16 条富数据页 pattern

实体 schema：把每条实体压成一个对象，至少含 4 类字段

口径切换：每条实体可挂多 region/version 子块

四象限散点图：一张图把所有实体定位完

气泡 tooltip：HTML overlay 用百分比定位匹配 SVG viewBox

焦点模式：选中后其它实体自动灰化

横向排行条：所有实体一屏，按选中维度叠色

对比模式：≤3 项 side-by-side 表格

筛选条：搜索 + 类别 pills + 排序 pills

URL 状态：所有筛选可分享

免责与数据口径：顶部 + 底部双重保险

复合实体：每条是 X × Y 配对而不是单一物种

主观打分：必须暴露评分 rubric

Status 作为一级筛选 + 默认配色维度

逐实体核实徽章：不确定的事实不要藏在 footer

最近信号 line：时间敏感话题的必备字段

新事实重塑叙事支点时：从核心 thesis 往下重做，不要只加一条数据

下载 / 复制 Skill 文件

SKILL.md

agents/openai.yaml

16 条富数据页 pattern

实体 schema：把每条实体压成一个对象，至少含 4 类字段

口径切换：每条实体可挂多 region/version 子块

四象限散点图：一张图把所有实体定位完

气泡 tooltip：HTML overlay 用百分比定位匹配 SVG viewBox

焦点模式：选中后其它实体自动灰化

横向排行条：所有实体一屏，按选中维度叠色

对比模式：≤3 项 side-by-side 表格

筛选条：搜索 + 类别 pills + 排序 pills

URL 状态：所有筛选可分享

免责与数据口径：顶部 + 底部双重保险

复合实体：每条是 X × Y 配对而不是单一物种

主观打分：必须暴露评分 rubric

Status 作为一级筛选 + 默认配色维度

逐实体核实徽章：不确定的事实不要藏在 footer

最近信号 line：时间敏感话题的必备字段

新事实重塑叙事支点时：从核心 thesis 往下重做，不要只加一条数据

下载 / 复制 Skill 文件

SKILL.md

agents/openai.yaml