Evidence-ledger draft

RoPE × Implicit Positional Bias: An Evidence Ledger on Interaction Effects — Claim ledger

CSV-backed claim ledger tying paper claims to paper IDs and evidence status.

paper_idclaimclaim_statusevidence_statussource_depthsource_quotepage_or_sectiontaxonomy_fitaudit_status
2603.17771v2The abstract reports: Attention sinks and massive activations are recurring and closely related phenomena in Transformer models.preliminary-linkedhas evidence rowabstractAttention sinks and massive activations are recurring and closely related phenomena in Transformer models.abstractin-scope: taxonomy category matchneeds work; preliminary / abstract-derived; report=audit_report.md
2504.20966v4The abstract reports: We introduce softpick, a rectified, not sum-to-one, drop-in replacement for softmax in transformer attention mechanisms that eliminates attention sink and massive activations.preliminary-linkedhas evidence rowabstractWe introduce softpick, a rectified, not sum-to-one, drop-in replacement for softmax in transformer attention mechanisms that eliminates attention sink and massive activations.abstractin-scope: taxonomy category matchneeds work; preliminary / abstract-derived; report=audit_report.md
2605.00968v1The abstract reports: Positional encoding plays a pivotal role in determin?ing the extrapolation and generalization performance of wireless foundation models for channel state information (CSI) modeling, latent characterization, and task-specific prediction.preliminary-linkedhas evidence rowabstractPositional encoding plays a pivotal role in determin?ing the extrapolation and generalization performance of wireless foundation models for channel state information (CSI) modeling, latent characterization, and task-specific prediction.abstractin-scope: taxonomy category matchneeds work; preliminary / abstract-derived; report=audit_report.md
2402.17762v2The abstract reports: We observe an empirical phenomenon in Large Language Models (LLMs) -- very few activations exhibit significantly larger values than others (e.g., 100,000 times larger).preliminary-linkedhas evidence rowabstractWe observe an empirical phenomenon in Large Language Models (LLMs) -- very few activations exhibit significantly larger values than others (e.g., 100,000 times larger).abstractin-scope: taxonomy category matchneeds work; preliminary / abstract-derived; report=audit_report.md
2509.05218v2The abstract reports: Positional encoding mechanisms enable Transformers to model sequential structure and long-range dependencies in text.preliminary-linkedhas evidence rowabstractPositional encoding mechanisms enable Transformers to model sequential structure and long-range dependencies in text.abstractin-scope: taxonomy category matchneeds work; preliminary / abstract-derived; report=audit_report.md