<!DOCTYPE html>
<html lang="zh-CN">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>因果格拉斯曼序列建模架构</title>
<link href="https://fonts.googleapis.com/css2?family=Noto+Sans+SC:wght@300;400;500;700;900&family=Roboto:wght@400;700&display=swap" rel="stylesheet">
<link href="https://fonts.googleapis.com/icon?family=Material+Icons" rel="stylesheet">
<style>
:root {
--primary-color: #1E3A8A; /* Deep Blue */
--secondary-color: #3B82F6; /* Bright Blue */
--accent-color: #60A5FA; /* Light Blue */
--bg-color: #F0F4F8;
--card-bg: #FFFFFF;
--text-main: #1F2937;
--text-secondary: #4B5563;
--gradient-main: linear-gradient(135deg, #1E3A8A 0%, #2563EB 100%);
}
* {
margin: 0;
padding: 0;
box-sizing: border-box;
}
body {
font-family: 'Noto Sans SC', 'Roboto', sans-serif;
background-color: #E2E8F0;
display: flex;
justify-content: center;
color: var(--text-main);
line-height: 1.5;
}
.poster-container {
width: 720px;
min-height: 960px;
background-color: var(--bg-color);
position: relative;
overflow: hidden;
box-shadow: 0 10px 25px rgba(0,0,0,0.1);
}
/* Geometric Background Decoration */
.bg-shape {
position: absolute;
border-radius: 50%;
opacity: 0.1;
z-index: 0;
}
.shape-1 {
width: 400px;
height: 400px;
background: var(--secondary-color);
top: -100px;
right: -100px;
}
.shape-2 {
width: 300px;
height: 300px;
background: var(--primary-color);
bottom: -50px;
left: -50px;
}
/* Header */
header {
background: var(--gradient-main);
color: white;
padding: 40px 30px;
position: relative;
z-index: 1;
clip-path: polygon(0 0, 100% 0, 100% 85%, 0 100%);
}
h1 {
font-size: 36px;
font-weight: 900;
margin-bottom: 10px;
letter-spacing: -0.5px;
}
h2 {
font-size: 20px;
font-weight: 400;
opacity: 0.9;
margin-bottom: 15px;
}
.citation {
font-size: 12px;
font-family: 'Roboto', sans-serif;
background: rgba(255,255,255,0.2);
padding: 4px 8px;
border-radius: 4px;
display: inline-block;
}
/* Main Content */
.content {
padding: 20px 30px 40px;
position: relative;
z-index: 1;
display: flex;
flex-direction: column;
gap: 20px;
}
.card {
background: var(--card-bg);
border-radius: 12px;
padding: 20px;
box-shadow: 0 4px 6px rgba(0,0,0,0.05);
border-left: 4px solid var(--secondary-color);
}
.card-title {
font-size: 18px;
font-weight: 700;
color: var(--primary-color);
margin-bottom: 12px;
display: flex;
align-items: center;
gap: 8px;
}
.card-title i {
font-size: 20px;
}
p {
font-size: 14px;
color: var(--text-secondary);
text-align: justify;
margin-bottom: 10px;
}
/* Problem vs Solution Comparison */
.comparison-grid {
display: grid;
grid-template-columns: 1fr 1fr;
gap: 15px;
margin-top: 10px;
}
.comp-item {
background: #F8FAFC;
padding: 12px;
border-radius: 8px;
border: 1px solid #E2E8F0;
}
.comp-header {
font-weight: 700;
font-size: 14px;
margin-bottom: 5px;
display: flex;
justify-content: space-between;
}
.comp-desc {
font-size: 12px;
}
/* Architecture Flow */
.flow-container {
display: flex;
align-items: center;
justify-content: space-between;
background: #EFF6FF;
padding: 15px;
border-radius: 10px;
margin-top: 5px;
}
.flow-step {
display: flex;
flex-direction: column;
align-items: center;
text-align: center;
flex: 1;
position: relative;
}
.step-icon {
width: 36px;
height: 36px;
background: var(--primary-color);
color: white;
border-radius: 50%;
display: flex;
align-items: center;
justify-content: center;
font-size: 18px;
margin-bottom: 5px;
z-index: 2;
}
.step-text {
font-size: 12px;
font-weight: 600;
color: var(--primary-color);
}
.step-sub {
font-size: 10px;
color: var(--text-secondary);
}
.arrow {
color: var(--secondary-color);
font-size: 20px;
flex: 0 0 20px;
text-align: center;
}
/* Mathematical Highlight */
.math-box {
background: #1E3A8A;
color: white;
padding: 10px 15px;
border-radius: 6px;
font-family: 'Roboto', sans-serif;
font-style: italic;
text-align: center;
margin: 10px 0;
font-size: 13px;
}
/* Results Grid */
.results-grid {
display: grid;
grid-template-columns: 1fr 1fr;
gap: 15px;
margin-top: 10px;
}
.result-item {
text-align: center;
background: #F0FDF4;
border: 1px solid #BBF7D0;
padding: 10px;
border-radius: 8px;
}
.result-value {
font-size: 20px;
font-weight: 700;
color: #166534;
margin-bottom: 2px;
}
.result-label {
font-size: 11px;
color: #15803D;
text-transform: uppercase;
letter-spacing: 0.5px;
}
/* Features List */
.feature-list {
list-style: none;
}
.feature-list li {
display: flex;
align-items: flex-start;
margin-bottom: 8px;
font-size: 13px;
}
.feature-list li i {
color: var(--secondary-color);
font-size: 16px;
margin-right: 8px;
margin-top: 2px;
}
/* Footer */
footer {
background: white;
padding: 20px;
text-align: center;
font-size: 10px;
color: #9CA3AF;
border-top: 1px solid #E5E7EB;
}
/* Decorative Elements */
.math-symbol {
position: absolute;
font-family: 'Times New Roman', serif;
font-style: italic;
color: rgba(30, 58, 138, 0.05);
font-weight: bold;
pointer-events: none;
}
</style>
</head>
<body>
<div class="poster-container">
<!-- Decorative Background Shapes -->
<div class="bg-shape shape-1"></div>
<div class="bg-shape shape-2"></div>
<div class="math-symbol" style="top: 120px; left: 20px; font-size: 60px;">Gr(2,r)</div>
<div class="math-symbol" style="bottom: 150px; right: 30px; font-size: 80px;">⊗</div>
<header>
<h1>因果格拉斯曼序列建模</h1>
<h2>Causal Grassmann Sequence Modeling</h2>
<p style="color: rgba(255,255,255,0.8); margin-bottom: 15px; max-width: 600px;">
挑战自注意力机制:通过几何流形构建更具可解释性且高效的深度学习框架。
</p>
<div class="citation">Source: Attention Is Not What You Need (arXiv:2512.19428)</div>
</header>
<div class="content">
<!-- Problem vs Solution -->
<div class="card">
<div class="card-title">
<i class="material-icons">psychology_alt</i>
核心变革:从张量到流形
</div>
<p>传统Transformer的"不可解释性"源于其复杂的张量提升过程(Tensor Lifting)。新架构将词元状态视为低维流形上的几何对象。</p>
<div class="comparison-grid">
<div class="comp-item">
<div class="comp-header">
<span style="color: #6B7280;">传统 Attention</span>
<i class="material-icons" style="font-size: 14px; color: #6B7280;">grid_on</i>
</div>
<div class="comp-desc">
• 二次方复杂度 O(L²)<br>
• 成对交互过于密集<br>
• 难以追踪数学规律
</div>
</div>
<div class="comp-item" style="border-color: var(--secondary-color); background: #EFF6FF;">
<div class="comp-header">
<span style="color: var(--primary-color);">Grassmann Flow</span>
<i class="material-icons" style="font-size: 14px; color: var(--primary-color);">waves</i>
</div>
<div class="comp-desc">
• 线性复杂度 O(L)<br>
• 局部几何流形映射<br>
• 显式几何不变性
</div>
</div>
</div>
</div>
<!-- Architecture Detail -->
<div class="card">
<div class="card-title">
<i class="material-icons">architecture</i>
架构原理:格拉斯曼混合层
</div>
<p>通过普吕克坐标(Plücker coordinates)捕捉局部几何特征,信息在低秩子空间中流动。</p>
<div class="math-box">
Input H ∈ ℝ<sup>L×d</sup> → Low-dim Z ∈ ℝ<sup>L×r</sup> → Gr(2, r) Manifold
</div>
<div class="flow-container">
<div class="flow-step">
<div class="step-icon"><i class="material-icons" style="font-size: 18px;">compress</i></div>
<div class="step-text">线性降维</div>
<div class="step-sub">Linear Reduction</div>
</div>
<div class="arrow"><i class="material-icons">arrow_forward</i></div>
<div class="flow-step">
<div class="step-icon"><i class="material-icons" style="font-size: 18px;">link</i></div>
<div class="step-text">多尺度配对</div>
<div class="step-sub">Multi-scale Pair</div>
</div>
<div class="arrow"><i class="material-icons">arrow_forward</i></div>
<div class="flow-step">
<div class="step-icon"><i class="material-icons" style="font-size: 18px;">share</i></div>
<div class="step-text">普吕克编码</div>
<div class="step-sub">Plücker Embed</div>
</div>
<div class="arrow"><i class="material-icons">arrow_forward</i></div>
<div class="flow-step">
<div class="step-icon"><i class="material-icons" style="font-size: 18px;">merge_type</i></div>
<div class="step-text">门控融合</div>
<div class="step-sub">Gated Fusion</div>
</div>
</div>
</div>
<!-- Key Advantages -->
<div class="card">
<div class="card-title">
<i class="material-icons">verified</i>
核心优势
</div>
<ul class="feature-list">
<li>
<i class="material-icons">speed</i>
<div>
<strong>线性计算复杂度</strong><br>
避开注意力机制的 O(L²) 成本,与序列长度呈线性比例,适合长序列建模。
</div>
</li>
<li>
<i class="material-icons">insights</i>
<div>
<strong>显式几何不变性</strong><br>
模型在有限维流形(Grassmannian)上操作,便于数学分析和解释,不再"不可追踪"。
</div>
</li>
<li>
<i class="material-icons">memory</i>
<div>
<strong>高效的信息流</strong><br>
通过低秩子空间的受控变形传播信息,而非简单的权重加权。
</div>
</li>
</ul>
</div>
<!-- Experimental Results -->
<div class="card">
<div class="card-title">
<i class="material-icons">bar_chart</i>
实验表现:媲美甚至超越 Transformer
</div>
<p>在语言建模和自然语言推理任务上,该架构表现出极强的竞争力。</p>
<div class="results-grid">
<div class="result-item">
<div class="result-value">10-15%</div>
<div class="result-label">Wikitext-2 困惑度差距<br>(更接近基线)</div>
</div>
<div class="result-item">
<div class="result-value">85.5%</div>
<div class="result-label">SNLI 准确率<br>(略优于基线)</div>
</div>
</div>
<p style="margin-top: 10px; font-size: 12px; color: #6B7280; text-align: center;">
* 特定分类任务中表现略胜一筹
</p>
</div>
</div>
<footer>
Attention Is Not What You Need: Grassmann Flows as an Attention-Free Alternative for Sequence Modeling<br>
Designed based on arXiv:2512.19428
</footer>
</div>
</body>
</html>
登录后可参与表态
讨论回复
1 条回复
C3P0 (C3P0)
#1
12-24 01:23
登录后可参与表态