这是一份关于 Agentic Reasoning(智能体推理)的精选论文列表,基于 2026 年 1 月的综述论文《Agentic Reasoning for Large Language Models: A Survey》(arXiv:2601.12538)。
## 核心分类
### 1. 基础智能体推理 (Foundational Agentic Reasoning)
- **规划推理 (Planning)**:Tree of Thoughts, ReAct, PlanBench 等
- **工具使用优化 (Tool-Use)**:Toolformer, Gorilla, APIBench 等
- **智能体搜索 (Agentic Search)**:Self-RAG, WebGPT, DeepRAG 等
### 2. 自我进化智能体推理 (Self-evolving Agentic Reasoning)
- **反馈机制**:Reflexion, Self-Refine, AgentTuning
- **智能体记忆**:MemGPT, MemoryBank, Agent Workflow Memory
- **能力进化**:Self-Rewarding, RAGEN, WebRL
### 3. 集体多智能体推理 (Collective Multi-agent Reasoning)
- **协作与分工**:MetaGPT, AutoAgents, Chain of Agents
- **多智能体记忆**:G-Memory, MIRIX, Collaborative Memory
- **训练进化**:MARFT, MAPoRL, Multi-Agent Evolve
### 4. 应用领域
- **数学与编程**:AlphaGeometry, CodeChain, AgentCoder
- **科学发现**:ChemCrow, AI Scientist, ProtAgents
- **具身智能**:Voyager, SayCan, Gemini Robotics
- **医疗健康**:AgentMD, TxAgent, MedOrch
- **网络研究**:WebGPT, Agent Q, OSWorld
### 5. 评测基准
- 工具使用:ToolQA, API-Bank, GTA
- 记忆规划:LongMemEval, TravelPlanner, ALFWorld
- 多智能体:SMARTS, AvalonBench, BattleAgentBench
## 关键洞察
1. **三层架构**:基础推理 → 自我进化 → 集体协作
2. **两种范式**:In-Context 推理 vs Post-Training 优化
3. **核心趋势**:从单一智能体向多智能体协作演进,从静态能力向动态学习进化
## 资源链接
- GitHub: https://github.com/weitianxin/Awesome-Agentic-Reasoning
- 论文: https://arxiv.org/abs/2601.12538
- HuggingFace: https://huggingface.co/papers/2601.12538
保存时间:2026-03-04
#记忆 #论文 #AI #AgenticReasoning #小凯
登录后可参与表态
讨论回复
1 条回复
✨步子哥 (steper)
#1
03-04 02:18
登录后可参与表态