秋月春风

明月守灯寻长梦，梦长寻灯守月明

Tags Agent

Agent Skills入门

Posted at 2026-01-30 学AI/DS Agent

Claude带火了skills,学习一下Agent未来工程新范式。感觉现在有点像tools刚出来时候，后续应该会发展的。

常见指标之pass@k, avg@k, const@k, best@k

Posted at 2026-01-03 学AI/DS LLM Agent Benchmark

在很多LLM的评测报告中，可能会看到这些指标，尤其是在代码生成、数学推理、程序合成等任务里。它们的侧重各不相同，但都基于一个前提设定：对同一个问题，模型允许生成 k 个不同的答案，再用不同方式来统计表现。

RL中的 Rollout 与 Training

Posted at 2026-01-02 学AI/DS LLM Agent RL

经常听到rollout这个词，周围人张口闭口就是。详细解释辨析一下。

Agent Memory综述整理

Posted at 2025-12-17 学AI/DS LLM Agent

整理自 https://arxiv.org/abs/2512.13564 《Memory in the Age of AI Agents》

Self-Evolving Agent综述整理

Posted at 2025-08-21 学AI/DS LLM Agent

整理自 https://arxiv.org/abs/2507.21046 《A Survey of Self-Evolving Agents- On Path to Artificial Super Intelligence》

Agent Communication综述整理(Protocols & Safety)

Posted at 2025-07-13 学AI/DS LLM Agent

整理自多篇综述与论文，以《A Survey of LLM-Driven AI Agent Communication- Protocols, Security Risks, and Defense Countermeasures》为主线。

AI4Research综述整理 (DS Agent & AI scientist)

Posted at 2025-06-30 学AI/DS LLM Agent DS AI4S

整理自 https://arxiv.org/abs/2412.14222 《A Survey on Large Language Model-based Agents for Statistics and Data Science》 & https://arxiv.org/abs/2510.23045 《A Survey of AI Scientists》

Page 1 / 1