欣淇
发布于 2026-05-18 / 0 阅读
0
0

🤖 Agents Best Practices:735★ 的 Agent 技能包,把你的 Codex / Claude Code 变成 Agent 架构师

🤖 Agents Best Practices:735★ 的 Agent 技能包,把你的 Codex / Claude Code 变成 Agent 架构师

一个 provider-neutral 的 Agent Skill,专治「Agent 跑起来容易,跑好难」

这玩意儿解决什么问题?

老实说,现在搞 Agent 开发的兄弟们普遍状态是:用框架跑个 demo 五分钟,想上生产发现全是坑——工具权限乱给、上下文一长就失忆、没有预算控制、出了问题不知道谁干的。

说的就是这玩意儿。我不是说某个框架不好,而是说整个行业缺一份 「Agent 运行时该长什么样」的参考标准

agents-best-practices 就是干这个的。735★ 刚发布几天,已经冲上 GitHub Trending。它不是又一个框架,而是一个 provider-neutral 的 Agent Skill,装到 Codex 或 Claude Code 里,你的 Coding Agent 就变成了一个 Agent 架构师。

最骚的操作

安装极其简单,一行命令搞定:

npx skills add DenisSergeevitch/agents-best-practices -g

或者手动装:

# Codex
mkdir -p "${CODEX_HOME:-$HOME/.codex}/skills"
git clone https://github.com/DenisSergeevitch/agents-best-practices.git \
  "${CODEX_HOME:-$HOME/.codex}/skills/agents-best-practices"

# Claude Code
mkdir -p "$HOME/.claude/skills"
git clone https://github.com/DenisSergeevitch/agents-best-practices.git \
  "$HOME/.claude/skills/agents-best-practices"

装完之后,你直接跟你的 AI 说「帮我设计一个风控 Agent」,它就会自动走这个 Skill 的流程,生成一个完整的 MVP 蓝图,包括工具定义、权限分级、预算控制、观测链路。

核心哲学

这个项目最牛逼的地方在于它的核心理念:Agent 不是一个提示词,而是一个运行时。

"The model proposes actions; the harness validates, authorizes, executes, records, and returns observations."

实际开发中拆成七层:

instructions  context builder  model call  
tool proposal  validation  permission decision  
execution or approval pause  observation  
next step or final answer

每层都有明确的边界和检查点。别问我怎么知道的——踩过的坑都是泪。

实际用起来咋样?

比如你让 Agent 设计一个「账户续约风险分析 Agent」:

You > 帮我搭一个账户续约风险评估 Agent。它能读 CRM、工单、用量数据,
      然后生成续约建议。

Agent > 好的,用 approval-gated Level 2 架构。
       MVP 只干一件事:生成风险评估报告 + 草稿建议。

       Core loop:
         user/task -> context builder -> model call -> typed tool call
         -> schema validation -> permission check -> execution or pause
         -> structured observation -> next step or final brief

       Minimal tools:
         - read_account_profile      read_private_data
         - list_support_tickets      read_private_data
         - fetch_usage_summary       read_private_data
         - draft_customer_email      draft_external_message
         - request_approval          approval_gate

       Launch gate:
         20 个历史账户走一遍 trace review,
         无未经审批的外部发送,
         人工审批通过率 ≥ 80%。

看到没?它不会丢给你一个「用 LangChain 搭一个 Agent」这种废话,而是从运行时设计出发,把工具权限、审批流、验证标准全都写清楚了。

有什么坑?

实话讲,这个项目 不是给新手用的。它假设你已经知道 Agent 是什么、为什么要用、基本概念都懂。它不会教你 from scratch 搭 Agent,而是教你 搭一个能上生产的 Agent

另外,735★ 还算早期,参考文档的覆盖还不太全(比如多 Agent 编排的篇幅很少),但核心的 MVP 蓝图、工具权限、安全评估三份文档写得非常扎实。

总结

  • Provider-neutral,Codex 和 Claude Code 都能用,不锁厂商
  • 从运行时设计出发,不是提示词工程
  • 工具权限分 risk class,窄接口 + 审批门控,不乱暴露 write_database 这种炸弹
  • 带了完整的 launch gate 和 checklists

如果你已经在搞 Agent 开发,装一个试试。反正一行命令的事。


🤖 Agents Best Practices: 735★ Agent Skill That Turns Your Codex/Claude Code Into an Agent Architect

A provider-neutral Agent Skill for building production-grade agentic harnesses

The Pain Point

Honestly, the state of agent development is: 5 minutes to run a demo, 5 months to get it production-ready. Tools with no permission boundaries, context that forgets everything after compaction, no budget controls, no observability.

agents-best-practices fixes this. At 735★ and climbing GitHub Trending, it's not another framework — it's a provider-neutral Agent Skill that turns your coding agent into an agent architect.

Installation

One command:

npx skills add DenisSergeevitch/agents-best-practices -g

Or manually:

# Codex
mkdir -p "${CODEX_HOME:-$HOME/.codex}/skills"
git clone https://github.com/DenisSergeevitch/agents-best-practices.git \
  "${CODEX_HOME:-$HOME/.codex}/skills/agents-best-practices"

# Claude Code
mkdir -p "$HOME/.claude/skills"
git clone https://github.com/DenisSergeevitch/agents-best-practices.git \
  "$HOME/.claude/skills/agents-best-practices"

Once installed, ask your agent to "design a risk-assessment agent" and it'll walk through the full MVP blueprint flow — tools, permissions, budgets, observability, launch gates.

Core Philosophy

The central insight: An agent is not a prompt — it's a runtime.

"The model proposes actions; the harness validates, authorizes, executes, records, and returns observations."

Seven layers in production:

instructions  context builder  model call 
tool proposal  validation  permission decision 
execution or approval pause  observation 
next step or final answer

Each layer has clear boundaries and checkpoints. Learned this the hard way.

In Practice

Ask your agent to design an account renewal risk agent:

You > Build an agent for account renewal risk. It should read CRM,
      support tickets, and usage data, then draft renewal actions.

Agent > Using an approval-gated Level 2 harness.
       MVP produces a renewal-risk brief plus draft next actions.

       Core loop:
         user/task → context builder → model call → typed tool call
         → schema validation → permission check → execution or pause
         → structured observation → next step or final brief

       Tools (risk-classified):
         - read_account_profile      read_private_data
         - list_support_tickets      read_private_data
         - fetch_usage_summary       read_private_data
         - draft_customer_email      draft_external_message
         - request_approval          approval_gate

       Launch gate:
         20 historical accounts with trace review,
         zero unapproved external sends,
         ≥80% human acceptance on draft actions.

Notice: no "use LangChain" hand-waving. Real runtime design with permission boundaries, approval flows, and verification criteria.

Caveats

This isn't for beginners. It assumes you know the basics and want to build production-grade agents. At 735★ it's still early — multi-agent orchestration coverage is thin — but the core references (MVP blueprint, tools & permissions, security & evals) are solid.

TL;DR

  • Provider-neutral: works with Codex and Claude Code, no vendor lock-in
  • Runtime-first design, not prompt engineering
  • Risk-classified tools with narrow interfaces and approval gates
  • Complete launch gates and checklists

If you're building production agents, npx skills add it. One command.


评论