OpenAI Codex高效使用指南

深度长文 2026年3月9日

如果你是@OpenAI Codex或编程代理的新手，这个指南将帮助你更快获得更好的结果。它涵盖了使Codex在CLI、IDE扩展和Codex应用中更有效的核心习惯，从提示和规划到验证、MCP、技能和自动化。

Codex在你把它当作一个持续配置和改进的队友，而不仅仅是一次性助手时效果最佳。

一个有用的思维模型是：从正确的任务上下文开始，使用AGENTS.md作为持久的指导，根据你的工作流程配置Codex，使用MCP连接外部系统，将重复工作转化为技能，并自动化稳定的工作流程。

1. 通过清晰的上下文和提示，做好Codex的首次强力使用准备

即使你的提示不是完美的，Codex已经足够强大，可以有用。你通常可以在最少设置的情况下交给它一个难题，仍然获得强有力的结果。清晰的提示不是获取价值的必需，但它确实让结果更可靠，特别是在大型代码库或重要任务中。

如果你在大型或复杂的代码库中工作，最大的解锁通常是为Codex提供正确的任务上下文和清晰的结构，说明你想完成什么。

一个好的默认做法是在提示中包括四个内容：

目标：你试图改变或构建什么？

上下文：哪些文件、文件夹、文档、示例或错误对这个任务重要？你可以 @ 提及某些文件作为上下文。

约束：Codex应遵循哪些标准、架构、安全要求或惯例？

完成条件：任务完成前应满足什么条件，比如测试通过、行为改变或错误被修复？

这能帮助Codex保持范围，减少假设，产出更易于审查和验证的工作成果。

根据任务的难度选择推理级别，并测试哪种设置最适合你的工作流程。不同的用户和任务适合不同的设置。

低级别适用于更快速、范围明确的任务

中级或高级适用于更复杂的变更或调试

超高级适用于冗长、有代理性、重推理的任务

新手提示：大多数人通过从一些基础的成功开始最快上手，比如向Codex提问代码库相关问题或使用它进行小范围修复。强烈推荐在Codex应用中使用语音输入以加快迭代速度。

2. 通过让Codex先制定计划减少困难任务中的错误

如果任务复杂、含糊或难以清晰描述，先让Codex制定计划再开始编码。

这里有几种有效的方法：

使用计划模式：对大多数用户来说，这是最简单且最有效的选项。计划模式让Codex收集上下文信息、提出澄清问题，并在实施前制定更强的计划。通过输入 /plan 或 Shift+Tab 切换。

让Codex采访你：如果你对想要的内容有大致想法，但不确定如何准确描述，可以让Codex先提问你。告诉它质疑你的假设，并把模糊的想法转化为具体内容再写代码。

使用PLANS.md模板：对于更高级的工作流程，你可以配置Codex遵循PLANS.md或执行计划模板，用于需长时间运行或多步骤的工作。更多细节，请查看我们的执行计划指南。

3. 使用AGENTS.md使成功的指导可复用

一旦提示模式有效，下一步就是停止手动重复它。这就是AGENTS.md的作用。

把AGENTS.md看作代理的README。它是一个简单的开放格式，会自动加载到上下文中，是你和团队在代码仓库中编码如何使用Codex的最佳位置。

一个好的AGENTS.md通常包括：

仓库布局和重要目录

如何运行项目

构建、测试和代码检查命令

工程规范和PR期望

限制和禁止规则

完成的定义以及如何验证工作

CLI中的/init斜杠命令是快速启动命令，用于在当前目录搭建一个初始的AGENTS.md。这是一个很好的起点，但你应该编辑结果，使其符合你团队实际的构建、测试、代码审核和发布流程。

你可以在多个层级创建AGENTS.md文件：位于~/.codex的全局AGENTS.md用于个人默认设置，仓库级文件用于共享标准，以及子目录中更具体的规则文件。如果在当前目录附近有更具体的文件，该文件的指导优先。

保持实用。简短且准确的AGENTS.md比长篇模糊规则文件更有用。从基础开始，然后只在发现反复错误后添加新规则。

如果 AGENTS.md 文件变得过大，请保持主文件简洁，并针对规划、代码审查或架构等任务引用专门的 Markdown 文件。

提示：当 Codex 连续犯同样的错误时，询问它进行回顾并更新 AGENTS.md。指导保持实用，并基于实际摩擦。

4. 通过配置 Codex 使其更符合你的工作流程，从而获得更一致的行为

配置是令 Codex 在不同会话和环境中表现更一致的主要方式之一。例如，你可以设置模型选择、推理力度、沙盒模式、审批策略、配置文件和 MCP 设置的默认值。

一个好的起点模式是：

将个人默认设置保存在 ~/.codex/config.toml（在 Codex 应用中的设置 → 配置 → 打开 config.toml）

将仓库特定行为保存在 .codex/config.toml 中

仅在单次情况使用命令行覆盖（如果你使用 CLI）

config.toml 是定义持久偏好设置的地方，比如 MCP 服务器、配置文件、多代理设置和实验功能。你可以直接编辑它，也可以让 Codex 帮你更新。

Codex 自带操作级别的沙箱功能，有两个关键设置可控。审批模式决定何时 Codex 在执行命令前请求你的许可，沙箱模式决定 Codex 是否能在目录中读写以及代理可以访问哪些文件。

如果你是完全的新手，建议默认权限保守开始。默认保持审批和沙箱严格，只在信任的仓库或特定流程中，根据需求再放宽权限。

注意 CLI、IDE 及 Codex 应用共用相同的配置层。详情请查看我们的示例配置文档页面。

提示：尽早为你的真实环境配置 Codex。许多质量问题实际上是配置问题，比如错误的工作目录、缺少写权限、错误的模型默认设置，或者缺少工具和连接器。

5. 通过让 Codex 进行测试、验证和审查工作来提升可靠性

不要仅仅停留在让 Codex 做出更改。需要时让它创建测试，运行相关检查，验证结果，并在你接受之前审查工作。

Codex 可以为你完成这个循环，但前提是它知道“正确”的标准是什么。这个指导可以来自提示或 AGENTS.md。

这可能包括：

编写或更新针对更改的测试

运行正确的测试套件

检查代码风格、格式或类型检查

确认最终行为符合请求

审查差异以发现错误、回归或风险模式

提示：在 Codex 应用中切换差异面板，可以直接在本地审查更改。点击具体行即可提供反馈，反馈将作为上下文传递给下一次 Codex 交互。

这里一个有用的选项是使用斜杠命令 /review，它为你提供了几种不同的代码审查方式：

针对基础分支进行PR风格的审查

审查未提交的更改

审查一次提交

使用自定义审查指令

如果你和你的团队有一个code_review.md文件，并且该文件在AGENTS.md中有引用，Codex在审查时也可以遵循这些指导。这是希望团队在多个代码库和贡献者之间保持审查行为一致性的强有力模式。

Codex不仅仅是生成代码。通过合适的指令，它还能帮助测试、验证和审查代码。

如果你使用GitHub云端服务，可以轻松地设置Codex为你的PR运行代码审查。我们在OpenAI使Codex审查100%的PR。你可以选择启用自动审查，或者在@Codex时让它被动审查。

6. 通过MCP将外部工具和实时上下文引入Codex

当Codex所需的上下文存在于代码库之外时，使用MCP。它让Codex连接你已经使用的工具和系统，这样你就不必不断地复制粘贴实时信息到提示中。

模型上下文协议（Model Context Protocol，简称MCP）是一种用于将Codex连接到外部工具和系统的开放标准。

以下情况使用MCP：

所需的上下文存在于代码库之外

数据经常变化

您希望Codex使用工具而不是依赖粘贴的指令

您需要跨用户或项目实现可重复的集成

Codex支持带有OAuth的STDIO和可流式HTTP服务器。

在Codex应用程序中，前往设置 → MCP服务器以查看自定义和推荐服务器。通常，Codex可以帮助您安装所需的服务器。您只需提出请求。您还可以在命令行界面使用codex mcp add命令添加带有名称、URL和任何附加信息的自定义服务器。

提示：只有当工具能解锁真实工作流程时才添加工具。不要一开始就接入你使用的所有工具。先从一两个明显能消除你经常做的手动环节的工具开始，然后再逐步扩展。

7. 将重复的工作流程转化为可复用的技能

一旦工作流程变得可重复，就不要再依赖冗长的提示或反复往返。使用技能（Skill）将指令打包成SKILL.md文件、上下文和支持逻辑，Codex将持续应用。技能能在CLI、IDE扩展和Codex应用中使用。

保持每个技能的范围紧密聚焦于一项工作。先从2到3个具体用例开始，定义清晰的输入和输出，并撰写描述，明确说明技能做什么及何时使用。包括用户实际可能说出的触发短语类型。

不要试图一开始就覆盖所有边缘情况。先从一个代表性任务开始，使其运行良好，然后将该工作流程转化为技能并逐步改进。只在能显著提升可靠性时才包含脚本或额外资源。

一个好的经验法则是：如果你不断重复使用相同的提示或反复纠正同一工作流程，它可能就应该成为一个技能。

技能对于以下重复性工作尤其有用：

日志分类

发布说明草拟

根据清单审查PR

迁移规划

遥测或事件总结

标准调试流程

$skill-creator 技能是搭建技能第一个版本的最佳起点，而使用 $skill-installer 技能可以将其安装到本地。技能中最重要的部分之一是描述。它应清晰说明技能的功能及使用时机。

提示：个人技能存储在 $HOME/.agents/skills，团队共享技能可以提交到仓库内的 .agents/skills 文件夹。这对新队友入职特别有帮助。

8. 通过自动化节省重复工作时间

一旦工作流程稳定，你可以安排 Codex 在后台为你运行它。在 Codex 应用中，自动化允许你选择项目、提示词、频率和执行环境来处理重复任务。

当任务变得重复时，你可以轻松在 Codex 应用的自动化标签页创建自动化。你可以选择运行的项目、运行的提示词（可以调用技能），以及执行频率。你还可以选择自动化是在专用的 git 工作树中运行，还是在本地环境中运行。了解更多关于 git 工作树的信息。

合适的候选包括：

总结最近的提交

扫描可能的漏洞

起草发布说明

检查持续集成失败

制作站立会议总结

定期运行可重复的分析工作流程

一个有用的规则是技能定义方法，自动化定义时间表。如果一个工作流程仍然需要大量指导，先将其转变为技能。一旦它变得可预测，自动化就成为倍增器。

提示：使用自动化进行反思和维护，而不仅仅是执行。回顾最近的会话，总结重复出现的摩擦，并随着时间推移改进提示、指令或工作流程设置。

9. 通过会话控制在较长时间的工作中保持有序

Codex 会话不仅仅是聊天记录。它们是积累了上下文、决策和行动的工作线程，因此良好的管理对质量有重要影响。

在 Codex 应用界面中管理多个线程最简单，可以固定线程和创建工作树。但如果你使用命令行界面，这些斜杠命令尤其有用：

/experimental 用于切换实验性功能并添加到你的 config.toml

/resume 恢复保存的对话

/fork 创建一个新线程，同时保留原始记录

/compact 当线程变长时，您想要早期上下文的总结版本时使用。注意，Codex 会自动为您压缩对话

/agent 当您运行多个代理并希望在活动代理线程之间切换时使用

/theme 选择语法高亮主题

/apps 直接在 Codex 中使用 ChatGPT 应用

/status 检查当前会话状态

每个连贯的工作单元保持一个线程。如果工作仍然是同一问题的一部分，保持在同一个线程通常更好，因为它保留了推理轨迹。只有当工作真正分支时才分叉。

提示：使用 Codex 的多代理工作流程，将有限的工作从主线程中卸载出去。让主代理专注于核心问题，使用子代理处理探索、测试或分诊等任务。

10. 常见错误及避免方法

首次使用 Codex 时要避免的一些常见错误：

在提示中堆砌持久规则，而不是将它们移入 AGENTS.md 或技能文件

不让代理查看其工作内容，不提供如何最佳运行构建和测试命令的详细信息

跳过多步骤和复杂任务的规划

在了解工作流程之前，给予Codex对你电脑的全部权限

在不使用git工作树的情况下，对同一文件运行多个活动线程

在手动操作还不可靠时，将重复任务转变为自动化

将Codex与自己的工作并行使用，而不是逐步监视其操作

入门清单

给Codex设定正确的目标、背景、限制条件和完成标准

对于复杂任务，先让Codex进行规划

创建一个初始的AGENTS.md

告诉Codex如何构建、测试、验证和评审

设置与你工作流匹配的配置默认值

为高价值的外部工具添加MCP

将重复的工作流转化为技能

当工作流稳定后使用自动化

你越是把你的工作流、标准和上下文转化为Codex可用的内容，就越能看到智能代理的真正能力。今天就开始吧！

显示英文原文 / Show English Original

If you’re new to @OpenAI Codex or coding agents in general, this guide will help you get better results faster. It covers the core habits that make Codex more effective across the CLI, IDE extensions, and the Codex app, from prompting and planning to validation, MCP, skills, and automations. Codex works best when you treat it less like a one-off assistant and more like a teammate that you configure and improve over time. A useful mental model is: start with the right task context, use AGENTS.md for durable guidance, configure Codex to match your workflow, connect external systems with MCP, turn repeated work into skills, and automate stable workflows. 1. Set Codex up for a strong first use with clear context and prompting Codex is already strong enough to be useful even when your prompt is not perfect. You can often hand it a hard problem with minimal setup and still get a strong result. Clear prompting is not required to get value, but it does make results more reliable, especially in larger codebases or higher-stakes tasks. If you work in a large or complex repository, the biggest unlock is usually giving Codex the right task context and a clear structure for what you want done. A good default is to include four things in your prompt: Goal: What are you trying to change or build?

Context: Which files, folders, docs, examples, or errors matter for this task? You can @ mention certain files as context. Constraints: What standards, architecture, safety requirements, or conventions should Codex follow? Done when: What should be true before the task is complete, such as tests passing, behavior changing, or a bug being fixed? This helps Codex stay scoped, make fewer assumptions, and produce work that is easier to review and validate. Choose a reasoning level based on how hard the task is and test what works best for your workflow. Different users and tasks benefit from different settings. Low for faster, well-scoped tasks Medium or High for more complex changes or debugging Extra High for long, agentic, reasoning-heavy tasks

Tip for new users: Most people get up to speed fastest by starting with a few basic wins, like asking Codex questions about the codebase or using it to make a small, scoped fix. Highly recommend using speech dictation in the Codex app to speed up iterations. 2. Reduce mistakes on hard tasks by having Codex plan first If the task is complex, ambiguous, or hard to describe clearly, ask Codex to plan before it starts coding. There are a few good ways to do this: Use Plan mode: For most users, this is the easiest and most effective option. Plan mode lets Codex gather context, ask clarifying questions, and build a stronger plan before implementation. Toggle with /plan or Shift+Tab. Ask Codex to interview you: If you have a rough idea of what you want but are not sure how to describe it well, ask Codex to question you first. Tell it to challenge your assumptions and turn the fuzzy idea into something concrete before writing code. Use a PLANS.md template: For more advanced workflows, you can configure Codex to follow a PLANS.md or execution-plan template for longer-running or multi-step work. For more detail, check out our execution plans guide. 3. Make successful guidance reusable with AGENTS.md

Once a prompting pattern works, the next step is to stop repeating it manually. That is where AGENTS.md comes in. Think of AGENTS.md as a README for agents. It is a simple, open format that gets loaded into context automatically and is the best place to encode how you and your team want Codex to work in a repository. A good AGENTS.md usually covers: Repo layout and important directories How to run the project Build, test, and lint commands Engineering conventions and PR expectations Constraints and do-not rules

What done means and how to verify work The /init slash command in the CLI is the quick-start command to scaffold a starter AGENTS.md in the current directory. It is a great starting point, but you should edit the result to match how your team actually builds, tests, reviews, and ships code. You can create AGENTS.md files at multiple levels: a global AGENTS.md for personal defaults that sits in ~/.codex, a repo-level file for shared standards, and more specific files in subdirectories for local rules. If there’s a more specific file closer to your current directory, that guidance wins. Keep it practical. A short, accurate AGENTS.md is more useful than a long file full of vague rules. Start with the basics, then add new rules only after you notice repeated mistakes. If AGENTS.md starts getting too large, keep the main file concise and reference task-specific markdown files for things like planning, code review, or architecture. Tip: When Codex makes the same mistake twice, ask it for a retrospective and update AGENTS.md. Guidance stays practical and based on real friction. 4. Get more consistent behavior by configuring Codex to match your workflow Configuration is one of the main ways to make Codex behave more consistently across sessions and surfaces. For example, you can set defaults for model choice, reasoning effort, sandbox mode, approval policy, profiles, and MCP setup.

A good starting pattern is: Keep personal defaults in ~/.codex/config.toml (Settings → Configuration → Open config.toml from the Codex app) Keep repo-specific behavior in .codex/config.toml Use command-line overrides only for one-off situations (if you use the CLI) Config.toml is where you define durable preferences such as MCP servers, profiles, multi-agent setup, and experimental features. You can edit it directly or ask Codex to update it for you. Codex ships with operating level sandboxing and has two key knobs that you can control. Approval mode determines when Codex asks for your permission to run a command and sandbox mode determines if Codex can read or write in the directory and what files the agent can access. If you are completely new to coding agents, the recommendation is to start conservative with default permissions. Keep approval and sandboxing tight by default, then loosen permissions only for trusted repos or specific workflows once the need is clear. Note that the CLI, IDE, and Codex app all share the same configuration layers. Learn more on our sample configuration documentation page.

Tip: Configure Codex for your real environment early. Many quality issues are really setup issues, like the wrong working directory, missing write access, wrong model defaults, or missing tools and connectors. 5. Improve reliability by having Codex test, validate, and review the work Do not stop at asking Codex to make a change. Ask it to create tests when needed, run the relevant checks, validate the result, and review the work before you accept it. Codex can do this loop for you, but only if it knows what “good” looks like. That guidance can come from either the prompt or AGENTS.md. That can include: Writing or updating tests for the change Running the right test suites Checking lint, formatting, or type checks

Confirming the final behavior matches the request Reviewing the diff for bugs, regressions, or risky patterns Tip: Toggle the diff panel in the Codex app to directly review changes locally. Click on a specific row to provide feedback that gets fed as context to the next Codex turn. A useful option here is the slash command /review, which gives you several different ways to review code: Review against a base branch for PR-style review Review uncommitted changes Review a commit Use custom review instructions

If you and your team have a code_review.md file that is referenced in AGENTS.md, Codex can follow that guidance during review as well. This is a strong pattern for teams that want review behavior to stay consistent across repositories and contributors. Codex should not just generate code. With the right instructions, it can also help test it, validate it, and review it. If you use GitHub Cloud, you can easily set up Codex to run code reviews for your PRs. We have Codex review 100% of PRs at OpenAI. You have the option to enable automatic reviews or have Codex reactively review when you @Codex. 6. Bring external tools and live context into Codex with MCPs Use MCPs when the context Codex needs lives outside the repo. It lets Codex connect to the tools and systems you already use, so you do not have to keep copying and pasting live information into prompts. Model Context Protocol, or MCP, is an open standard for connecting Codex to external tools and systems. Use MCP when: The needed context lives outside the repo

The data changes frequently You want Codex to use a tool rather than rely on pasted instructions You need a repeatable integration across users or projects Codex supports both STDIO and Streamable HTTP servers with OAuth. In the Codex App, head to Settings → MCP servers to see custom and recommended servers. Often, Codex can help you install the needed servers. All you need to do is ask. You can also use the codex mcp add command in the CLI to add your custom servers with a name, URL, and any additional information. Tip: Add tools only when they unlock a real workflow. Do not start by wiring in every tool you use. Start with one or two tools that clearly remove a manual loop you already do often, then expand from there. 7. Turn repeated workflows into reusable skills Once a workflow becomes repeatable, stop relying on long prompts or repeated back-and-forth. Use a Skill to package the instructions in a SKILL.md file, context, and supporting logic Codex should apply consistently. Skills works across the CLI, IDE extension, and Codex app.

Keep each skill tightly scoped to one job. Start with 2 to 3 concrete use cases, define clear inputs and outputs, and write the description so it clearly says what the skill does and when to use it. Include the kinds of trigger phrases a user would actually say. Do not try to cover every edge case up front. Start with one representative task, get it working well, then turn that workflow into a skill and improve from there. Include scripts or extra assets only when they meaningfully improve reliability. A good rule of thumb: if you keep reusing the same prompt or correcting the same workflow, it should probably become a skill. Skills are especially useful for recurring jobs like: Log triage Release note drafting PR review against a checklist Migration planning

Telemetry or incident summaries Standard debugging flows The $skill-creator skill is the best place to start to scaffold the first version of a skill and to use the $skill-installer skill to install it locally. One of the most important parts of a skill is the description. It should clearly say what the skill does and when to use it. Tip: Personal skills are stored in $HOME/.agents/skills, and shared team skills can be checked into .agents/skills inside a repository. This is especially helpful for onboarding new teammates. 8. Save time on recurring work with automations Once a workflow is stable, you can schedule Codex to run it in the background for you. In the Codex app, automations let you choose the project, prompt, cadence, and execution environment for a recurring task. Once a task becomes repetitive for you, you can easily create an automation in the Automations tab on the Codex app. You can choose which project it runs in, the prompt it runs (you can invoke skills), and the cadence it will run. You can also choose whether the automation runs in a dedicated git worktree or in your local environment. Learn more about git worktrees. Good candidates include:

Summarizing recent commits Scanning for likely bugs Drafting release notes Checking CI failures Producing standup summaries Running repeatable analysis workflows on a schedule A useful rule is that skills define the method, automations define the schedule. If a workflow still needs a lot of steering, turn it into a skill first. Once it is predictable, automation becomes a force multiplier. Tip: Use automations for reflection and maintenance, not just execution. Review recent sessions, summarize repeated friction, and improve prompts, instructions, or workflow setup over time.

9. Stay organized across longer-running work with session controls Codex sessions are not just chat history. They are working threads that accumulate context, decisions, and actions over time, so managing them well has a big impact on quality. Managing multiple threads is easiest in the Codex app UI with the ability to pin threads and create worktrees. But if you are using the CLI, these slash commands are especially useful: /experimental to toggle experimental features and add to your config.toml /resume to resume a saved conversation /fork to create a new thread while preserving the original transcript /compact when the thread is getting long and you want a summarized version of earlier context. Note that Codex does automatically compact conversations for you /agent when you are running multiple agents and want to switch between the active agent thread

/theme to choose a syntax highlighting theme /apps to use ChatGPT apps directly in Codex /status to inspect the current session state Keep one thread per coherent unit of work. If the work is still part of the same problem, staying in the same thread is often better because it preserves the reasoning trail. Fork only when the work truly branches. Tip: Use Codex’s multi-agent workflows to offload bounded work from the main thread. Keep the main agent focused on the core problem, and use subagents for tasks like exploration, tests, or triage. 10. Common mistakes to avoid A few common mistakes to avoid when first using Codex: Overloading the prompt with durable rules instead of moving them into AGENTS.md or a skill

Not letting the agent see its work by not giving details on how to best run build and test commands Skipping planning on multi-step and complex tasks Giving Codex full permission to your computer before the workflow is understood Running multiple live threads on the same files without using git worktrees Turning a recurring task into an automation before it is reliable manually Use Codex in parallel with your own work instead of treating it as something you have to watch step by step Getting started checklist Give Codex the right goal, context, constraints, and done-when

For hard tasks, ask Codex to plan first Create a starter AGENTS.md Tell Codex how to build, test, validate, and review Set configuration defaults that match your workflow Add MCP for high-value external tools Turn repeated workflows into skills Use automations once a workflow is stable The more you turn your workflow, standards, and context into something Codex can use, the more you’ll see what the agent can really do. Start today!

来源 Source

https://x.com/i/article/2030089070479548416