ce-optimize

Run metric-driven iterative optimization loops. Define a measurable goal, build measurement scaffolding, then run parallel experiments that try many approaches, measure each against hard gates and/or LLM-as-judge quality scores, keep improvements, and converge toward the best solution. Use when optimizing clustering quality, search relevance, build performance, prompt quality, or any measurable outcome that benefits from systematic experimentation. Inspired by Karpathy's autoresearch, generalized for multi-file code changes and non-ML domains.

Best for Systematic optimization pro…Works with GitHubLow risk

#optimization #experimentation #metrics #parallel #automation

⌘source

author: @EveryInc
repo: EveryInc/compound-engineering-plugin
language: TypeScript

✦overview.md

Key Features

·Run parallel experiments with multiple approaches
·Measure results against hard gates and LLM-as-judge quality scores
·Keep improvements and converge toward best solution
·Define measurable goals with measurement scaffolding
·Optimize clustering quality, search relevance, build performance, or prompt quality

Use Cases

→Improving search relevance through systematic experimentation
→Optimizing build performance with parallel test runs
→Enhancing prompt quality through iterative refinement
→Increasing clustering quality with metric-driven approaches

Best for

✓Systematic optimization projects
✓Metric-driven development teams

plugins/compound-engineering/skills/ce-optimize/SKILL.md

name

ce-optimize

description

argument-hint:[path to optimization spec YAML, or describe the optimization goal]

Iterative Optimization Loop

Run metric-driven iterative optimization. Define a goal, build measurement scaffolding, then run parallel experiments that converge toward the best solution.

Interaction Method

Use the platform's blocking question tool when available (AskUserQuestion in Claude Code, request_user_input in Codex, ask_user in Gemini). Otherwise, present numbered options in chat and wait for the user's reply before proceeding.

Input

<optimization_input> #$ARGUMENTS </optimization_input>

If the input above is empty, ask: "What would you like to optimize? Describe the goal, or provide a path to an optimization spec YAML file."

Optimization Spec Schema

Reference the spec schema for validation:

references/optimize-spec-schema.yaml

Experiment Log Schema

Reference the experiment log schema for state management:

references/experiment-log-schema.yaml

Quick Start

For a first run, optimize for signal and safety, not maximum throughput:

Start from references/example-hard-spec.yaml when the metric is objective and cheap to measure
Use references/example-judge-spec.yaml only when actual quality requires semantic judgment

...

$install

1-click copy

npx skills add EveryInc/compound-engineering-plugin --skill ce-optimize

Safety assessment

★

Clarity score

How clear and easy to understand the SKILL.md instructions are, rated from 1 to 5.

1/ 5

poor

The SKILL.md content is hard to understand and quite ambiguous.

◎

Actionability score

How directly an agent can act on the SKILL.md instructions, rated from 1 to 5.

1/ 5

not actionable

The SKILL.md is hard to act on; an agent would not know what to do.

~community cookbook

April 21, 2026

◧ Compare

ce-optimize

Best for Systematic optimization pro…Works with GitHubLow risk

Iterative Optimization Loop

Run metric-driven iterative optimization. Define a goal, build measurement scaffolding, then run parallel experiments that converge toward the best solution.

Interaction Method

Input

<optimization_input> #$ARGUMENTS </optimization_input>

If the input above is empty, ask: "What would you like to optimize? Describe the goal, or provide a path to an optimization spec YAML file."

Optimization Spec Schema

Reference the spec schema for validation:

references/optimize-spec-schema.yaml

Experiment Log Schema

Reference the experiment log schema for state management:

references/experiment-log-schema.yaml

Quick Start

For a first run, optimize for signal and safety, not maximum throughput:

Start from references/example-hard-spec.yaml when the metric is objective and cheap to measure

Use references/example-judge-spec.yaml only when actual quality requires semantic judgment

ce-optimize

Key Features

Use Cases

Best for

Iterative Optimization Loop

Interaction Method

Input

Optimization Spec Schema

Experiment Log Schema

Quick Start

Safety assessment

Clarity score

Actionability score

~community cookbook

~you might also like

cli-anything-unrealinsights

signal-postmortem

strategy-pivot-designer

macro-regime-detector

edge-signal-aggregator

pair-trade-screener

AI Skill Finder

ce-optimize

Key Features

Use Cases

Best for

Iterative Optimization Loop

Interaction Method

Input

Optimization Spec Schema

Experiment Log Schema

Quick Start

Safety assessment

Clarity score

Actionability score

~community cookbook

~you might also like

cli-anything-unrealinsights

signal-postmortem

strategy-pivot-designer

macro-regime-detector

edge-signal-aggregator

pair-trade-screener