experimentation-analytics

How to read experiment results without fooling yourself. Confidence intervals, p-values, multiple testing, sequential testing, CUPED, heterogeneous treatment effects, ratio metrics, network effects, dashboard reconciliation, and the interpretation failures that produce confidently wrong shipping decisions.

Best for Experimentation teamsWorks with GitHubHigh risk

#experimentation #a/b testing #statistics #data analysis #confidence intervals #p-values #multiple testing #sequential testing #cuped #ratio metrics #network effects #dashboard reconciliation

⌘source

author: @rampstackco
repo: rampstackco/claude-skills
language: Python

✦overview.md

Key Features

·Prevents misreading experiment results
·Covers p-values, CUPED, ratio metrics
·Addresses multiple testing & sequential testing
·Explains network effects & dashboard reconciliation

Use Cases

→A data scientist validating experiment results before shipping
→A product manager interpreting a results panel with multiple metrics
→A platform team reconciling discrepancies between experiment dashboards

Best for

✓Experimentation teams
✓Data scientists
✓Product analytics

Not ideal for

!Setting up experiments
!Designing experiments

FAQs

skills/experimentation-analytics/SKILL.md

name

experimentation-analytics

description

category:product

catalog_summary:Read result panels without fooling yourself: confidence intervals, p-values, multiple testing, sequential testing, CUPED, ratio metrics, network effects, dashboard reconciliation

display_order:6

Experimentation Analytics

A data-team-mentor's playbook for interpreting experiment results without fooling yourself.

The result panel is the moment-of-truth for an experiment. The numbers on it determine whether you ship, kill, or iterate. They also expose every shortcut taken in the design phase: an underpowered test produces wide confidence intervals; a peeked test produces a too-narrow p-value; a ratio metric without delta-method correction produces overconfident lift estimates. Most ship-the-wrong-thing decisions trace back to misreading the result panel.

This skill is the discipline that prevents misreading. It assumes the experiment was designed well (see the experiment-design skill). It assumes the platform's results panel is technically correct (most modern platforms are; some older ones are not). It assumes you can read a number off a screen. The hard part is knowing what each number actually means and what it does not, and that is what is here.

When to use this skill: any time you are reading an experiment result panel and about to make a ship, kill, or iterate decision.

What this skill is for

...

$install

1-click copy

npx skills add rampstackco/claude-skills --skill experimentation-analytics

Safety assessment

★

Clarity score

How clear and easy to understand the SKILL.md instructions are, rated from 1 to 5.

3/ 5

good

Mostly clear, but there are still a few confusing or poorly structured parts.

◎

Actionability score

How directly an agent can act on the SKILL.md instructions, rated from 1 to 5.

2/ 5

low

Some hints are present, but an agent still has to guess many steps.

~community cookbook

May 7, 2026

◧ Compare

experimentation-analytics

Best for Experimentation teamsWorks with GitHubHigh risk

Experimentation Analytics

A data-team-mentor's playbook for interpreting experiment results without fooling yourself.

When to use this skill: any time you are reading an experiment result panel and about to make a ship, kill, or iterate decision.

What this skill is for

experimentation-analytics

Key Features

Use Cases

Best for

Not ideal for

FAQs

Does this skill cover experiment design?

What common mistakes does this skill address?

Experimentation Analytics

What this skill is for

Safety assessment

Clarity score

Actionability score

~community cookbook

~you might also like

game-sprite-pipeline

n8n-architect

dispatching-parallel-agents

shopify-development

reversa-reviewer

n8n-architect

AI Skill Finder

experimentation-analytics

Key Features

Use Cases

Best for

Not ideal for

FAQs

Does this skill cover experiment design?

What common mistakes does this skill address?

Experimentation Analytics

What this skill is for

Safety assessment

Clarity score

Actionability score

~community cookbook

~you might also like

game-sprite-pipeline

n8n-architect

dispatching-parallel-agents

shopify-development

reversa-reviewer

n8n-architect