Ask me what skills you need
What are you building?
Tell me what you're working on and I'll find the best agent skills for you.
Testing and benchmarking LLM agents including behavioral testing,
Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world benchmarks
npx skills add sickn33/antigravity-awesome-skills --skill agent-evaluationHow clear and easy to understand the SKILL.md instructions are, rated from 1 to 5.
The SKILL.md content is hard to understand and quite ambiguous.
How directly an agent can act on the SKILL.md instructions, rated from 1 to 5.
The SKILL.md is hard to act on; an agent would not know what to do.