sn-da-image-caption

Use this skill when image files (.png, .jpg, .jpeg, .gif, .webp, .bmp) are the primary input and the user needs to understand, extract data from, or analyze image content. Provides a pre-configured caption script (scripts/caption.py) that converts images to text descriptions via a vision model — no API key setup needed. Covers: (1) captioning charts/tables/screenshots/diagrams via scripts/caption.py, (2) parsing caption text into structured DataFrames, (3) re-creating visualizations from extracted data, (4) exporting to Excel/CSV. Trigger when user uploads images and wants: data extraction, table OCR, chart analysis, UI description, or diagram understanding. Do NOT trigger for image editing (resize, crop, filter) or image generation.

Best for Data extraction from imagesWorks with GitHubLow risk

#image caption #data extraction #ocr #vision model #chart analysis #table extraction #screenshot #diagram #csv #excel

⌘source

author: @OpenSenseNova
repo: OpenSenseNova/SenseNova-Skills
language: Python

✦overview.md

Key Features

·Captures images via a pre-configured script
·No API key setup needed
·Parses captions into DataFrames
·Exports to Excel/CSV
·Re-creates visualizations from extracted data

Use Cases

→Extract data from a chart screenshot
→Convert a table image into a CSV file
→Describe a UI screenshot for documentation
→Analyze a diagram and create a structured summary

Best for

✓Data extraction from images
✓Table/Chart OCR
✓UI description

Not ideal for

!Image editing
!Image generation

FAQs

skills/sn-da-image-caption/SKILL.md

name

sn-da-image-caption

description

Image Caption Analysis — 图片描述与数据提取

Overview

Analyze, extract data from, or understand image files (.png, .jpg, .jpeg, .gif, .webp, .bmp). The core workflow:

Run scripts/caption.py to get a text description of the image
Parse the description into structured data (DataFrame, etc.)
Analyze, visualize, or export

scripts/caption.py — Image Caption

The script converts images to text descriptions via a vision model. Set VISION_API_KEY and VISION_API_BASE environment variables before running.

Usage

# Basic — get text description
python3 scripts/caption.py /mnt/data/image.png

# Custom prompt — guide what to extract
python3 scripts/caption.py /mnt/data/chart.png --prompt "提取所有数值，Markdown 表格格式"

# JSON output — includes detected type, usage stats, cache info
python3 scripts/caption.py /mnt/data/image.png --json

# Batch — process all images in a directory
python3 scripts/caption.py /mnt/data/images/ --batch --output /mnt/data/captions.json

# Override model (optional)
python3 scripts/caption.py /mnt/data/image.png --model gemini-3.1-flash-lite-preview

Options

Option	Description

...

$install

1-click copy

npx skills add OpenSenseNova/SenseNova-Skills --skill sn-da-image-caption

Safety assessment

★

Clarity score

How clear and easy to understand the SKILL.md instructions are, rated from 1 to 5.

4/ 5

very good

Clear and well structured, with only minor parts that might need a second read.

◎

Actionability score

How directly an agent can act on the SKILL.md instructions, rated from 1 to 5.

4/ 5

high

Mostly actionable with clear steps; only a few small gaps remain.

~community cookbook

~you might also like

view all →

duplicate-value-coloring

★70

testing#excel

[✓]from @OpenSenseNova

[✓]

对比Excel多表中的特定系数并对异常值进行颜色标记。

April 30, 2026

◧ Compare

numeric-format-normalization

★70

testing#excel

[✓]from @OpenSenseNova

[✓]

对 Excel 数据进行数值格式标准化与清洗，支持大规模数据的 Parquet 转换流程，并完成关键指标的合计核对与结果文件导出。

April 30, 2026

◧ Compare

bar-chart-visualization

★70

testing#excel

[✓]from @OpenSenseNova

[✓]

读取多工作表Excel文件，自动处理合并单元格与数据清洗，进行交叉分组统计并生成带总计行的结果表，最后绘制支持中英文字体的美化柱状图，适用于多维度数据汇总与可视化分析。

April 30, 2026

◧ Compare

percentage-calculation

★70

testing#excel

[✓]from @OpenSenseNova

[✓]

根据文件行数动态切换大文件处理策略（Parquet转换），通过逐行扫描或列匹配提取关键指标并计算占比、均值等统计量，最终输出结构化Excel报告及可视化图表。

April 30, 2026

◧ Compare

kpi-metric-analysis

★70

testing#analysis

[✓]from @OpenSenseNova

[✓]

根据数据量自动选择读取策略（大文件转Parquet），提取关键指标进行单位一致性验证与排序分析，并输出可下载的结果表格。

April 30, 2026

◧ Compare

category-statistics

★70

documentation#python

[✓]from @OpenSenseNova

[✓]

提取指定类别列并统计各类别数量与占比，生成高分辨率的柱状图、饼图等组合可视化报告，适用于分类数据的分布情况分析。

April 30, 2026

◧ Compare

sn-da-image-caption

Best for Data extraction from imagesWorks with GitHubLow risk

Image Caption Analysis — 图片描述与数据提取

Overview

Analyze, extract data from, or understand image files (.png, .jpg, .jpeg, .gif, .webp, .bmp). The core workflow:

Run scripts/caption.py to get a text description of the image

Parse the description into structured data (DataFrame, etc.)

Analyze, visualize, or export

scripts/caption.py — Image Caption

The script converts images to text descriptions via a vision model. Set VISION_API_KEY and VISION_API_BASE environment variables before running.

Usage

# Basic — get text description python3 scripts/caption.py /mnt/data/image.png # Custom prompt — guide what to extract python3 scripts/caption.py /mnt/data/chart.png --prompt "提取所有数值，Markdown 表格格式" # JSON output — includes detected type, usage stats, cache info python3 scripts/caption.py /mnt/data/image.png --json # Batch — process all images in a directory python3 scripts/caption.py /mnt/data/images/ --batch --output /mnt/data/captions.json # Override model (optional) python3 scripts/caption.py /mnt/data/image.png --model gemini-3.1-flash-lite-preview

Options

Option

Description

sn-da-image-caption

Key Features

Use Cases

Best for

Not ideal for

FAQs

What image formats are supported?

Do I need to set up an API key?

Can I customize what to extract?

How do I get started?

Image Caption Analysis — 图片描述与数据提取

Overview

scripts/caption.py — Image Caption

Usage

Options

Safety assessment

Clarity score

Actionability score

~community cookbook

~you might also like

duplicate-value-coloring

numeric-format-normalization

bar-chart-visualization

percentage-calculation

kpi-metric-analysis

category-statistics

AI Skill Finder

sn-da-image-caption

Key Features

Use Cases

Best for

Not ideal for

FAQs

What image formats are supported?

Do I need to set up an API key?

Can I customize what to extract?

How do I get started?

Image Caption Analysis — 图片描述与数据提取

Overview

scripts/caption.py — Image Caption

Usage

Options

Safety assessment

Clarity score

Actionability score

~community cookbook

~you might also like

duplicate-value-coloring

numeric-format-normalization

bar-chart-visualization

percentage-calculation

kpi-metric-analysis

category-statistics