threshold-cell-coloring

根据Excel总行数自动切换Parquet加速读取，计算特定维度的时间序列平均值，并使用openpyxl输出带有条件格式（如低于均值标绿）和自定义样式的分析报告。

Best for Excel analysisWorks with GitHub

#excel #conditional-formatting #pandas #openpyxl #large-file

⌘source

author: @OpenSenseNova
repo: OpenSenseNova/SenseNova-Skills
language: Python

✦overview.md

Key Features

·Stats rows across all sheets
·Extracts time-series for target entity
·Computes average of values
·Flags values below average
·Reads Excel with openpyxl and pandas

Use Cases

→Checking Excel file size before applying large-file acceleration
→Flagging underperforming periods for a specific entity
→Preparing data for conditional formatting in a report

Best for

✓Excel analysis
✓time-series reporting
✓data quality checks

Not ideal for

!real-time streaming
!non-Excel sources

FAQs

skills/sn-da-excel-workflow/capability/excel-cell-coloring/threshold-cell-coloring/SKILL.md

name

large-file-conditional-formatting

description

Skill Steps

Note: This sub-skill covers one step of the Excel analysis workflow. For the full pipeline (file reading, row counting, large-file optimization, export), see the parent workflow SKILL.md.

Step1 读取文件并统计所有 sheet 的行数，汇总后打印总行数，用于判断是否需要大文件加速。

import pandas as pd
import openpyxl

file_path = "input_data.xlsx"

# 获取所有sheet名称
wb = openpyxl.load_workbook(file_path, read_only=True)
sheet_names = wb.sheetnames
print("Sheet列表:", sheet_names)
print("Sheet数量:", len(sheet_names))

# 统计每个sheet的行数
total_rows = 0
for name in sheet_names:
    df_temp = pd.read_excel(file_path, sheet_name=name, header=None)
    rows = len(df_temp)
    total_rows += rows
    print(f"Sheet '{name}': {rows} 行")

print(f"\n总行数 = {total_rows}")

Step2 提取目标实体的时间序列数据，计算平均值，并构建包含比较结果的结构化 DataFrame。

target_entity = 'Target_Entity' # 占位示例，如 'US'

# 提取目标行数据 (假设第0列为实体名称)
target_row = df[df[0] == target_entity]

# 提取时间标签和对应数值 (假设第6行为表头，1:10列为数据)
time_labels = df.iloc[6, 1:10].tolist()
target_values = target_row.iloc[0, 1:10].tolist()
target_values_numeric = [float(v) for v in target_values]

# 计算平均值
avg_value = sum(target_values_numeric) / len(target_values_numeric)

...

$install

1-click copy

npx skills add OpenSenseNova/SenseNova-Skills --skill threshold-cell-coloring

Safety assessment

★

Clarity score

How clear and easy to understand the SKILL.md instructions are, rated from 1 to 5.

2/ 5

fair

The main idea is there, but the wording is messy and easy to misinterpret.

◎

Actionability score

How directly an agent can act on the SKILL.md instructions, rated from 1 to 5.

2/ 5

low

Some hints are present, but an agent still has to guess many steps.

~community cookbook

~you might also like

view all →

duplicate-value-coloring

★70

testing#excel

[✓]from @OpenSenseNova

[✓]

对比Excel多表中的特定系数并对异常值进行颜色标记。

April 30, 2026

◧ Compare

sn-search-academic

★70

documentation#arxiv

[✓]from @OpenSenseNova

[✓]

搜索学术论文和百科知识：ArXiv 预印本、Semantic Scholar（含引用数）、PubMed 生医文献、Wikipedia 百科。支持按章节读取 ArXiv HTML 全文和 PMC 开放获取全文，适合学术调研和深度阅读。

April 30, 2026

◧ Compare

numeric-format-normalization

★70

testing#excel

[✓]from @OpenSenseNova

[✓]

对 Excel 数据进行数值格式标准化与清洗，支持大规模数据的 Parquet 转换流程，并完成关键指标的合计核对与结果文件导出。

April 30, 2026

◧ Compare

bar-chart-visualization

★70

testing#excel

[✓]from @OpenSenseNova

[✓]

读取多工作表Excel文件，自动处理合并单元格与数据清洗，进行交叉分组统计并生成带总计行的结果表，最后绘制支持中英文字体的美化柱状图，适用于多维度数据汇总与可视化分析。

April 30, 2026

◧ Compare

percentage-calculation

★70

testing#excel

[✓]from @OpenSenseNova

[✓]

根据文件行数动态切换大文件处理策略（Parquet转换），通过逐行扫描或列匹配提取关键指标并计算占比、均值等统计量，最终输出结构化Excel报告及可视化图表。

April 30, 2026

◧ Compare

kpi-metric-analysis

★70

testing#analysis

[✓]from @OpenSenseNova

[✓]

根据数据量自动选择读取策略（大文件转Parquet），提取关键指标进行单位一致性验证与排序分析，并输出可下载的结果表格。

April 30, 2026

◧ Compare

import pandas as pd import openpyxl file_path = "input_data.xlsx" # 获取所有sheet名称 wb = openpyxl.load_workbook(file_path, read_only=True) sheet_names = wb.sheetnames print("Sheet列表:", sheet_names) print("Sheet数量:", len(sheet_names)) # 统计每个sheet的行数 total_rows = 0 for name in sheet_names: df_temp = pd.read_excel(file_path, sheet_name=name, header=None) rows = len(df_temp) total_rows += rows print(f"Sheet '{name}': {rows} 行") print(f"\n总行数 = {total_rows}")

target_entity = 'Target_Entity' # 占位示例，如 'US' # 提取目标行数据 (假设第0列为实体名称) target_row = df[df[0] == target_entity] # 提取时间标签和对应数值 (假设第6行为表头，1:10列为数据) time_labels = df.iloc[6, 1:10].tolist() target_values = target_row.iloc[0, 1:10].tolist() target_values_numeric = [float(v) for v in target_values] # 计算平均值 avg_value = sum(target_values_numeric) / len(target_values_numeric)

threshold-cell-coloring

Key Features

Use Cases

Best for

Not ideal for

FAQs

Does this skill handle multiple sheets?

How does it determine if a value is below average?

Skill Steps

Safety assessment

Clarity score

Actionability score

~community cookbook

~you might also like

duplicate-value-coloring

sn-search-academic

numeric-format-normalization

bar-chart-visualization

percentage-calculation

kpi-metric-analysis

AI Skill Finder

threshold-cell-coloring

Key Features

Use Cases

Best for

Not ideal for

FAQs

Does this skill handle multiple sheets?

How does it determine if a value is below average?

Skill Steps

Safety assessment

Clarity score

Actionability score

~community cookbook

~you might also like

duplicate-value-coloring

sn-search-academic

numeric-format-normalization

bar-chart-visualization

percentage-calculation

kpi-metric-analysis