docs-xlsx
Excel Spreadsheet Parsing
Parse Excel spreadsheets (.xlsx) into CSV files, JSON metadata, and formula artifacts using multi-method extraction.
Usage
Run the parsing script directly:
./scripts/parse_xlsx.py <path_to_file.xlsx> <output_dir>
Example:
./scripts/parse_xlsx.py ~/data/workbook.xlsx ./parsed/
The script uses 4 extraction methods:
- pandas (basic) - CSV exports + schema
- pandas (detailed) - Data types + statistics
- openpyxl (structure) - Workbook structure JSON
- openpyxl (formulas) - Formula extraction
Output Structure
output_dir/
├── file.xlsx/
│ ├── parsing_summary.json
│ ├── pandas_basic/
│ │ ├── Sheet1.csv
│ │ ├── Sheet2.csv
│ │ └── metadata.json
│ ├── pandas_detailed/
│ │ ├── Sheet1.csv
│ │ ├── Sheet2.csv
│ │ ├── statistics.json
│ │ └── data_types.json
│ ├── openpyxl_structure/
│ │ └── workbook.json
│ └── openpyxl_formulas/
│ └── formulas.json
Script Features
- Exports each sheet as separate CSV file
- Captures data types and statistics
- Extracts formulas with cell references
- Handles multiple sheets per workbook
- Provides schema information for each sheet
More from nikhilmaddirala/gtd-cc
tools-catppuccin
Agent skill for creating and validating Catppuccin theme ports
18tools-diagnostics
Interactive system resource analysis and troubleshooting for memory, disk, CPU, and performance issues
11productivity-todoist
Fetch and manage Todoist tasks. Use when the user asks about "todoist tasks", "show my tasks", "what's due", "overdue tasks", "triage tasks", or when another skill needs Todoist task context.
10obsidian-options
Conduct comprehensive research comparing multiple options, alternatives, or approaches. Use when the user needs to evaluate product comparisons, technology evaluations, vendor selections, strategic alternatives, or any scenario requiring systematic analysis of multiple choices (e.g., baby formulas, project management tools, cloud providers, equipment purchases).
10web-fetch
Download web articles with images and save as clean markdown for offline reference. Covers Jina AI Reader, WebFetch tool, and multiple fallback methods for article extraction with image handling.
10gh-tasks
Issue-driven development with worktree isolation. Create issues, plan implementations, work in worktrees, review PRs, and merge with clean squash history.
9