pptx
PPTX Skill
Choosing an approach
| Objective | Technique | Reference |
|---|---|---|
| Extract text or data | python -m markitdown presentation.pptx |
Also: slides.py thumbnail for visual grid |
| Modify an existing file or template | Unpack to XML, edit, repack | See EDITING.md |
| Generate a deck from scratch | JavaScript with pptxgenjs |
See CREATING.md |
Pre-installed sandbox packages: markitdown[pptx], Pillow, pptxgenjs (Node), react-icons + react + react-dom + sharp (icon rendering), LibreOffice (soffice), Poppler (pdftoppm).
Math and Equations
Render equations with Unicode math symbols only. Do not use OMML or generate equation images — LibreOffice cannot display either during visual QA.
More from kortix-ai/suna
logo-creator
Create professional logos through an intelligent, iterative design process. Use this skill when the user wants to create a logo, icon, favicon, brand mark, wordmark, or any visual brand identity mark. Triggers on: 'create a logo', 'design a logo', 'make me a logo', 'logo for my brand', 'I need a logo', 'brand mark', 'wordmark', 'logomark', 'icon design', 'favicon'. This is NOT a one-shot image generator — it researches, strategizes, generates symbols with AI, visually inspects every output, then programmatically composes them with real Google Fonts typography into complete logo systems (logomark, wordmark, combination marks in multiple layouts).
2agent-browser
Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.
2webapp
Use for fullstack web apps built with the bundled Express, Vite, React, Tailwind, and Drizzle template.
1computer-use
Control desktop applications on the user's machine using agent-click. Use when you need to click buttons, type text, read screens, scroll, drag files, move/resize windows, open/quit apps, interact with UI elements, or automate desktop workflows. Triggers: 'click on', 'open app', 'type into', 'scroll down', 'drag file', 'take screenshot', 'read the screen', 'interact with UI', 'desktop automation', 'computer use', 'agent-click'. Built on agent-click (https://github.com/kortix-ai/agent-click, https://www.agent-click.dev/) — an open-source computer use CLI by Kortix. Right now only works on macOS.
1presentations
Create, manage, validate, preview, and export HTML presentation slides (1920x1080). Load this skill when you need to build a slide deck, export to PDF/PPTX, or preview slides in a browser.
1