arxiv-package
arXiv Submission Packager
Purpose
Assemble a TeX/LaTeX project into a clean, correctly structured .tar.gz or
.zip archive ready for arXiv upload. Handles file selection, artifact
exclusion, directory structure, 00README generation, and validation of the
final package.
Companion skills:
arxiv-preflight— validate before packagingarxiv-figures— optimize figures before packaging
Workflow
1. Inventory
Scan the project directory. Classify every file:
INCLUDE — Required
- Main
.texfile(s) - All
\input/\includetargets (recursively trace) - All
\includegraphicstargets .bblfiles (pre-processed bibliography).indfiles (pre-processed index).gls/.nlsfiles (glossary/nomenclature)- Custom
.sty,.cls,.bstfiles not in TeX Live 00README.XXX(if present)anc/directory contents (ancillary files)
EXCLUDE — Build Artifacts
.aux,.log,.toc,.lot,.lof,.out,.nav,.snm,.vrb.dvi,.ps,.pdf(except figure PDFs and ancillary PDFs).synctex,.synctex.gz,.fls,.fdb_latexmk.blg,.ilg,.glg,.nlg(log files from bib/index/glossary processing)*.bak,*~,*.swp,*.swo(editor backups).DS_Store,Thumbs.db,desktop.ini(OS artifacts)
EXCLUDE — Should Not Submit
- Journal templates / style guides (unless custom
.styneeded for compilation) - Referee letters, cover letters
- Build scripts (
Makefile,latexmkrc) — not needed by arXiv .git/,.svn/, version control directories- IDE configs (
.vscode/,.idea/,.texpadtmp/)
FLAG — Needs Decision
.bibfiles (include alongside.bblor omit — arXiv processes both)- Multiple
.texfiles with\documentclass(needs00README.XXXwithtoplevelfile) - Large data files (ancillary or exclude)
2. Generate 00README.XXX
Create or update the 00README.XXX file when needed:
# Auto-generated by arxiv-package
# Main TeX file (only if ambiguous — multiple \documentclass files)
main.tex toplevelfile
# Files to exclude from processing
cover-letter.pdf ignore
notes.txt ignore
# Optional directives
# nostamp
# nohypertex
Only generate if:
- Multiple
.texfiles contain\documentclass(declaretoplevelfile) - Files need explicit
ignoredirectives - User requests
nostampor other directives
3. Validate Structure
Before creating the archive:
- All
\input/\includetargets resolve to files in the package - All
\includegraphicstargets resolve to files in the package - All
\bibliographytargets have corresponding.bblfiles - No absolute paths in any TeX source
- No filenames with spaces or special characters
anc/directory contains no.texfiles- No hidden files (will be deleted by arXiv upon announcement)
- Total uncompressed size is reasonable (flag >50MB — may need exception request)
4. Create Archive
# From project root, create tar.gz excluding unwanted files
tar czf submission.tar.gz \
--exclude='*.aux' --exclude='*.log' --exclude='*.toc' \
--exclude='*.lot' --exclude='*.lof' --exclude='*.out' \
--exclude='*.nav' --exclude='*.snm' --exclude='*.vrb' \
--exclude='*.dvi' --exclude='*.synctex*' \
--exclude='*.fls' --exclude='*.fdb_latexmk' \
--exclude='*.blg' --exclude='*.ilg' --exclude='*.glg' \
--exclude='*.bak' --exclude='*~' --exclude='*.swp' \
--exclude='.git' --exclude='.svn' \
--exclude='.DS_Store' --exclude='Thumbs.db' \
--exclude='.vscode' --exclude='.idea' \
[list of files/directories to include]
# Or create zip
zip -r submission.zip [files] \
-x '*.aux' -x '*.log' -x '*.toc' ...
Prefer explicit file inclusion over directory-wide inclusion with exclusions. This prevents accidental inclusion of sensitive or unnecessary files.
5. Verify Archive
After creation:
- List archive contents — verify no unexpected files
- Extract to temporary directory
- Attempt compilation in the extracted directory (if TeX toolchain available)
- Report archive size (compressed and uncompressed)
- Compare file list against the include/exclude inventory
6. Report
# arXiv Package Report
**Archive:** submission.tar.gz
**Compressed size:** [size]
**Uncompressed size:** [size]
**File count:** [count]
## Contents
| File | Size | Type |
|------|------|------|
| main.tex | 45 KB | Source |
| references.bbl | 12 KB | Bibliography |
| fig1.pdf | 380 KB | Figure |
| anc/data.csv | 1.2 MB | Ancillary |
## Excluded
| File | Reason |
|------|--------|
| main.aux | Build artifact |
| main.log | Build artifact |
| referee-response.pdf | Not for submission |
## 00README.XXX
[Contents if generated]
## Verification
- [ ] Archive extracts cleanly
- [ ] All source references resolve
- [ ] Compilation succeeds (if tested)
- [ ] No sensitive files included
Archive Format Notes
.tar.gzpreferred over.zip(standard in academic TeX workflows)- arXiv accepts both
.tar(gzipped) and.zip - Do not nest archives (no
.tar.gzinside a.zip) - Files should be at root level or in logical subdirectories — no wrapper directory unless the project uses subdirectories for organization
Core Principles
- Explicit inclusion over blanket exclusion. Build the file list from what's needed, not from everything minus what's not needed. This catches files that shouldn't be submitted but aren't in the exclusion list.
- Verify after packaging. The archive is the submission. Test it, not the source directory.
- Preserve directory structure. If the project uses subdirectories for figures or sections, maintain that structure. TeX paths depend on it.
- No sensitive files. Check for
.env, credentials, personal notes, referee correspondence before packaging.
More from mathews-tom/armory
architecture-diagram
Generate layered architecture diagrams as self-contained HTML with inline SVG icons, CSS Grid containers, and connection overlays. Triggers on: "architecture diagram", "infra diagram", "system diagram", "deployment diagram", "topology", "draw architecture". NOT for architecture reviews, use architecture-reviewer.
61architecture-reviewer
Architecture reviews across 7 dimensions (structural, scalability, enterprise readiness, performance, security, ops, data) with scored reports. Triggers on: "review architecture", "critique design", "audit system", "assess scalability", "enterprise readiness", "technical due diligence". NOT for diagrams, use architecture-diagram.
59concept-to-video
Turn concepts into animated explainer videos using Manim (Python) with MP4/GIF output, audio overlay, multi-scene composition. Triggers on: "create a video", "animate this", "make an explainer", "manim animation", "motion graphic". NOT for React video, use remotion-video.
57youtube-analysis
Extract YouTube transcripts and produce structured concept analysis with multi-level summaries, key concepts, takeaways. Uses youtube-transcript-api with yt-dlp fallback. Triggers on: "analyze youtube video", "youtube transcript", "summarize this video", "extract concepts from video", "video key points", or any youtube.com/youtu.be URL.
57humanize
Detects and removes AI-generated writing patterns while preserving meaning and facts. Triggers on: "humanize text", "make this sound human", "remove AI patterns", "rewrite to sound natural", "make this less AI", "de-slop this", "not sound like ChatGPT", "human pass".
56concept-to-image
Turn concepts into static HTML visuals exported as PNG or SVG files via HTML/CSS/SVG. Triggers on: "create an image of", "export as PNG", "save as SVG", "concept to image", "screenshot this HTML". NOT for interactive HTML, use static-web-artifacts-builder.
56