foundation-models
FoundationModels — On-Device LLM
Apple's framework for on-device generative AI. No cloud, no API keys, full privacy.
Critical Constraints
- ❌ DO NOT use
response.output→ ✅ Useresponse.contentto access generated values - ❌ DO NOT skip availability check → ✅ Always check
SystemLanguageModel.default.availabilityfirst - ❌ DO NOT exceed 4,096 tokens per session → ✅ Break large tasks into multiple sessions
- ❌ DO NOT send concurrent requests on same session → ✅ Check
session.isRespondingfirst - ❌ DO NOT confuse with OpenAI/Anthropic APIs → ✅ This is Apple's native framework, different API surface
Availability Check (Required)
import FoundationModels
let model = SystemLanguageModel.default
switch model.availability {
case .available:
// Show AI features
case .unavailable(.deviceNotEligible):
// Device doesn't support Apple Intelligence
case .unavailable(.appleIntelligenceNotEnabled):
// User needs to enable in Settings
case .unavailable(.modelNotReady):
// Model downloading or not ready
case .unavailable(let other):
// Other reason
}
Basic Session & Response
let session = LanguageModelSession()
let response = try await session.respond(to: "What's a good month to visit Paris?")
print(response.content) // ← ALWAYS .content, never .output
Session with Instructions
let instructions = """
You are a cooking assistant.
Provide recipe suggestions based on ingredients.
Keep suggestions brief and practical.
"""
let session = LanguageModelSession(instructions: instructions)
let response = try await session.respond(to: "I have chicken, rice, and broccoli")
print(response.content)
Guided Generation (@Generable)
Receive structured Swift data instead of raw strings.
@Generable(description: "Profile information about a cat")
struct CatProfile {
var name: String
@Guide(description: "The age of the cat", .range(0...20))
var age: Int
@Guide(description: "One sentence personality profile")
var profile: String
}
let session = LanguageModelSession()
let response = try await session.respond(
to: "Generate a cute rescue cat",
generating: CatProfile.self
)
print(response.content.name) // ← .content, not .output
print(response.content.age)
print(response.content.profile)
Collection with Count Constraint
@Generable
struct CookbookSuggestions {
@Guide(description: "Cookbook Suggestions", .count(3))
var suggestions: [String]
}
let response = try await session.respond(
to: "What's a good name for a cooking app?",
generating: CookbookSuggestions.self
)
print(response.content.suggestions)
Snapshot Streaming
Stream partially-generated structured output. @Generable produces a PartiallyGenerated type with optional properties.
@Generable
struct TripIdeas {
@Guide(description: "Ideas for upcoming trips")
var ideas: [String]
}
let session = LanguageModelSession()
let stream = session.streamResponse(
to: "What are some exciting trip ideas?",
generating: TripIdeas.self
)
for try await partial in stream {
// partial.ideas is [String]? — fills in as tokens generate
print(partial)
}
SwiftUI Integration with Streaming
struct StreamingView: View {
@State private var partial: TripIdeas.PartiallyGenerated?
var body: some View {
VStack {
if let ideas = partial?.ideas {
ForEach(ideas, id: \.self) { Text($0) }
}
}
.task { await streamIdeas() }
}
func streamIdeas() async {
let session = LanguageModelSession()
let stream = session.streamResponse(
to: "Trip ideas for 2025",
generating: TripIdeas.self
)
do {
for try await snapshot in stream {
partial = snapshot
}
} catch { print(error) }
}
}
Tool Calling
struct RecipeSearchTool: Tool {
struct Arguments: Codable {
var searchTerm: String
var numberOfResults: Int
}
func call(arguments: Arguments) async throws -> ToolOutput {
let recipes = await searchRecipes(term: arguments.searchTerm, limit: arguments.numberOfResults)
return .string(recipes.map { "- \($0.name): \($0.description)" }.joined(separator: "\n"))
}
}
let session = LanguageModelSession(tools: [RecipeSearchTool()])
let response = try await session.respond(to: "Find me some pasta recipes")
// Error handling
do {
let answer = try await session.respond("Find a recipe for tomato soup.")
} catch let error as LanguageModelSession.ToolCallError {
print(error.tool.name)
print(error.underlyingError)
}
Generation Options
let options = GenerationOptions(temperature: 2.0) // Higher = more creative
let response = try await session.respond(to: prompt, options: options)
Session Transcript
let transcript = session.transcript // View model actions during session
Context Limits
- 4,096 tokens per session (~12K-16K English characters)
- Instructions + prompts + outputs all count
- For large data: break into chunks across multiple sessions
- Error on overflow:
LanguageModelSession.GenerationError.exceededContextWindowSize
Common Mistakes & Fixes
| Mistake | Fix |
|---|---|
response.output |
response.content — always use content |
| Skipping availability check | Always check SystemLanguageModel.default.availability |
| Sending request while session is busy | Check session.isResponding first |
| Expecting cloud model quality | On-device model is smaller; keep prompts focused and simple |
| Trying to use on Simulator | Requires Apple Silicon device with Apple Intelligence enabled |
References
More from makgunay/claude-swift-skills
macos-app-structure
macOS application architecture patterns covering App protocol (@main), Scene types (WindowGroup, Window, Settings, MenuBarExtra), multi-window management, NSApplicationDelegateAdaptor for AppKit lifecycle hooks, Info.plist configuration (LSUIElement for menu bar apps, NSAccessibilityUsageDescription), entitlements for sandbox/hardened runtime, and project structure conventions. Use when scaffolding a new macOS app, configuring scenes and windows, setting up menu bar apps, or resolving macOS-specific lifecycle issues. Corrects the common LLM mistake of generating iOS-only app structures.
31macos-permissions
macOS permission handling for Accessibility (AXIsProcessTrusted), Screen Recording, Full Disk Access, input monitoring, camera, microphone, location, and contacts. Covers TCC (Transparency Consent and Control) database, graceful degradation when permissions are denied, permission prompting patterns, opening System Settings to the correct pane, detecting permission changes, and the privacy manifest (PrivacyInfo.xcprivacy) requirement. Use when implementing features that require system permissions, building permission onboarding flows, or handling denied permissions gracefully.
17tech-stack-validator
Validates and recommends technology stacks for native macOS/iOS app projects against PRD and architecture requirements. Reads the PRD and architecture documents (or gathers requirements interactively), then systematically checks every technology choice for: OS version availability, framework capability gaps, performance feasibility, distribution compatibility (sandbox vs direct), API deprecation risks, dependency conflicts, and timeline realism for the team size. Produces a structured validation report with go/no-go verdicts, risk flags, and alternative recommendations. Use when user says things like 'validate my tech stack', 'check if this architecture works', 'what should I build this with', 'is SwiftData the right choice', 'can I ship this on the App Store', 'review my architecture', or before starting implementation of any PRD. Also use when migrating between tech stacks or evaluating whether to adopt a new framework.
9swift-lang
Swift 6.2 language patterns and critical corrections for LLM code generation. Covers the concurrency paradigm shift (default MainActor isolation, @concurrent for background work, isolated conformances, nonisolated opt-out), @Observable vs ObservableObject, structured concurrency, modern error handling, property wrappers, InlineArray, Span, and value generics. Use whenever generating Swift code, especially async/concurrent code, class/struct design, or performance-critical paths. This skill prevents the most common LLM mistakes in Swift code generation.
8cross-platform
Patterns for sharing code between macOS and iOS in SwiftUI apps. Covers project structure (70% shared / 15% macOS / 15% iOS), platform abstraction via protocols and #if os() conditional compilation, adaptive navigation (NavigationSplitView on Mac/iPad → NavigationStack on iPhone), shared components with platform styling, iOS-specific extensions (custom keyboard extension, interactive widgets, share extension, action extension, Control Center widget, lock screen widget), App Groups for data sharing with extensions, CloudKit sync monitoring, JSON export/import, schema versioning and migration, URL scheme deep linking, and the full macOS→iOS migration checklist. Use when building apps that target both macOS and iOS, when adding iOS support to a macOS app, when building widgets or keyboard extensions, or when setting up iCloud sync with SwiftData.
8macos-distribution
macOS app distribution covering code signing (Developer ID, App Store certificates), notarization (notarytool), DMG/pkg creation, App Store submission workflow, sandboxing entitlements, StoreKit for in-app purchases and subscriptions (SubscriptionOfferView, appTransactionID, Transaction.currentEntitlements, subscriptionStatusTask), StoreKit testing with local configuration files, PrivacyInfo.xcprivacy manifest, and dual distribution strategies (App Store + direct). Use when preparing an app for distribution, implementing purchases/subscriptions, configuring signing, or troubleshooting App Store rejection.
8