lovart-skill
English | 简体中文 | 繁體中文 | 日本語
AI Agent Skills for Lovart — generate images, videos, and audio from your AI coding assistant.
This skill connects OpenClaw (and other AI coding assistants) to Lovart's Agent OpenAPI, enabling:
- 🖼️ Image generation — posters, logos, illustrations, banners, mockups, etc.
- 🎬 Video generation — clips, animations, product videos
- 🎵 Audio generation — BGM, songs, sound effects
- ✂️ Image/video editing — upscale, reframe, style transfer
- 🧊 3D generation — 3D models from text or images
- 📁 Project & thread management — multi-project support with local state persistence
npx skills add lovartai/lovart-skillThen set your environment variables:
export LOVART_ACCESS_KEY="ak_xxx"
export LOVART_SECRET_KEY="sk_xxx"Get your AK/SK from the Lovart platform (Avatar menu -> AK/SK Management).
🎉 That's it! The skill files will be added to your project. Your AI Agent will auto-discover and invoke them — no manual script execution needed.
# Generate an image
python3 agent_skill.py chat --prompt "a cyberpunk cat in neon city" --json --download
# Generate a video
python3 agent_skill.py chat --prompt "ocean waves crashing on rocks, cinematic" --json --download
# Generate BGM
python3 agent_skill.py chat --prompt "lofi hip-hop, chill, study vibes" --json --download| Command | Description |
|---|---|
chat |
Send prompt, wait for completion, return all results at once. Main command. |
watch |
Send prompt and stream artifacts as they complete (NDJSON, incremental delivery) |
send |
Send prompt without waiting (returns thread_id immediately) |
confirm |
Confirm a pending high-cost operation (e.g. video), then wait |
result |
Get results for a thread |
status |
Check thread status |
| Command | Description |
|---|---|
projects |
List all projects |
project-add |
Add and switch to a project |
project-switch |
Switch active project (supports prefix match) |
project-rename |
Rename a project |
project-remove |
Remove a project and its threads |
create-project |
Create a new empty project on the server |
| Command | Description |
|---|---|
config |
View/update local settings (~/.lovart/state.json) |
threads |
List saved conversation threads |
set-mode |
Switch between fast (credits) / unlimited (queue) mode |
query-mode |
Check current generation mode |
| Command | Description |
|---|---|
upload |
Upload a local file to CDN (returns URL) |
upload-artifact |
Upload a URL artifact to a project |
download |
Download artifacts from URLs |
# Use an existing project
python3 agent_skill.py chat --project-id PROJECT_ID --prompt "draw a cat" --json --download
# Continue a conversation (thread reuse preserves context)
python3 agent_skill.py chat --thread-id THREAD_ID --prompt "make it blue" --json --download
# Stream artifacts as they complete (NDJSON, for multi-image/video requests)
python3 agent_skill.py watch --prompt "generate 4 variations of a cyberpunk cat"
# Edit with reference image
python3 agent_skill.py upload --file photo.jpg
python3 agent_skill.py chat --prompt "change the style to watercolor" --attachments "CDN_URL" --json --download
# Prefer a specific model
python3 agent_skill.py chat --prompt "draw a cat" \
--prefer-models '{"IMAGE":["generate_image_midjourney"]}' --json --download
# Force a specific tool (e.g. upscale instead of re-generate)
python3 agent_skill.py chat --prompt "upscale this image" \
--include-tools upscale_image --attachments "IMAGE_URL" --json --download
# Thinking mode — deep structured reasoning for complex requests
python3 agent_skill.py chat --prompt "design a brand identity for a coffee startup" \
--mode thinking --json --download
# Project management
python3 agent_skill.py projects
python3 agent_skill.py project-add --project-id NEW_ID --name "My Brand Kit"
python3 agent_skill.py project-switch --project-id NEW_ID
python3 agent_skill.py threadsYou can control which model the Agent uses in three ways:
- In the prompt (simple) —
"generate ocean waves video using kling" --prefer-models(soft preference) —'{"IMAGE":["generate_image_midjourney"]}'--include-tools(hard constraint) —upscale_image
Available models:
| Category | Tool name | Display name | Premium |
|---|---|---|---|
| IMAGE | generate_image_gpt_image_2 |
GPT Image 2 Auto | |
| IMAGE | generate_image_gpt_image_2_low |
GPT Image 2 Low | |
| IMAGE | generate_image_gpt_image_2_medium |
GPT Image 2 Medium | |
| IMAGE | generate_image_gpt_image_2_high |
GPT Image 2 High | |
| IMAGE | generate_image_nano_banana_pro |
Nano Banana Pro | |
| IMAGE | generate_image_nano_banana_2 |
Nano Banana 2 | |
| IMAGE | generate_image_gpt_image_1_5 |
GPT Image 1.5 | |
| IMAGE | generate_image_seedream_v5 |
Seedream 5.0 Lite | |
| IMAGE | generate_image_luma_uni_1 |
Luma uni-1 | |
| IMAGE | generate_image_luma_uni_1_max |
Luma uni-1-max | |
| IMAGE | generate_image_flux_2_max |
Flux.2 Max | |
| IMAGE | generate_image_flux_2_pro |
Flux.2 Pro | |
| IMAGE | generate_image_seedream_v4_5 |
Seedream 4.5 | |
| IMAGE | generate_image_nano_banana |
Nano Banana | |
| IMAGE | generate_image_seedream_v4 |
Seedream 4 | |
| IMAGE | generate_image_midjourney |
Midjourney | |
| VIDEO | generate_video_seedance_v2_0 |
Seedance 2.0 | ⭐ Premium |
| VIDEO | generate_video_seedance_v2_0_fast |
Seedance 2.0 Fast | ⭐ Premium |
| VIDEO | generate_video_kling_v3 |
Kling 3.0 | ⭐ Premium |
| VIDEO | generate_video_kling_v3_omni |
Kling 3.0 Omni | ⭐ Premium |
| VIDEO | generate_video_seedance_pro_v1_5 |
Seedance 1.5 Pro | |
| VIDEO | generate_video_kling_v2_6 |
Kling 2.6 | ⭐ Premium |
| VIDEO | generate_video_wan_v2_6 |
Wan 2.6 | |
| VIDEO | generate_video_veo3_1 |
Veo 3.1 | ⭐ Premium |
| VIDEO | generate_video_veo3_1_fast |
Veo 3.1 Fast | ⭐ Premium |
| VIDEO | generate_video_kling_omni_v1 |
Kling O1 | ⭐ Premium |
| VIDEO | generate_video_hailuo_v2_3 |
Hailuo 2.3 | |
| VIDEO | generate_video_veo3 |
Veo 3 | ⭐ Premium |
| VIDEO | generate_video_vidu_q2 |
Vidu Q2 | |
| 3D | generate_3d_tripo |
Tripo |
Control how the agent thinks per request via --mode:
fast(default) — lightweight single-pass response. Faster, cheaper, suitable for simple one-shot generations.thinking— deep structured reasoning with planning and multi-step analysis. Use for complex brand systems, multi-asset campaigns, anything that benefits from deliberate planning. Slower but higher quality.
# Quick, single-shot (default)
python3 agent_skill.py chat --prompt "draw a cat"
# Deliberate, plan-first reasoning
python3 agent_skill.py chat --prompt "design a full brand identity" --mode thinkingMode is locked to the thread on its first message. To switch modes, start a new thread (omit --thread-id). Mirrors the Lovart web UI toggle.
Separate from reasoning mode. This is a persistent account-level billing setting:
# Fast — costs credits, no queue
python3 agent_skill.py set-mode --fast
# Unlimited — free, may queue
python3 agent_skill.py set-mode --unlimited
# Check current
python3 agent_skill.py query-modeThe API enforces per-account request frequency limits, split into two tiers based on the endpoint you hit:
| Tier | Endpoints | Per minute | Per hour |
|---|---|---|---|
| Chat (write) | /chat, /chat/confirm |
60 | 600 |
| Query (read) | /chat/status, /chat/result, /project/*, /mode/*, everything else |
300 | 3000 |
The stricter Chat tier protects generation. The Query tier is much looser so polling for status/results doesn't eat into your generation budget.
Exceeding a limit returns HTTP 429 with Retry-After: 60.
This is separate from generation concurrency — each thread can only run one generation task at a time. If a task is already running in a thread, new requests to that thread are rejected with HTTP 409 until it finishes. You can run tasks in different threads concurrently.
The skill auto-retries on transient network errors (3 attempts with backoff), but rate limit and billing errors are returned immediately.
Settings and thread history are persisted at ~/.lovart/state.json:
{
"active_project": "abc123...",
"projects": {
"abc123...": {"name": "My Project", "created_at": "..."}
},
"threads": [
{"id": "xxx", "project_id": "abc123...", "topic": "cyberpunk cat", "updated_at": "..."}
]
}npx skills add lovartai/lovart-skillThis skill is designed as a first-class OpenClaw skill. After installation, the AI Agent will auto-discover and invoke it — no extra configuration needed beyond setting the env vars.
The skill also works with Claude Code, Cursor, and any assistant that can invoke Python scripts. See SKILL.md for the full integration contract.
lovart-skill/
├── README.md
├── README_CN.md
├── README_TW.md
├── README_JA.md
└── skills/
└── lovart-skill/
├── SKILL.md # Skill contract (OpenClaw spec)
└── agent_skill.py # Python client (zero dependencies)
- Local state file: The skill reads/writes
~/.lovart/state.jsonto persist your active project and recent thread IDs. No other files are accessed. - Outbound calls: Only talks to the Lovart API (
https://lgw.lovart.ai) and Lovart CDN (for downloading your own generated artifacts). No third-party services. - API keys: AK/SK are read from env vars (
LOVART_ACCESS_KEY/LOVART_SECRET_KEY) and signed with HMAC-SHA256 per request. Keys are never logged or persisted to disk. - TLS: SSL certificate verification is enabled by default. Set
LOVART_INSECURE_SSL=1to disable (only if you're behind a corporate proxy/VPN that intercepts TLS). - Source code:
skills/lovart-skill/agent_skill.pyis ~900 lines of pure Python standard library — you're encouraged to read it before installing.
User -> OpenClaw / Claude Code / other AI assistant
-> agent_skill.py (this skill)
-> Lovart OpenAPI (AK/SK HMAC-SHA256 auth)
-> Lovart AI Agent (model selection, orchestration)
-> Generated images / videos / audio
Contributions are welcome! Feel free to:
- Open an issue to report bugs or suggest features
- Submit a pull request to fix issues or add improvements