Mountsea AI API Introduction

Discover all you can do with our Mountsea AI API! Our comprehensive AI platform empowers developers to integrate advanced AI capabilities into their applications — including video generation, image creation, music production, and multi-model LLM chat.

Overview

Mountsea AI provides a unified API gateway to access the world’s leading AI models across video, image, music, and language domains. We offer six core services:

Google (Gemini)

Video & image generation powered by Veo 2 / Veo 3 / Veo 3.1 and Nano Banana models

Sora2

OpenAI’s Sora-2 / Sora-2-Pro text-to-video generation with character roles

XAI (Grok)

Image & video generation powered by xAI’s Grok models

Suno

Full-suite AI music generation, voice persona, custom model training, and audio processing via Suno AI

ElevenLabs

AI music generation with composition plans, video scoring, and stem separation via ElevenLabs

Producer

AI music generation powered by Google DeepMind’s Lyria 3 Pro model

Chat

Multi-protocol AI chat gateway supporting OpenAI, Claude, and Gemini APIs

Why Mountsea AI Stands Out

All-in-One API Platform

A single API key gives you access to multiple AI services across different domains — no need to manage separate accounts and credentials for each AI provider.

Multi-Protocol LLM Gateway

Our Chat service supports OpenAI Compatible API, Anthropic (Claude) API, OpenAI Responses API, and Google Gemini Native API. Use official SDKs directly — just change the base_url.

Comprehensive AI Model Coverage

Access top-tier AI models from leading providers:

Domain	Models
Video	Veo 2, Veo 3, Veo 3.1, Sora-2, Sora-2-Pro, Grok Imagine Video
Image	Nano Banana Fast/Pro/2, Grok Imagine Image
Music	Suno (chirp-v35 ~ chirp-v55, custom models), ElevenLabs music_v1, Lyria 3 Pro
LLM	GPT-5.1, GPT-5.2, Claude 4.5, Claude Opus 4.6, Claude Sonnet 4.6, Gemini 2.5/3/3.1

Budget-Friendly Excellence

Enjoy premium AI tools at competitive prices. Our platform delivers high-quality, scalable solutions while keeping your projects cost-effective.

Available Services

🎬 Google (Gemini) — Video & Image Generation

Video Generation — Create videos from text prompts (text2video), images (img2video), or reference ingredients (ingredients2video) using Veo 2 / Veo 3 / Veo 3.1 models
Video Upsampling — Upscale to 1080p or 4K resolution, or generate GIF previews
Video Reshoot — Re-generate video with different camera motions (pan, zoom, dolly, etc.)
Video Object Editing — Insert or remove objects from videos using masks
Prompt Expansion — Expand simple prompts into detailed cinematic descriptions
Image Generation — Create and edit images using Nano Banana models (Fast / Pro / 2) with support for multiple aspect ratios and up to 4K resolution

Explore Gemini Documentation →

🎥 Sora2 — OpenAI Video Generation

Text-to-Video Generation — Create high-quality videos from text prompts using Sora-2 or Sora-2-Pro models
Image-to-Video — Use reference images to guide video generation
Character Roles — Create custom characters from short video clips, then reference them in prompts with @character_name
Style Presets — Choose from styles like anime, retro, comic, vintage, and more
Flexible Video Formats — Landscape/portrait orientation, variable durations (10s/15s/25s), optional watermark removal

Explore Sora2 Documentation →

🖼️ XAI (Grok) — Image & Video Generation

Text-to-Image — Generate high-quality images from text prompts using grok-imagine-image, with customizable aspect ratios (1:1, 2:3, 3:2, 9:16, 16:9)
Image-to-Image — Provide a reference image to guide generation — output size follows the reference
Text-to-Video — Generate videos from text prompts using grok-imagine-video, with control over duration (6s/10s/15s), aspect ratio, and resolution (480P/720P)
Image-to-Video — Use a reference image to guide video generation — aspect ratio and resolution follow the reference automatically
Async Task System — All generation requests return a taskId for polling

Explore XAI Documentation →

🎵 Suno — Music Generation

Music Generation — Create, extend, cover, mashup, sample, and generate inspiration-based tracks using 15 task types via a unified /generate endpoint, with models from chirp-v35 to chirp-v55
Sound Effects — Generate one-shot or looped sound effects from text descriptions
Lyrics Generation — Generate original lyrics or mashup lyrics from two songs
Voice Persona — Create verified voice personas from your own recordings through a single-task two-phase voice verification flow (init → await phrase → complete)
Custom Models — Train personalized music models on your own audio (6+ training clips), then use chirp-custom:<uuid> for generation
Audio Processing — Concat clips, remaster tracks (V4.5+/V5 models), adjust playback speed with pitch preservation
Stem Separation — Separate tracks into vocals + instrumental (two-track) or all individual stems
Audio Export — Export to MP4 (with visualizer), lossless WAV, or MDI (MIDI) format
Audio Analysis — Get synchronized lyrics timeline, downbeat detection, and enhanced style tags
Vocal Persona — Extract vocal characteristics from clips and create reusable personas for consistent vocal style

Explore Suno Documentation →

🎶 ElevenLabs — AI Music Generation

Text-to-Music — Generate music from simple text prompts or structured composition plans with section-level style and lyrics control
Composition Plan — AI-generated structured plans with sections, global/local styles, and lyrics — free, no credits consumed
Video to Music — Automatically generate background music that matches your video content (up to 10 videos, 600s total)
Stem Separation — Split audio into 2 tracks (vocals + instrumental) or 6 individual stems
Inpainting — Edit specific sections of existing songs (enterprise only)
Multiple Output Formats — MP3, PCM, Opus with configurable sample rates and bitrates

Available models: music_v1 Explore ElevenLabs Documentation →

🎧 Producer — AI Music Generation

Create Music — Generate original tracks from sound prompts, lyrics, and images
Image-Guided Generation — Use images to influence the mood and style of generated music
Instrumental Mode — Generate without vocals
Stem Separation — Separate audio into individual stems (vocals, drums, bass, etc.)
Multi-Format Export — Download as MP3/M4A/WAV audio or generate video with preset visualizers

Available models: Lyria 3 Pro Explore Producer Documentation →

💬 Chat — Multi-Protocol AI Gateway

A unified gateway supporting multiple API protocols:

Protocol	Base URL	Description
OpenAI Compatible	`https://api.mountsea.ai/chat`	Drop-in replacement for OpenAI Chat Completions API
OpenAI Responses	`https://api.mountsea.ai/chat`	OpenAI Responses API format
Anthropic (Claude)	`https://api.mountsea.ai/chat/claude`	Claude Code & Anthropic SDK compatible
Gemini Native	`https://api.mountsea.ai/chat/gemini`	Google Gemini Native API format

✅ Use official SDKs (OpenAI, Anthropic, Google GenAI) directly
✅ Full streaming, function calling, and tool support
✅ Compatible with Claude Code, Cursor, Cherry Studio, and other AI tools
✅ Access GPT-5.1/5.2, Claude 4.5/Opus/Sonnet, Gemini 2.5/3/3.1 models

Explore Chat Documentation →

How to Get Started

Get Your API Key

Choose Your Service

Select the service that fits your needs — video, image, music, or chat.

Make Your First API Call

Use Authorization: Bearer your-api-key in your request header and call the appropriate endpoint.

Track & Download Results

For async tasks (video/music), poll the task status endpoint until complete, then download your content.

Check out our Quick Start Guide for step-by-step examples.

API Base URL

All API requests are made to:

https://api.mountsea.ai

Exception: The Claude (Anthropic) compatible API uses https://api.mountsea.ai/chat/claude as the base URL.

Get Started Today

Ready to integrate AI capabilities into your applications?

📖 Check out our Quick Start Guide
🎬 Explore Gemini Video & Image API
🎥 Explore Sora2 Video API
🖼️ Explore XAI (Grok) Image & Video API
🎵 Explore Suno Music API
🎶 Explore ElevenLabs Music API
🎧 Explore Producer Music API
💬 Explore Chat LLM Gateway
📞 Need help? Contact us

Transform your creative projects with the power of AI. Start building amazing applications today!

Getting started

​Mountsea AI API Introduction

​Overview

Google (Gemini)

Sora2

XAI (Grok)

Suno

ElevenLabs

Producer

Chat

​Why Mountsea AI Stands Out

​All-in-One API Platform

​Multi-Protocol LLM Gateway

​Comprehensive AI Model Coverage

​Budget-Friendly Excellence

​Available Services

​🎬 Google (Gemini) — Video & Image Generation

​🎥 Sora2 — OpenAI Video Generation

​🖼️ XAI (Grok) — Image & Video Generation

​🎵 Suno — Music Generation

​🎶 ElevenLabs — AI Music Generation

​🎧 Producer — AI Music Generation

​💬 Chat — Multi-Protocol AI Gateway

​How to Get Started

​API Base URL

​Get Started Today

Mountsea AI API Introduction

Overview

Why Mountsea AI Stands Out

All-in-One API Platform

Multi-Protocol LLM Gateway

Comprehensive AI Model Coverage

Budget-Friendly Excellence

Available Services

🎬 Google (Gemini) — Video & Image Generation

🎥 Sora2 — OpenAI Video Generation

🖼️ XAI (Grok) — Image & Video Generation

🎵 Suno — Music Generation

🎶 ElevenLabs — AI Music Generation

🎧 Producer — AI Music Generation

💬 Chat — Multi-Protocol AI Gateway

How to Get Started

API Base URL

Get Started Today