Skip to main content

Mountsea AI API Introduction

Discover all you can do with our Mountsea AI API! Our comprehensive AI platform empowers developers to integrate advanced AI capabilities into their applications โ€” including video generation, image creation, music production, and multi-model LLM chat.

Overview

Mountsea AI provides a unified API gateway to access the worldโ€™s leading AI models across video, image, music, and language domains. We offer six core services:

Google (Gemini)

Video & image generation powered by Veo 2 / Veo 3 / Veo 3.1 and Nano Banana models

Sora2

OpenAIโ€™s Sora-2 / Sora-2-Pro text-to-video generation with character roles

XAI (Grok)

Image & video generation powered by xAIโ€™s Grok models

Suno

Full-suite AI music generation, voice persona, custom model training, and audio processing via Suno AI

ElevenLabs

AI music generation with composition plans, video scoring, and stem separation via ElevenLabs

Producer

AI music generation powered by Google DeepMindโ€™s Lyria 3 Pro model

Chat

Multi-protocol AI chat gateway supporting OpenAI, Claude, and Gemini APIs

Why Mountsea AI Stands Out

All-in-One API Platform

A single API key gives you access to multiple AI services across different domains โ€” no need to manage separate accounts and credentials for each AI provider.

Multi-Protocol LLM Gateway

Our Chat service supports OpenAI Compatible API, Anthropic (Claude) API, OpenAI Responses API, and Google Gemini Native API. Use official SDKs directly โ€” just change the base_url.

Comprehensive AI Model Coverage

Access top-tier AI models from leading providers:
DomainModels
VideoVeo 2, Veo 3, Veo 3.1, Sora-2, Sora-2-Pro, Grok Imagine Video
ImageNano Banana Fast/Pro/2, Grok Imagine Image
MusicSuno (chirp-v35 ~ chirp-v55, custom models), ElevenLabs music_v1, Lyria 3 Pro
LLMGPT-5.1, GPT-5.2, Claude 4.5, Claude Opus 4.6, Claude Sonnet 4.6, Gemini 2.5/3/3.1

Budget-Friendly Excellence

Enjoy premium AI tools at competitive prices. Our platform delivers high-quality, scalable solutions while keeping your projects cost-effective.

Available Services

๐ŸŽฌ Google (Gemini) โ€” Video & Image Generation

Powered by Googleโ€™s cutting-edge AI, this service provides:
  • Video Generation โ€” Create videos from text prompts (text2video), images (img2video), or reference ingredients (ingredients2video) using Veo 2 / Veo 3 / Veo 3.1 models
  • Video Upsampling โ€” Upscale to 1080p or 4K resolution, or generate GIF previews
  • Video Reshoot โ€” Re-generate video with different camera motions (pan, zoom, dolly, etc.)
  • Video Object Editing โ€” Insert or remove objects from videos using masks
  • Prompt Expansion โ€” Expand simple prompts into detailed cinematic descriptions
  • Image Generation โ€” Create and edit images using Nano Banana models (Fast / Pro / 2) with support for multiple aspect ratios and up to 4K resolution
Explore Gemini Documentation โ†’

๐ŸŽฅ Sora2 โ€” OpenAI Video Generation

Powered by OpenAIโ€™s Sora, this service provides:
  • Text-to-Video Generation โ€” Create high-quality videos from text prompts using Sora-2 or Sora-2-Pro models
  • Image-to-Video โ€” Use reference images to guide video generation
  • Character Roles โ€” Create custom characters from short video clips, then reference them in prompts with @character_name
  • Style Presets โ€” Choose from styles like anime, retro, comic, vintage, and more
  • Flexible Video Formats โ€” Landscape/portrait orientation, variable durations (10s/15s/25s), optional watermark removal
Explore Sora2 Documentation โ†’

๐Ÿ–ผ๏ธ XAI (Grok) โ€” Image & Video Generation

Powered by xAIโ€™s Grok models, this service provides:
  • Text-to-Image โ€” Generate high-quality images from text prompts using grok-imagine-image, with customizable aspect ratios (1:1, 2:3, 3:2, 9:16, 16:9)
  • Image-to-Image โ€” Provide a reference image to guide generation โ€” output size follows the reference
  • Text-to-Video โ€” Generate videos from text prompts using grok-imagine-video, with control over duration (6s/10s/15s), aspect ratio, and resolution (480P/720P)
  • Image-to-Video โ€” Use a reference image to guide video generation โ€” aspect ratio and resolution follow the reference automatically
  • Async Task System โ€” All generation requests return a taskId for polling
Explore XAI Documentation โ†’

๐ŸŽต Suno โ€” Music Generation

Powered by Suno AI, this service provides a full suite of music creation and processing tools:
  • Music Generation โ€” Create, extend, cover, mashup, sample, and generate inspiration-based tracks using 15 task types via a unified /generate endpoint, with models from chirp-v35 to chirp-v55
  • Sound Effects โ€” Generate one-shot or looped sound effects from text descriptions
  • Lyrics Generation โ€” Generate original lyrics or mashup lyrics from two songs
  • Voice Persona โ€” Create verified voice personas from your own recordings through a single-task two-phase voice verification flow (init โ†’ await phrase โ†’ complete)
  • Custom Models โ€” Train personalized music models on your own audio (6+ training clips), then use chirp-custom:<uuid> for generation
  • Audio Processing โ€” Concat clips, remaster tracks (V4.5+/V5 models), adjust playback speed with pitch preservation
  • Stem Separation โ€” Separate tracks into vocals + instrumental (two-track) or all individual stems
  • Audio Export โ€” Export to MP4 (with visualizer), lossless WAV, or MDI (MIDI) format
  • Audio Analysis โ€” Get synchronized lyrics timeline, downbeat detection, and enhanced style tags
  • Vocal Persona โ€” Extract vocal characteristics from clips and create reusable personas for consistent vocal style
Explore Suno Documentation โ†’

๐ŸŽถ ElevenLabs โ€” AI Music Generation

Powered by ElevenLabsโ€™ music_v1 model, this service provides:
  • Text-to-Music โ€” Generate music from simple text prompts or structured composition plans with section-level style and lyrics control
  • Composition Plan โ€” AI-generated structured plans with sections, global/local styles, and lyrics โ€” free, no credits consumed
  • Video to Music โ€” Automatically generate background music that matches your video content (up to 10 videos, 600s total)
  • Stem Separation โ€” Split audio into 2 tracks (vocals + instrumental) or 6 individual stems
  • Inpainting โ€” Edit specific sections of existing songs (enterprise only)
  • Multiple Output Formats โ€” MP3, PCM, Opus with configurable sample rates and bitrates
Available models: music_v1 Explore ElevenLabs Documentation โ†’

๐ŸŽง Producer โ€” AI Music Generation

Powered by Google DeepMindโ€™s Lyria 3 Pro model, this service provides high-quality music generation:
  • Create Music โ€” Generate original tracks from sound prompts, lyrics, and images
  • Image-Guided Generation โ€” Use images to influence the mood and style of generated music
  • Instrumental Mode โ€” Generate without vocals
  • Stem Separation โ€” Separate audio into individual stems (vocals, drums, bass, etc.)
  • Multi-Format Export โ€” Download as MP3/M4A/WAV audio or generate video with preset visualizers
Available models: Lyria 3 Pro Explore Producer Documentation โ†’

๐Ÿ’ฌ Chat โ€” Multi-Protocol AI Gateway

A unified gateway supporting multiple API protocols:
ProtocolBase URLDescription
OpenAI Compatiblehttps://api.mountsea.ai/chatDrop-in replacement for OpenAI Chat Completions API
OpenAI Responseshttps://api.mountsea.ai/chatOpenAI Responses API format
Anthropic (Claude)https://api.mountsea.ai/chat/claudeClaude Code & Anthropic SDK compatible
Gemini Nativehttps://api.mountsea.ai/chat/geminiGoogle Gemini Native API format
  • โœ… Use official SDKs (OpenAI, Anthropic, Google GenAI) directly
  • โœ… Full streaming, function calling, and tool support
  • โœ… Compatible with Claude Code, Cursor, Cherry Studio, and other AI tools
  • โœ… Access GPT-5.1/5.2, Claude 4.5/Opus/Sonnet, Gemini 2.5/3/3.1 models
Explore Chat Documentation โ†’

How to Get Started

1

Get Your API Key

Sign up at shanhaiapi.com, go to API ๅฏ†้’ฅ็ฎก็†, and create a new API key.
2

Choose Your Service

Select the service that fits your needs โ€” video, image, music, or chat.
3

Make Your First API Call

Use Authorization: Bearer your-api-key in your request header and call the appropriate endpoint.
4

Track & Download Results

For async tasks (video/music), poll the task status endpoint until complete, then download your content.
Check out our Quick Start Guide for step-by-step examples.

API Base URL

All API requests are made to:
https://api.mountsea.ai
Exception: The Claude (Anthropic) compatible API uses https://api.mountsea.ai/chat/claude as the base URL.

Get Started Today

Ready to integrate AI capabilities into your applications?
Transform your creative projects with the power of AI. Start building amazing applications today!