采用我们最新的音乐生成模型Lyria 3进行创作。

内容来源:https://blog.google/innovation-and-ai/technology/developers-tools/lyria-3-developers/
内容总结:
谷歌发布新一代音乐生成模型Lyria 3,现面向全球开发者开放公测。该模型通过Gemini API及Google AI Studio的全新音频体验提供,旨在将深度音乐理解与结构连贯性相结合,帮助开发者构建能够生成高保真度、包含人声及完整段落结构的音乐应用。
本次推出的模型包含两个版本:Lyria 3 Pro专注于生成长约三分钟的完整歌曲,具备专业级结构感知能力;Lyria 3 Clip则针对速度与高并发需求优化,可生成30秒高质量音频片段,适用于快速原型设计、背景循环及社交媒体内容制作。两款模型均支持富含表现力的人声合成、多语言演唱及跨流派音乐创作,涵盖流行、放克、摩城等多种风格。
Lyria 3引入了精细控制功能,开发者可通过自然语言指令精准调节节奏、设定歌词时间对齐,甚至上传图片以影响音乐的情绪与风格。为展示应用场景,谷歌在AI Studio中构建了多个示例,包括为视频生成同步背景音乐、结合日历与天气信息生成个性化闹钟铃声等。
目前,开发者可在Google AI Studio中通过付费API密钥体验Lyria 3的文本创作与分段作曲两种模式。所有生成的音频均包含SynthID数字水印,以确保技术使用的透明度与可追溯性。谷歌表示,该工具在与行业专家合作下开发,旨在以人工智能增强人类创造力。
中文翻译:
基于Lyria 3——我们最新的音乐生成模型进行创作
我们的音乐生成模型Lyria 3与Lyria 3 Pro现已通过Gemini API及Google AI Studio的全新音频体验功能,面向开发者开放公测。
Lyria 3旨在将深度的音乐理解与结构连贯性相结合。开发者可借此构建能够创作高保真音乐作品的应用,这些作品包含人声、主歌与副歌,并能从首个音符至结尾始终保持音乐风格的一致性。
专业品质与高效速度
开发者现可根据具体制作需求与延迟要求,选择两种不同的模型变体:
-
Lyria 3 Pro(lyria-3-pro-preview):我们的旗舰级全长歌曲生成模型,可创作时长约三分钟的完整曲目。该模型具备专业级的结构感知能力,是制作录音室品质优质作品的标杆。
-
Lyria 3 Clip(lyria-3-clip-preview):专为高速处理与高并发请求优化,可生成高品质的30秒音频片段。该模型是快速原型设计、背景循环音乐及社交媒体素材制作的理想选择。
两款模型均支持富有表现力细节的真实人声,并提升音质清晰度以呈现更自然的声音效果。开发者还可探索全球多种语言与音乐流派,生成不同语言的人声,创作涵盖流行、放克、摩城等多元风格的音乐作品。
精准控制与多模态输入
Lyria 3引入精细化控制功能,支持通过自然语言指令精准引导模型:
-
节奏调控:高精度设定特定节奏(如快速、慢速),确保音乐契合应用场景的韵律需求。
-
时间对齐歌词:可在指令中规划歌曲的进展脉络,精确控制歌词在音轨中的起止时间。
-
多模态图像转音乐输入:除文本外,Lyria 3支持多模态输入。开发者可提供图像来影响音频的情绪、风格与氛围营造。
Lyria 3应用实例
为展示如何将该模型集成至应用中,我们在Google AI Studio构建了若干示例:
-
视频背景音乐:该演示应用允许用户上传视频,由Gemini 3 Flash分析生成描述性指令,进而通过Lyria创作出与视频同步的定制配乐作为背景音乐。
-
智能闹钟:该演示应用每日清晨以全新歌曲唤醒用户,歌曲内容融合天气、所在地、时间日期及日历日程等关联信息。
在Google AI Studio体验Lyria 3
为助力开发者即刻开始探索,我们同步在AI Studio推出全新音乐生成体验。通过付费API密钥,这个专属工作区为使用Lyria 3进行创作提供了优质环境,并可体验图像转音乐等高级功能。
在操作平台中,开发者可探索两种强大的音乐创作模式:
-
文本模式:使用包含节奏、调性等参数的自然语言描述想要聆听的音乐。
-
作曲模式:逐段构建歌曲结构,从前奏、主歌到过渡段等均可精细编排。该模式支持独立设置每个部分的时长、强度与描述,实现颗粒化控制。
即刻开启创作之旅
Lyria 3 Clip与Lyria 3 Pro现已面向全球开发者开放公测。
我们始终与行业专家紧密合作开发音乐生成工具,确保人工智能成为人类创造力的增强力量。此外,每段由Lyria 3生成的音轨均包含SynthID数字水印技术。该技术通过可识别验证谷歌AI生成音频的机制(即使音频经过修改),持续保障透明度与可信度。
-
在Google AI Studio体验:通过模型选择下拉菜单选用Lyria 3(30秒版)或Lyria 3 Pro(完整歌曲版),在操作平台开启探索。
-
查阅技术文档:访问音乐生成指南获取指令手册、API参考与代码片段,快速启动集成工作。
-
通过示例指南开始编程:查看示例指南掌握API使用入门。
-
体验演示应用:Lyria Studio、Lyria Rhythm、智能闹钟、视频背景音乐。
英文来源:
Build with Lyria 3, our newest music generation model
Lyria 3 and Lyria 3 Pro, our music generation models, are rolling out now to developers in public preview through the Gemini API and a new audio experience in Google AI Studio.
Lyria 3 is designed to combine deep musical awareness with structural coherence. This allows developers to build apps that offer high-fidelity compositions, including vocals, verses and choruses, that maintain musical consistency from the first note to the last.
Studio quality and speed
Developers can now choose between two distinct model variants designed to meet specific production and latency requirements:
- Lyria 3 Pro (lyria-3-pro-preview): Our premier model for full-length song generation creates tracks up to approximately three minutes long. These tracks have professional-grade structural awareness, making it the standard for studio-quality, premium output.
- Lyria 3 Clip (lyria-3-clip-preview): Optimized for speed and high-volume requests, this variant generates high-quality 30-second clips. It is the ideal choice for rapid prototyping, background loops and social media assets.
Both models support realistic vocals that convey expressive nuance, plus improved clarity for more natural sounds. Developers can also explore global languages and genres. Generate vocals in different languages, and create music spanning genres from pop to funk to Motown.
Precision control and multimodal input
Lyria 3 introduces granular controls that allow you to direct the model with precision through natural language prompts: - Tempo conditioning: Set a specific tempo (e.g., Fast, slow) with high accuracy to ensure the music fits your application’s rhythm.
- Time-aligned lyrics: You can outline the progression of a song in your prompt and control when lyrics start and end within a track.
- Multimodal image-to-music input: Beyond text, Lyria 3 supports multimodal inputs. You can provide an image to influence the mood, style and atmosphere of the audio.
Lyria 3 in action
To show how you could incorporate this model into an application we built some examples in Google AI Studio: - Background music for videos: This demo app allows users to upload a video that is analyzed by Gemini 3 flash to generate a descriptive prompt for a custom soundtrack. Lyria then uses this prompt to compose a matching instrumental that serves as a synchronized background music for the video.
- Alarm clock: This demo app wakes you up each morning with a new song that covers relevant information like the weather, your location, the time and date, and events on your calendar.
Try Lyria 3 in Google AI Studio
To help you start experimenting immediately, we are also launching a new music generation experience in AI Studio. Using a paid API key, this dedicated workspace provides a first-class environment to create with Lyria 3 and explore its advanced features like image to music.
Inside the playground, you can explore two powerful creation modes for music: - Text mode: Describe the music you want to hear using natural language including parameters like Tempo or Key.
- Composer mode: Build your song section by section from intro to verses, to bridges and more. This mode gives you granular control to set timing, intensity and descriptions for each part individually.
Start composing today
Lyria 3 Clip and Lyria 3 Pro are now available in public preview for developers globally.
We have been developing our music generation tools in close partnership with industry experts to ensure AI serves as an additive force for human creativity. Additionally, every track generated by Lyria 3 includes a SynthID digital watermark. This technology maintains transparency and trust by allowing anyone to identify and verify audio generated by Google AI, even after the audio has been modified. - Try it in Google AI Studio: Use the model selection dropdown to select Lyria 3 (30s) or Lyria 3 Pro (Full Song) and start experimenting in the playground.
- Explore the documentation: Visit the Music Generation Guide for prompt guides, API references and code snippets to jumpstart your integration.
- Start coding with the cookbook: Check the cookbook guide to get started with the API.
- Try the demo applications: Lyria Studio, Lyria Rhythm, Alarm Clock, Background music for Videos
文章标题:采用我们最新的音乐生成模型Lyria 3进行创作。
文章链接:https://news.qimuai.cn/?post=3655
本站文章均为原创,未经授权请勿用于任何商业用途