
Amphion (/?m?fa??n/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development. – Amphion/models/tts/maskgct at main · open-mmlab/Amphion
数据统计
数据评估
关于MaskGCT特别声明
本站AI工具库提供的MaskGCT都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由AI工具库实际控制,在2025年2月14日 下午9:42收录时,该网页上的内容,都属于合规合法,后期网页的内容如出现违规,可以直接联系网站管理员进行删除,AI工具库不承担任何责任。
相关导航

MiniMax 成立于 2021 年 12 月,是领先的通用人工智能科技公司,致力于与用户共创智能。MiniMax 自主研发多模态、万亿参数的 MoE 大模型,并基于大模型推出海螺AI、星野等原生应用。MiniMax API 开放平台提供安全、灵活、可靠的 API 服务,助力企业和开发者快速搭建 AI 应用。

GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model - Ucas-HaoranWei/GOT-OCR2.0

StoryMaker
StoryMaker: Towards consistent characters in text-to-image generation - RedAIGC/StoryMaker

Ovis1.6
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings. - AIDC-AI/Ovis

Swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team. - openai/swarm

UniEdit
Inserting anybody's identity into anywhere for customized image/video/3d generation.

Motionshop
Motionshop, a framework to replace the characters in video with 3D avatars

MMMLU
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
暂无评论...