Support MOSS-TTSD v0.7 fused with codec by CloudRipple · Pull Request #6 · OpenMOSS/sglang

CloudRipple · 2026-03-06T06:38:45Z

Motivation

This pull request introduces support for multi-channel audio generation models, specifically adding configuration and runtime changes for Moss-TTSD-With-Codec. Key enhancements include new model configuration, improved handling of multi-channel input/output, and sampler logic updates to support multi-channel generation. The changes are grouped below by theme:

Modifications

Multi-channel audio model support:

Added MossTTSDWithCodecConfig in python/sglang/srt/configs/moss_ttsd_with_codec.py and registered it in python/sglang/srt/configs/__init__.py to support Moss-TTSD-With-Codec audio generation models. [1] [2] [3]
Introduced _init_channels method and related logic in ModelConfig to normalize and handle multi-channel metadata from model configs (channels or n_vq). [1] [2]
Added is_audio_gen_model utility and detection logic for audio generation models, updating model type checks and health endpoints. [1] [2] [3]

Input/output handling for multi-channel models:

Updated generate API in engine.py to accept nested lists for input_ids and propagate multi_channel flag. [1] [2]
Enhanced replay logic in NPU graph runner to handle lists of outputs for multi-channel inference.

Sampler logic improvements:

Introduced MultiChannelSampler class in sampler.py to handle sampling for multi-channel logits, and updated create_sampler to select the appropriate sampler based on multi_channel flag. [1] [2]
Modified LogitsProcessor and its buffer copying logic to handle per-channel vocab sizes and output slicing for multi-channel models. [1] [2]

Miscellaneous:

Minor import fix in sampler.py and added tempfile import in detokenizer_manager.py. [1] [2]

Accuracy Tests

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.

Support MOSS-TTSD v0.7 fused with codec

136ca94

gaoyang07 mentioned this pull request Mar 6, 2026

[SGLang support] MOSS-TTSD v0.7 OpenMOSS/MOSS-TTSD#111

Open

CloudRipple marked this pull request as draft March 7, 2026 06:38

CloudRipple marked this pull request as ready for review March 12, 2026 13:17

CloudRipple merged commit 25462a2 into main Mar 12, 2026
54 of 63 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support MOSS-TTSD v0.7 fused with codec#6

Support MOSS-TTSD v0.7 fused with codec#6
CloudRipple merged 1 commit intomainfrom
moss-ttsd-v0.7-with-xy

CloudRipple commented Mar 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

CloudRipple commented Mar 6, 2026

Motivation

Modifications

Accuracy Tests

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant