Popular repositories Loading
-
-
delayed-streams-modeling
delayed-streams-modeling PublicKyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
-
Repositories
- flashy Public
Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpointing, logging, distributed, compatibility with Dora, and more!
kyutai-labs/flashy’s past year of commit activity - moshi Public
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
kyutai-labs/moshi’s past year of commit activity - ovie Public
Official implementation and models for OVIE (One View Is Enough! Monocular Training for In-the-Wild Novel View Generation)
kyutai-labs/ovie’s past year of commit activity - dactory Public
kyutai-labs/dactory’s past year of commit activity - casa Public
A vision-language model with an improved cross-attention mechanism for scalable streaming inference
kyutai-labs/casa’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…