Skip to content

A laboratory for training dual autoregressive audio seq2seq models

License

Notifications You must be signed in to change notification settings

EndlessReform/dual-ar

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DualAR Transformer Laboratory

This repo is a personal laboratory for training fully autoregressive text-audio multimodal models with the DualAR Transformer architecture. This architecture is most popularly used as the neural codec seq2seq backbone for:

Models trained here will be compatible with my DualAR fish-speech.rs inference engine.

Please do not expect anything here to be usable currently. Full documentation will come once an early artifact is good enough to release.

About

A laboratory for training dual autoregressive audio seq2seq models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published