Side View of 8th Gen Kenworth

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

The model is pretrained on a mixture of publicly available datasets, achieving superior zero-shot performance on various evaluation benchmarks of multi-modal comprehension and generation. It can be ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

Trending now