💥 Introducing MiniCPM-o 2.6: An 8B size, GPT-4o level Omni Model runs on device
✨ Highlights:
~Match GPT-4o-202405 in vision, audio and multimodal live streaming
~End-to-end real-time bilingual audio conversation ~Voice cloning & emotion control
~Advanced OCR & video understanding
~Offline iPad-compatible multimodal live streaming
🔗 Try it out:
GitHub:https://t.co/gtRJoHOlfd
HF:https://t.co/IY9KgoOqSI
Demo:https://t.co/IzZuyz0qB1