Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Smartground 's Collections
Audio
Speako
Play-Ground
Imagen
OCR
Data
Spatial
Code
Multimode

Multimode

updated Apr 3, 2025
Upvote
-

  • microsoft/Phi-4-multimodal-instruct

    Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 161k • 1.56k

  • ByteDance/Sa2VA-8B

    Image-Text-to-Text • 8B • Updated Sep 8, 2025 • 751 • 65
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs