27 6 29

Jonathan Yankovich PRO

tensiondriven

AI & ML interests

None yet

Recent Activity

updated a Space about 15 hours ago

tensiondriven/ultravox-quantizer

published a Space about 15 hours ago

tensiondriven/ultravox-quantizer

upvoted an article 21 days ago

Qwen Image Edit 2511 Free and Open Source Crushes Qwen Image Edit 2509 and Challenges Nano Banana Pro

View all activity

Organizations

None yet

updated a Space about 15 hours ago

Ultravox Quantizer

🚀

published a Space about 15 hours ago

Ultravox Quantizer

🚀

upvoted an article 21 days ago

Article

Qwen Image Edit 2511 Free and Open Source Crushes Qwen Image Edit 2509 and Challenges Nano Banana Pro

22 days ago

•

liked a model 3 months ago

Qwen/Qwen3-Omni-30B-A3B-Thinking

Any-to-Any • 32B • Updated Sep 22, 2025 • 10.6k • 245

liked a model 5 months ago

OpenGVLab/InternVL3_5-241B-A28B

Image-Text-to-Text • 241B • Updated Aug 29, 2025 • 404 • 133

liked 2 models 6 months ago

bosonai/higgs-audio-v2-generation-3B-base

Text-to-Speech • 6B • Updated Jul 28, 2025 • 227k • 651

DavidAU/Qwen3-128k-30B-A3B-NEO-MAX-Imatrix-gguf

Text Generation • 31B • Updated 30 days ago • 19.6k • 30

liked 3 models 7 months ago

liked 2 Spaces 8 months ago

ICEdit

🖼

664

Universal Image Editing is worth a single LoRA

Dia 1.6B

👯

1.74k

Generate realistic dialogue from a script, using Dia!

reacted to merve's post with 🔥 8 months ago

Post

5087

A ton of impactful models and datasets in open AI past week, let's summarize the best 🤩 merve/releases-apr-21-and-may-2-6819dcc84da4190620f448a3

💬 Qwen made it rain! They released Qwen3: new dense and MoE models ranging from 0.6B to 235B 🤯 as well as Qwen2.5-Omni, any-to-any model in 3B and 7B!
> Microsoft AI released Phi4 reasoning models (that also come in mini and plus sizes)
> NVIDIA released new CoT reasoning datasets
🖼️ > ByteDance released UI-TARS-1.5, native multimodal UI parsing agentic model
> Meta released EdgeTAM, an on-device object tracking model (SAM2 variant)
🗣️ NVIDIA released parakeet-tdt-0.6b-v2, a smol 600M automatic speech recognition model
> Nari released Dia, a 1.6B text-to-speech model
> Moonshot AI released Kimi Audio, a new audio understanding, generation, conversation model
👩🏻‍💻 JetBrains released Melium models in base and SFT for coding
> Tesslate released UIGEN-T2-7B, a new text-to-frontend-code model 🤩