view article Article Qwen Image Edit 2511 Free and Open Source Crushes Qwen Image Edit 2509 and Challenges Nano Banana Pro 22 days ago β’ 2
bosonai/higgs-audio-v2-generation-3B-base Text-to-Speech β’ 6B β’ Updated Jul 28, 2025 β’ 227k β’ 651
DavidAU/Qwen3-128k-30B-A3B-NEO-MAX-Imatrix-gguf Text Generation β’ 31B β’ Updated 30 days ago β’ 19.6k β’ 30
ArtusDev/L3.3-Electra-R1-70b_EXL3_3.5bpw_H8 Text Generation β’ 17B β’ Updated May 13, 2025 β’ 2 β’ 1
Running on Zero Featured 1.74k Dia 1.6B π― 1.74k Generate realistic dialogue from a script, using Dia!
view post Post 5087 A ton of impactful models and datasets in open AI past week, let's summarize the best π€© merve/releases-apr-21-and-may-2-6819dcc84da4190620f448a3π¬ Qwen made it rain! They released Qwen3: new dense and MoE models ranging from 0.6B to 235B π€― as well as Qwen2.5-Omni, any-to-any model in 3B and 7B!> Microsoft AI released Phi4 reasoning models (that also come in mini and plus sizes)> NVIDIA released new CoT reasoning datasetsπΌοΈ > ByteDance released UI-TARS-1.5, native multimodal UI parsing agentic model> Meta released EdgeTAM, an on-device object tracking model (SAM2 variant)π£οΈ NVIDIA released parakeet-tdt-0.6b-v2, a smol 600M automatic speech recognition model> Nari released Dia, a 1.6B text-to-speech model> Moonshot AI released Kimi Audio, a new audio understanding, generation, conversation modelπ©π»βπ» JetBrains released Melium models in base and SFT for coding> Tesslate released UIGEN-T2-7B, a new text-to-frontend-code model π€© See translation π₯ 10 10 + Reply