-
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing
Paper • 2306.10012 • Published • 36 -
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Paper • 2403.05135 • Published • 45 -
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
Paper • 2408.06072 • Published • 39 -
haoningwu/StoryGen
Updated • 4
Mwangi PRO
Benson
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 17 hours ago
SparkAudio/voxbox
upvoted
an
article
about 18 hours ago
How to make NeuTTS-air generate over 200 seconds of audio in a single second.
upvoted
an
article
about 19 hours ago
LLM based Audio models
Organizations
None yet