Umar Azam

UmarAzam

Umar-Azam

AI & ML interests

Robotics and Simulations

Recent Activity

liked a Space 9 days ago

ResembleAI/chatterbox-turbo-demo

upvoted a paper 14 days ago

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

liked a model 20 days ago

microsoft/Fara-7B

View all activity

Organizations

None yet

liked a Space 9 days ago

Chatterbox Turbo Demo

⚡

359

Chatterbox Turbo Demo

upvoted a paper 14 days ago

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

Paper • 2512.03000 • Published 22 days ago • 35

liked a model 20 days ago

microsoft/Fara-7B

Image-Text-to-Text • 8B • Updated 13 days ago • 234k • 448

upvoted a paper 27 days ago

Monet: Reasoning in Latent Visual Space Beyond Images and Language

Paper • 2511.21395 • Published 29 days ago • 15

upvoted a paper 29 days ago

VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation

Paper • 2511.17199 • Published Nov 21 • 7

upvoted 3 papers about 1 month ago

liked a model about 2 months ago

yonigozlan/EdgeTAM-hf

Mask Generation • 13.9M • Updated Nov 6 • 6.3k • 67

upvoted a paper about 2 months ago

Kinematify: Open-Vocabulary Synthesis of High-DoF Articulated Objects

Paper • 2511.01294 • Published Nov 3 • 13

liked a model about 2 months ago

apple/FastVLM-0.5B

Text Generation • 0.8B • Updated Sep 3 • 5.96k • 361

liked a Space about 2 months ago

Robot Learning: A Tutorial

📝

276

Read and explore a tutorial on robot learning

upvoted a paper 2 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23 • 55

upvoted 2 articles 2 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Oct 23

•

137

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

•

1.31k

liked 3 models 2 months ago

PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 14 days ago • 18.5k • 1.42k

prithivMLmods/Gliese-OCR-7B-Post1.0

Image-Text-to-Text • 8B • Updated Nov 16 • 126 • 13

ModernVBERT/colmodernvbert

Visual Document Retrieval • Updated Oct 2 • 3.41k • 25

upvoted 2 articles 3 months ago

Article

ScreenEnv: Deploy your full stack Desktop Agent

Jul 10

•

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

Sep 23

•

134

Umar Azam

AI & ML interests

Recent Activity

Organizations

UmarAzam's activity

Chatterbox Turbo Demo

Robot Learning: A Tutorial

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Open-source DeepResearch – Freeing our search agents

ScreenEnv: Deploy your full stack Desktop Agent

Smol2Operator: Post-Training GUI Agents for Computer Use