Gabriel Mongaras PRO
gmongaras
AI & ML interests
None yet
Recent Activity
updated
a collection
about 15 hours ago
Stuff I'm going to read
liked
a Space
16 days ago
microsoft/TRELLIS.2
Organizations
datasets
-
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual_512
Viewer • Updated • 19.8M • 3.58k • 1 -
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual_256
Viewer • Updated • 19.8M • 4.48k -
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual
Viewer • Updated • 19.8M • 5.83k • 5 -
gmongaras/CC12M_and_Imagenet21K_Recap
Viewer • Updated • 22.7M • 5.23k • 7
Reddit Models
Some terrible Reddit models I am training just to see what happens. Never again will I hear "As an AI language model"
BERT_512
Stable Diffusion 3 Checkpoints
Collection of checkpoints from the stable diffusion 3 model I am training (https://github.com/gmongaras/Stable-Diffusion-3-From-Scratch)
-
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_13batchsize_stage3
Updated -
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_40batchsize_stage2
Updated -
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_140batchsize_stage1
Updated -
gmongaras/datav3_attempt4_8GPU_SoftFlash_RoPE2dV2_2AccSteps_stage2
Updated
Cosine Attention (Cottention)
Models for the paper Cottention: Linear Transformers With Cosine Attention https://arxiv.org/abs/2409.18747
Squad Models
Models trained on squad data
Subtitle Data
Stuff I'm going to read
Stable Diffusion 3 Checkpoints
Collection of checkpoints from the stable diffusion 3 model I am training (https://github.com/gmongaras/Stable-Diffusion-3-From-Scratch)
-
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_13batchsize_stage3
Updated -
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_40batchsize_stage2
Updated -
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_140batchsize_stage1
Updated -
gmongaras/datav3_attempt4_8GPU_SoftFlash_RoPE2dV2_2AccSteps_stage2
Updated
datasets
-
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual_512
Viewer • Updated • 19.8M • 3.58k • 1 -
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual_256
Viewer • Updated • 19.8M • 4.48k -
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual
Viewer • Updated • 19.8M • 5.83k • 5 -
gmongaras/CC12M_and_Imagenet21K_Recap
Viewer • Updated • 22.7M • 5.23k • 7
Cosine Attention (Cottention)
Models for the paper Cottention: Linear Transformers With Cosine Attention https://arxiv.org/abs/2409.18747
Reddit Models
Some terrible Reddit models I am training just to see what happens. Never again will I hear "As an AI language model"
Squad Models
Models trained on squad data
BERT_512
Subtitle Data