4:02STARS (WACV'26) Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences.Soroush Mehraban178 views·2 months ago
5:01FastHMR (WACV'26)Accelerating Human Mesh Recovery via Token & Layer Merging with Diffusion DecodingSoroush Mehraban144 views·2 months ago
14:38LightlyTrain - Train Better Models, Faster - No Labels NeededSoroush Mehraban729 views·1 year ago
14:22Variational Score Distillation (VSD) Helps Create Amazing 3D Scenes From Text PromptsSoroush Mehraban631 views·1 year ago
11:39Null-text Inversion for Editing Real Images using Guided Diffusion ModelsSoroush Mehraban1.1K views·1 year ago
30:57Denoising Diffusion Null-Space Model (DDNM) - Method ExplainedSoroush Mehraban860 views·1 year ago
21:44Autoregressive Image Generation without Vector QuantizationSoroush Mehraban2.3K views·1 year ago
10:46GLIGEN (CVPR2023) Open-Set Grounded Text-to-Image GenerationSoroush Mehraban908 views·1 year ago
9:09The Entropy Enigma Success and Failure of Entropy MinimizationSoroush Mehraban797 views·1 year ago
9:44VPD (ICCV2023) Unleashing Text-to-Image Diffusion Models for Visual PerceptionSoroush Mehraban398 views·1 year ago
30:13TokenHMR (CVPR2024) Advancing Human Mesh Recovery witha Tokenized Pose RepresentationSoroush Mehraban779 views·1 year ago
22:26SHViT (CVPR2024) Single-Head Vision Transformer with Memory Efficient Macro DesignSoroush Mehraban1.5K views·1 year ago
22:17InstaFlow One Step is Enough for High-Quality Diffusion-Based Text-to-Image GenerationSoroush Mehraban1.3K views·2 years ago
28:39GaLore Memory-Efficient LLM Training by Gradient Low-Rank ProjectionSoroush Mehraban2.0K views·2 years ago
9:13MotionAGFormer (WACV2024) Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer NetworkSoroush Mehraban1.6K views·2 years ago
8:25ST-GCN Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action RecognitionSoroush Mehraban8.9K views·2 years ago
13:08Graph Convolutional Networks (GCN) From CNN point of viewSoroush Mehraban16.7K views·2 years ago