Soroush Mehraban

6.37K subscribers

46 videos

View on YouTube

Latest Videos

STARS (WACV'26) Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences.

STARS (WACV'26) Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences.

Soroush Mehraban

178 views·2 months ago

FastHMR (WACV'26)Accelerating Human Mesh Recovery via Token & Layer Merging with Diffusion Decoding

FastHMR (WACV'26)Accelerating Human Mesh Recovery via Token & Layer Merging with Diffusion Decoding

Soroush Mehraban

144 views·2 months ago

TRELLIS One Latent for Any 3D Asset

TRELLIS One Latent for Any 3D Asset

Soroush Mehraban

461 views·4 months ago

LightlyTrain - Train Better Models, Faster - No Labels Needed

LightlyTrain - Train Better Models, Faster - No Labels Needed

Soroush Mehraban

729 views·1 year ago

One-step Diffusion with Distribution Matching Distillation

One-step Diffusion with Distribution Matching Distillation

Soroush Mehraban

2.2K views·1 year ago

Variational Score Distillation (VSD) Helps Create Amazing 3D Scenes From Text Prompts

Variational Score Distillation (VSD) Helps Create Amazing 3D Scenes From Text Prompts

Soroush Mehraban

631 views·1 year ago

Dream-in-4D Paper Explained!

Dream-in-4D Paper Explained!

Soroush Mehraban

393 views·1 year ago

FreeU - Paper Explained

FreeU - Paper Explained

Soroush Mehraban

776 views·1 year ago

AnimateDiff - Paper explained!

AnimateDiff - Paper explained!

Soroush Mehraban

713 views·1 year ago

DreamFusion Text-to-3D using 2D Diffusion

DreamFusion Text-to-3D using 2D Diffusion

Soroush Mehraban

1.6K views·1 year ago

Null-text Inversion for Editing Real Images using Guided Diffusion Models

Null-text Inversion for Editing Real Images using Guided Diffusion Models

Soroush Mehraban

1.1K views·1 year ago

Prompt-to-Prompt (P2P) image Editing - Method Explained

Prompt-to-Prompt (P2P) image Editing - Method Explained

Soroush Mehraban

821 views·1 year ago

Denoising Diffusion Null-Space Model (DDNM) - Method Explained

Denoising Diffusion Null-Space Model (DDNM) - Method Explained

Soroush Mehraban

860 views·1 year ago

Autoregressive Image Generation without Vector Quantization

Autoregressive Image Generation without Vector Quantization

Soroush Mehraban

2.3K views·1 year ago

Diffusion Models (DDPM & DDIM) - Easily explained!

Diffusion Models (DDPM & DDIM) - Easily explained!

Soroush Mehraban

29.0K views·1 year ago

GLIGEN (CVPR2023) Open-Set Grounded Text-to-Image Generation

GLIGEN (CVPR2023) Open-Set Grounded Text-to-Image Generation

Soroush Mehraban

908 views·1 year ago

The Entropy Enigma Success and Failure of Entropy Minimization

The Entropy Enigma Success and Failure of Entropy Minimization

Soroush Mehraban

797 views·1 year ago

Tent Fully Test-time Adaptation by Entropy Minimization

Tent Fully Test-time Adaptation by Entropy Minimization

Soroush Mehraban

908 views·1 year ago

VPD (ICCV2023) Unleashing Text-to-Image Diffusion Models for Visual Perception

VPD (ICCV2023) Unleashing Text-to-Image Diffusion Models for Visual Perception

Soroush Mehraban

398 views·1 year ago

TokenHMR (CVPR2024) Advancing Human Mesh Recovery witha Tokenized Pose Representation

TokenHMR (CVPR2024) Advancing Human Mesh Recovery witha Tokenized Pose Representation

Soroush Mehraban

779 views·1 year ago

SHViT (CVPR2024) Single-Head Vision Transformer with Memory Efficient Macro Design

SHViT (CVPR2024) Single-Head Vision Transformer with Memory Efficient Macro Design

Soroush Mehraban

1.5K views·1 year ago

InstaFlow One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

InstaFlow One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Soroush Mehraban

1.3K views·2 years ago

FastV An Image is Worth 12 Tokens After Layer 2

FastV An Image is Worth 12 Tokens After Layer 2

Soroush Mehraban

870 views·2 years ago

GaLore Memory-Efficient LLM Training by Gradient Low-Rank Projection

GaLore Memory-Efficient LLM Training by Gradient Low-Rank Projection

Soroush Mehraban

2.0K views·2 years ago

PoseGPT (ChatPose) Chatting about 3D Human Pose

PoseGPT (ChatPose) Chatting about 3D Human Pose

Soroush Mehraban

1.2K views·2 years ago

MotionAGFormer (WACV2024) Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network

MotionAGFormer (WACV2024) Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network

Soroush Mehraban

1.6K views·2 years ago

HD-GCN (ICCV2023) Skeleton-Based Action Recognition

HD-GCN (ICCV2023) Skeleton-Based Action Recognition

Soroush Mehraban

3.2K views·2 years ago

ST-GCN Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition

ST-GCN Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition

Soroush Mehraban

8.9K views·2 years ago

Graph Convolutional Networks (GCN) From CNN point of view

Graph Convolutional Networks (GCN) From CNN point of view

Soroush Mehraban

16.7K views·2 years ago

DINO Self-Supervised Vision Transformers

DINO Self-Supervised Vision Transformers

Soroush Mehraban

10.1K views·2 years ago

Load More Videos

Soroush Mehraban - NatokHD | NatokHD