vufinder/vggt-1b
Reconstruct 3D scene geometry from images or video. Accepts one or more images or a video and returns per-frame depth ma...
Found 16 models (showing 1-16)
Reconstruct 3D scene geometry from images or video. Accepts one or more images or a video and returns per-frame depth ma...
Reconstruct 3D point clouds and scene geometry from single images, image sets, or video. Takes images or a video as inpu...
Estimate depth and reconstruct 3D point clouds from single images or videos. Accepts one or more images or a video and r...
Estimate monocular depth from a single input image and output a dense depth map. Combine relative and metric depth; sele...
Estimate per-pixel depth (disparity) from a single image. Output a colorized or grayscale depth map image, with optional...
Estimate monocular depth from a single image, returning colorized and grayscale depth maps. Choose Small, Base, or Large...
Estimate per-pixel depth from a single image and return a dense grayscale relative depth map. Leverage MiDaS with DPT ba...
Estimate per-pixel metric depth from a single image. Takes an RGB image (optionally a known focal length) and returns a...
Estimate temporally consistent per-pixel depth from an input video. Accepts an image or a video and outputs a depth-map...
Estimate per-pixel depth from a single image. Accepts an image and returns a dense grayscale depth map (relative monocul...
Estimate human pose from an input image and output an OpenPose-style pose map image. Optionally include facial landmarks...
Estimate dense depth from a stereo pair of rectified images. Takes left and right images and outputs a per-pixel dispari...
Estimate per-pixel depth from a single image and output a grayscale depth map. Perform monocular depth estimation for AR...
Estimate 3D geometry from a single RGB image. Takes an image input and outputs a depth map (EXR + preview), surface norm...
Generate ControlNet conditioning maps from an input image. Run multiple preprocessors simultaneously and return images f...
Generates images from text prompts with exceptional text rendering capabilities, particularly excelling at complex multi...