Lists (32)
Sort Name ascending (A-Z)
3d-nerf
3D structured light
3dpose-shape
6dof
agent
body measurement
cam-pose
clothes_avator
comic
dataset
face_simlation
facesawp
🔮 Future ideas
game
RL, FPSground-gs
hand
human parsing
linux software
metahuman
mvs-splat
network
network dealone view image 3d object
robs
ros
sam
slam
speed-pose
stereo
TODO
track
tts
video-generate
Starred repositories
[CVPR 2023] Iterative Geometry Encoding Volume for Stereo Matching
[ICCV 2019] Depth Hints are complementary depth suggestions which improve monocular depth estimation algorithms trained from stereo pairs
Official MegEngine implementation of CREStereo(CVPR 2022 Oral).
Non-official Pytorch implementation of the CREStereo(CVPR 2022 Oral).
Continuous 3D Label Stereo Matching using Local Expansion Moves (TPAMI 2018)
A Pytorch implementation of Pyramid Stereo Matching Network
Pyramid Stereo Matching Network (CVPR2018)
[CVPR2020] Learning multiview 3D point cloud registration
Pytorch implementation of ICRA 2020 paper "360° Stereo Depth Estimation with Learnable Cost Volume"
IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching
Hierarchical Deep Stereo Matching on High Resolution Images, CVPR 2019.
UmeTrack Unified multi-view end-to-end hand tracking for VR
Rankings include: Align3R BetterDepth Buffer Anytime ChronoDepth CUT3R Deep3D Depth Any Video Depth Anything Depth Pro DepthCrafter Diffusion E2E FT FutureDepth GRIN L4P Metric3D MoGe MonST3R NVDS …
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
Tooling for the Common Objects In 3D dataset.
Official implementation of "ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills"
[3DV 2024] Official repo of "TeCH: Text-guided Reconstruction of Lifelike Clothed Humans"
[CVPR'23, Highlight] ECON: Explicit Clothed humans Optimized via Normal integration
A curated list of papers & resources linked to 3D reconstruction from images.
Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page https://mv-dust3rp.github.io/
[NeurIPS'24] WildGaussians: 3D Gaussian Splatting In the Wild
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.