← back
x.comZhuang LiuSun, May 31, 2026, 9:30 PM PDT
score 16.1
102likes4RT1reply

Standard vision models handle 3D tasks without custom architecture

Original: Excited to share VLM³ - standard VLMs go surprisingly far in 3D!

Source: x.com

Writing ELI5 summary…