Skip to content

2025.10.15 - #51 - OKVIS2-X, Open-YOLO 3D, CoT-VLA, π0.5, RND1, SuperDec #53

@changh95

Description

@changh95

Interesting papers

OKVIS2-X: Open Keyframe-based Visual-Inertial SLAM Configurable with Dense Depth or LiDAR, and GNSS

Image Image

Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation

  • https://github.com/aminebdj/OpenYOLO3D
  • https://arxiv.org/pdf/2406.02548
  • SAM 이나 CLIP을 매 프레임마다 추출해서 multi-view reconstruction을 통해 3D instance segmentation을 하는 모델들이 많았음. 굉장히 무거운 연산들이라 속도가 많이 느림.
  • 2D object detection + 3D network 두개만으로 기존의 Sota보다 16배나 빠른 속도를 얻어낼 수 있음.
Image Image Image

Metadata

Metadata

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions