ZERO: Multi-modal Prompt-based Visual Grounding
Published in arXiv, 2025
Zero-shot multi-prompt object detection model for production-ready visual grounding across industrial domains.
Recommended citation: Sangbum Choi and Kyeongryeol Go. (2025). "ZERO: Multi-modal Prompt-based Visual Grounding." arXiv. https://arxiv.org/abs/2507.04270