Browsing: Video-language-action model