HOI-DETR is a transformer-based framework for detecting hands, hand-held objects, and their interactions in images and video. Built on the Co-DETR architecture, it adds a lightweight interaction ...
Here, we include a demo, running our optimization on an example from BEHAVE, where we provide the initialization from VisTracker. Please follow this for a minimal demo. Later, we provide instructions ...