-
ILabel: Revealing Objects in Neural Fields [Abstract]
-
Weakly Supervised Referring Expression Grounding via Dynamic Self-Knowledge Distillation [Abstract]
-
EventTransAct: A Video Transformer-Based Framework for Event-Camera Based Action Recognition [Abstract]
-
Virtual Ski Training System That Allows Beginners to Acquire Ski Skills Based on Physical and Visual Feedbacks [Abstract]
-
Attention-Based VR Facial Animation with Visual Mouth Camera Guidance for Immersive Telepresence Avatars [Abstract]
-
Test-Time Adaptation for Point Cloud Upsampling Using Meta-Learning [Abstract]
-
Revisiting Event-Based Video Frame Interpolation [Abstract]
-
Revisiting Deformable Convolution for Depth Completion [Abstract]
-
Long-Distance Gesture Recognition Using Dynamic Neural Networks [Abstract]
-
Neural Implicit Vision-Language Feature Fields [Abstract]
-
Language Guided Robotic Grasping with Fine-Grained Instructions [Abstract]
-
Whole Shape Estimation of Transparent Object from Its Contour Using Statistical Shape Model [Abstract]