🏢 Southeast University
Number it: Temporal Grounding Videos like Flipping Manga
·2758 words·13 mins
AI Generated
🤗 Daily Papers
Computer Vision
Video Understanding
🏢 Southeast University
Boosting video temporal grounding, NumPro empowers Vid-LLMs by adding frame numbers, making temporal localization as easy as flipping through manga.
Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks
·6756 words·32 mins
AI Generated
🤗 Daily Papers
AI Applications
Human-AI Interaction
🏢 Southeast University
Collaborative Assistant for Personalized Exploration (CARE) enhances LLM chatbots for exploratory tasks by combining a multi-agent framework with a structured interface, delivering tailored solutions …