Skip to main content

🏢 Southeast University

Number it: Temporal Grounding Videos like Flipping Manga
·2758 words·13 mins
AI Generated 🤗 Daily Papers Computer Vision Video Understanding 🏢 Southeast University
Boosting video temporal grounding, NumPro empowers Vid-LLMs by adding frame numbers, making temporal localization as easy as flipping through manga.
Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks
·6756 words·32 mins
AI Generated 🤗 Daily Papers AI Applications Human-AI Interaction 🏢 Southeast University
Collaborative Assistant for Personalized Exploration (CARE) enhances LLM chatbots for exploratory tasks by combining a multi-agent framework with a structured interface, delivering tailored solutions …