Skip to main content

🏢 UCLA

Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
·4642 words·22 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 UCLA
Feature4X: 4D Agentic AI from Monocular Video w/ Gaussian Feature Fields
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
·3299 words·16 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 UCLA
Reflect-DiT: Scaling Text-to-Image Diffusion Transformers via In-Context Reflection!