🏢 UCLA
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
·4642 words·22 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
3D Vision
🏢 UCLA
Feature4X: 4D Agentic AI from Monocular Video w/ Gaussian Feature Fields
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
·3299 words·16 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Computer Vision
Image Generation
🏢 UCLA
Reflect-DiT: Scaling Text-to-Image Diffusion Transformers via In-Context Reflection!