Skip to main content

🏢 Huazhong University of Science and Technology

Exploring the Evolution of Physics Cognition in Video Generation: A Survey
·3260 words·16 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Video Understanding 🏢 Huazhong University of Science and Technology
This survey explores the evolution of physics cognition in video generation, addressing the gap between visual realism and physical accuracy.
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
·2359 words·12 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 Huazhong University of Science and Technology
Free4D: Tuning-free 4D scene generation with spatial-temporal consistency.
Wikipedia in the Era of LLMs: Evolution and Risks
·3967 words·19 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Huazhong University of Science and Technology
LLMs modestly affect Wikipedia, subtly altering content and potentially skewing NLP benchmarks.
CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom
·8404 words·40 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Huazhong University of Science and Technology
CROWDSELECT boosts instruction tuning by cleverly selecting synthetic data using multi-LLM wisdom, enhancing model performance across diverse tasks.
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
·3436 words·17 mins· loading · loading
AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Huazhong University of Science and Technology
LightningDiT resolves the optimization dilemma in latent diffusion models by aligning latent space with pre-trained vision models, achieving state-of-the-art ImageNet 256x256 generation with over 21x …