↓Skip to main content

🏢 Huazhong University of Science and Technology

Exploring the Evolution of Physics Cognition in Video Generation: A Survey

27 March 2025·3260 words·16 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Video Understanding 🏢 Huazhong University of Science and Technology

This survey explores the evolution of physics cognition in video generation, addressing the gap between visual realism and physical accuracy.

Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency

26 March 2025·2359 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision 3D Vision 🏢 Huazhong University of Science and Technology

Free4D: Tuning-free 4D scene generation with spatial-temporal consistency.

Wikipedia in the Era of LLMs: Evolution and Risks

4 March 2025·3967 words·19 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Huazhong University of Science and Technology

LLMs modestly affect Wikipedia, subtly altering content and potentially skewing NLP benchmarks.

CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom

3 March 2025·8404 words·40 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Huazhong University of Science and Technology

CROWDSELECT boosts instruction tuning by cleverly selecting synthetic data using multi-LLM wisdom, enhancing model performance across diverse tasks.

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

2 January 2025·3436 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Huazhong University of Science and Technology

LightningDiT resolves the optimization dilemma in latent diffusion models by aligning latent space with pre-trained vision models, achieving state-of-the-art ImageNet 256x256 generation with over 21x …