🏢 Midea Group
EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching
·3842 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Midea Group
The Efficient Diffusion Transformer (EDT) framework significantly speeds up and improves image generation by leveraging a lightweight architecture, human-like sketching-inspired Attention Modulation M…
Any2Policy: Learning Visuomotor Policy with Any-Modality
·1938 words·10 mins·
loading
·
loading
AI Generated
Multimodal Learning
Embodied AI
🏢 Midea Group
Any2Policy: a unified multi-modal system enabling robots to perform tasks using diverse instruction and observation modalities (text, image, audio, video, point cloud).