Skip to main content

🏢 Midea Group

EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching
·3842 words·19 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Midea Group
The Efficient Diffusion Transformer (EDT) framework significantly speeds up and improves image generation by leveraging a lightweight architecture, human-like sketching-inspired Attention Modulation M…
Any2Policy: Learning Visuomotor Policy with Any-Modality
·1938 words·10 mins· loading · loading
AI Generated Multimodal Learning Embodied AI 🏢 Midea Group
Any2Policy: a unified multi-modal system enabling robots to perform tasks using diverse instruction and observation modalities (text, image, audio, video, point cloud).