🏢 Brown University
The GAN is dead; long live the GAN! A Modern GAN Baseline
·3072 words·15 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Brown University
R3GAN, a minimalist GAN baseline, surpasses state-of-the-art models by using a novel regularized relativistic GAN loss and modern architectures, proving GANs can be trained efficiently without relying…
Text-Aware Diffusion for Policy Learning
·3340 words·16 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Brown University
Text-Aware Diffusion for Policy Learning (TADPoLe) uses pretrained diffusion models for zero-shot reward generation, enabling natural language-driven policy learning without manual reward design.
Talking Heads: Understanding Inter-Layer Communication in Transformer Language Models
·3768 words·18 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Brown University
Transformer Language Models’ (LMs) sensitivity to seemingly arbitrary prompt changes is explained by identifying low-rank communication channels between layers. By decomposing attention heads, resear…
RTify: Aligning Deep Neural Networks with Human Behavioral Decisions
·1884 words·9 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Brown University
RTify: A novel framework aligns deep neural networks’ dynamics with human reaction times for improved visual decision-making models.
Learning to Edit Visual Programs with Self-Supervision
·2121 words·10 mins·
loading
·
loading
Computer Vision
Visual Question Answering
🏢 Brown University
AI learns to edit visual programs more accurately using a self-supervised method that combines one-shot program generation with iterative local edits, significantly boosting performance, especially wi…
Feint Behaviors and Strategies: Formalization, Implementation and Evaluation
·2257 words·11 mins·
loading
·
loading
AI Generated
AI Applications
Gaming
🏢 Brown University
This paper introduces a novel formalization of feint behaviors in multi-player games, improving AI performance and game diversity via a unified MARL implementation.
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
·9001 words·43 mins·
loading
·
loading
AI Generated
Computer Vision
Visual Question Answering
🏢 Brown University
Vision transformers surprisingly struggle with visual relations; this study reveals ViTs use distinct perceptual and relational processing stages to solve same/different tasks, highlighting a previous…