Skip to main content

🏢 Brown University

The GAN is dead; long live the GAN! A Modern GAN Baseline
·3072 words·15 mins· loading · loading
Computer Vision Image Generation 🏢 Brown University
R3GAN, a minimalist GAN baseline, surpasses state-of-the-art models by using a novel regularized relativistic GAN loss and modern architectures, proving GANs can be trained efficiently without relying…
Text-Aware Diffusion for Policy Learning
·3340 words·16 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Brown University
Text-Aware Diffusion for Policy Learning (TADPoLe) uses pretrained diffusion models for zero-shot reward generation, enabling natural language-driven policy learning without manual reward design.
Talking Heads: Understanding Inter-Layer Communication in Transformer Language Models
·3768 words·18 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Brown University
Transformer Language Models’ (LMs) sensitivity to seemingly arbitrary prompt changes is explained by identifying low-rank communication channels between layers. By decomposing attention heads, resear…
RTify: Aligning Deep Neural Networks with Human Behavioral Decisions
·1884 words·9 mins· loading · loading
Computer Vision Image Classification 🏢 Brown University
RTify: A novel framework aligns deep neural networks’ dynamics with human reaction times for improved visual decision-making models.
Learning to Edit Visual Programs with Self-Supervision
·2121 words·10 mins· loading · loading
Computer Vision Visual Question Answering 🏢 Brown University
AI learns to edit visual programs more accurately using a self-supervised method that combines one-shot program generation with iterative local edits, significantly boosting performance, especially wi…
Feint Behaviors and Strategies: Formalization, Implementation and Evaluation
·2257 words·11 mins· loading · loading
AI Generated AI Applications Gaming 🏢 Brown University
This paper introduces a novel formalization of feint behaviors in multi-player games, improving AI performance and game diversity via a unified MARL implementation.
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
·9001 words·43 mins· loading · loading
AI Generated Computer Vision Visual Question Answering 🏢 Brown University
Vision transformers surprisingly struggle with visual relations; this study reveals ViTs use distinct perceptual and relational processing stages to solve same/different tasks, highlighting a previous…