Skip to main content

🏢 Google

Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
·3022 words·15 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Google
A-Harmonic reward function and Reward Preference Optimization (RPO) improve subject-driven text-to-image generation by enabling faster training and state-of-the-art results with a simpler setup.
Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation
·4968 words·24 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 Google
This paper presents a novel neural network architecture that simultaneously learns to generate and segment images in an unsupervised manner, achieving accurate results across multiple datasets without…
Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms
·3719 words·18 mins· loading · loading
AI Generated Natural Language Processing Machine Translation 🏢 Google
Fast approximation of Minimum Bayes Risk (MBR) decoding achieved using low-rank matrix completion algorithms, drastically reducing computational cost without sacrificing translation quality.
Differentially Private Set Representations
·1424 words·7 mins· loading · loading
AI Generated AI Theory Privacy 🏢 Google
Differentially private set representations achieve optimal privacy-utility tradeoffs with exponentially smaller error than prior histogram methods.
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
·2029 words·10 mins· loading · loading
Speech Recognition 🏢 Google
Transformers can now perform self-alignment, enabling simpler, faster speech recognition models.