🏢 National University of Singapore
Monomial Matrix Group Equivariant Neural Functional Networks
·2706 words·13 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 National University of Singapore
Monomial-NFNs boost neural network efficiency by leveraging scaling/sign-flipping symmetries, resulting in fewer trainable parameters and competitive performance.
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
·2790 words·14 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 National University of Singapore
MomentumSMoE boosts Sparse Mixture of Experts’ (SMoE) performance by integrating momentum, resulting in more stable training and robust models.
MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures
·3103 words·15 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 National University of Singapore
MixEval revolutionizes LLM benchmarking by blending real-world user queries with existing datasets, creating a cost-effective, unbiased, and dynamic evaluation method.
Memory-Efficient Gradient Unrolling for Large-Scale Bi-level Optimization
·3095 words·15 mins·
loading
·
loading
AI Generated
Machine Learning
Meta Learning
🏢 National University of Singapore
FG²U: a novel memory-efficient algorithm for unbiased stochastic approximation of meta-gradients in large-scale bi-level optimization, showing superior performance across diverse tasks.
Localized Zeroth-Order Prompt Optimization
·3110 words·15 mins·
loading
·
loading
Large Language Models
🏢 National University of Singapore
Localized Zeroth-Order Prompt Optimization (ZOPO) efficiently finds high-performing local optima for prompt optimization in black-box LLMs, outperforming existing global optimization methods.
Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
·2463 words·12 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 National University of Singapore
Learning-to-Cache (L2C) dramatically accelerates diffusion transformers by intelligently caching layer computations, achieving significant speedups with minimal performance loss.
Learning Macroscopic Dynamics from Partial Microscopic Observations
·1980 words·10 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 National University of Singapore
Learn macroscopic dynamics efficiently using only partial microscopic force computations! This novel method leverages sparsity assumptions and stochastic estimation for accurate, cost-effective modeli…
Implicit Curriculum in Procgen Made Explicit
·1613 words·8 mins·
loading
·
loading
Reinforcement Learning
🏢 National University of Singapore
C-Procgen reveals implicit curriculum in Procgen’s multi-level training, showing learning shifts gradually from easy to hard contexts.
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering
·3454 words·17 mins·
loading
·
loading
AI Generated
Natural Language Processing
Question Answering
🏢 National University of Singapore
G-Retriever: a novel RAG approach enables conversational interaction with textual graphs, improving graph understanding and question answering efficiency while mitigating hallucination.
FreeSplat: Generalizable 3D Gaussian Splatting Towards Free View Synthesis of Indoor Scenes
·2183 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 National University of Singapore
FreeSplat achieves state-of-the-art novel view synthesis by accurately localizing 3D Gaussians from long image sequences, overcoming limitations of prior methods confined to narrow-range interpolation…
Federated Transformer: Multi-Party Vertical Federated Learning on Practical Fuzzily Linked Data
·3432 words·17 mins·
loading
·
loading
AI Generated
Machine Learning
Federated Learning
🏢 National University of Singapore
Federated Transformer (FeT) revolutionizes multi-party fuzzy vertical federated learning by encoding fuzzy identifiers and using a transformer architecture, achieving up to 46% accuracy improvement an…
EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
·2156 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 National University of Singapore
EZ-HOI: Efficient Zero-Shot HOI detection adapts Vision-Language Models (VLMs) for Human-Object Interaction (HOI) tasks using a novel prompt learning framework, achieving state-of-the-art performance …
Exocentric-to-Egocentric Video Generation
·2698 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Video Understanding
🏢 National University of Singapore
Exo2Ego-V generates realistic egocentric videos from sparse exocentric views, significantly outperforming state-of-the-art methods on a challenging benchmark.
End-to-End Video Semantic Segmentation in Adverse Weather using Fusion Blocks and Temporal-Spatial Teacher-Student Learning
·2581 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Video Understanding
🏢 National University of Singapore
Optical-flow-free video semantic segmentation excels in adverse weather by merging adjacent frame information via a fusion block and a novel temporal-spatial teacher-student learning strategy.
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation
·3048 words·15 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 National University of Singapore
Dynamic Tuning (DyT) significantly boosts Vision Transformer (ViT) adaptation by dynamically skipping less important tokens during inference, achieving superior performance with 71% fewer FLOPs than e…
DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus
·3216 words·16 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 National University of Singapore
DOGS: Distributed-Oriented Gaussian Splatting accelerates large-scale 3D reconstruction by distributing the training of 3D Gaussian Splatting models across multiple machines, achieving 6x faster train…
DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning
·3087 words·15 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 National University of Singapore
DETAIL: A novel attribution method reveals the impact of individual demonstrations in in-context learning, boosting interpretability and improving transformer-based model performance.
Cross-Scale Self-Supervised Blind Image Deblurring via Implicit Neural Representation
·3186 words·15 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 National University of Singapore
Self-supervised blind image deblurring (BID) breakthrough! A novel cross-scale consistency loss and progressive training scheme using implicit neural representations achieves superior performance wit…
Can Simple Averaging Defeat Modern Watermarks?
·3146 words·15 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 National University of Singapore
Simple averaging of watermarked images reveals hidden patterns, enabling watermark removal and forgery, thus highlighting the vulnerability of content-agnostic watermarking methods.
Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal Masking
·2427 words·12 mins·
loading
·
loading
Self-Supervised Learning
🏢 National University of Singapore
Brain-JEPA: a novel brain dynamics foundation model leverages fMRI data via innovative gradient positioning and spatiotemporal masking to achieve state-of-the-art performance in diverse brain activity…