Posters

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling

26 September 2024·2946 words·14 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

GaussianCube revolutionizes 3D generative modeling with a structured, explicit radiance representation, achieving state-of-the-art results using significantly fewer parameters.

Gaussian Process Bandits for Top-k Recommendations

26 September 2024·1799 words·9 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Massachusetts Amherst

GP-TopK: A novel contextual bandit algorithm uses Gaussian processes with a Kendall kernel for efficient & accurate top-k recommendations, even with limited feedback.

Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images

26 September 2024·2277 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

Gaussian Graph Network (GGN) revolutionizes novel view synthesis by efficiently generating generalizable Gaussian representations from multi-view images, achieving superior rendering quality with fewe…

Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning

26 September 2024·1382 words·7 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 HSE University

This paper delivers non-asymptotic accuracy bounds for confidence intervals in linear stochastic approximation, leveraging a novel multiplier bootstrap method.

Gated Slot Attention for Efficient Linear-Time Sequence Modeling

26 September 2024·2081 words·10 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Soochow University

Gated Slot Attention (GSA) enhances linear Transformers for efficient, real-time sequence modeling. GSA uses a two-layer gated linear attention structure linked by softmax, enabling improved memory ca…

Gated Inference Network: Inference and Learning State-Space Models

26 September 2024·3839 words·19 mins· loading · loading

Machine Learning Representation Learning 🏢 Seoul National University

GIN, a novel approximate Bayesian inference algorithm, efficiently handles nonlinear state-space models with high-dimensional, noisy observations by disentangling observation and dynamics. Achieving l…

GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation

26 September 2024·2482 words·12 mins· loading · loading

AI Applications Robotics 🏢 Peking University

GarmentLab: A new benchmark and simulation platform tackles garment manipulation challenges by offering realistic simulations, diverse assets, and tasks bridging the sim-to-real gap for more robust AI…

GAMap: Zero-Shot Object Goal Navigation with Multi-Scale Geometric-Affordance Guidance

26 September 2024·2182 words·11 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 New York University Abu Dhabi

GAMap: Zero-shot object goal navigation excels by using multi-scale geometric-affordance guidance, significantly boosting robot success rates in unseen environments.

GACL: Exemplar-Free Generalized Analytic Continual Learning

26 September 2024·1993 words·10 mins· loading · loading

Machine Learning Continual Learning 🏢 South China University of Technology

GACL: a novel exemplar-free technique for generalized analytic continual learning, achieves superior performance by analytically solving the weight-invariant property for handling real-world data.

G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models

26 September 2024·2323 words·11 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 City University of Hong Kong

G3: A novel framework leverages Retrieval-Augmented Generation to achieve highly accurate worldwide image geolocalization, overcoming limitations of existing methods.

G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training

26 September 2024·2099 words·10 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 University of Oxford

G2D: a novel medical VLP framework achieves superior performance in medical image analysis by simultaneously learning global and dense visual features using image-text pairs without extra annotations.

G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering

26 September 2024·3454 words·17 mins· loading · loading

AI Generated Natural Language Processing Question Answering 🏢 National University of Singapore

G-Retriever: a novel RAG approach enables conversational interaction with textual graphs, improving graph understanding and question answering efficiency while mitigating hallucination.

FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion

26 September 2024·2650 words·13 mins· loading · loading

Multimodal Learning Multimodal Understanding 🏢 Department of Computer Science Johns Hopkins University

FuseMoE, a novel mixture-of-experts transformer, efficiently fuses diverse and incomplete multimodal data, achieving superior predictive performance via a unique Laplace gating function.

FUSE: Fast Unified Simulation and Estimation for PDEs

26 September 2024·7308 words·35 mins· loading · loading

AI Generated AI Applications Healthcare 🏢 ETH Zurich

FUSE, a novel framework, efficiently predicts continuous fields & estimates discrete parameters in PDEs, significantly improving accuracy and robustness.

Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models

26 September 2024·4898 words·23 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 University of Texas at Austin

This paper introduces a rate-distortion framework for prompt compression in LLMs, bridging the gap between existing methods and optimal performance. By formulating prompt compression as a linear progr…

Fundamental Convergence Analysis of Sharpness-Aware Minimization

26 September 2024·2234 words·11 mins· loading · loading

AI Theory Optimization 🏢 Ho Chi Minh City University of Education

This research establishes fundamental convergence properties for the widely-used SAM optimization algorithm, significantly advancing our theoretical understanding and practical applications.

Functionally Constrained Algorithm Solves Convex Simple Bilevel Problem

26 September 2024·310 words·2 mins· loading · loading

AI Theory Optimization 🏢 Tsinghua University

Near-optimal algorithms solve convex simple bilevel problems by reformulating them into functionally constrained problems, achieving near-optimal convergence rates.

Functional Gradient Flows for Constrained Sampling

26 September 2024·3022 words·15 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Peking University

Constrained sampling solved! New functional gradient flow method (CFG) efficiently samples from constrained probability distributions via a novel boundary condition for gradient flows, achieving prov…

Fully Explicit Dynamic Gaussian Splatting

26 September 2024·3268 words·16 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 School of Electrical Engineering and Computer Science

Ex4DGS achieves real-time high-quality dynamic scene rendering using explicit 4D Gaussian representations and keyframe interpolation.

Full-Distance Evasion of Pedestrian Detectors in the Physical World

26 September 2024·2691 words·13 mins· loading · loading

Computer Vision Object Detection 🏢 Tsinghua University

Researchers developed Full Distance Attack (FDA) to generate adversarial patterns effective against pedestrian detectors across all distances, resolving the appearance gap issue between simulated and …