Skip to main content

Posters

2024

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling
·2946 words·14 mins· loading · loading
Computer Vision 3D Vision 🏢 Tsinghua University
GaussianCube revolutionizes 3D generative modeling with a structured, explicit radiance representation, achieving state-of-the-art results using significantly fewer parameters.
Gaussian Process Bandits for Top-k Recommendations
·1799 words·9 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 University of Massachusetts Amherst
GP-TopK: A novel contextual bandit algorithm uses Gaussian processes with a Kendall kernel for efficient & accurate top-k recommendations, even with limited feedback.
Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images
·2277 words·11 mins· loading · loading
Computer Vision 3D Vision 🏢 Tsinghua University
Gaussian Graph Network (GGN) revolutionizes novel view synthesis by efficiently generating generalizable Gaussian representations from multi-view images, achieving superior rendering quality with fewe…
Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning
·1382 words·7 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 HSE University
This paper delivers non-asymptotic accuracy bounds for confidence intervals in linear stochastic approximation, leveraging a novel multiplier bootstrap method.
Gated Slot Attention for Efficient Linear-Time Sequence Modeling
·2081 words·10 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 Soochow University
Gated Slot Attention (GSA) enhances linear Transformers for efficient, real-time sequence modeling. GSA uses a two-layer gated linear attention structure linked by softmax, enabling improved memory ca…
Gated Inference Network: Inference and Learning State-Space Models
·3839 words·19 mins· loading · loading
Machine Learning Representation Learning 🏢 Seoul National University
GIN, a novel approximate Bayesian inference algorithm, efficiently handles nonlinear state-space models with high-dimensional, noisy observations by disentangling observation and dynamics. Achieving l…
GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation
·2482 words·12 mins· loading · loading
AI Applications Robotics 🏢 Peking University
GarmentLab: A new benchmark and simulation platform tackles garment manipulation challenges by offering realistic simulations, diverse assets, and tasks bridging the sim-to-real gap for more robust AI…
GAMap: Zero-Shot Object Goal Navigation with Multi-Scale Geometric-Affordance Guidance
·2182 words·11 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 New York University Abu Dhabi
GAMap: Zero-shot object goal navigation excels by using multi-scale geometric-affordance guidance, significantly boosting robot success rates in unseen environments.
GACL: Exemplar-Free Generalized Analytic Continual Learning
·1993 words·10 mins· loading · loading
Machine Learning Continual Learning 🏢 South China University of Technology
GACL: a novel exemplar-free technique for generalized analytic continual learning, achieves superior performance by analytically solving the weight-invariant property for handling real-world data.
G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models
·2323 words·11 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 City University of Hong Kong
G3: A novel framework leverages Retrieval-Augmented Generation to achieve highly accurate worldwide image geolocalization, overcoming limitations of existing methods.
G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training
·2099 words·10 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 University of Oxford
G2D: a novel medical VLP framework achieves superior performance in medical image analysis by simultaneously learning global and dense visual features using image-text pairs without extra annotations.
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering
·3454 words·17 mins· loading · loading
AI Generated Natural Language Processing Question Answering 🏢 National University of Singapore
G-Retriever: a novel RAG approach enables conversational interaction with textual graphs, improving graph understanding and question answering efficiency while mitigating hallucination.
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion
·2650 words·13 mins· loading · loading
Multimodal Learning Multimodal Understanding 🏢 Department of Computer Science Johns Hopkins University
FuseMoE, a novel mixture-of-experts transformer, efficiently fuses diverse and incomplete multimodal data, achieving superior predictive performance via a unique Laplace gating function.
FUSE: Fast Unified Simulation and Estimation for PDEs
·7308 words·35 mins· loading · loading
AI Generated AI Applications Healthcare 🏢 ETH Zurich
FUSE, a novel framework, efficiently predicts continuous fields & estimates discrete parameters in PDEs, significantly improving accuracy and robustness.
Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models
·4898 words·23 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 University of Texas at Austin
This paper introduces a rate-distortion framework for prompt compression in LLMs, bridging the gap between existing methods and optimal performance. By formulating prompt compression as a linear progr…
Fundamental Convergence Analysis of Sharpness-Aware Minimization
·2234 words·11 mins· loading · loading
AI Theory Optimization 🏢 Ho Chi Minh City University of Education
This research establishes fundamental convergence properties for the widely-used SAM optimization algorithm, significantly advancing our theoretical understanding and practical applications.
Functionally Constrained Algorithm Solves Convex Simple Bilevel Problem
·310 words·2 mins· loading · loading
AI Theory Optimization 🏢 Tsinghua University
Near-optimal algorithms solve convex simple bilevel problems by reformulating them into functionally constrained problems, achieving near-optimal convergence rates.
Functional Gradient Flows for Constrained Sampling
·3022 words·15 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Peking University
Constrained sampling solved! New functional gradient flow method (CFG) efficiently samples from constrained probability distributions via a novel boundary condition for gradient flows, achieving prov…
Fully Explicit Dynamic Gaussian Splatting
·3268 words·16 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 School of Electrical Engineering and Computer Science
Ex4DGS achieves real-time high-quality dynamic scene rendering using explicit 4D Gaussian representations and keyframe interpolation.
Full-Distance Evasion of Pedestrian Detectors in the Physical World
·2691 words·13 mins· loading · loading
Computer Vision Object Detection 🏢 Tsinghua University
Researchers developed Full Distance Attack (FDA) to generate adversarial patterns effective against pedestrian detectors across all distances, resolving the appearance gap issue between simulated and …