Posters
2024
GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling
·2946 words·14 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Tsinghua University
GaussianCube revolutionizes 3D generative modeling with a structured, explicit radiance representation, achieving state-of-the-art results using significantly fewer parameters.
Gaussian Process Bandits for Top-k Recommendations
·1799 words·9 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 University of Massachusetts Amherst
GP-TopK: A novel contextual bandit algorithm uses Gaussian processes with a Kendall kernel for efficient & accurate top-k recommendations, even with limited feedback.
Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images
·2277 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Tsinghua University
Gaussian Graph Network (GGN) revolutionizes novel view synthesis by efficiently generating generalizable Gaussian representations from multi-view images, achieving superior rendering quality with fewe…
Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning
·1382 words·7 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 HSE University
This paper delivers non-asymptotic accuracy bounds for confidence intervals in linear stochastic approximation, leveraging a novel multiplier bootstrap method.
Gated Slot Attention for Efficient Linear-Time Sequence Modeling
·2081 words·10 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 Soochow University
Gated Slot Attention (GSA) enhances linear Transformers for efficient, real-time sequence modeling. GSA uses a two-layer gated linear attention structure linked by softmax, enabling improved memory ca…
Gated Inference Network: Inference and Learning State-Space Models
·3839 words·19 mins·
loading
·
loading
Machine Learning
Representation Learning
🏢 Seoul National University
GIN, a novel approximate Bayesian inference algorithm, efficiently handles nonlinear state-space models with high-dimensional, noisy observations by disentangling observation and dynamics. Achieving l…
GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation
·2482 words·12 mins·
loading
·
loading
AI Applications
Robotics
🏢 Peking University
GarmentLab: A new benchmark and simulation platform tackles garment manipulation challenges by offering realistic simulations, diverse assets, and tasks bridging the sim-to-real gap for more robust AI…
GAMap: Zero-Shot Object Goal Navigation with Multi-Scale Geometric-Affordance Guidance
·2182 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 New York University Abu Dhabi
GAMap: Zero-shot object goal navigation excels by using multi-scale geometric-affordance guidance, significantly boosting robot success rates in unseen environments.
GACL: Exemplar-Free Generalized Analytic Continual Learning
·1993 words·10 mins·
loading
·
loading
Machine Learning
Continual Learning
🏢 South China University of Technology
GACL: a novel exemplar-free technique for generalized analytic continual learning, achieves superior performance by analytically solving the weight-invariant property for handling real-world data.
G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models
·2323 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 City University of Hong Kong
G3: A novel framework leverages Retrieval-Augmented Generation to achieve highly accurate worldwide image geolocalization, overcoming limitations of existing methods.
G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training
·2099 words·10 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 University of Oxford
G2D: a novel medical VLP framework achieves superior performance in medical image analysis by simultaneously learning global and dense visual features using image-text pairs without extra annotations.
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering
·3454 words·17 mins·
loading
·
loading
AI Generated
Natural Language Processing
Question Answering
🏢 National University of Singapore
G-Retriever: a novel RAG approach enables conversational interaction with textual graphs, improving graph understanding and question answering efficiency while mitigating hallucination.
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion
·2650 words·13 mins·
loading
·
loading
Multimodal Learning
Multimodal Understanding
🏢 Department of Computer Science Johns Hopkins University
FuseMoE, a novel mixture-of-experts transformer, efficiently fuses diverse and incomplete multimodal data, achieving superior predictive performance via a unique Laplace gating function.
FUSE: Fast Unified Simulation and Estimation for PDEs
·7308 words·35 mins·
loading
·
loading
AI Generated
AI Applications
Healthcare
🏢 ETH Zurich
FUSE, a novel framework, efficiently predicts continuous fields & estimates discrete parameters in PDEs, significantly improving accuracy and robustness.
Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models
·4898 words·23 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 University of Texas at Austin
This paper introduces a rate-distortion framework for prompt compression in LLMs, bridging the gap between existing methods and optimal performance. By formulating prompt compression as a linear progr…
Fundamental Convergence Analysis of Sharpness-Aware Minimization
·2234 words·11 mins·
loading
·
loading
AI Theory
Optimization
🏢 Ho Chi Minh City University of Education
This research establishes fundamental convergence properties for the widely-used SAM optimization algorithm, significantly advancing our theoretical understanding and practical applications.
Functionally Constrained Algorithm Solves Convex Simple Bilevel Problem
·310 words·2 mins·
loading
·
loading
AI Theory
Optimization
🏢 Tsinghua University
Near-optimal algorithms solve convex simple bilevel problems by reformulating them into functionally constrained problems, achieving near-optimal convergence rates.
Functional Gradient Flows for Constrained Sampling
·3022 words·15 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
🏢 Peking University
Constrained sampling solved! New functional gradient flow method (CFG) efficiently samples from constrained probability distributions via a novel boundary condition for gradient flows, achieving prov…
Fully Explicit Dynamic Gaussian Splatting
·3268 words·16 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 School of Electrical Engineering and Computer Science
Ex4DGS achieves real-time high-quality dynamic scene rendering using explicit 4D Gaussian representations and keyframe interpolation.
Full-Distance Evasion of Pedestrian Detectors in the Physical World
·2691 words·13 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Tsinghua University
Researchers developed Full Distance Attack (FDA) to generate adversarial patterns effective against pedestrian detectors across all distances, resolving the appearance gap issue between simulated and …