Posters
2024
Homology Consistency Constrained Efficient Tuning for Vision-Language Models
·1675 words·8 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
π’ University of Science and Technology of China
Constraining vision-language model tuning via persistent homology ensures consistent image-text alignment, improving few-shot learning and domain generalization.
Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models
·2415 words·12 mins·
loading
·
loading
Computer Vision
Image Generation
π’ Qualcomm AI Research
Hollowed Net efficiently personalizes text-to-image diffusion models on-device by temporarily removing deep U-Net layers during training, drastically reducing memory usage without sacrificing performa…
HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness
·4260 words·20 mins·
loading
·
loading
AI Generated
Computer Vision
Video Understanding
π’ University of Texas at Austin
HOI-Swap: a novel diffusion model flawlessly swaps objects in videos while intelligently preserving natural hand interactions, producing high-quality edits.
HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction
·2361 words·12 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
π’ Tsinghua University
HLM-Cite: A hybrid language model workflow boosts scientific citation prediction accuracy by 17.6% and scales to 100K candidate papers, surpassing existing methods.
Historical Test-time Prompt Tuning for Vision Foundation Models
·2286 words·11 mins·
loading
·
loading
Computer Vision
Image Segmentation
π’ Nanyang Technological University
HisTPT: Historical Test-Time Prompt Tuning memorizes past learning, enabling robust online prompt adaptation for vision models, overcoming performance degradation in continuously changing data streams…
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models
·3025 words·15 mins·
loading
·
loading
Natural Language Processing
Large Language Models
π’ Ohio State University
HippoRAG, a neurobiologically inspired framework, dramatically improves LLM long-term memory and multi-hop question answering by synergistically orchestrating LLMs, knowledge graphs, and the Personali…
Higher-Rank Irreducible Cartesian Tensors for Equivariant Message Passing
·3519 words·17 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
π’ NEC Laboratories Europe
Higher-rank irreducible Cartesian tensors boost accuracy and efficiency in equivariant message-passing neural networks for atomistic simulations.
Higher-Order Causal Message Passing for Experimentation with Complex Interference
·1660 words·8 mins·
loading
·
loading
AI Theory
Causality
π’ Stanford University
Higher-Order Causal Message Passing (HO-CMP) accurately estimates treatment effects in complex systems with unknown interference by using observed data to learn the system’s dynamics over time.
High-Resolution Image Harmonization with Adaptive-Interval Color Transformation
·3030 words·15 mins·
loading
·
loading
Computer Vision
Image Generation
π’ Harbin Institute of Technology
AICT: Adaptive-Interval Color Transformation harmonizes high-resolution images by predicting pixel-wise color changes, adaptively adjusting sampling intervals to capture local variations, and using a …
High-probability complexity bounds for stochastic non-convex minimax optimization
·1500 words·8 mins·
loading
·
loading
AI Theory
Optimization
π’ UniversitΓ© CΓ΄te D'Azur
First high-probability complexity guarantees for solving stochastic nonconvex minimax problems using a single-loop method are established.
High-dimensional (Group) Adversarial Training in Linear Regression
·1556 words·8 mins·
loading
·
loading
AI Generated
Machine Learning
Optimization
π’ Georgia Institute of Technology
Adversarial training achieves minimax-optimal prediction error in high-dimensional linear regression under lβ-perturbation, improving upon existing methods.
High Rank Path Development: an approach to learning the filtration of stochastic processes
·2124 words·10 mins·
loading
·
loading
AI Applications
Finance
π’ Institute of Mathematical Sciences
High-Rank PCF-GAN uses a novel metric (HRPCFD) based on high-rank path development to learn filtration of stochastic processes, outperforming state-of-the-art methods in hypothesis testing and time-se…
Hierarchical Visual Feature Aggregation for OCR-Free Document Understanding
·2062 words·10 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
π’ ECE & 2IPAI, Seoul National University
This paper introduces HVFA, a novel OCR-free document understanding framework using MLLMs and multi-scale visual features, achieving superior performance across various document understanding tasks.
Hierarchical Uncertainty Exploration via Feedforward Posterior Trees
·5486 words·26 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
π’ Technion-Israel Institute of Technology
Visualizing high-dimensional posterior distributions is challenging. This paper introduces ‘Posterior Trees,’ a novel method using tree-structured neural network predictions for hierarchical uncertai…
Hierarchical Selective Classification
·2174 words·11 mins·
loading
·
loading
Computer Vision
Image Classification
π’ Technion
Hierarchical Selective Classification (HSC) improves deep learning model reliability for risk-sensitive tasks by leveraging hierarchical class relationships to provide more informative predictions eve…
Hierarchical Programmatic Option Framework
·5774 words·28 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
π’ National Taiwan University
Hierarchical Programmatic Option framework (HIPO) uses human-readable programs as options in reinforcement learning to solve long, repetitive tasks with improved interpretability and generalization.
Hierarchical Hybrid Sliced Wasserstein: A Scalable Metric for Heterogeneous Joint Distributions
·2222 words·11 mins·
loading
·
loading
Machine Learning
Deep Learning
π’ University of Texas at Austin
Hierarchical Hybrid Sliced Wasserstein (H2SW) solves the challenge of comparing complex, heterogeneous joint distributions by introducing novel slicing operators, leading to a scalable and statistical…
Hierarchical Federated Learning with Multi-Timescale Gradient Correction
·2189 words·11 mins·
loading
·
loading
Machine Learning
Federated Learning
π’ Purdue University
MTGC tackles multi-timescale model drift in hierarchical federated learning.
HiCoM: Hierarchical Coherent Motion for Dynamic Streamable Scenes with 3D Gaussian Splatting
·2356 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Peking University
HiCoM, a novel framework, achieves high-fidelity streamable dynamic scene reconstruction by using a hierarchical coherent motion mechanism and parallel processing to significantly reduce training time…
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation
·3034 words·15 mins·
loading
·
loading
Computer Vision
Image Generation
π’ 360 AI Research
HiCo: Hierarchical Controllable Diffusion Model achieves superior layout-to-image generation by disentangling spatial layouts through a multi-branch network structure, resulting in high-quality images…