Posters

LACIE: Listener-Aware Finetuning for Calibration in Large Language Models

26 September 2024·2396 words·12 mins· loading · loading

Natural Language Processing Large Language Models 🏢 UNC Chapel Hill

LACIE: Listener-aware finetuning improves LLM confidence calibration, reducing incorrect answers accepted by human listeners by 47% while maintaining correct answer acceptance.

Label Noise: Ignorance Is Bliss

26 September 2024·3504 words·17 mins· loading · loading

AI Generated Machine Learning Semi-Supervised Learning 🏢 University of Michigan

Ignorance is bliss: A new framework shows ignoring label noise in multi-class classification can achieve state-of-the-art performance, especially when using self-supervised feature extraction.

Label Delay in Online Continual Learning

26 September 2024·4705 words·23 mins· loading · loading

AI Generated Machine Learning Continual Learning 🏢 University of Oxford

Bridging the accuracy gap in online continual learning caused by label delays, a new framework with Importance Weighted Memory Sampling prioritizes relevant memory samples, significantly outperforming…

L4GM: Large 4D Gaussian Reconstruction Model

26 September 2024·2618 words·13 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Toronto

L4GM: The first 4D model generating high-quality animated 3D objects from single-view videos in a single feed-forward pass.

L-TTA: Lightweight Test-Time Adaptation Using a Versatile Stem Layer

26 September 2024·2871 words·14 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 Seoul National University of Science and Technology

L-TTA: A lightweight test-time adaptation method using a versatile stem layer minimizes channel-wise uncertainty for rapid and memory-efficient adaptation to new domains.

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

26 September 2024·5270 words·25 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 UC Berkeley

KVQuant achieves <0.1 perplexity degradation with 3-bit quantization in LLMs by using per-channel key quantization, pre-RoPE quantization, and non-uniform quantization, enabling 10M context length inf…

KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization

26 September 2024·3037 words·15 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Dept. of Computer Science, Rice University

Boost LLM inference speed 1.4-3.5x by using Coupled Quantization (CQ) to compress KV cache down to 1 bit per channel, while preserving model accuracy.

Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference

26 September 2024·2061 words·10 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Princeton University

Kraken: A new Transformer architecture boosts multi-device inference speed by 35.6% by cleverly overlapping communication with computation.

KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension

26 September 2024·1673 words·8 mins· loading · loading

Natural Language Processing Large Language Models 🏢 University of Hong Kong

KptLLM: A novel multimodal model leverages LLMs for superior keypoint comprehension, outperforming existing methods in various benchmarks.

KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis

26 September 2024·5238 words·25 mins· loading · loading

Computer Vision Image Generation 🏢 Electronics and Telecommunications Research Institute

KOALA: New efficient text-to-image diffusion models achieving 4x speed and 69% size reduction of SDXL, generating 1024px images on consumer GPUs with 8GB VRAM.

Knowledge-Empowered Dynamic Graph Network for Irregularly Sampled Medical Time Series

26 September 2024·3904 words·19 mins· loading · loading

AI Generated AI Applications Healthcare 🏢 South China University of Technology

KEDGN, a novel graph neural network, leverages medical knowledge to model variable-specific temporal dependencies and dynamic inter-variable correlations in irregularly sampled medical time series, si…

Knowledge Graph Completion by Intermediate Variables Regularization

26 September 2024·2107 words·10 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Fudan University

Novel intermediate variables regularization boosts knowledge graph completion!

Knowledge Composition using Task Vectors with Learned Anisotropic Scaling

26 September 2024·4960 words·24 mins· loading · loading

AI Generated Computer Vision Few-Shot Learning 🏢 Australian Institute for Machine Learning

aTLAS: a novel parameter-efficient fine-tuning method using learned anisotropic scaling of task vectors for enhanced knowledge composition and transfer.

Knowledge Circuits in Pretrained Transformers

26 September 2024·3083 words·15 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Zhejiang University

Researchers unveil ‘knowledge circuits’ within LLMs, revealing how knowledge is collaboratively encoded and utilized, leading to improved LLM design and interpretations of model behavior.

KnowGPT: Knowledge Graph based Prompting for Large Language Models

26 September 2024·1971 words·10 mins· loading · loading

Natural Language Processing Question Answering 🏢 Hong Kong Polytechnic University

KnowGPT: A novel framework boosts Large Language Model accuracy by intelligently integrating knowledge graphs, significantly reducing factual errors and achieving near-human performance on benchmark d…

KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World Knowledge

26 September 2024·3104 words·15 mins· loading · loading

Natural Language Processing Large Language Models 🏢 University of Illinois at Urbana-Champaign

KG-FIT boosts knowledge graph embedding by smartly integrating open-world knowledge from LLMs, achieving significant performance gains.

KFNN: K-Free Nearest Neighbor For Crowdsourcing

26 September 2024·1335 words·7 mins· loading · loading

AI Applications Healthcare 🏢 China University of Geosciences

KFNN dynamically determines optimal neighborhood sizes for label integration in crowdsourcing, significantly boosting accuracy and robustness, especially with limited noisy labels.

Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features

26 September 2024·2426 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Peking University

Key-Grid: An unsupervised 3D keypoint detector achieving state-of-the-art semantic consistency and accuracy for both rigid and deformable objects using novel grid heatmap features.

Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm

26 September 2024·311 words·2 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 MediaTek Research

Novel optimistic RL algorithm using kernel methods achieves no-regret performance in the challenging infinite-horizon average-reward setting.

Kernel PCA for Out-of-Distribution Detection

26 September 2024·2628 words·13 mins· loading · loading

AI Generated Machine Learning Deep Learning 🏢 Shanghai Jiao Tong University

Boosting Out-of-Distribution Detection with Kernel PCA!