Natural Language Processing

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

26 September 2024·2496 words·12 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Stanford University

Boosting LLM performance: This research shows how larger language models need bigger vocabularies for optimal efficiency and performance.

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

26 September 2024·2949 words·14 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Stanford University

Direct Alignment Algorithms (DAAs) for LLM alignment suffer from over-optimization, even without explicit reward models; this paper empirically demonstrates this and proposes scaling laws to understan…

SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain

26 September 2024·2626 words·13 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 CINES

SaulLM-54B & SaulLM-141B achieve state-of-the-art performance on legal tasks by scaling up model size, employing a specialized instruction-following protocol, and aligning model outputs with human pre…

SafeWorld: Geo-Diverse Safety Alignment

26 September 2024·3977 words·19 mins· loading · loading

Natural Language Processing Large Language Models 🏢 UC Los Angeles

SAFEWORLD: a new benchmark reveals and fixes LLMs’ struggle with diverse safety standards.

S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity

26 September 2024·1908 words·9 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Carnegie Mellon University

S2FT: Structured Sparse Fine-Tuning achieves state-of-the-art LLM fine-tuning performance, training efficiency, and inference scalability by selecting sparsely and computing densely.

S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training

26 September 2024·2718 words·13 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Tsinghua University

S-STE achieves efficient 2:4 sparse pre-training by introducing a novel continuous pruning function, overcoming the limitations of previous methods and leading to improved accuracy and speed.

RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions

26 September 2024·2064 words·10 mins· loading · loading

Natural Language Processing Vision-Language Models 🏢 Yale University

RSA: Language unlocks metric depth from single images!

RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models

26 September 2024·3132 words·15 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Hong Kong University of Science and Technology

RouterDC: A query-based router trained via dual contrastive learning assembles multiple LLMs, significantly outperforming individual LLMs and existing routing methods on both in- and out-of-distributi…

RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold

26 September 2024·2612 words·13 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Carnegie Mellon University

Leveraging model-generated synthetic data for LLM finetuning significantly improves efficiency when using both positive and strategically constructed negative examples, resulting in an eight-fold incr…

Risk-Averse Fine-tuning of Large Language Models

26 September 2024·3716 words·18 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Amazon

Risk-Averse RLHF fine-tunes LLMs to minimize toxic outputs while maintaining performance.

Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy

26 September 2024·2475 words·12 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Peking University

Richelieu: a self-evolving LLM-based AI agent masters Diplomacy, a complex game requiring strategic planning and negotiation, without human data, by integrating self-play for continuous improvement.

Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference

26 September 2024·2947 words·14 mins· loading · loading

Natural Language Processing Large Language Models 🏢 UC Santa Barbara

Reverse the forget-retain objectives for efficient LLM unlearning!

Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models

26 September 2024·2992 words·15 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Ant Group

PaRO boosts LLM training speed by up to 266% through refined model state partitioning and optimized communication.

Rethinking LLM Memorization through the Lens of Adversarial Compression

26 September 2024·2014 words·10 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Carnegie Mellon University

Researchers propose Adversarial Compression Ratio (ACR) to assess LLM memorization, offering an adversarial, flexible, and computationally efficient method for monitoring data misuse and compliance.

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

26 September 2024·2799 words·14 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Tsinghua University

ReST-MCTS*: A novel LLM self-training method using process reward guided tree search, outperforming existing methods by generating higher-quality reasoning traces for improved model accuracy.

Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe

26 September 2024·5026 words·24 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 University of Cambridge

This research unveils a compute-optimal recipe for fine-tuning language models into high-quality text embedding models, offering practical guidance and scaling laws for resource-constrained settings.

Representation Noising: A Defence Mechanism Against Harmful Finetuning

26 September 2024·3502 words·17 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Dalhousie University

RepNoise: a novel defense against harmful fine-tuning of LLMs by removing information about harmful representations, generalizing across different harmful tasks, and maintaining LLM capabilities.

ReMoDetect: Reward Models Recognize Aligned LLM's Generations

26 September 2024·4799 words·23 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Korea Advanced Institute of Science and Technology

ReMoDetect leverages reward models to identify and classify LLM-generated text. By using continual preference fine-tuning and incorporating human/LLM mixed text, ReMoDetect achieves state-of-the-art p…

Reinforcing LLM Agents via Policy Optimization with Action Decomposition

26 September 2024·2925 words·14 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Shanghai Jiao Tong University

POAD enhances LLM agents by decomposing language agent optimization to the token level, achieving finer-grained credit assignment and improved learning efficiency and generalization.

Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs

26 September 2024·3598 words·17 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 University of Illinois Urbana-Champaign

Regularizing hidden states improves reward model generalization in RLHF for LLMs, boosting accuracy and mitigating over-optimization.