Natural Language Processing

Query and Conquer: Execution-Guided SQL Generation

31 March 2025·2511 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Text Generation 🏢 Snowflake AI Research

Execution-guided SQL generation enhances accuracy in text-to-SQL tasks by using execution results to select the most semantically consistent query, improving performance and reducing inference costs.

Entropy-Based Adaptive Weighting for Self-Training

31 March 2025·1608 words·8 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 University of California Los Angeles

EAST: Prioritizing uncertainty in self-training refines reasoning of Large Language Models.

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base

30 March 2025·2941 words·14 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 University of Southern California

SEA: Stochastic Error Ascent efficiently discovers LLM knowledge gaps, outperforming existing methods in error detection with reduced cost.

Efficient Inference for Large Reasoning Models: A Survey

29 March 2025·857 words·5 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 National University of Singapore

Survey on efficient inference methods for Large Reasoning Models, focusing on mitigating token inefficiency while preserving quality.

ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition

27 March 2025·3118 words·15 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Shanghai Artificial Intelligence Laboratory

ResearchBench: Benchmarking LLMs for Scientific Discovery via Inspiration-Based Task Decomposition.

ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback

27 March 2025·7449 words·35 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Text Summarization 🏢 Korea Advanced Institute of Science and Technology (KAIST)

ReFeed enhances multi-dimensional summarization by using reflective reasoning on feedback, mitigating trade-offs between dimensions and improving robustness.

ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

27 March 2025·6982 words·33 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Question Answering 🏢 Tsinghua University

ReaRAG enhances factuality in large reasoning models (LRMs) by integrating knowledge-guided reasoning with iterative retrieval augmented generation.

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

27 March 2025·2979 words·14 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Peking University

This survey presents a methodology-centered taxonomy of LLM agent systems, linking design principles to emergent behaviors and identifying future research directions.

Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models

27 March 2025·5419 words·26 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Renmin University of China

OlymMATH: A new Olympiad-level math benchmark rigorously tests LLMs’ reasoning, revealing limitations and paving the way for advancements.

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

27 March 2025·2301 words·11 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Shanghai AI Laboratory

Survey on improving efficiency in large reasoning models across language, multimodality, and beyond.

Open Deep Search: Democratizing Search with Open-source Reasoning Agents

26 March 2025·1746 words·9 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Question Answering 🏢 University of Washington

Open Deep Search (ODS): Democratizing Search with Open-source Reasoning Agents.

MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search

26 March 2025·2082 words·10 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Question Answering 🏢 Yale University

MCTS-RAG: Combines Monte Carlo Tree Search with Retrieval-Augmented Generation to enhance small LMs’ reasoning on complex tasks.

Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking

25 March 2025·1979 words·10 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 A-M-Team

Boost LLM reasoning by having models ‘Think Twice’! This novel method iteratively refines answers, significantly enhancing accuracy on complex tasks.

FFN Fusion: Rethinking Sequential Computation in Large Language Models

24 March 2025·3776 words·18 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 NVIDIA

FFN Fusion: Parallelizing sequential computation in large language models for significant speedups!

Lost in Cultural Translation: Do LLMs Struggle with Math Across Cultural Contexts?

23 March 2025·3575 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 155mv Research Lab

LLMs falter on culturally adapted math problems, revealing a critical cultural bias.

V-Seek: Accelerating LLM Reasoning on Open-hardware Server-class RISC-V Platforms

21 March 2025·1371 words·7 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Politecnico of Turin

V-SEEK accelerates LLM reasoning on open-hardware RISC-V platforms, achieving up to 3.0x speedup through optimized kernels and memory management.

Modifying Large Language Model Post-Training for Diverse Creative Writing

21 March 2025·2548 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Text Generation 🏢 Midjourney

This paper introduces deviation-factored post-training methods to enhance diversity and quality in creative LLM writing.

MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization

21 March 2025·8765 words·42 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Xi'an Jiaotong University

MARS: Optimizing prompts with multi-agent collaboration and Socratic learning for better LLM performance!

LEMMA: Learning from Errors for MatheMatical Advancement in LLMs

21 March 2025·4802 words·23 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Tsinghua University

LEMMA: LLMs learn math via mistake analysis and correction, boosting performance without external critics.

XAttention: Block Sparse Attention with Antidiagonal Scoring

20 March 2025·2960 words·14 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Tsinghua University

XAttention: Antidiagonal scoring unlocks block-sparse attention, slashing compute costs in long-context Transformers without sacrificing accuracy.