Paper Reviews by AI

You Do Not Fully Utilize Transformer's Representation Capacity

13 February 2025·4126 words·20 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 T-Tech HSE University Moscow Institute of Physics and Technology

Boosting Transformer performance, Layer-Integrated Memory (LIMe) enhances representation capacity by enabling access to earlier layers’ hidden states, significantly improving performance across variou…

Typhoon T1: An Open Thai Reasoning Model

13 February 2025·3148 words·15 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 SCB 10X R&D

Typhoon T1: Open Thai reasoning model improves complex task performance by generating long chains of thought, detailed methodology, and open-source resources are provided.

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

13 February 2025·2201 words·11 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Tencent AI Lab

LLMs often fail to demonstrate true understanding of concepts, acting as ‘stochastic parrots’ – a phenomenon quantitatively proven by the PHYSICO benchmark.

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

13 February 2025·4327 words·21 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Question Answering 🏢 Intel Labs

SQUARE, a novel prompting technique, enhances LLM reasoning by prompting self-interrogation through sequential question answering, significantly outperforming traditional methods.

Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-Checking

13 February 2025·1354 words·7 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Question Answering 🏢 University of Copenhagen

Fact-checkers need explainable AI: This study reveals how AI tools can better support fact-checkers by providing explanations tailored to their workflows, addressing unmet needs, and improving the eff…

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

13 February 2025·4209 words·20 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 MIT

SelfCite: A self-supervised approach boosts LLM citation accuracy via context ablation. By removing or isolating cited text, SelfCite trains LLMs to generate high-quality citations without manual ann…

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

13 February 2025·2116 words·10 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Beijing University of Posts and Telecommunications

MUDDFormer boosts Transformer performance by dynamically generating connection weights, improving cross-layer information flow and surpassing models trained with significantly more compute.

Exploring the Potential of Encoder-free Architectures in 3D LMMs

13 February 2025·3414 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Northwestern Polytechnical University

Encoder-free 3D LMMs outperform state-of-the-art, achieving comparable results to significantly larger models.

DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

13 February 2025·4451 words·21 mins· loading · loading

AI Generated 🤗 Daily Papers AI Applications Robotics 🏢 Tsinghua University

DexTrack achieves highly generalizable neural tracking control for dexterous robot manipulation by iteratively training a controller using high-quality demonstrations refined via homotopy optimization…

CRANE: Reasoning with constrained LLM generation

13 February 2025·2445 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 University of Illinois Urbana-Champaign

CRANE: A novel constrained decoding algorithm boosts LLM reasoning accuracy by strategically alternating between unconstrained reasoning and constrained generation.

CoT-Valve: Length-Compressible Chain-of-Thought Tuning

13 February 2025·3429 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 National University of Singapore

CoT-Valve dynamically adjusts reasoning chain lengths based on task difficulty, significantly reducing inference costs in large language models without substantial accuracy loss.

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

13 February 2025·3096 words·15 mins· loading · loading

AI Generated 🤗 Daily Papers Machine Learning Deep Learning 🏢 School of Computer Science and Engineering

ProbeLog: Zero-shot model search directly from weights, boosting efficiency and accuracy!

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

13 February 2025·3494 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 SCB 10X R&D

Low-resource language LLMs gain strong reasoning abilities by merging with a high-resource reasoning model, achieving performance comparable to state-of-the-art models while maintaining target languag…

One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

12 February 2025·2416 words·12 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Tsinghua University

New benchmark COUNTERMATH enhances LLMs’ mathematical reasoning using counterexample-driven proofs, revealing current models’ limitations and paving the way for improved mathematical capabilities.

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

12 February 2025·3464 words·17 mins· loading · loading

AI Generated 🤗 Daily Papers Multimodal Learning Multimodal Reasoning 🏢 Hong Kong University of Science and Technology

ThinkDiff empowers text-to-image diffusion models with multimodal reasoning by aligning vision-language models to an LLM decoder, achieving state-of-the-art results on in-context reasoning benchmarks.

Cluster and Predict Latents Patches for Improved Masked Image Modeling

12 February 2025·7222 words·34 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Segmentation 🏢 Meta FAIR

CAPI: a novel masked image modeling framework boosts self-supervised visual representation learning by predicting latent clusterings, achieving state-of-the-art ImageNet accuracy and mIoU.

Better Embeddings with Coupled Adam

12 February 2025·2826 words·14 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 AI Sweden

Coupled Adam: A novel optimizer fixes anisotropic word embeddings in LLMs, boosting model performance.

We Can't Understand AI Using our Existing Vocabulary

11 February 2025·3226 words·16 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Large Language Models 🏢 Google DeepMind

To understand AI, we need new words! This paper argues that developing neologisms—new words for human & machine concepts—is key to bridging the communication gap and achieving better AI control.

VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation

11 February 2025·3389 words·16 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Image Generation 🏢 Fudan University

VidCRAFT3 enables high-quality image-to-video generation with precise control over camera movement, object motion, and lighting, pushing the boundaries of visual content creation.

Next Block Prediction: Video Generation via Semi-Autoregressive Modeling

11 February 2025·3939 words·19 mins· loading · loading

AI Generated 🤗 Daily Papers Computer Vision Video Understanding 🏢 Peking University

Next-Block Prediction (NBP) revolutionizes video generation by using a semi-autoregressive model that predicts blocks of video content simultaneously, resulting in significantly faster inference.