↓Skip to main content

🏢 Saarland University

The Expressive Capacity of State Space Models: A Formal Language Perspective

26 September 2024·1723 words·9 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Saarland University

State-space models (SSMs) rival transformers in language modeling, but their capabilities remain unclear; this paper rigorously analyzes SSM expressivity, revealing unique strengths and limitations, i…

Stabilized Proximal-Point Methods for Federated Optimization

26 September 2024·1402 words·7 mins· loading · loading

Federated Learning 🏢 Saarland University

S-DANE & ACC-S-DANE achieve best-known communication complexity for federated learning, improving local computation efficiency via stabilized proximal-point methods.

On the Complexity of Identification in Linear Structural Causal Models

26 September 2024·1403 words·7 mins· loading · loading

AI Theory Causality 🏢 Saarland University

New polynomial-space algorithm for causal parameter identification in linear models vastly improves upon existing methods, showing that this crucial task is computationally hard.

InversionView: A General-Purpose Method for Reading Information from Neural Activations

26 September 2024·10684 words·51 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Saarland University

InversionView unveils neural network inner workings by decoding information from activations. It identifies inputs producing similar activations, revealing the information content. Case studies on v…

Efficient Streaming Algorithms for Graphlet Sampling

26 September 2024·1741 words·9 mins· loading · loading

AI Generated AI Theory Optimization 🏢 Saarland University

STREAM-UGS: a novel semi-streaming algorithm for efficient graphlet sampling, enabling fast analysis of massive graphs with limited memory.