Skip to main content

🏢 Shanghai Jiao Tong University

Mixture of Link Predictors on Graphs
·3247 words·16 mins· loading · loading
Machine Learning Deep Learning 🏢 Shanghai Jiao Tong University
Link-MoE boosts link prediction accuracy by strategically selecting the best model for each node pair, surpassing single-model approaches.
MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning
·2752 words·13 mins· loading · loading
Video Understanding 🏢 Shanghai Jiao Tong University
MECD: A new task and dataset unlocks multi-event causal discovery in videos, enabling a novel framework that outperforms existing models by efficiently identifying causal relationships between chronol…
MC-DiT: Contextual Enhancement via Clean-to-Clean Reconstruction for Masked Diffusion Models
·2494 words·12 mins· loading · loading
Computer Vision Image Generation 🏢 Shanghai Jiao Tong University
MC-DiT: A novel training paradigm for masked diffusion models achieving state-of-the-art image generation by leveraging clean-to-clean reconstruction.
MADiff: Offline Multi-agent Learning with Diffusion Models
·2719 words·13 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Shanghai Jiao Tong University
MADIFF: Offline multi-agent learning uses attention-based diffusion models to achieve effective coordination and teammate modeling, outperforming existing methods.
Learning Versatile Skills with Curriculum Masking
·2688 words·13 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Shanghai Jiao Tong University
CurrMask: a novel curriculum masking paradigm for offline RL, achieving superior zero-shot and fine-tuning performance by dynamically adjusting masking schemes during pretraining, enabling versatile s…
Learning Plaintext-Ciphertext Cryptographic Problems via ANF-based SAT Instance Representation
·1855 words·9 mins· loading · loading
AI Generated AI Theory Optimization 🏢 Shanghai Jiao Tong University
CryptoANFNet accelerates solving cryptographic problems by 50x using a novel graph neural network and ANF representation, outperforming existing methods in accuracy.
Language-Driven Interactive Traffic Trajectory Generation
·2233 words·11 mins· loading · loading
AI Applications Autonomous Vehicles 🏢 Shanghai Jiao Tong University
InteractTraj: Generating realistic, interactive traffic trajectories from natural language!
Lambda: Learning Matchable Prior For Entity Alignment with Unlabeled Dangling Cases
·2851 words·14 mins· loading · loading
AI Generated Natural Language Processing Named Entity Recognition 🏢 Shanghai Jiao Tong University
Lambda: A novel framework tackles entity alignment challenges with unlabeled dangling entities using GNN-based encoding, spectral contrastive learning, and an iterative PU learning algorithm, achievin…
Kernel PCA for Out-of-Distribution Detection
·2628 words·13 mins· loading · loading
AI Generated Machine Learning Deep Learning 🏢 Shanghai Jiao Tong University
Boosting Out-of-Distribution Detection with Kernel PCA!
Initialization is Critical to Whether Transformers Fit Composite Functions by Reasoning or Memorizing
·2409 words·12 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Shanghai Jiao Tong University
Transformer model initialization dramatically affects whether it reasons or memorizes, impacting performance on compositional tasks.
Improved Analysis for Bandit Learning in Matching Markets
·707 words·4 mins· loading · loading
AI Generated AI Theory Optimization 🏢 Shanghai Jiao Tong University
A new algorithm, AOGS, achieves significantly lower regret in two-sided matching markets by cleverly integrating exploration and exploitation, thus removing the dependence on the number of arms (K) in…
HuRef: HUman-REadable Fingerprint for Large Language Models
·2598 words·13 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Shanghai Jiao Tong University
HuRef: Generate unique, human-readable fingerprints for LLMs to protect copyright without exposing model parameters or impeding training.
HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid
·2769 words·13 mins· loading · loading
AI Generated Multimodal Learning Vision-Language Models 🏢 Shanghai Jiao Tong University
Humanoid robot learns to rearrange objects using vision and language instructions, achieving remarkable success on diverse tasks in a novel dataset.
Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling
·2719 words·13 mins· loading · loading
AI Generated AI Applications Weather Forecasting 🏢 Shanghai Jiao Tong University
WeatherGFT generalizes weather forecasts to finer temporal scales using a physics-AI hybrid model, achieving state-of-the-art performance and 30-minute forecast capability with only hourly training da…
General Articulated Objects Manipulation in Real Images via Part-Aware Diffusion Process
·2623 words·13 mins· loading · loading
Computer Vision Image Generation 🏢 Shanghai Jiao Tong University
Part-Aware Diffusion Model (PA-Diffusion) enables precise and efficient manipulation of articulated objects in real images by using abstract 3D models and dynamic feature maps, overcoming limitations …
FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference Images
·1953 words·10 mins· loading · loading
Image Generation 🏢 Shanghai Jiao Tong University
FuseAnyPart: Swap facial parts seamlessly using multiple reference images via diffusion, achieving high-fidelity results.
Few-Shot Diffusion Models Escape the Curse of Dimensionality
·419 words·2 mins· loading · loading
Machine Learning Few-Shot Learning 🏢 Shanghai Jiao Tong University
Few-shot diffusion models efficiently generate customized images; this paper provides the first theoretical explanation, proving improved approximation and optimization bounds, escaping the curse of d…
Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial Optimization
·2382 words·12 mins· loading · loading
AI Theory Optimization 🏢 Shanghai Jiao Tong University
Fast T2T: Optimization Consistency Boosts Diffusion-Based Combinatorial Optimization!
Face2QR: A Unified Framework for Aesthetic, Face-Preserving, and Scannable QR Code Generation
·2451 words·12 mins· loading · loading
Computer Vision Image Generation 🏢 Shanghai Jiao Tong University
Face2QR: A unified framework generates aesthetically pleasing, scannable QR codes that faithfully preserve facial features, solving the conflict between aesthetics, identity, and scannability.
Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning
·3571 words·17 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 Shanghai Jiao Tong University
Novel trajectory volatility score (TV Score) significantly improves out-of-distribution detection in mathematical reasoning by leveraging dynamic embedding trajectories, outperforming existing GLM met…