🏢 Shanghai Jiao Tong University
Dual-Diffusion for Binocular 3D Human Pose Estimation
·3829 words·18 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 Shanghai Jiao Tong University
Dual-Diffusion boosts binocular 3D human pose estimation accuracy by simultaneously denoising 2D and 3D pose uncertainties using a diffusion model.
DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning
·1917 words·9 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Shanghai Jiao Tong University
DomainGallery: Few-shot domain-driven image generation via attribute-centric finetuning, solving key issues of previous works by introducing attribute erasure, disentanglement, regularization, and enh…
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
·2760 words·13 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Shanghai Jiao Tong University
Diffusion-DICE: A novel offline RL method using in-sample diffusion guidance for optimal policy transformation, achieving state-of-the-art performance.
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
·2583 words·13 mins·
loading
·
loading
Natural Language Processing
Dialogue Systems
🏢 Shanghai Jiao Tong University
CoVoMix: Generating human-like, multi-speaker conversations with zero-shot speech synthesis.
Connectivity Shapes Implicit Regularization in Matrix Factorization Models for Matrix Completion
·4538 words·22 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
🏢 Shanghai Jiao Tong University
Data connectivity profoundly shapes implicit regularization in matrix factorization for matrix completion, transitioning from low nuclear norm to low rank solutions as data shifts from disconnected to…
CondTSF: One-line Plugin of Dataset Condensation for Time Series Forecasting
·3169 words·15 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 Shanghai Jiao Tong University
CondTSF: One-line plugin for time series forecasting dataset condensation, boosting performance at low condensation ratios.
Cluster-wise Graph Transformer with Dual-granularity Kernelized Attention
·1558 words·8 mins·
loading
·
loading
🏢 Shanghai Jiao Tong University
Cluster-wise Graph Transformer (Cluster-GT) improves graph learning by using a novel Node-to-Cluster Attention mechanism that leverages multiple kernel learning to capture node and cluster-level infor…
Calibrating Reasoning in Language Models with Internal Consistency
·2546 words·12 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Shanghai Jiao Tong University
LLMs’ reasoning can be improved by using internal consistency to calibrate their outputs.
AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario
·3145 words·15 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Shanghai Jiao Tong University
AnyFit: Controllable virtual try-on for any attire combination across any scenario, exceeding existing methods in accuracy and scalability.
A Siamese Transformer with Hierarchical Refinement for Lane Detection
·2636 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
🏢 Shanghai Jiao Tong University
Siamese Transformer with Hierarchical Refinement achieves state-of-the-art lane detection accuracy by integrating global and local features and a novel Curve-IoU loss.
2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution
·2009 words·10 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 Shanghai Jiao Tong University
2DQuant achieves highly efficient and accurate low-bit image super-resolution by using a dual-stage post-training quantization method that minimizes accuracy loss in transformer-based models, surpassi…