Skip to main content

🏢 Shanghai Jiao Tong University

Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement
·3015 words·15 mins· loading · loading
Image Generation 🏢 Shanghai Jiao Tong University
Enhance deep neural network privacy and trustworthiness with unified gradient-based machine unlearning, leveraging remain geometry for efficient forgetting and performance preservation.
Towards the Dynamics of a DNN Learning Symbolic Interactions
·1849 words·9 mins· loading · loading
AI Theory Interpretability 🏢 Shanghai Jiao Tong University
DNNs learn interactions in two phases: initially removing complex interactions, then gradually learning higher-order ones, leading to overfitting.
The Closeness of In-Context Learning and Weight Shifting for Softmax Regression
·2475 words·12 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Shanghai Jiao Tong University
Softmax regression reveals in-context learning’s surprising similarity to gradient descent in self-attention Transformers, showing the models’ remarkable learning capabilities.
Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation
·2477 words·12 mins· loading · loading
Computer Vision 3D Vision 🏢 Shanghai Jiao Tong University
SFCNet, a novel spherical frustum sparse convolution network, tackles LiDAR point cloud semantic segmentation by eliminating quantized information loss, leading to superior performance, especially for…
Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection
·2209 words·11 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Shanghai Jiao Tong University
Self-Calibrated Tuning (SCT) enhances vision-language model OOD detection by adaptively weighting OOD regularization based on prediction uncertainty, mitigating issues caused by inaccurate feature ext…
SeeA*: Efficient Exploration-Enhanced A* Search by Selective Sampling
·3439 words·17 mins· loading · loading
AI Applications Gaming 🏢 Shanghai Jiao Tong University
SeeA* enhances A* search by selectively sampling promising nodes, improving exploration and efficiency, especially with less accurate heuristics.
SceneCraft: Layout-Guided 3D Scene Generation
·2040 words·10 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Shanghai Jiao Tong University
SceneCraft generates highly detailed indoor scenes from user-provided textual descriptions and spatial layouts, overcoming limitations of previous text-to-3D methods in scale and control.
Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
·2541 words·12 mins· loading · loading
Image Generation 🏢 Shanghai Jiao Tong University
DisCo: a novel framework for generalizable complex image generation using scene graph disentanglement and composition, achieving superior performance over existing methods.
Rethinking Parity Check Enhanced Symmetry-Preserving Ansatz
·2377 words·12 mins· loading · loading
AI Theory Optimization 🏢 Shanghai Jiao Tong University
Enhanced VQAs via Hamming Weight Preserving ansatz and parity checks achieve superior performance on quantum chemistry and combinatorial problems, showcasing quantum advantage potential in NISQ era.
ResAD: A Simple Framework for Class Generalizable Anomaly Detection
·2059 words·10 mins· loading · loading
Anomaly Detection 🏢 Shanghai Jiao Tong University
ResAD, a novel framework, tackles class-generalizable anomaly detection by learning residual feature distributions, achieving remarkable results on diverse datasets without retraining.
ReLIZO: Sample Reusable Linear Interpolation-based Zeroth-order Optimization
·2192 words·11 mins· loading · loading
AI Theory Optimization 🏢 Shanghai Jiao Tong University
ReLIZO boosts zeroth-order optimization by cleverly reusing past queries, drastically cutting computation costs while maintaining gradient estimation accuracy.
Reinforcing LLM Agents via Policy Optimization with Action Decomposition
·2925 words·14 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Shanghai Jiao Tong University
POAD enhances LLM agents by decomposing language agent optimization to the token level, achieving finer-grained credit assignment and improved learning efficiency and generalization.
QVAE-Mole: The Quantum VAE with Spherical Latent Variable Learning for 3-D Molecule Generation
·1891 words·9 mins· loading · loading
Machine Learning Deep Learning 🏢 Shanghai Jiao Tong University
Quantum VAE with spherical latent variable learning enables efficient, one-shot 3D molecule generation, outperforming classic and other quantum methods.
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model
·2714 words·13 mins· loading · loading
AI Generated Computer Vision Image Classification 🏢 Shanghai Jiao Tong University
QuadMamba: A novel vision model leveraging quadtree-based scanning for superior performance in visual tasks, achieving state-of-the-art results with linear-time complexity.
Probabilistic Conformal Distillation for Enhancing Missing Modality Robustness
·3353 words·16 mins· loading · loading
AI Generated Multimodal Learning Multimodal Understanding 🏢 Shanghai Jiao Tong University
Enhance multimodal model robustness against missing data with Probabilistic Conformal Distillation (PCD)! PCD models missing modalities probabilistically, achieving superior performance on multiple be…
PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders
·2194 words·11 mins· loading · loading
3D Vision 🏢 Shanghai Jiao Tong University
PCP-MAE enhances point cloud self-supervised learning by cleverly predicting masked patch centers, leading to superior 3D object classification and scene segmentation.
On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection
·2133 words·11 mins· loading · loading
Computer Vision Video Understanding 🏢 Shanghai Jiao Tong University
MM-Det, a novel algorithm, uses multimodal learning and spatiotemporal attention to detect diffusion-generated videos, achieving state-of-the-art performance on the new DVF dataset.
Nimbus: Secure and Efficient Two-Party Inference for Transformers
·3036 words·15 mins· loading · loading
AI Generated AI Theory Privacy 🏢 Shanghai Jiao Tong University
Nimbus achieves 2.7-4.7x speedup in BERT base inference using novel two-party computation techniques for efficient matrix multiplication and non-linear layer approximation.
NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction
·2947 words·14 mins· loading · loading
Computer Vision 3D Vision 🏢 Shanghai Jiao Tong University
NeuRodin: A two-stage neural framework achieves high-fidelity 3D surface reconstruction from posed RGB images by innovatively addressing limitations in SDF-based methods, resulting in superior reconst…
Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction
·1845 words·9 mins· loading · loading
Computer Vision 3D Vision 🏢 Shanghai Jiao Tong University
Ref-MC2 reconstructs high-fidelity 3D objects with inter-reflections by using a novel multi-times Monte Carlo sampling strategy, achieving superior performance in accuracy and efficiency.