🏢 Shanghai Jiao Tong University

Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement

26 September 2024·3015 words·15 mins· loading · loading

Image Generation 🏢 Shanghai Jiao Tong University

Enhance deep neural network privacy and trustworthiness with unified gradient-based machine unlearning, leveraging remain geometry for efficient forgetting and performance preservation.

Towards the Dynamics of a DNN Learning Symbolic Interactions

26 September 2024·1849 words·9 mins· loading · loading

AI Theory Interpretability 🏢 Shanghai Jiao Tong University

DNNs learn interactions in two phases: initially removing complex interactions, then gradually learning higher-order ones, leading to overfitting.

The Closeness of In-Context Learning and Weight Shifting for Softmax Regression

26 September 2024·2475 words·12 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Shanghai Jiao Tong University

Softmax regression reveals in-context learning’s surprising similarity to gradient descent in self-attention Transformers, showing the models’ remarkable learning capabilities.

Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation

26 September 2024·2477 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Shanghai Jiao Tong University

SFCNet, a novel spherical frustum sparse convolution network, tackles LiDAR point cloud semantic segmentation by eliminating quantized information loss, leading to superior performance, especially for…

Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection

26 September 2024·2209 words·11 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Shanghai Jiao Tong University

Self-Calibrated Tuning (SCT) enhances vision-language model OOD detection by adaptively weighting OOD regularization based on prediction uncertainty, mitigating issues caused by inaccurate feature ext…

SeeA*: Efficient Exploration-Enhanced A* Search by Selective Sampling

26 September 2024·3439 words·17 mins· loading · loading

AI Applications Gaming 🏢 Shanghai Jiao Tong University

SeeA* enhances A* search by selectively sampling promising nodes, improving exploration and efficiency, especially with less accurate heuristics.

SceneCraft: Layout-Guided 3D Scene Generation

26 September 2024·2040 words·10 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Shanghai Jiao Tong University

SceneCraft generates highly detailed indoor scenes from user-provided textual descriptions and spatial layouts, overcoming limitations of previous text-to-3D methods in scale and control.

Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation

26 September 2024·2541 words·12 mins· loading · loading

Image Generation 🏢 Shanghai Jiao Tong University

DisCo: a novel framework for generalizable complex image generation using scene graph disentanglement and composition, achieving superior performance over existing methods.

Rethinking Parity Check Enhanced Symmetry-Preserving Ansatz

26 September 2024·2377 words·12 mins· loading · loading

AI Theory Optimization 🏢 Shanghai Jiao Tong University

Enhanced VQAs via Hamming Weight Preserving ansatz and parity checks achieve superior performance on quantum chemistry and combinatorial problems, showcasing quantum advantage potential in NISQ era.

ResAD: A Simple Framework for Class Generalizable Anomaly Detection

26 September 2024·2059 words·10 mins· loading · loading

Anomaly Detection 🏢 Shanghai Jiao Tong University

ResAD, a novel framework, tackles class-generalizable anomaly detection by learning residual feature distributions, achieving remarkable results on diverse datasets without retraining.

ReLIZO: Sample Reusable Linear Interpolation-based Zeroth-order Optimization

26 September 2024·2192 words·11 mins· loading · loading

AI Theory Optimization 🏢 Shanghai Jiao Tong University

ReLIZO boosts zeroth-order optimization by cleverly reusing past queries, drastically cutting computation costs while maintaining gradient estimation accuracy.

Reinforcing LLM Agents via Policy Optimization with Action Decomposition

26 September 2024·2925 words·14 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Shanghai Jiao Tong University

POAD enhances LLM agents by decomposing language agent optimization to the token level, achieving finer-grained credit assignment and improved learning efficiency and generalization.

QVAE-Mole: The Quantum VAE with Spherical Latent Variable Learning for 3-D Molecule Generation

26 September 2024·1891 words·9 mins· loading · loading

Machine Learning Deep Learning 🏢 Shanghai Jiao Tong University

Quantum VAE with spherical latent variable learning enables efficient, one-shot 3D molecule generation, outperforming classic and other quantum methods.

QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model

26 September 2024·2714 words·13 mins· loading · loading

AI Generated Computer Vision Image Classification 🏢 Shanghai Jiao Tong University

QuadMamba: A novel vision model leveraging quadtree-based scanning for superior performance in visual tasks, achieving state-of-the-art results with linear-time complexity.

Probabilistic Conformal Distillation for Enhancing Missing Modality Robustness

26 September 2024·3353 words·16 mins· loading · loading

AI Generated Multimodal Learning Multimodal Understanding 🏢 Shanghai Jiao Tong University

Enhance multimodal model robustness against missing data with Probabilistic Conformal Distillation (PCD)! PCD models missing modalities probabilistically, achieving superior performance on multiple be…

PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders

26 September 2024·2194 words·11 mins· loading · loading

3D Vision 🏢 Shanghai Jiao Tong University

PCP-MAE enhances point cloud self-supervised learning by cleverly predicting masked patch centers, leading to superior 3D object classification and scene segmentation.

On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection

26 September 2024·2133 words·11 mins· loading · loading

Computer Vision Video Understanding 🏢 Shanghai Jiao Tong University

MM-Det, a novel algorithm, uses multimodal learning and spatiotemporal attention to detect diffusion-generated videos, achieving state-of-the-art performance on the new DVF dataset.

Nimbus: Secure and Efficient Two-Party Inference for Transformers

26 September 2024·3036 words·15 mins· loading · loading

AI Generated AI Theory Privacy 🏢 Shanghai Jiao Tong University

Nimbus achieves 2.7-4.7x speedup in BERT base inference using novel two-party computation techniques for efficient matrix multiplication and non-linear layer approximation.

NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction

26 September 2024·2947 words·14 mins· loading · loading

Computer Vision 3D Vision 🏢 Shanghai Jiao Tong University

NeuRodin: A two-stage neural framework achieves high-fidelity 3D surface reconstruction from posed RGB images by innovatively addressing limitations in SDF-based methods, resulting in superior reconst…

Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction

26 September 2024·1845 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 Shanghai Jiao Tong University

Ref-MC2 reconstructs high-fidelity 3D objects with inter-reflections by using a novel multi-times Monte Carlo sampling strategy, achieving superior performance in accuracy and efficiency.