DFormer++: Improving RGBD Representation Learning for Semantic Segmentation
Diffusion Models and Representation Learning: A Survey
Hierarchical Information Embeddings With Neural ODEs for Personalized Federated Learning
Calibrating Biased Distribution in VFM-Derived Latent Space via Cross-Domain Geometric Consistency
CLIP-Actor-X: Text-Driven 4D Human Avatar Generation via Cross-Modal Synthesis-Through-Optimization
DrivingGaussian++: Toward Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes
Disentangling Consistent and Specific Information for Double Incomplete Multi-View Multi-Label Classification
Out-of-Distribution-Resistant Evaluations for Explanations of Graph Neural Networks
Reason-Align-Respond: Aligning LLM Reasoning With Knowledge Graphs for KGQA
50 Years of Automated Face Recognition
Aligning Few-Step Diffusion Models With Dense Reward Difference Learning
Attribution Explanations for Deep Neural Networks: A Theoretical Perspective
Deployment Prior Injection for Run-Time Re-Biasable Object Detection
Enhancing Multi-View Clustering: A Sufficient Information-Theoretic Approach for Consistency Acquisition and Redundancy Elimination
Consistent and Controllable Image Animation With Linear Motion Diffusion Transformers
DREAM: A Benchmark Study for Deepfake PhotoRealism AssessMent
DOtA++: Unsupervisely and Collaboratively Detect Objects From Multi-Agent Observations With Multi-Modal Prior Constraints
AdvDiffusion: Adversarial Patches Generation for Face Recognition With High Transferability in Physical Domain
STCF: Multi-View Clustering for Spatial Transcriptomics Based on Cross-View Fusion
Soft Label Pruning and Quantization for Large-Scale Dataset Distillation
Beyond Heat Dissipation: Optimizing Diffusion Models in Frequency Domain
FlowTurbo: Accelerating Flow-Based Image Generation Models via Multi-Stage Refinement
Safe Fairness Guarantees Without Demographics in Classification: Spectral Uncertainty Set Perspective
Learning Compact Semantic Information and Reliable Pseudo-Labels for Incomplete Multi-View Multi-Label Classification
Tail Task Risk Minimization in Meta-Learning From Theoretical Advances to Practical Strategies
MC#: Mixture Compressor for Mixture-of-Experts Large Models
DNGaussian++: Improving Sparse-View Gaussian Radiance Fields With Depth Normalization
Brightness-Aware Synthetic-to-Real Learning for Nighttime Hazy Image Enhancement
Codebook Transfer With Vision-to-Language Translation for Vector Quantization
Efficient Point Cloud Processing With High-Dimensional Positional Encoding and Non-Local MLPs
Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
Penny-Wise and Pound-Foolish in AI-Generated Image Detection
Toward a Unified Complementary Fusion Framework for Robust Polarimetric Imaging
Alignment-Invertibility Regularization for Explainable Neural Networks
FreeSplat++: Generalizable 3D Gaussian Splatting for Efficient Indoor Scene Reconstruction
D2S-RSG-SSD: Dual Double-Sampling With Random Sub-Samples Generation for Self-Supervised Real Image Denoising
Toward the Spectral Bias Alleviation by Normalizations in Coordinate Networks
A Natural Language Guided Approach for Blind Face Restoration: Methodology and Dataset
Generalized Distribution Aggregation Protocol for Federated Statistical Heterogeneity
Collaborative Feedback Discriminative Propagation for Video Super-Resolution
On the Adversarial Transferability of Generalized “Skip Connections”
SSD: Making Face Forgery Clues Evident Again With Self-Steganographic Detection
Reservoir-Based Graph Convolutional Networks
Graph-Embedded Deep Generative Clustering for Single-Cell Multi-Omics Data Integration
UDFStudio: A Unified Framework of Datasets, Benchmarks and Generative Models for Unsigned Distance Functions
Winsor-CAM: Human-Tunable Visual Explanations From Deep Networks via Layer-Wise Winsorization
Velocity Disambiguation for Video Frame Interpolation
Spatio-Temporal Decoupled Knowledge Compensator for Few-Shot Action Recognition
Local Causal Discovery With Background Knowledge
Visual-in-Visual: A Unified and Efficient Baseline for Image Restoration