Editorial: Introduction to the Special Section on Best of CVPR'2022
Learning to Solve Hard Minimal Problems
Dual-Shutter Optical Vibration Sensing
Ref-NeRF: Structured View-Dependent Appearance for Neural Radiance Fields
Ego4D: Around the World in 3,600 Hours of Egocentric Video
Transformer-Empowered Invariant Grounding for Video Question Answering
Light Field Neural Rendering
Beyond Binary: Improving Signed Message Passing in Graph Neural Networks for Multi-Class Graphs
Aligning Text-to-Image Diffusion Models With Constrained Reinforcement Learning
Unifying Graph Contrastive Learning via Graph Message Augmentation
Re-GAN: Data-Efficient GANs Training via Architectural Reconfiguration
Efficient Visual Transformer by Learnable Token Merging
SPOT: Scalable 3D Pre-Training via Occupancy Prediction for Learning Transferable 3D Representations
Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Protecting Feature Privacy in Person Re-Identification
Ins-HOI: Instance Aware Human-Object Interactions Recovery
Enhanced Dual-Pattern Matching With Vision-Language Representation for Out-of-Distribution Detection
ClusMatch: Improving Deep Clustering by Unified Positive and Negative Pseudo-Label Learning
One Neuron Saved is One Neuron Earned: On Parametric Efficiency of Quadratic Networks
Monocular-to-3D Virtual Try-On With Generative Semantic Articulated Fields
FLAG3D++: A Benchmark for 3D Fitness Activity Comprehension With Language Instruction
Another Vertical View: A Hierarchical Network for Heterogeneous Trajectory Prediction via Spectrums
Spatial Frequency Modulation for Semantic Segmentation
360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos
Towards OOD Object Detection With Unknown-Concept Guided Feature Diffusion
Scalable Random Feature Latent Variable Models
Re-Boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration
An Efficient Image Fusion Network Exploiting Unifying Language and Mask Guidance
Accelerating Zero-Shot NAS With Feature Map-Based Proxy and Operation Scoring Function
Bringing Equity to Classification: Domain Generalization for Domain-Linked Classes
Towards Reliable and Faithful Explanations: A Disentanglement-Augmented Approach for Selective Rationalization
Sel4FT: Annotation Selection for Pretraining-Finetuning With Distribution Shift
Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Parameter Adapter
EPIC-SOUNDS: A Large-Scale Dataset of Actions That Sound
Reconstructing High Quality Raw Video Using Temporal Affinity and Diffusion Prior
M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising
Multilingual-Prompt-Guided Directional Feature Learning for Weakly Supervised Video Anomaly Detection
NUPES: Non-Uniform Post-Training Quantization via Power Exponent Search
SEMI-CAVA: A Causal Variational Approach to Semi-Supervised Learning
Simple Lifelong Learning Machines
Dynamic Inference by Model Reduction
Lightweight and Accurate Multi-View Stereo With Confidence-Aware Diffusion Model
Learning Dual-Stream Conditional Concepts in Compositional Zero-Shot Learning
Deep Equilibrium Object Detection and Segmentation
Selection, Ensemble, and Adaptation: Advancing Multi-Source-Free Domain Adaptation via Architecture Zoo
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding
Self-Supervised Hypergraph Training Framework via Structure-Aware Learning
AFC-RNN: Adaptive Forgetting-Controlled Recurrent Neural Network for Pedestrian Trajectory Prediction
Towards Universal Modal Tracking With Online Dense Temporal Token Learning
HAC++: Towards 100X Compression of 3D Gaussian Splatting