Evaluating and regulating agentic AI: A study of benchmarks, metrics, and regulation
Generalization saturation and reasoning divergence in synthetic persona construction for role-enacted language modeling
Knowledge-enhanced consistency learning with disentangled multimodal representation for misinformation detection
DAP-former: A physics-aware dual-stream fusion transformer for robust industrial time-series forecasting
FDSMM: fusion-driven signals-to-mechanisms modelling with physics-constrained information integration and entropy-based fault evolution
CausalCEM: Self-improved causal counterfactual emotion modeling for multimodal physiological signals
Federated learning with context-aware client collaboration: Challenges, advances, and open problems
SGA-SAC: Fusing Bayesian Stackelberg games to DRL with graph attention for explainable task offloading in vehicular edge computing
Fusion of advanced machine learning techniques in digital manipulation detection on identity card attacks
U2ENet: Unified end-to-end network for monocular image-based single-object and multi-object 3D visual grounding
Responsible AI in healthcare: Mitigating hallucinations and enhancing multimodal fusion - based reasoning in medical imaging
Knowledge-enhanced explicitly disentangled representation with missing modality for medical image diagnosis
A brain-inspired low-light and infrared multimodal remote sensing detection network for nighttime personnel search and rescue
Rank-aware routing decomposition for hyperspectral and multispectral image fusion
Probabilistic multi-label classification via a divide-and-conquer and fusion approach
Physics-guided stokes-consistent fusion of DoLP–AoP representations for polarization image restoration
MeSIF-Net: Multi-level cross-scale information fusion via knowledge distillation for low-resolution industrial surface defect detection
Motion estimation for multi-object tracking using KalmanNet with semantic-independent encoding
TA-Diff: Topology-aware diffusion networks for joint 3D hand pose estimation and mesh reconstruction
GIF-Calib: Geometry-intensity fusion for 4D radar-camera calibration in autonomous driving vehicles
Bridging 3D geometry and 1D temporal signals: A symmetric masked pre-training framework for heterogeneous multimodal fusion
PhraseEx: Enhancing back-translation data augmentation with phrase-level extraction for neural machine translation
Conditional diffusion transformer enabled few-shot spatiotemporal modeling
Toward trustworthy digital healthcare: A system-level convergence of IoMT, large language models, and explainable AI
VQFusion : Infrared and visible image fusion via codebook prior guidance
Trusted multi-factor fusion: A quantitative model for the confidence of naturalistic stimuli and multimodal neural responses
Listen2Track: Plug-and-play audio-visual matching for generic multi-object tracking
Hinting the unknown: Effective open-set multimodal emotion recognition with a hierarchical cross-modal emotion-interactive prompting approach
FEAP-DiT: Frequency-decoupled efficient adversarial purification with diffusion transformers
Decoding emotional nuances: A multimodal approach to detecting depression through audio, video, and text
Multi-Gaussian target tracking with binary detectors
SAFEDEC: Unifying proprioceptive and exteroceptive sensing for safe autonomous driving
Fusion of pseudo-label learning and subspace-structured graph for embedded feature selection
Nash equilibrium-guided dynamic cross-scale fusion and manifold alignment for remote sensing image semantic segmentation
Adaptive retrieval-augmented evidence-fusion for highly heterogeneous temporal entity alignment
Bridging quantum walks and graph learning: A fusion approach to structural encoding
Prediction of postoperative outcomes in spinal cord surgery via fusion of point-cloud geometric features and radiomics
DinoMamba: Semantic-prior-guided sequence modeling paradigm for underwater image enhancement
Mitigating Clever Hans strategies in image classifiers through generating counterexamples
LGCS-WA: Loss-guided clustering and dynamic client selection with weight adaptation for clustered federated learning
Diffusion-based frequency degradation prior fusion with hierarchical wavelet decompositions for underwater image enhancement
FedKSS: One-shot federated learning with kernel space statistics
A language-guided visual feature learning strategy for multimodal teacher action quality assessment
UFFL: Bridging slide- and cell-level annotations via unified feature fusion learning for cervical cancer screening
CCRDet: Cross-modal complementary region-aware framework for drone-based RGBT tiny person detection
BuildingCD-Mamba: An efficient shuffle-scan mamba with temporal-spatial feature fusion for urban building change detection
MTF-BFE: a multi-task learning framework for building footprint extraction by fusing historical cadastral maps with up-to-date remote sensing images
Dominance and complementarity in cross-modal representation learning for wearable time series
TGC-Net: a structure-aware and semantically-aligned framework for text-guided medical image segmentation
Heterogeneous model fusion for privacy-aware multi-Camera surveillance via synthetic domain adaptation