Cs-Cv
Spectral Evolution Search: Efficient Inference-Time Scaling for Reward-Aligned Image Generation
WebSplatter: Enabling Cross-Device Efficient Gaussian Splatting in Web Browsers via WebGPU
Hand3R: Online 4D Hand-Scene Reconstruction in the Wild
BinaryDemoire: Moiré-Aware Binarization for Image Demoiréing
LSGQuant: Layer-Sensitivity Guided Quantization for One-Step Diffusion Real-World Video Super-Resolution
FSOD-VFM: Few-Shot Object Detection with Vision Foundation Models and Graph Diffusion
Spiral RoPE: Rotate Your Rotary Positional Embeddings in the 2D Plane
EventFlash: Towards Efficient MLLMs for Event-Based Vision
From Single Scan to Sequential Consistency: A New Paradigm for LIDAR Relocalization
InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation
LaVPR: Benchmarking Language and Vision for Place Recognition
PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers
CountZES: Counting via Zero-Shot Exemplar Selection
DeepUrban: Interaction-Aware Trajectory Prediction and Planning for Automated Driving by Aerial Imagery
Model Optimization for Multi-Camera 3D Detection and Tracking
MapDream: Task-Driven Map Learning for Vision-Language Navigation
Happy Young Women, Grumpy Old Men? Emotion-Driven Demographic Biases in Synthetic Face Generation
Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Driving on Registers
Comprehensive Machine Learning Benchmarking for Fringe Projection Profilometry with Photorealistic Synthetic Data
Rethinking Efficient Mixture-of-Experts for Remote Sensing Modality-Missing Classification