Daily Papers
- Defect Detection
- Defect Segmentation
- Anomaly Detection
- 3D Anomaly Detection
- Multimodal Anomaly Detection
- Vector Quantization
Updated on 2026.04.07
Defect Detection
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2026-4-5 | A Self-Evolving Defect Detection Framework for Industrial Photovoltaic Systems | Haoyu He et.al | paper | - | - |
| 2026-4-4 | Shower-Aware Dual-Stream Voxel Networks for Structural Defect Detection in Cosmic-Ray Muon Tomography | Parthiv Dasgupta et.al | paper | - | - |
| 2026-4-1 | Open-Set Supervised 3D Anomaly Detection: An Industrial Dataset and a Generalisable Framework for Unknown Defects | Hanzhe Liang et.al | paper | code | <summary>detail</summary>Resources: https://github |
| 2026-3-23 | SteelDefectX: A Coarse-to-Fine Vision-Language Dataset and Benchmark for Generalizable Steel Surface Defect Detection | Shuxian Zhao et.al | paper | code | <summary>detail</summary>This paper was submitted to CVPR 2026 |
| 2026-3-15 | Multi-Period Texture Contrast Enhancement for Low-Contrast Wafer Defect Detection and Segmentation | Zihan Zhang et.al | paper | - | - |
| 2026-3-14 | Multi-View Camera System for Variant-Aware Autonomous Vehicle Inspection and Defect Detection | Yash Kulkarni et.al | paper | - | - |
| 2026-3-13 | Vision-Language Based Expert Reporting for Painting Authentication and Defect Detection | Eman Ouda et.al | paper | - | <summary>detail</summary>Submitted to Journal of Cultural Heritage |
| 2026-3-11 | StructDamage:A Large Scale Unified Crack and Surface Defect Dataset for Robust Structural Damage Detection | Misbah Ijaz et.al | paper | - | - |
| 2026-3-6 | ExDD: Explicit Dual Distribution Learning for Surface Defect Detection via Diffusion Synthesis | Muhammad Aqeel et.al | paper | code | <summary>detail</summary>ICIAP 2025 |
| 2026-2-20 | Cross-Modal Purification and Fusion for Small-Object RGB-D Transmission-Line Defect Detection | Jiaming Cui et.al | paper | - | - |
| 2026-2-11 | Defect-aware Hybrid Prompt Optimization via Progressive Tuning for Zero-Shot Multi-type Anomaly Detection and Segmentation | Nadeem Nazer et.al | paper | - | - |
| 2026-1-22 | A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection | Yangjie Xiao et.al | paper | code | - |
| 2026-1-16 | SME-YOLO: A Real-Time Detector for Tiny Defect Detection on PCB Surfaces | Meng Han et.al | paper | - | - |
| 2026-1-15 | NATLM: Detecting Defects in NFT Smart Contracts Leveraging LLM | Yuanzheng Niu et.al | paper | - | - |
| 2026-1-13 | LPCAN: Lightweight Pyramid Cross-Attention Network for Rail Surface Defect Detection Using RGB-D Data | Jackie Alex et.al | paper | - | - |
| 2026-1-9 | SSR: Safeguarding Staking Rewards by Defining and Detecting Logical Defects in DeFi Staking | Zewei Lin et.al | paper | - | - |
| 2026-1-1 | Application Research of a Deep Learning Model Integrating CycleGAN and YOLO in PCB Infrared Defect Detection | Chao Yang et.al | paper | - | - |
| 2025-12-16 | TACK Tunnel Data (TTD): A Benchmark Dataset for Deep Learning-Based Defect Detection in Tunnels | Andreas Sjölander et.al | paper | - | - |
| 2025-12-12 | A Comparative Analysis of Semiconductor Wafer Map Defect Detection with Image Transformer | Sushmita Nath et.al | paper | - | <summary>detail</summary>submit/7075585 |
| 2025-12-11 | Unsupervised Learning for Industrial Defect Detection: A Case Study on Shearographic Data | Jessica Plassmann et.al | paper | - | - |
| 2025-12-5 | SPFFNet: Strip Perception and Feature Fusion Spatial Pyramid Pooling for Fabric Defect Detection | Peizhe Zhao et.al | paper | - | - |
| 2025-12-5 | Automated Annotation of Shearographic Measurements Enabling Weakly Supervised Defect Detection | Jessica Plassmann et.al | paper | - | - |
| 2025-11-25 | Automated Neural Architecture Design for Industrial Defect Detection | Yuxi Liu et.al | paper | code | - |
| 2025-11-17 | Saliency-Guided Deep Learning for Bridge Defect Detection in Drone Imagery | Loucif Hebbache et.al | paper | - | - |
| 2025-11-13 | TinyDef-DETR: A Transformer-Based Framework for Defect Detection in Transmission Lines from UAV Imagery | Feng Shen et.al | paper | - | - |
Defect Segmentation
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2026-3-15 | Multi-Period Texture Contrast Enhancement for Low-Contrast Wafer Defect Detection and Segmentation | Zihan Zhang et.al | paper | - | - |
| 2026-2-11 | Defect-aware Hybrid Prompt Optimization via Progressive Tuning for Zero-Shot Multi-type Anomaly Detection and Segmentation | Nadeem Nazer et.al | paper | - | - |
| 2026-1-22 | A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection | Yangjie Xiao et.al | paper | code | - |
| 2025-11-24 | A Storage-Efficient Feature for 3D Concrete Defect Segmentation to Replace Normal Vector | Linxin Hua et.al | paper | - | - |
| 2025-11-8 | Point Cloud Segmentation of Integrated Circuits Package Substrates Surface Defects Using Causal Inference: Dataset Construction and Methodology | Bingyang Guo et.al | paper | - | - |
| 2025-11-6 | KARMA: Efficient Structural Defect Segmentation via Kolmogorov-Arnold Representation Learning | Md Meftahul Ferdaus et.al | paper | code | <summary>detail</summary>This work has been submitted to the IEEE for possible publication |
| 2025-10-15 | Sample-Centric Multi-Task Learning for Detection and Segmentation of Industrial Surface Defects | Hang-Cheng Dong et.al | paper | - | - |
| 2025-10-6 | Attention-Enhanced Prototypical Learning for Few-Shot Infrastructure Defect Segmentation | Christina Thrainer et.al | paper | - | - |
| 2025-10-1 | Defect Segmentation in OCT scans of ceramic parts for non-destructive inspection using deep learning | Andrés Laveda-Martínez et.al | paper | - | - |
| 2025-9-11 | Unsupervised Integrated-Circuit Defect Segmentation via Image-Intrinsic Normality | Botong Zhao et.al | paper | - | - |
| 2025-8-6 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
| 2025-7-23 | Exploring Active Learning for Semiconductor Defect Segmentation | Lile Cai et.al | paper | - | <summary>detail</summary>accepted to ICIP 2022 |
| 2025-7-14 | Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al | paper | code | - |
| 2025-6-28 | Region-Aware CAM: High-Resolution Weakly-Supervised Defect Segmentation via Salient Region Perception | Hang-Cheng Dong et.al | paper | - | - |
| 2025-6-24 | Evolutionary computing-based image segmentation method to detect defects and features in Additive Friction Stir Deposition Process | Akshansh Mishra et.al | paper | - | - |
| 2025-6-17 | synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections? | Johannes Flotzinger et.al | paper | - | - |
| 2025-4-24 | Conformal Segmentation in Industrial Surface Defect Detection with Statistical Guarantees | Cheng Shen et.al | paper | - | <summary>detail</summary>Under Review |
| 2025-4-11 | Weakly Supervised Panoptic Segmentation for Defect-Based Grading of Fresh Produce | Manuel Knott et.al | paper | code | <summary>detail</summary>Accepted as a paper to the 6th International Workshop on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture in conjunction with IEEE/CVF CVPR 2025 |
| 2025-2-11 | Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models | Tongkun Liu et.al | paper | code | - |
| 2025-1-23 | Effective Defect Detection Using Instance Segmentation for NDI | Ashiqur Rahman et.al | paper | code | - |
| 2025-1-17 | Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography | Mohammed Salah et.al | paper | - | <summary>detail</summary>Pulse thermography |
| 2024-10-24 | Synth4Seg – Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization | Shancong Mou et.al | paper | - | - |
| 2024-10-1 | Application of Segment Anything Model for Civil Infrastructure Defect Assessment | Mohsen Ahmadi et.al | paper | - | - |
| 2024-9-20 | Cycle-Consistency Uncertainty Estimation for Visual Prompting based One-Shot Defect Segmentation | Geonuk Kim et.al | paper | - | <summary>detail</summary>ECCV 2024 VISION workshop Most Innovative Prize |
| 2024-8-31 | Change-Aware Siamese Network for Surface Defects Segmentation under Complex Background | Biyuan Liu et.al | paper | - | - |
Anomaly Detection
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2026-4-6 | Cyber-Physical Systems Security: A Comprehensive Review of Anomaly Detection Techniques | Danial Abshari et.al | paper | - | <summary>detail</summary>Journal ref:Internet of Things |
| 2026-4-6 | Synthesis4AD: Synthetic Anomalies are All You Need for 3D Anomaly Detection | Yihan Sun et.al | paper | code | - |
| 2026-4-6 | InCTRLv2: Generalist Residual Models for Few-Shot Anomaly Detection and Segmentation | Jiawen Zhu et.al | paper | - | - |
| 2026-4-5 | SubspaceAD: Training-Free Few-Shot Anomaly Detection via Subspace Modeling | Camile Lendering et.al | paper | code | <summary>detail</summary>CVPR 2026 |
| 2026-4-5 | Extended Hybrid Timed Petri Nets with Semi-Supervised Anomaly Detection for Switched Systems, Modelling and Fault Detection | Fatiha Hamdi et.al | paper | - | <summary>detail</summary>Journal ref:Journal of the Franklin Institute |
| 2026-4-5 | Hierarchical Point-Patch Fusion with Adaptive Patch Codebook for 3D Shape Anomaly Detection | Xueyang Kang et.al | paper | - | - |
| 2026-4-4 | Systematic Integration of Digital Twins and Constrained LLMs for Interpretable Cyber-Physical Anomaly Detection | Konstantinos E. Kampourakis et.al | paper | - | - |
| 2026-4-4 | Advancing Pre-trained Teacher: Towards Robust Feature Discrepancy for Anomaly Detection | Canhui Tang et.al | paper | - | <summary>detail</summary>Accepted by IEEE Transactions on Image Processing (TIP) |
| 2026-4-3 | QVAD: A Question-Centric Agentic Framework for Efficient and Training-Free Video Anomaly Detection | Lokman Bekit et.al | paper | - | - |
| 2026-4-3 | SPG: Sparse-Projected Guides with Sparse Autoencoders for Zero-Shot Anomaly Detection | Tomoyasu Nanaumi et.al | paper | - | - |
| 2026-4-3 | ADSeeker: A Knowledge-Grounded Reasoning Framework for Industry Anomaly Detection and Reasoning | Kai Zhang et.al | paper | - | - |
| 2026-4-2 | Financial Anomaly Detection for the Canadian Market | Luigi Caputi et.al | paper | - | <summary>detail</summary>MSC Class:68T09 |
| 2026-4-2 | Semantic Iterative Reconstruction: One-Shot Universal Anomaly Detection | Ning Zhu et.al | paper | - | - |
| 2026-4-2 | Matrix Profile for Time-Series Anomaly Detection: A Reproducible Open-Source Benchmark on TSB-AD | Chin-Chia Michael Yeh et.al | paper | code | - |
| 2026-4-2 | Modulate-and-Map: Crossmodal Feature Mapping with Cross-View Modulation for 3D Anomaly Detection | Alex Costanzino et.al | paper | - | <summary>detail</summary>CVPR Findings 2026 |
| 2026-4-2 | GenGait: A Transformer-Based Model for Human Gait Anomaly Detection and Normative Twin Generation | Elisa Motta et.al | paper | code | - |
| 2026-4-2 | CANDI: Curated Test-Time Adaptation for Multivariate Time-Series Anomaly Detection Under Distribution Shift | HyunGi Kim et.al | paper | - | <summary>detail</summary>AAAI 2026 |
| 2026-4-2 | GridVAD: Open-Set Video Anomaly Detection via Spatial Reasoning over Stratified Frame Grids | Mohamed Eltahir et.al | paper | code | - |
| 2026-4-2 | Labels Matter More Than Models: Rethinking the Unsupervised Paradigm in Time Series Anomaly Detection | Zhijie Zhong et.al | paper | code | - |
| 2026-4-2 | Towards Transparent and Efficient Anomaly Detection in Industrial Processes through ExIFFI | Davide Frizzo et.al | paper | - | <summary>detail</summary>Submitted to IEEE Transaction on Industry Applications |
| 2026-4-1 | Open-Set Supervised 3D Anomaly Detection: An Industrial Dataset and a Generalisable Framework for Unknown Defects | Hanzhe Liang et.al | paper | code | <summary>detail</summary>Resources: https://github |
| 2026-4-1 | VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection | Huilin Deng et.al | paper | - | - |
| 2026-4-1 | Perturb-and-Restore: Simulation-driven Structural Augmentation Framework for Imbalance Chromosomal Anomaly Detection | Yilan Zhang et.al | paper | - | <summary>detail</summary>This preprint version of the manuscript has been submitted to the IEEE Journal of Biomedical and Health Informatics (JBHI) for review |
| 2026-4-1 | Neuro-Symbolic Process Anomaly Detection | Devashish Gaikwad et.al | paper | - | <summary>detail</summary>CAiSE2026 |
| 2026-3-31 | mmAnomaly: Leveraging Visual Context for Robust Anomaly Detection in the Non-Visual World with mmWave Radar | Tarik Reza Toha et.al | paper | - | <summary>detail</summary>the 24th ACM/IEEE International Conference on Embedded Artificial Intelligence and Sensing Systems (SenSys 2026) |
3D Anomaly Detection
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2026-4-6 | Synthesis4AD: Synthetic Anomalies are All You Need for 3D Anomaly Detection | Yihan Sun et.al | paper | code | - |
| 2026-4-5 | Hierarchical Point-Patch Fusion with Adaptive Patch Codebook for 3D Shape Anomaly Detection | Xueyang Kang et.al | paper | - | - |
| 2026-4-2 | Modulate-and-Map: Crossmodal Feature Mapping with Cross-View Modulation for 3D Anomaly Detection | Alex Costanzino et.al | paper | - | <summary>detail</summary>CVPR Findings 2026 |
| 2026-4-1 | Open-Set Supervised 3D Anomaly Detection: An Industrial Dataset and a Generalisable Framework for Unknown Defects | Hanzhe Liang et.al | paper | code | <summary>detail</summary>Resources: https://github |
| 2026-3-26 | A Semantically Disentangled Unified Model for Multi-category 3D Anomaly Detection | SuYeon Kim et.al | paper | - | <summary>detail</summary>Accepted by CVPR 2026 |
| 2026-3-22 | Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection | Kaiqiang Li et.al | paper | code | <summary>detail</summary>Accepted by CVPR 2026 |
| 2026-3-4 | Cross-Modal Mapping and Dual-Branch Reconstruction for 2D-3D Multimodal Industrial Anomaly Detection | Radia Daci et.al | paper | code | - |
| 2026-2-25 | GS-CLIP: Zero-shot 3D Anomaly Detection by Geometry-Aware Prompt and Synergistic View Representation Learning | Zehao Deng et.al | paper | code | <summary>detail</summary>Accepted by CVPR 2026 |
| 2026-2-16 | Training-Free Zero-Shot Anomaly Detection in 3D Brain MRI with 2D Foundation Models | Tai Le-Gia et.al | paper | - | <summary>detail</summary>Accepted for MIDL 2026 |
| 2026-2-11 | DMP-3DAD: Cross-Category 3D Anomaly Detection via Realistic Depth Map Projection with Few Normal Samples | Zi Wang et.al | paper | - | - |
| 2025-12-15 | 3D Human-Human Interaction Anomaly Detection | Shun Maeda et.al | paper | - | - |
| 2025-12-14 | A Lightweight 3D Anomaly Detection Method with Rotationally Invariant Features | Hanzhe Liang et.al | paper | - | <summary>detail</summary>Preprint |
| 2025-11-23 | PointAD+: Learning Hierarchical Representations for Zero-shot 3D Anomaly Detection | Qihang Zhou et.al | paper | - | <summary>detail</summary>Submitted to TPAMI |
| 2025-11-16 | CASL: Curvature-Augmented Self-supervised Learning for 3D Anomaly Detection | Yaohua Zha et.al | paper | code | <summary>detail</summary>AAAI 2026 |
| 2025-11-5 | IEC3D-AD: A 3D Dataset of Industrial Equipment Components for Unsupervised Point Cloud Anomaly Detection | Bingyang Guo et.al | paper | - | - |
| 2025-10-19 | Registration is a Powerful Rotation-Invariance Learner for 3D Anomaly Detection | Yuyang Yu et.al | paper | - | - |
| 2025-10-14 | IterMask3D: Unsupervised Anomaly Detection and Segmentation with Test-Time Iterative Mask Refinement in 3D Brain MR | Ziyun Liang et.al | paper | code | <summary>detail</summary>Published in Medical Image Analysis |
| 2025-9-23 | 3D-ADAM: A Dataset for 3D Anomaly Detection in Additive Manufacturing | Paul McHard et.al | paper | - | - |
| 2025-9-16 | Taming Anomalies with Down-Up Sampling Networks: Group Center Preserving Reconstruction for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>ACM MM25 Accepted |
| 2025-9-12 | MCL-AD: Multimodal Collaboration Learning for Zero-Shot 3D Anomaly Detection | Gang Li et.al | paper | - | <summary>detail</summary>Page 14 |
| 2025-8-28 | IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection | Xuanming Cao et.al | paper | - | - |
| 2025-8-19 | Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline | Shiyi Mu et.al | paper | code.) | <summary>detail</summary>under review |
| 2025-8-2 | C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor | Haoquan Lu et.al | paper | code | <summary>detail</summary>We have provided the code for C3D-AD with checkpoints and BASELINE at this link: https://github |
| 2025-8-1 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | code | <summary>detail</summary>ICCV 2025 |
| 2025-8-1 | HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection | Jiaping Cao et.al | paper | - | - |
Multimodal Anomaly Detection
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2026-4-1 | VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection | Huilin Deng et.al | paper | - | - |
| 2026-3-29 | Bidirectional Multimodal Prompt Learning with Scale-Aware Training for Few-Shot Multi-Class Anomaly Detection | Yujin Lee et.al | paper | - | <summary>detail</summary>accepted to CVPR 2026 |
| 2026-3-23 | Multimodal Industrial Anomaly Detection via Geometric Prior | Min Li et.al | paper | - | <summary>detail</summary>Accepted for publication in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) |
| 2026-3-23 | Towards Multimodal Time Series Anomaly Detection with Semantic Alignment and Condensed Interaction | Shiyan Hu et.al | paper | code | <summary>detail</summary>ICLR 2026 |
| 2026-3-23 | Exploring Multimodal Prompts For Unsupervised Continuous Anomaly Detection | Mingle Zhou et.al | paper | - | - |
| 2026-3-4 | Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild | Shanle Yao et.al | paper | - | - |
| 2026-3-4 | Cross-Modal Mapping and Dual-Branch Reconstruction for 2D-3D Multimodal Industrial Anomaly Detection | Radia Daci et.al | paper | code | - |
| 2026-3-3 | Towards an Incremental Unified Multimodal Anomaly Detection: Augmenting Multimodal Denoising From an Information Bottleneck Perspective | Kaifang Long et.al | paper | - | - |
| 2026-2-26 | Leveraging Multimodal LLM Descriptions of Activity for Explainable Semi-Supervised Video Anomaly Detection | Furkan Mumcu et.al | paper | - | - |
| 2026-2-23 | EAGLE: Expert-Augmented Attention Guidance for Tuning-Free Industrial Anomaly Detection in Multimodal Large Language Models | Xiaomeng Peng et.al | paper | - | - |
| 2026-2-17 | Can Multimodal LLMs Perform Time Series Anomaly Detection? | Xiongxiao Xu et.al | paper | - | <summary>detail</summary>ACM Web Conference 2026 (WWW’26) |
| 2026-2-11 | Enhancing Weakly Supervised Multimodal Video Anomaly Detection through Text Guidance | Shengyang Sun et.al | paper | - | <summary>detail</summary>Accepted by IEEE Transactions on Multimedia |
| 2026-2-9 | AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection | Junru Zhang et.al | paper | - | <summary>detail</summary>Preprint |
| 2026-1-22 | VTFusion: A Vision-Text Multimodal Fusion Network for Few-Shot Anomaly Detection | Yuxin Jiang et.al | paper | - | - |
| 2026-1-20 | Physic-HM: Restoring Physical Generative Logic in Multimodal Anomaly Detection via Hierarchical Modulation | Xiao Liu et.al | paper | - | <summary>detail</summary>Working in progress |
| 2025-11-23 | Multimodal Real-Time Anomaly Detection and Industrial Applications | Aman Verma et.al | paper | - | - |
| 2025-11-16 | Commonality in Few: Few-Shot Multimodal Anomaly Detection via Hypergraph-Enhanced Memory | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by AAAI 2026 |
| 2025-11-10 | RobustA: Robust Anomaly Detection in Multimodal Data | Salem AlMarri et.al | paper | - | <summary>detail</summary>Submitted to IEEE Transactions on Image Processing |
| 2025-10-16 | Incomplete Multimodal Industrial Anomaly Detection via Cross-Modal Distillation | Wenbo Sui et.al | paper | - | <summary>detail</summary>For a published version refer to the Information Fusion |
| 2025-10-15 | IAD-GPT: Advancing Visual Knowledge in Multimodal Large Language Model for Industrial Anomaly Detection | Zewen Li et.al | paper | code | <summary>detail</summary>Accepted by IEEE Transactions on Instrumentation and Measurement (TIM) |
| 2025-9-12 | MCL-AD: Multimodal Collaboration Learning for Zero-Shot 3D Anomaly Detection | Gang Li et.al | paper | - | <summary>detail</summary>Page 14 |
| 2025-8-20 | PB-IAD: Utilizing multimodal foundation models for semantic industrial anomaly detection in dynamic manufacturing environments | Bernd Hofmann et.al | paper | - | - |
| 2025-8-6 | AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization | Jingyi Liao et.al | paper | - | - |
| 2025-8-1 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | code | <summary>detail</summary>ICCV 2025 |
| 2025-7-25 | BridgeNet: A Unified Multimodal Framework for Bridging 2D and 3D Industrial Anomaly Detection | An Xiang et.al | paper | code | - |
Vector Quantization
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2026-4-3 | Minimal Information Control Invariance via Vector Quantization | Ege Yuceel et.al | paper | - | - |
| 2026-3-29 | SmaAT-QMix-UNet: A Parameter-Efficient Vector-Quantized UNet for Precipitation Nowcasting | Nikolas Stavrou et.al | paper | code | - |
| 2026-3-23 | DiVeQ: Differentiable Vector Quantization Using the Reparameterization Trick | Mohammad Hassan Vali et.al | paper | - | - |
| 2026-3-22 | Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution | Qifan Li et.al | paper | - | <summary>detail</summary>ICLR 2026 |
| 2026-3-17 | Mitigating Premature Discretization with Progressive Quantization for Robust Vector Tokenization | Wenhao Zhao et.al | paper | - | - |
| 2026-3-17 | VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization | Yixuan Wang et.al | paper | - | - |
| 2026-3-14 | Purrception: Variational Flow Matching for Vector-Quantized Image Generation | Răzvan-Andrei Matişan et.al | paper | - | <summary>detail</summary>Published as a conference paper at ICLR 2026 |
| 2026-3-11 | Leech Lattice Vector Quantization for Efficient LLM Compression | Tycho F. A. van der Ouderaa et.al | paper | - | - |
| 2026-3-4 | Vector-Quantized Soft Label Compression for Dataset Distillation | Ali Abbasi et.al | paper | - | - |
| 2026-3-3 | ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization | Hao Cao et.al | paper | - | - |
| 2026-2-24 | KBVQ-MoE: KLT-guided SVD with Bias-Corrected Vector Quantization for MoE Large Language Models | Zukang Xu et.al | paper | - | <summary>detail</summary>Accepted by ICLR 2026 |
| 2026-2-22 | VQEL: Enabling Self-Play in Emergent Language Games via Agent-Internal Vector Quantization | Mohammad Mahdi Samiei Paqaleh et.al | paper | - | - |
| 2026-2-21 | Beyond Stationarity: Rethinking Codebook Collapse in Vector Quantization | Hao Lu et.al | paper | code | - |
| 2026-2-19 | VP-VAE: Rethinking Vector Quantization via Adaptive Vector Perturbation | Linwei Zhai et.al | paper | - | - |
| 2026-2-12 | Multiscale Vector-Quantized Variational Autoencoder for Endoscopic Image Synthesis | Dimitrios E. Diamantis et.al | paper | - | <summary>detail</summary>Journal ref:Proc |
| 2026-2-7 | Residual Vector Quantization For Communication-Efficient Multi-Agent Perception | Dereje Shenkut et.al | paper | - | <summary>detail</summary>ICASSP 2026 |
| 2026-2-6 | Online Vector Quantized Attention | Nick Alonso et.al | paper | - | - |
| 2026-2-5 | Price of universality in vector quantization is at most 0.11 bit | Alina Harbuzova et.al | paper | - | <summary>detail</summary>41 page |
| 2026-2-5 | Vector Quantization using Gaussian Variational Autoencoder | Tongda Xu et.al | paper | - | - |
| 2026-2-4 | VQ-DSC-R: Robust Vector Quantized-Enabled Digital Semantic Communication With OFDM Transmission | Jianqiao Chen et.al | paper | - | - |
| 2026-2-2 | Vector Quantized Latent Concepts: A Scalable Alternative to Clustering-Based Concept Discovery | Xuemin Yu et.al | paper | - | - |
| 2026-2-2 | Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization | Yuli Zhou et.al | paper | code | - |
| 2026-2-2 | Bandwidth-Efficient Multi-Agent Communication through Information Bottleneck and Vector Quantization | Ahmad Farooq et.al | paper | - | <summary>detail</summary>the 2026 IEEE International Conference on Robotics and Automation (ICRA 2026) |
| 2026-2-2 | ParaGSE: Parallel Generative Speech Enhancement with Group-Vector-Quantization-based Neural Speech Codec | Fei Liu et.al | paper | - | <summary>detail</summary>Accepted by ICASSP 2026 |
| 2026-2-1 | Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization | Haochen You et.al | paper | - | <summary>detail</summary>This paper has been accepted as a conference paper at CPAL 2026 |