Daily Papers
- Defect Detection
- Defect Segmentation
- Anomaly Detection
- 3D Anomaly Detection
- Multimodal Anomaly Detection
- Vector Quantization
Updated on 2025.10.25
Defect Detection
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2025-10-16 | Real-Time Surgical Instrument Defect Detection via Non-Destructive Testing | Qurrat Ul Ain et.al | paper | - | - |
| 2025-10-16 | TinyDef-DETR: A Transformer-Based Framework for Defect Detection in Transmission Lines from UAV Imagery | Feng Shen et.al | paper | - | - |
| 2025-10-16 | BoardVision: Deployment-ready and Robust Motherboard Defect Detection with YOLO+Faster-RCNN Ensemble | Brandon Hill et.al | paper | - | <summary>detail</summary>This paper has been submitted to IEEE/CVF WACV 2026 Applications track and is currently under review |
| 2025-10-15 | InfraGPT Smart Infrastructure: An End-to-End VLM-Based Framework for Detecting and Managing Urban Defects | Ibrahim Sheikh Mohamed et.al | paper | - | - |
| 2025-10-15 | Sample-Centric Multi-Task Learning for Detection and Segmentation of Industrial Surface Defects | Hang-Cheng Dong et.al | paper | - | - |
| 2025-10-8 | Transformer-Based Indirect Structural Health Monitoring of Rail Infrastructure with Attention-Driven Detection and Localization of Transient Defects | Sizhe Ma et.al | paper | - | <summary>detail</summary>Preprint presented at the 15th International Workshop on Structural Health Monitoring (IWSHM) |
| 2025-10-8 | Automated Neural Architecture Design for Industrial Defect Detection | Yuxi Liu et.al | paper | code | - |
| 2025-10-7 | Kaputt: A Large-Scale Dataset for Visual Defect Detection | Sebastian Höfer et.al | paper | code | <summary>detail</summary>ICCV 2025 |
| 2025-10-3 | Automated Defect Detection for Mass-Produced Electronic Components Based on YOLO Object Detection Models | Wei-Lung Mao et.al | paper | - | - |
| 2025-10-2 | YOLO-Based Defect Detection for Metal Sheets | Po-Heng Chou et.al | paper | - | - |
| 2025-9-30 | Multi-View Camera System for Variant-Aware Autonomous Vehicle Inspection and Defect Detection | Yash Kulkarni et.al | paper | - | - |
| 2025-9-25 | Unsupervised Defect Detection for Surgical Instruments | Joseph Huang et.al | paper | - | - |
| 2025-9-25 | A Real-Time On-Device Defect Detection Framework for Laser Power-Meter Sensors via Unsupervised Learning | Dongqi Zheng et.al | paper | - | - |
| 2025-9-23 | Advancing Metallic Surface Defect Detection via Anomaly-Guided Pretraining on a Large Industrial Dataset | Chuni Liu et.al | paper | code | - |
| 2025-9-19 | ISP-AD: A Large-Scale Real-World Dataset for Advancing Industrial Anomaly Detection with Synthetic and Real Defects | Paul J. Krassnig et.al | paper | code | - |
| 2025-9-6 | On the Detection of Internal Defects in Structured Media | Bryl Nico M. Ong et.al | paper | - | - |
| 2025-9-4 | YOLO Ensemble for UAV-based Multispectral Defect Detection in Wind Turbine Components | Serhii Svystun et.al | paper | - | <summary>detail</summary>The 13th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications |
| 2025-9-3 | Joint Training of Image Generator and Detector for Road Defect Detection | Kuan-Chuan Peng et.al | paper | - | <summary>detail</summary>This paper is accepted to ICCV 2025 Workshop on Representation Learning with Very Limited Resources: When Data |
| 2025-9-2 | Resilient Multimodal Industrial Surface Defect Detection with Uncertain Sensors Availability | Shuai Jiang et.al | paper | code | <summary>detail</summary>IEEE/ASME Transactions on Mechatronics |
| 2025-9-2 | Vision-Based Object Detection for UAV Solar Panel Inspection Using an Enhanced Defects Dataset | Ashen Rodrigo et.al | paper | code | - |
| 2025-9-2 | FusWay: Multimodal hybrid fusion approach. Application to Railway Defect Detection | Alexey Zhukov et.al | paper | - | - |
| 2025-9-1 | TransMatch: A Transfer-Learning Framework for Defect Detection in Laser Powder Bed Fusion Additive Manufacturing | Mohsen Asghari Ilani et.al | paper | - | - |
| 2025-8-31 | Surface Defect Detection with Gabor Filter Using Reconstruction-Based Blurring U-Net-ViT | Jongwook Si et.al | paper | - | - |
| 2025-8-26 | No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes | Blaž Rolih et.al | paper | code | <summary>detail</summary>Accepted by The Journal of Intelligent Manufacturing |
| 2025-8-22 | A Lightweight Group Multiscale Bidirectional Interactive Network for Real-Time Steel Surface Defect Detection | Yong Zhang et.al | paper | code | - |
Defect Segmentation
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2025-10-15 | Sample-Centric Multi-Task Learning for Detection and Segmentation of Industrial Surface Defects | Hang-Cheng Dong et.al | paper | - | - |
| 2025-10-6 | Attention-Enhanced Prototypical Learning for Few-Shot Infrastructure Defect Segmentation | Christina Thrainer et.al | paper | - | - |
| 2025-10-1 | Defect Segmentation in OCT scans of ceramic parts for non-destructive inspection using deep learning | Andrés Laveda-Martínez et.al | paper | - | - |
| 2025-9-11 | Unsupervised Integrated-Circuit Defect Segmentation via Image-Intrinsic Normality | Botong Zhao et.al | paper | - | - |
| 2025-8-14 | A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection | Yangjie Xiao et.al | paper | code | - |
| 2025-8-11 | KARMA: Efficient Structural Defect Segmentation via Kolmogorov-Arnold Representation Learning | Md Meftahul Ferdaus et.al | paper | code | <summary>detail</summary>submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence |
| 2025-8-6 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
| 2025-7-23 | Exploring Active Learning for Semiconductor Defect Segmentation | Lile Cai et.al | paper | - | <summary>detail</summary>accepted to ICIP 2022 |
| 2025-7-14 | Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al | paper | code | - |
| 2025-6-28 | Region-Aware CAM: High-Resolution Weakly-Supervised Defect Segmentation via Salient Region Perception | Hang-Cheng Dong et.al | paper | - | - |
| 2025-6-24 | Evolutionary computing-based image segmentation method to detect defects and features in Additive Friction Stir Deposition Process | Akshansh Mishra et.al | paper | - | - |
| 2025-6-17 | synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections? | Johannes Flotzinger et.al | paper | - | - |
| 2025-4-24 | Conformal Segmentation in Industrial Surface Defect Detection with Statistical Guarantees | Cheng Shen et.al | paper | - | <summary>detail</summary>Under Review |
| 2025-4-11 | Weakly Supervised Panoptic Segmentation for Defect-Based Grading of Fresh Produce | Manuel Knott et.al | paper | code | <summary>detail</summary>Accepted as a paper to the 6th International Workshop on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture in conjunction with IEEE/CVF CVPR 2025 |
| 2025-2-11 | Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models | Tongkun Liu et.al | paper | code | - |
| 2025-1-23 | Effective Defect Detection Using Instance Segmentation for NDI | Ashiqur Rahman et.al | paper | code | - |
| 2025-1-17 | Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography | Mohammed Salah et.al | paper | - | <summary>detail</summary>Pulse thermography |
| 2024-10-24 | Synth4Seg – Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization | Shancong Mou et.al | paper | - | - |
| 2024-10-1 | Application of Segment Anything Model for Civil Infrastructure Defect Assessment | Mohsen Ahmadi et.al | paper | - | - |
| 2024-9-20 | Cycle-Consistency Uncertainty Estimation for Visual Prompting based One-Shot Defect Segmentation | Geonuk Kim et.al | paper | - | <summary>detail</summary>ECCV 2024 VISION workshop Most Innovative Prize |
| 2024-8-31 | Change-Aware Siamese Network for Surface Defects Segmentation under Complex Background | Biyuan Liu et.al | paper | - | - |
| 2024-8-19 | Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network | Rasha Alshawi et.al | paper | - | - |
| 2024-8-19 | Dynamic Label Injection for Imbalanced Industrial Defect Segmentation | Emanuele Caruso et.al | paper | code | <summary>detail</summary>ECCV 2024 VISION Workshop |
| 2024-6-26 | An unsupervised approach towards promptable defect segmentation in laser-based additive manufacturing by Segment Anything | Israt Zarin Era et.al | paper | - | - |
| 2024-4-17 | CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect | Minh Tran et.al | paper | code | <summary>detail</summary>Poultry Science Journal |
Anomaly Detection
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2025-10-23 | Uncovering Anomalous Events for Marine Environmental Monitoring via Visual Anomaly Detection | Laura Weihl et.al | paper | - | - |
| 2025-10-23 | Rebellious Student: A Complementary Learning Framework for Background Feature Enhancement in Hyperspectral Anomaly Detection | Wenping Jin et.al | paper | code | - |
| 2025-10-23 | GMFVAD: Using Grained Multi-modal Feature to Improve Video Anomaly Detection | Guangyu Dai et.al | paper | - | - |
| 2025-10-21 | Securing IoT Communications via Anomaly Traffic Detection: Synergy of Genetic Algorithm and Ensemble Method | Behnam Seyedi et.al | paper | - | - |
| 2025-10-21 | An Encode-then-Decompose Approach to Unsupervised Time Series Anomaly Detection on Contaminated Training Data–Extended Version | Buang Zhang et.al | paper | - | - |
| 2025-10-21 | DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection | Yingli Shen et.al | paper | - | <summary>detail</summary>NeurIPS 2025 Datasets and Benchmarks Track |
| 2025-10-21 | ShortcutBreaker: Low-Rank Noisy Bottleneck with Global Perturbation Attention for Multi-Class Unsupervised Anomaly Detection | Peng Tang et.al | paper | - | <summary>detail</summary>Under Review |
| 2025-10-21 | Scalable, Explainable and Provably Robust Anomaly Detection with One-Step Flow Matching | Zhong Li et.al | paper | - | <summary>detail</summary>Paper accepted by NeurIPS 2025 |
| 2025-10-20 | BlockScan: Detecting Anomalies in Blockchain Transactions | Jiahao Yu et.al | paper | - | - |
| 2025-10-20 | VelocityNet: Real-Time Crowd Anomaly Detection via Person-Specific Velocity Analysis | Fatima AlGhamdi et.al | paper | - | - |
| 2025-10-20 | Batch Distillation Data for Developing Machine Learning Anomaly Detection Methods | Justus Arweiler et.al | paper | - | - |
| 2025-10-20 | SAVANT: Semantic Analysis with Vision-Augmented Anomaly deTection | Roberto Brusnicki et.al | paper | - | - |
| 2025-10-20 | One Dinomaly2 Detect Them All: A Unified Framework for Full-Spectrum Unsupervised Anomaly Detection | Jia Guo et.al | paper | - | <summary>detail</summary>Extended version of CVPR2025 |
| 2025-10-20 | Formally Exploring Time-Series Anomaly Detection Evaluation Metrics | Dennis Wagner et.al | paper | - | - |
| 2025-10-20 | OCR-APT: Reconstructing APT Stories from Audit Logs using Subgraph Anomaly Detection and LLMs | Ahmed Aly et.al | paper | code | <summary>detail</summary>This is the authors’ extended version of the paper accepted for publication at the ACM SIGSAC Conference on Computer and Communications Security (CCS 2025) |
| 2025-10-20 | Hyperspectral Anomaly Detection Fused Unified Nonconvex Tensor Ring Factors Regularization | Wenjin Qin et.al | paper | - | - |
| 2025-10-20 | Physics-Informed Large Language Models for HVAC Anomaly Detection with Autonomous Rule Generation | Subin Lin et.al | paper | - | <summary>detail</summary>NeurIPS 2025 Workshop of UrbanAI (Oral) |
| 2025-10-19 | Explainable Heterogeneous Anomaly Detection in Financial Networks via Adaptive Expert Routing | Zan Li et.al | paper | - | - |
| 2025-10-19 | Registration is a Powerful Rotation-Invariance Learner for 3D Anomaly Detection | Yuyang Yu et.al | paper | - | - |
| 2025-10-19 | Kick Bad Guys Out! Conditionally Activated Anomaly Detection in Federated Learning with Zero-Knowledge Proof Verification | Shanshan Han et.al | paper | - | - |
| 2025-10-18 | Cross-Domain Graph Anomaly Detection via Test-Time Training with Homophily-Guided Self-Supervision | Delaram Pirhayati et.al | paper | - | <summary>detail</summary>Transactions on Machine Learning Research (TMLR) |
| 2025-10-18 | Structured Temporal Causality for Interpretable Multivariate Time Series Anomaly Detection | Dongchan Cho et.al | paper | - | <summary>detail</summary>Accepted by NeurIPS 2025 |
| 2025-10-18 | MIRAD - A comprehensive real-world robust anomaly detection dataset for Mass Individualization | Pulin Li et.al | paper | code | <summary>detail</summary>https://github |
| 2025-10-17 | Cerberus: Real-Time Video Anomaly Detection via Cascaded Vision-Language Models | Yue Zheng et.al | paper | - | - |
| 2025-10-17 | Robust Anomaly Detection through Multi-Modal Autoencoder Fusion for Small Vehicle Damage Detection | Sara Khan et.al | paper | - | - |
3D Anomaly Detection
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2025-10-19 | Registration is a Powerful Rotation-Invariance Learner for 3D Anomaly Detection | Yuyang Yu et.al | paper | - | - |
| 2025-10-14 | IterMask3D: Unsupervised Anomaly Detection and Segmentation with Test-Time Iterative Mask Refinement in 3D Brain MR | Ziyun Liang et.al | paper | code | <summary>detail</summary>Published in Medical Image Analysis |
| 2025-10-9 | PointAD+: Learning Hierarchical Representations for Zero-shot 3D Anomaly Detection | Qihang Zhou et.al | paper | - | <summary>detail</summary>Submitted to TPAMI |
| 2025-9-23 | 3D-ADAM: A Dataset for 3D Anomaly Detection in Additive Manufacturing | Paul McHard et.al | paper | - | - |
| 2025-9-16 | Taming Anomalies with Down-Up Sampling Networks: Group Center Preserving Reconstruction for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>ACM MM25 Accepted |
| 2025-9-12 | MCL-AD: Multimodal Collaboration Learning for Zero-Shot 3D Anomaly Detection | Gang Li et.al | paper | - | <summary>detail</summary>Page 14 |
| 2025-8-28 | IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection | Xuanming Cao et.al | paper | - | - |
| 2025-8-19 | Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline | Shiyi Mu et.al | paper | code.) | <summary>detail</summary>under review |
| 2025-8-2 | C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor | Haoquan Lu et.al | paper | code | <summary>detail</summary>We have provided the code for C3D-AD with checkpoints and BASELINE at this link: https://github |
| 2025-8-1 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | code | <summary>detail</summary>ICCV 2025 |
| 2025-8-1 | HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection | Jiaping Cao et.al | paper | - | - |
| 2025-7-29 | Multi-View Reconstruction with Global Context for 3D Anomaly Detection | Yihan Sun et.al | paper | - | - |
| 2025-7-27 | Position: Untrained Machine Learning for Anomaly Detection by using 3D Point Cloud Data | Juan Du et.al | paper | - | - |
| 2025-7-25 | BridgeNet: A Unified Multimodal Framework for Bridging 2D and 3D Industrial Anomaly Detection | An Xiang et.al | paper | code | - |
| 2025-7-24 | MC3D-AD: A Unified Geometry-aware Reconstruction Model for Multi-category 3D Anomaly Detection | Jiayi Cheng et.al | paper | code | - |
| 2025-7-17 | 3DKeyAD: High-Resolution 3D Point Cloud Anomaly Detection via Keypoint-Guided Point Clustering | Zi Wang et.al | paper | - | - |
| 2025-7-10 | Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects | Yuqi Cheng et.al | paper | - | - |
| 2025-6-3 | DAS3D: Dual-modality Anomaly Synthesis for 3D Anomaly Detection | Kecen Li et.al | paper | code | <summary>detail</summary>Code available at https://github |
| 2025-5-27 | Mentor3AD: Feature Reconstruction-based 3D Anomaly Detection via Multi-modality Mentor Learning | Hanzhe Liang et.al | paper | - | <summary>detail</summary>arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission |
| 2025-5-15 | Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection | Hanzhe Liang et.al | paper | code | - |
| 2025-4-19 | Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection | Wenbing Zhu et.al | paper | code | - |
| 2025-3-30 | Self-Supervised Masked Mesh Learning for Unsupervised Anomaly Detection on 3D Cortical Surfaces | Hao-Chun Yang et.al | paper | - | - |
| 2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
| 2025-3-10 | Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>AAAI2025 Poster |
| 2025-3-3 | Fence Theorem: Towards Dual-Objective Semantic-Structure Isolation in Preprocessing Phase for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | - |
Multimodal Anomaly Detection
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2025-10-16 | Leveraging Multimodal LLM Descriptions of Activity for Explainable Semi-Supervised Video Anomaly Detection | Furkan Mumcu et.al | paper | - | - |
| 2025-10-16 | Incomplete Multimodal Industrial Anomaly Detection via Cross-Modal Distillation | Wenbo Sui et.al | paper | - | <summary>detail</summary>For a published version refer to the Information Fusion |
| 2025-10-15 | IAD-GPT: Advancing Visual Knowledge in Multimodal Large Language Model for Industrial Anomaly Detection | Zewen Li et.al | paper | code | <summary>detail</summary>Accepted by IEEE Transactions on Instrumentation and Measurement (TIM) |
| 2025-9-12 | MCL-AD: Multimodal Collaboration Learning for Zero-Shot 3D Anomaly Detection | Gang Li et.al | paper | - | <summary>detail</summary>Page 14 |
| 2025-8-20 | PB-IAD: Utilizing multimodal foundation models for semantic industrial anomaly detection in dynamic manufacturing environments | Bernd Hofmann et.al | paper | - | - |
| 2025-8-6 | AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization | Jingyi Liao et.al | paper | - | - |
| 2025-8-1 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | code | <summary>detail</summary>ICCV 2025 |
| 2025-7-25 | BridgeNet: A Unified Multimodal Framework for Bridging 2D and 3D Industrial Anomaly Detection | An Xiang et.al | paper | code | - |
| 2025-7-25 | Tuned Reverse Distillation: Enhancing Multimodal Industrial Anomaly Detection with Crossmodal Tuners | Xinyue Liu et.al | paper | code | - |
| 2025-7-23 | HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs | Zhaolin Cai et.al | paper | - | <summary>detail</summary>Accepted by ACM MM 2025 |
| 2025-6-23 | Multimodal Anomaly Detection with a Mixture-of-Experts | Christoph Willibald et.al | paper | - | - |
| 2025-6-20 | When Every Millisecond Counts: Real-Time Anomaly Detection via the Multimodal Asynchronous Hybrid Network | Dong Xiao et.al | paper | - | <summary>detail</summary>ICML 2025 Spotlight |
| 2025-6-4 | MemoryOut: Learning Principal Features via Multimodal Sparse Filtering Network for Semi-supervised Video Anomaly Detection | Juntong Li et.al | paper | code | - |
| 2025-5-28 | OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning | Shifang Zhao et.al | paper | - | - |
| 2025-5-19 | Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | Kiarash Naghavi Khanghah et.al | paper | - | <summary>detail</summary>ASME 2025 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference IDETC/CIE2025 |
| 2025-5-8 | Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection | Sungheon Jeong et.al | paper | code | - |
| 2025-4-17 | LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection | Weijia Li et.al | paper | - | - |
| 2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
| 2025-3-17 | Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models | Jiacong Xu et.al | paper | code | - |
| 2025-2-24 | Can Multimodal LLMs Perform Time Series Anomaly Detection? | Xiongxiao Xu et.al | paper | code | - |
| 2025-2-20 | MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection | Xi Jiang et.al | paper | code | <summary>detail</summary>Accepted by ICLR 2025 |
| 2025-2-18 | Anomaly Detection in Smart Power Grids with Graph-Regularized MS-SVDD: a Multimodal Subspace Learning Approach | Thomas Debelle et.al | paper | - | - |
| 2025-2-10 | Multimodal Task Representation Memory Bank vs. Catastrophic Forgetting in Anomaly Detection | You Zhou et.al | paper | - | - |
| 2025-2-9 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al | paper | code | - |
| 2025-1-27 | Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? | Zhiling Chen et.al | paper | - | - |
Vector Quantization
| Date | Title | Authors | Code | Comments | |
|---|---|---|---|---|---|
| 2025-10-21 | Channel-Aware Vector Quantization for Robust Semantic Communication on Discrete Channels | Zian Meng et.al | paper | - | - |
| 2025-10-18 | AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models | Zeyu Li et.al | paper | - | - |
| 2025-10-16 | Vector Quantization in the Brain: Grid-like Codes in World Models | Xiangyuan Peng et.al | paper | - | - |
| 2025-10-16 | Group-Wise Optimization for Self-Extensible Codebooks in Vector Quantized Models | Hong-Kai Zheng et.al | paper | - | - |
| 2025-10-15 | GranQ: Efficient Channel-wise Quantization via Vectorized Pre-Scaling for Zero-Shot QAT | Inpyo Hong et.al | paper | - | - |
| 2025-10-14 | CARVQ: Corrective Adaptor with Group Residual Vector Quantization for LLM Embedding Compression | Dayin Gou et.al | paper | - | <summary>detail</summary>EMNLP Findings 2025 |
| 2025-10-10 | SQ-GAN: Semantic Image Communications Using Masked Vector Quantization | Francesco Pezone et.al | paper | - | <summary>detail</summary>arXiv admin note: substantial text overlap with arXiv:2502 |
| 2025-10-7 | VecInfer: Efficient LLM Inference with Low-Bit KV Cache via Outlier-Suppressed Vector Quantization | Dingyu Yao et.al | paper | - | - |
| 2025-10-3 | Addressing Representation Collapse in Vector Quantized Models with One Linear Layer | Yongxin Zhu et.al | paper | code | <summary>detail</summary>ICCV2025 |
| 2025-10-1 | Purrception: Variational Flow Matching for Vector-Quantized Image Generation | Răzvan-Andrei Matişan et.al | paper | - | - |
| 2025-9-30 | DiVeQ: Differentiable Vector Quantization Using the Reparameterization Trick | Mohammad Hassan Vali et.al | paper | - | - |
| 2025-9-30 | Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution | Qifan Li et.al | paper | - | - |
| 2025-9-30 | PUREVQ-GAN: Defending Data Poisoning Attacks through Vector-Quantized Bottlenecks | Alexander Branch et.al | paper | - | - |
| 2025-9-26 | Graph is a Natural Regularization: Revisiting Vector Quantization for Graph Representation Learning | Zian Zhai et.al | paper | - | - |
| 2025-9-26 | Pushing Toward the Simplex Vertices: A Simple Remedy for Code Collapse in Smoothed Vector Quantization | Takashi Morita et.al | paper | - | - |
| 2025-9-26 | AUV: Teaching Audio Universal Vector Quantization with Single Nested Codebook | Yushen Chen et.al | paper | code | <summary>detail</summary>Submitted to ICASSP 2026 |
| 2025-9-25 | Residual Vector Quantization For Communication-Efficient Multi-Agent Perception | Dereje Shenkut et.al | paper | - | - |
| 2025-9-23 | RSAVQ: Riemannian Sensitivity-Aware Vector Quantization for Large Language Models | Zukang Xu et.al | paper | - | - |
| 2025-9-22 | Individualized non-uniform quantization for vector search | Mariano Tepper et.al | paper | - | - |
| 2025-9-16 | Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization | Hao Xu et.al | paper | code | <summary>detail</summary>Code available at https://github |
| 2025-9-15 | SAQ: Pushing the Limits of Vector Quantization through Code Adjustment and Dimension Segmentation | Hui Li et.al | paper | - | - |
| 2025-9-12 | Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization | Yifan Chang et.al | paper | - | - |
| 2025-9-11 | A Vector-Quantized Foundation Model for Patient Behavior Monitoring | Rodrigo Oliver et.al | paper | - | - |
| 2025-9-4 | Kernel $k$-Medoids as General Vector Quantization | Thore Gerlach et.al | paper | - | - |
| 2025-8-25 | Scene-Aware Vectorized Memory Multi-Agent Framework with Cross-Modal Differentiated Quantization VLMs for Visually Impaired Assistance | Xiangxiang Wang et.al | paper | - | - |