Daily Papers
- Defect Detection
- Defect Segmentation
- Anomaly Detection
- 3D Anomaly Detection
- Multimodal Anomaly Detection
- Vector Quantization
Updated on 2025.07.23
Defect Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-7-21 | A Steel Surface Defect Detection Method Based on Lightweight Convolution Optimization | Cong Chen et.al | paper | - | <summary>detail</summary>Journal ref:International Journal of Advanced Computer Science and Applications (IJACSA) |
2025-7-21 | RoadFusion: Latent Diffusion Model for Pavement Defect Detection | Muhammad Aqeel et.al | paper | - | <summary>detail</summary>ICIAP 2025 |
2025-7-21 | ExDD: Explicit Dual Distribution Learning for Surface Defect Detection via Diffusion Synthesis | Muhammad Aqeel et.al | paper | - | <summary>detail</summary>ICIAP 2025 |
2025-7-21 | VQGNet: An Unsupervised Defect Detection Approach for Complex Textured Steel Surfaces | R Yu et.al | paper | - | <summary>detail</summary>Sensors, 2024 mdpi.com |
2025-7-21 | Unsupervised Bearing Raceway Surface Defect Detection Based on Improved f-AnoGAN | Y Zhang et.al | paper | - | <summary>detail</summary>Measurement Science and…, 2024 iopscience.iop.org |
2025-7-15 | A Comprehensive Survey for Real-World Industrial Defect Detection: Challenges, Approaches, and Prospects | Yuqi Cheng et.al | paper | - | - |
2025-7-14 | Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al | paper | code | - |
2025-7-10 | NexViTAD: Few-shot Unsupervised Cross-Domain Defect Detection via Vision Foundation Models and Multi-Task Learning | Tianwei Mu et.al | paper | - | - |
2025-7-10 | Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects | Yuqi Cheng et.al | paper | - | - |
2025-7-7 | Semi-Supervised Defect Detection via Conditional Diffusion and CLIP-Guided Noise Filtering | Shuai Li et.al | paper | code | - |
2025-7-4 | MRC-DETR: An Adaptive Multi-Residual Coupled Transformer for Bare Board PCB Defect Detection | Jiangzhong Cao et.al | paper | - | - |
2025-6-30 | VR-YOLO: Enhancing PCB Defect Detection with Viewpoint Robustness Based on YOLO | Hengyi Zhu et.al | paper | - | - |
2025-6-26 | YOLO-FDA: Integrating Hierarchical Attention and Detail Enhancement for Surface Defect Detection | Jiawei Hu et.al | paper | - | - |
2025-6-24 | Evolutionary computing-based image segmentation method to detect defects and features in Additive Friction Stir Deposition Process | Akshansh Mishra et.al | paper | - | - |
2025-6-20 | From Lab to Factory: Pitfalls and Guidelines for Self-/Unsupervised Defect Detection on Low-Quality Industrial Images | Sebastian Hönel et.al | paper | - | - |
2025-6-16 | ESRPCB: an Edge guided Super-Resolution model and Ensemble learning for tiny Printed Circuit Board Defect detection | Xiem HoangVan et.al | paper | - | <summary>detail</summary>Published in Engineering Applications of Artificial Intelligence |
2025-6-12 | Deep Learning-based Multi Project InP Wafer Simulation for Unsupervised Surface Defect Detection | Emílio Dolgener Cantú et.al | paper | - | - |
2025-6-12 | Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control | Sajjad Rezvani Boroujeni et.al | paper | - | - |
2025-5-23 | Research on Defect Detection Method of Motor Control Board Based on Image Processing | Jingde Huang et.al | paper | - | - |
2025-5-15 | Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection | Hanzhe Liang et.al | paper | code | - |
2025-5-15 | Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data | Prashant P. Shinde et.al | paper | - | - |
2025-5-12 | Self-Adaptive Gamma Context-Aware SSM-based Model for Metal Defect Detection | Sijin Sun et.al | paper | - | - |
2025-5-11 | Differentiable NMS via Sinkhorn Matching for End-to-End Fabric Defect Detection | Zhengyang Lu et.al | paper | - | - |
2025-5-11 | Transmission Line Defect Detection Based on UAV Patrol Images and Vision-language Pretraining | Ke Zhang et.al | paper | - | - |
2025-5-2 | A Comprehensive Survey on Machine Learning Driven Material Defect Detection | Jun Bai et.al | paper | - | <summary>detail</summary>ACM Computing Surveys |
2025-4-29 | SteelBlastQC: Shot-blasted Steel Surface Dataset with Interpretable Detection of Surface Defects | Irina Ruzavina et.al | paper | - | <summary>detail</summary>Accepted by IJCNN 2025 |
2025-4-24 | Conformal Segmentation in Industrial Surface Defect Detection with Statistical Guarantees | Cheng Shen et.al | paper | - | <summary>detail</summary>Under Review |
Defect Segmentation
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-7-14 | Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al | paper | code | - |
2025-6-28 | Region-Aware CAM: High-Resolution Weakly-Supervised Defect Segmentation via Salient Region Perception | Hang-Cheng Dong et.al | paper | - | - |
2025-6-24 | Evolutionary computing-based image segmentation method to detect defects and features in Additive Friction Stir Deposition Process | Akshansh Mishra et.al | paper | - | - |
2025-6-17 | synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections? | Johannes Flotzinger et.al | paper | - | - |
2025-4-24 | Conformal Segmentation in Industrial Surface Defect Detection with Statistical Guarantees | Cheng Shen et.al | paper | - | <summary>detail</summary>Under Review |
2025-4-11 | Weakly Supervised Panoptic Segmentation for Defect-Based Grading of Fresh Produce | Manuel Knott et.al | paper | code | <summary>detail</summary>Accepted as a paper to the 6th International Workshop on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture in conjunction with IEEE/CVF CVPR 2025 |
2025-4-9 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
2025-2-11 | Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models | Tongkun Liu et.al | paper | code | - |
2025-1-23 | Effective Defect Detection Using Instance Segmentation for NDI | Ashiqur Rahman et.al | paper | code | - |
2025-1-17 | Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography | Mohammed Salah et.al | paper | - | <summary>detail</summary>Pulse thermography |
2024-10-24 | Synth4Seg – Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization | Shancong Mou et.al | paper | - | - |
2024-10-1 | Application of Segment Anything Model for Civil Infrastructure Defect Assessment | Mohsen Ahmadi et.al | paper | - | - |
2024-9-20 | Cycle-Consistency Uncertainty Estimation for Visual Prompting based One-Shot Defect Segmentation | Geonuk Kim et.al | paper | - | <summary>detail</summary>ECCV 2024 VISION workshop Most Innovative Prize |
2024-8-31 | Change-Aware Siamese Network for Surface Defects Segmentation under Complex Background | Biyuan Liu et.al | paper | - | - |
2024-8-19 | Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network | Rasha Alshawi et.al | paper | - | - |
2024-8-19 | Dynamic Label Injection for Imbalanced Industrial Defect Segmentation | Emanuele Caruso et.al | paper | code | <summary>detail</summary>ECCV 2024 VISION Workshop |
2024-6-26 | An unsupervised approach towards promptable defect segmentation in laser-based additive manufacturing by Segment Anything | Israt Zarin Era et.al | paper | - | - |
2024-4-17 | CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect | Minh Tran et.al | paper | code | <summary>detail</summary>Poultry Science Journal |
2024-3-17 | LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation | Hanze Ding et.al | paper | - | - |
2024-2-6 | Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing | Jongmin Yu et.al | paper | - | <summary>detail</summary>the ICRA 2024 |
2023-12-21 | Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation | Rasha Alshawi et.al | paper | - | <summary>detail</summary>under review in IEEE Transactions on Artificial Intelligence |
2023-12-8 | Continual learning for surface defect segmentation by subnetwork creation and selection | Aleksandr Dekhovich et.al | paper | - | - |
2023-12-6 | Overhead Line Defect Recognition Based on Unsupervised Semantic Segmentation | Weixi Wang et.al | paper | - | - |
2023-11-16 | Segment Anything in Defect Detection | Bozhen Hu et.al | paper | - | - |
2023-10-24 | Harmonizing output imbalance for defect segmentation on extremely-imbalanced photovoltaic module cells images | Jianye Yi et.al | paper | - | - |
Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-7-22 | ViP$^2$-CLIP: Visual-Perception Prompting with Unified Alignment for Zero-Shot Anomaly Detection | Ziteng Yang et.al | paper | - | - |
2025-7-22 | Adaptive Gaussian Mixture Models-based Anomaly Detection for under-constrained Cable-Driven Parallel Robots | Julio Garrido et.al | paper | - | - |
2025-7-22 | One-for-More: Continual Diffusion Model for Anomaly Detection | Xiaofan Li et.al | paper | code | <summary>detail</summary>Accepted by CVPR2025 |
2025-7-22 | Diffusion-Based Electrocardiography Noise Quantification via Anomaly Detection | Tae-Seong Han et.al | paper | - | - |
2025-7-21 | SAGE: A Visual Language Model for Anomaly Detection via Fact Enhancement and Entropy-aware Alignment | Guoxin Zang et.al | paper | code | <summary>detail</summary>Accepted by ACMMM2025 |
2025-7-21 | Investigation of unsupervised and supervised hyperspectral anomaly detection | Mazharul Hossain et.al | paper | - | <summary>detail</summary>Published in Proceedings Volume 13138 |
2025-7-21 | Explainable Anomaly Detection for Electric Vehicles Charging Stations | Matteo Cederle et.al | paper | - | - |
2025-7-21 | Towards Explainable Anomaly Detection in Shared Mobility Systems | Elnur Isgandarov et.al | paper | - | - |
2025-7-21 | We Need to Rethink Benchmarking in Anomaly Detection | Philipp Röchner et.al | paper | - | - |
2025-7-21 | Foundation Models and Transformers for Anomaly Detection: A Survey | Mouïn Ben Ammar et.al | paper | - | - |
2025-7-21 | Self-Tuning Self-Supervised Image Anomaly Detection | Jaemin Yoo et.al | paper | code | <summary>detail</summary>KDD 2025 |
2025-7-20 | A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection | Jiangning Zhang et.al | paper | code | - |
2025-7-19 | ALLO: A Photorealistic Dataset and Data Generation Pipeline for Anomaly Detection During Robotic Proximity Operations in Lunar Orbit | Selina Leveugle et.al | paper | - | <summary>detail</summary>Submitted to International Conference on Robotics and Automation (ICRA’25) |
2025-7-19 | Revisiting Graph Contrastive Learning on Anomaly Detection: A Structural Imbalance Perspective | Yiming Xu et.al | paper | - | <summary>detail</summary>Accepted by AAAI2025 |
2025-7-19 | MoViAD: A Modular Library for Visual Anomaly Detection | Manuel Barusco et.al | paper | - | - |
2025-7-18 | Unmasking Performance Gaps: A Comparative Study of Human Anonymization and Its Effects on Video Anomaly Detection | Sara Abdulaziz et.al | paper | - | <summary>detail</summary>ACIVS 2025 |
2025-7-18 | Robust Anomaly Detection with Graph Neural Networks using Controllability | Yifan Wei et.al | paper | - | <summary>detail</summary>conference paper published in IEEE CAI 2025 |
2025-7-17 | Position: Untrained Machine Learning for Anomaly Detection by using 3D Point Cloud Data | Juan Du et.al | paper | - | - |
2025-7-17 | Salvaging the Overlooked: Leveraging Class-Aware Contrastive Learning for Multi-Class Anomaly Detection | Lei Fan et.al | paper | code | <summary>detail</summary>Accepted by ICCV2025 |
2025-7-17 | 3DKeyAD: High-Resolution 3D Point Cloud Anomaly Detection via Keypoint-Guided Point Clustering | Zi Wang et.al | paper | - | - |
2025-7-17 | ProDisc-VAD: An Efficient System for Weakly-Supervised Anomaly Detection in Video Surveillance Applications | Tao Zhu et.al | paper | code | <summary>detail</summary>arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission |
2025-7-16 | Unifying Explainable Anomaly Detection and Root Cause Analysis in Dynamical Systems | Yue Sun et.al | paper | - | <summary>detail</summary>Accepted by the AAAI-25 Workshop on Artificial Intelligence for Cyber Security (AICS) |
2025-7-16 | Text-ADBench: Text Anomaly Detection Benchmark based on LLMs Embedding | Feng Xiao et.al | paper | code | - |
2025-7-15 | Learning Representations of Event Time Series with Sparse Autoencoders for Anomaly Detection, Similarity Search, and Unsupervised Classification | Steven Dillmann et.al | paper | code | <summary>detail</summary>the 2025 ICML Workshop on Machine Learning for Astrophysics |
2025-7-15 | TAB: Unified Benchmarking of Time Series Anomaly Detection Methods | Xiangfei Qiu et.al | paper | code | <summary>detail</summary>Accepted by PVLDB2025 |
3D Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-7-17 | Position: Untrained Machine Learning for Anomaly Detection by using 3D Point Cloud Data | Juan Du et.al | paper | - | - |
2025-7-17 | 3DKeyAD: High-Resolution 3D Point Cloud Anomaly Detection via Keypoint-Guided Point Clustering | Zi Wang et.al | paper | - | - |
2025-7-12 | Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline | Shiyi Mu et.al | paper | code.) | <summary>detail</summary>under review |
2025-7-10 | 3D-ADAM: A Dataset for 3D Anomaly Detection in Advanced Manufacturing | Paul McHard et.al | paper | - | - |
2025-7-10 | Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects | Yuqi Cheng et.al | paper | - | - |
2025-7-5 | Taming Anomalies with Down-Up Sampling Networks: Group Center Preserving Reconstruction for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>ACM MM25 Accepted |
2025-6-26 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | - | - |
2025-6-23 | Uni-3DAD: GAN-Inversion Aided Universal 3D Anomaly Detection on Model-free Products | J Liu et.al | paper | code | - |
2025-6-22 | Efficient Slice Anomaly Detection Network for 3D Brain MRI Volume | Z Zhang et.al | paper | code | - |
2025-6-16 | Knowledge-informed randomized machine learning and data fusion for anomaly areas detection in multimodal 3D images | N Alsahanova et.al | paper | - | <summary>detail</summary>Information…, 2025 Elsevier |
2025-6-3 | DAS3D: Dual-modality Anomaly Synthesis for 3D Anomaly Detection | Kecen Li et.al | paper | code | <summary>detail</summary>Code available at https://github |
2025-5-30 | Learning in CubeRes Model Space for Anomaly Detection in 3D GPR Data | X Zhou et.al | paper | - | <summary>detail</summary>ijcai.org |
2025-5-27 | Mentor3AD: Feature Reconstruction-based 3D Anomaly Detection via Multi-modality Mentor Learning | Hanzhe Liang et.al | paper | - | <summary>detail</summary>arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission |
2025-5-27 | Anomaly Behavior Detection in Crowd via Lightweight 3D Convolution | J Wang et.al | paper | - | <summary>detail</summary>International Conference on Intelligent Computing, 2024 Springer |
2025-5-24 | 3D Industrial anomaly detection via dual reconstruction network | Z Li et.al | paper | - | <summary>detail</summary>Applied Intelligence, 2024 Springer |
2025-5-22 | Spatially Aware Fusion in 3D Convolutional Autoencoders for Video Anomaly Detection | A Niaz et.al | paper | - | <summary>detail</summary>IEEE Access, 2024 ieeexplore.ieee.org |
2025-5-18 | Towards High-resolution 3D Anomaly Detection via Group-Level Feature Contrastive Learning | H Zhu et.al | paper | code | - |
2025-5-15 | Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection | Hanzhe Liang et.al | paper | code | - |
2025-5-13 | IGSPAD: Inverting 3D Gaussian Splatting for Pose-agnostic Anomaly Detection | B Jiang et.al | paper | - | <summary>detail</summary>ACM Multimedia 2024 openreview.net |
2025-5-9 | R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection | Z Zhou et.al | paper | code | - |
2025-5-3 | MC3D-AD: A Unified Geometry-aware Reconstruction Model for Multi-category 3D Anomaly Detection | Jiayi Cheng et.al | paper | - | - |
2025-4-19 | Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection | Wenbing Zhu et.al | paper | code | - |
2025-4-7 | IterMask3D: Unsupervised Anomaly Detection and Segmentation with Test-Time Iterative Mask Refinement in 3D Brain MR | Ziyun Liang et.al | paper | - | - |
2025-3-30 | Self-Supervised Masked Mesh Learning for Unsupervised Anomaly Detection on 3D Cortical Surfaces | Hao-Chun Yang et.al | paper | - | - |
2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
2025-3-10 | Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>AAAI2025 Poster |
2025-3-3 | Fence Theorem: Towards Dual-Objective Semantic-Structure Isolation in Preprocessing Phase for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | - |
2025-2-16 | Exploiting Point-Language Models with Dual-Prompts for 3D Anomaly Detection | Jiaxiang Wang et.al | paper | - | - |
2025-2-9 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al | paper | code | - |
2024-12-23 | Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective | Kaifang Long et.al | paper | - | - |
2024-12-22 | PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection | Qihang Zhou et.al | paper | - | <summary>detail</summary>NeurIPS 2024 |
2024-12-17 | PO3AD: Predicting Point Offsets toward Better 3D Point Cloud Anomaly Detection | Jianan Ye et.al | paper | - | - |
2024-10-15 | SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection | Yizhe Liu et.al | paper | - | - |
Multimodal Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-7-16 | Exploiting Multimodal Latent Diffusion Models for Accurate Anomaly Detection in Industry 5.0 | L Capogrosso et.al | paper | - | <summary>detail</summary>2024 ceur ws.org |
2025-7-12 | Multimodal Attention-Enhanced Feature Fusion-based Weekly Supervised Anomaly Violence Detection | Y Kaneko et.al | paper | code | - |
2025-7-4 | Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning | Z Sun et.al | paper | code | - |
2025-6-26 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | - | - |
2025-6-23 | Multimodal Anomaly Detection with a Mixture-of-Experts | Christoph Willibald et.al | paper | - | - |
2025-6-20 | When Every Millisecond Counts: Real-Time Anomaly Detection via the Multimodal Asynchronous Hybrid Network | Dong Xiao et.al | paper | - | <summary>detail</summary>ICML 2025 Spotlight |
2025-6-16 | Knowledge-informed randomized machine learning and data fusion for anomaly areas detection in multimodal 3D images | N Alsahanova et.al | paper | - | <summary>detail</summary>Information…, 2025 Elsevier |
2025-6-4 | MemoryOut: Learning Principal Features via Multimodal Sparse Filtering Network for Semi-supervised Video Anomaly Detection | Juntong Li et.al | paper | code | - |
2025-5-28 | OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning | Shifang Zhao et.al | paper | - | - |
2025-5-28 | DIP-ECOD: improving anomaly detection in multimodal distributions | K Yang et.al | paper | - | <summary>detail</summary>Conference on Applied…, 2024 pure.qub.ac.uk |
2025-5-19 | Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | Kiarash Naghavi Khanghah et.al | paper | - | <summary>detail</summary>ASME 2025 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference IDETC/CIE2025 |
2025-5-8 | Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection | Sungheon Jeong et.al | paper | code | - |
2025-4-23 | A Collaborative Framework Using Multimodal Data and Adaptive Noise for Human Behavior Anomaly Detection | G Yang et.al | paper | - | <summary>detail</summary>2024 International Joint…, 2024 ieeexplore.ieee.org |
2025-4-20 | Improving the accuracy of Anomaly Detection in Multimodal Sensors using 1D-CNN | M Imad et.al | paper | - | <summary>detail</summary>Proceedings of the 17th…, 2024 dl.acm.org |
2025-4-17 | LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection | Weijia Li et.al | paper | - | - |
2025-4-7 | Weakly-supervised anomaly detection for multimodal data distributions | X Tan et.al | paper | code | - |
2025-4-4 | Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping Supplementary Material | A Costanzino et.al | paper | - | <summary>detail</summary>openaccess.thecvf.com |
2025-3-29 | M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising | C Wang et.al | paper | code | - |
2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
2025-3-19 | Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation | Xinyue Liu et.al | paper | - | - |
2025-3-17 | Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models | Jiacong Xu et.al | paper | code | - |
2025-2-24 | Can Multimodal LLMs Perform Time Series Anomaly Detection? | Xiongxiao Xu et.al | paper | code | - |
2025-2-20 | MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection | Xi Jiang et.al | paper | code | <summary>detail</summary>Accepted by ICLR 2025 |
2025-2-18 | Anomaly Detection in Smart Power Grids with Graph-Regularized MS-SVDD: a Multimodal Subspace Learning Approach | Thomas Debelle et.al | paper | - | - |
2025-2-10 | Multimodal Task Representation Memory Bank vs. Catastrophic Forgetting in Anomaly Detection | You Zhou et.al | paper | - | - |
2025-2-9 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al | paper | code | - |
2025-1-27 | Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? | Zhiling Chen et.al | paper | - | - |
2025-1-17 | Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection | Yuanze Li et.al | paper | code | - |
2024-12-23 | Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective | Kaifang Long et.al | paper | - | - |
2024-9-30 | VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection | Huilin Deng et.al | paper | - | - |
2024-9-26 | AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving | Daniel Bogdoll et.al | paper | - | <summary>detail</summary>Daniel Bogdoll |
2024-9-23 | Incomplete Multimodal Industrial Anomaly Detection via Cross-Modal Distillation | Wenbo Sui et.al | paper | - | - |
2024-7-8 | Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping | Alex Costanzino et.al | paper | - | <summary>detail</summary>CVPR 2024 |
Vector Quantization
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-7-20 | Vector Quantization Prompting for Continual Learning | Li Jiao et.al | paper | code | <summary>detail</summary>Accepted by NeurIPS 2024 |
2025-7-14 | A Vector-Quantized Foundation Model for Patient Behavior Monitoring | Rodrigo Oliver et.al | paper | - | - |
2025-7-9 | Adversarial Defenses via Vector Quantization | Zhiyi Dong et.al | paper | code | <summary>detail</summary>This is the author-accepted version of our paper published in Neurocomputing |
2025-7-9 | Semi-fragile watermarking of remote sensing images using DWT, vector quantization and automatic tiling | Jordi Serra-Ruiz et.al | paper | - | - |
2025-7-9 | VQ-SGen: A Vector Quantized Stroke Representation for Creative Sketch Generation | Jiawei Wang et.al | paper | code | <summary>detail</summary>Project Page: https://enigma-li |
2025-7-8 | EdgeCodec: Onboard Lightweight High Fidelity Neural Compressor with Residual Vector Quantization | Benjamin Hodo et.al | paper | - | <summary>detail</summary>7 Pages |
2025-7-2 | Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization | Mohammad Hassan Vali et.al | paper | - | - |
2025-7-1 | Hierarchical Patch Compression for ColPali: Efficient Multi-Vector Document Retrieval with Dynamic Pruning and Quantization | Duong Bach et.al | paper | code | - |
2025-7-1 | VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers | Yating Wang et.al | paper | code | <summary>detail</summary>Accepted by ICCV 2025 |
2025-6-30 | VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference | Zihan Liu et.al | paper | - | - |
2025-6-28 | Hierarchical Characterization of Brain Dynamics via State Space-based Vector Quantization | Yanwu Yang et.al | paper | - | - |
2025-6-26 | PCDVQ: Enhancing Vector Quantization for Large Language Models via Polar Coordinate Decoupling | Yuxuan Yue et.al | paper | - | - |
2025-6-24 | AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models | Zeyu Li et.al | paper | - | - |
2025-6-23 | CommVQ: Commutative Vector Quantization for KV Cache Compression | Junyan Li et.al | paper | code | <summary>detail</summary>ICML 2025 poster |
2025-6-21 | StainPIDR: A Pathological Image Decouplingand Reconstruction Method for Stain Normalization Based on Color Vector Quantization and Structure Restaining | Zheng Chen et.al | paper | - | - |
2025-6-17 | Enhancing Vector Quantization with Distributional Matching: A Theoretical and Empirical Study | Xianghong Fang et.al | paper | - | - |
2025-6-11 | STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization | Hao Li et.al | paper | - | <summary>detail</summary>Accepted by ICML 2025 Spotlight |
2025-6-8 | Vector-Quantized Vision Foundation Models for Object-Centric Learning | Rongzhen Zhao et.al | paper | code | - |
2025-6-5 | Kernel $k$-Medoids as General Vector Quantization | Thore Gerlach et.al | paper | - | - |
2025-6-3 | A$^2$ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization | Junhui He et.al | paper | - | - |
2025-6-2 | Efficient Generative Modeling with Residual Vector Quantization-Based Tokens | Jaehyeon Kim et.al | paper | - | <summary>detail</summary>ICML 2025 |
2025-5-31 | Concept-Centric Token Interpretation for Vector-Quantized Generative Models | Tianze Yang et.al | paper | code | - |
2025-5-27 | Autoregressive Speech Synthesis without Vector Quantization | Lingwei Meng et.al | paper | code | <summary>detail</summary>ACL 2025 Main |
2025-5-23 | NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache | Donghyun Son et.al | paper | - | - |
2025-5-19 | BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation | Jilong Li et.al | paper | - | - |