Daily Papers
- Defect Detection
- Defect Segmentation
- Anomaly Detection
- 3D Anomaly Detection
- Multimodal Anomaly Detection
- Vector Quantization
Updated on 2025.09.04
Defect Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-9-3 | Joint Training of Image Generator and Detector for Road Defect Detection | Kuan-Chuan Peng et.al | paper | - | <summary>detail</summary>This paper is accepted to ICCV 2025 Workshop on Representation Learning with Very Limited Resources: When Data |
2025-9-2 | Resilient Multimodal Industrial Surface Defect Detection with Uncertain Sensors Availability | Shuai Jiang et.al | paper | code | <summary>detail</summary>IEEE/ASME Transactions on Mechatronics |
2025-9-1 | TransMatch: A Transfer-Learning Framework for Defect Detection in Laser Powder Bed Fusion Additive Manufacturing | Mohsen Asghari Ilani et.al | paper | - | - |
2025-8-31 | Surface Defect Detection with Gabor Filter Using Reconstruction-Based Blurring U-Net-ViT | Jongwook Si et.al | paper | - | - |
2025-8-26 | No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes | Blaž Rolih et.al | paper | code | <summary>detail</summary>Accepted by The Journal of Intelligent Manufacturing |
2025-8-22 | A Lightweight Group Multiscale Bidirectional Interactive Network for Real-Time Steel Surface Defect Detection | Yong Zhang et.al | paper | code | - |
2025-8-15 | Defects4Log: Benchmarking LLMs for Logging Code Defect Detection and Reasoning | Xin Wang et.al | paper | - | - |
2025-8-14 | A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection | Yangjie Xiao et.al | paper | code | - |
2025-8-12 | Masked Autoencoder Self Pre-Training for Defect Detection in Microelectronics | Nikolai Röhrich et.al | paper | - | - |
2025-8-10 | A Steel Surface Defect Detection Method Based on Lightweight Convolution Optimization | Cong Chen et.al | paper | code | <summary>detail</summary>This is a preprint of an article accepted for publication in the International Journal of Advanced Computer Science and Applications (IJACSA) |
2025-8-8 | Advancing Welding Defect Detection in Maritime Operations via Adapt-WeldNet and Defect Detection Interpretability Analysis | Kamal Basha S et.al | paper | - | - |
2025-8-6 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
2025-8-2 | NATLM: Detecting Defects in NFT Smart Contracts Leveraging LLM | Yuanzheng Niu et.al | paper | - | - |
2025-7-29 | Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control | Sajjad Rezvani Boroujeni et.al | paper | - | - |
2025-7-22 | Multi-Scale PCB Defect Detection with YOLOv8 Network Improved via Pruning and Lightweight Network | Li Pingzhen et.al | paper | - | - |
2025-7-21 | RoadFusion: Latent Diffusion Model for Pavement Defect Detection | Muhammad Aqeel et.al | paper | - | <summary>detail</summary>ICIAP 2025 |
2025-7-21 | ExDD: Explicit Dual Distribution Learning for Surface Defect Detection via Diffusion Synthesis | Muhammad Aqeel et.al | paper | - | <summary>detail</summary>ICIAP 2025 |
2025-7-15 | A Comprehensive Survey for Real-World Industrial Defect Detection: Challenges, Approaches, and Prospects | Yuqi Cheng et.al | paper | - | - |
2025-7-14 | Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al | paper | code | - |
2025-7-10 | NexViTAD: Few-shot Unsupervised Cross-Domain Defect Detection via Vision Foundation Models and Multi-Task Learning | Tianwei Mu et.al | paper | - | - |
2025-7-10 | Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects | Yuqi Cheng et.al | paper | - | - |
2025-7-7 | Semi-Supervised Defect Detection via Conditional Diffusion and CLIP-Guided Noise Filtering | Shuai Li et.al | paper | code | - |
2025-7-4 | MRC-DETR: An Adaptive Multi-Residual Coupled Transformer for Bare Board PCB Defect Detection | Jiangzhong Cao et.al | paper | - | - |
2025-6-30 | VR-YOLO: Enhancing PCB Defect Detection with Viewpoint Robustness Based on YOLO | Hengyi Zhu et.al | paper | - | - |
2025-6-26 | YOLO-FDA: Integrating Hierarchical Attention and Detail Enhancement for Surface Defect Detection | Jiawei Hu et.al | paper | - | - |
Defect Segmentation
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-8-14 | A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection | Yangjie Xiao et.al | paper | code | - |
2025-8-11 | KARMA: Efficient Structural Defect Segmentation via Kolmogorov-Arnold Representation Learning | Md Meftahul Ferdaus et.al | paper | code | <summary>detail</summary>submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence |
2025-8-6 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
2025-7-23 | Exploring Active Learning for Semiconductor Defect Segmentation | Lile Cai et.al | paper | - | <summary>detail</summary>accepted to ICIP 2022 |
2025-7-14 | Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al | paper | code | - |
2025-6-28 | Region-Aware CAM: High-Resolution Weakly-Supervised Defect Segmentation via Salient Region Perception | Hang-Cheng Dong et.al | paper | - | - |
2025-6-24 | Evolutionary computing-based image segmentation method to detect defects and features in Additive Friction Stir Deposition Process | Akshansh Mishra et.al | paper | - | - |
2025-6-17 | synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections? | Johannes Flotzinger et.al | paper | - | - |
2025-4-24 | Conformal Segmentation in Industrial Surface Defect Detection with Statistical Guarantees | Cheng Shen et.al | paper | - | <summary>detail</summary>Under Review |
2025-4-11 | Weakly Supervised Panoptic Segmentation for Defect-Based Grading of Fresh Produce | Manuel Knott et.al | paper | code | <summary>detail</summary>Accepted as a paper to the 6th International Workshop on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture in conjunction with IEEE/CVF CVPR 2025 |
2025-2-11 | Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models | Tongkun Liu et.al | paper | code | - |
2025-1-23 | Effective Defect Detection Using Instance Segmentation for NDI | Ashiqur Rahman et.al | paper | code | - |
2025-1-17 | Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography | Mohammed Salah et.al | paper | - | <summary>detail</summary>Pulse thermography |
2024-10-24 | Synth4Seg – Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization | Shancong Mou et.al | paper | - | - |
2024-10-1 | Application of Segment Anything Model for Civil Infrastructure Defect Assessment | Mohsen Ahmadi et.al | paper | - | - |
2024-9-20 | Cycle-Consistency Uncertainty Estimation for Visual Prompting based One-Shot Defect Segmentation | Geonuk Kim et.al | paper | - | <summary>detail</summary>ECCV 2024 VISION workshop Most Innovative Prize |
2024-8-31 | Change-Aware Siamese Network for Surface Defects Segmentation under Complex Background | Biyuan Liu et.al | paper | - | - |
2024-8-19 | Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network | Rasha Alshawi et.al | paper | - | - |
2024-8-19 | Dynamic Label Injection for Imbalanced Industrial Defect Segmentation | Emanuele Caruso et.al | paper | code | <summary>detail</summary>ECCV 2024 VISION Workshop |
2024-6-26 | An unsupervised approach towards promptable defect segmentation in laser-based additive manufacturing by Segment Anything | Israt Zarin Era et.al | paper | - | - |
2024-4-17 | CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect | Minh Tran et.al | paper | code | <summary>detail</summary>Poultry Science Journal |
2024-3-17 | LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation | Hanze Ding et.al | paper | - | - |
2024-2-6 | Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing | Jongmin Yu et.al | paper | - | <summary>detail</summary>the ICRA 2024 |
2023-12-21 | Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation | Rasha Alshawi et.al | paper | - | <summary>detail</summary>under review in IEEE Transactions on Artificial Intelligence |
2023-12-8 | Continual learning for surface defect segmentation by subnetwork creation and selection | Aleksandr Dekhovich et.al | paper | - | - |
Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-9-3 | Machine Learning-Driven Anomaly Detection for 5G O-RAN Performance Metrics | Babak Azkaei et.al | paper | - | - |
2025-9-3 | PointAD+: Learning Hierarchical Representations for Zero-shot 3D Anomaly Detection | Qihang Zhou et.al | paper | - | <summary>detail</summary>Submitted to TPAMI |
2025-9-3 | Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection | Anja Delić et.al | paper | - | <summary>detail</summary>ICCV 2025 Highlight |
2025-9-2 | Learning local and global prototypes with optimal transport for unsupervised anomaly detection and localization | Robin Trombetta et.al | paper | - | - |
2025-9-2 | SALAD – Semantics-Aware Logical Anomaly Detection | Matic Fučka et.al | paper | code | <summary>detail</summary>ICCV 2025 |
2025-9-1 | Robust Anomaly Detection through Multi-Modal Autoencoder Fusion for Small Vehicle Damage Detection | Sara Khan et.al | paper | - | - |
2025-9-1 | Unsupervised Identification and Replay-based Detection (UIRD) for New Category Anomaly Detection in ECG Signal | Zhangyue Shi et.al | paper | - | - |
2025-9-1 | Anomaly detection in network flows using unsupervised online machine learning | Alberto Miguel-Diez et.al | paper | - | - |
2025-9-1 | ACD-CLIP: Decoupling Representation and Dynamic Fusion for Zero-Shot Anomaly Detection | Ke Ma et.al | paper | - | - |
2025-9-1 | Simplifying Traffic Anomaly Detection with Video Foundation Models | Svetlana Orlova et.al | paper | code | <summary>detail</summary>ICCVW 2025 accepted |
2025-8-31 | CCE: Confidence-Consistency Evaluation for Time Series Anomaly Detection | Zhijie Zhong et.al | paper | - | - |
2025-8-30 | Tighten The Lasso: A Convex Hull Volume-based Anomaly Detection Method | Uri Itai et.al | paper | - | - |
2025-8-29 | TMUAD: Enhancing Logical Capabilities in Unified Anomaly Detection Models with a Text Memory Bank | Jiawei Liu et.al | paper | code | - |
2025-8-29 | Silent Failures in Stateless Systems: Rethinking Anomaly Detection for Serverless Computing | Chanh Nguyen et.al | paper | - | - |
2025-8-29 | Quantum enhanced ensemble GANs for anomaly detection in continuous biomanufacturing | Rajiv Kailasanathan et.al | paper | - | - |
2025-8-28 | CALM: A Framework for Continuous, Adaptive, and LLM-Mediated Anomaly Detection in Time-Series Streams | Ashok Devireddy et.al | paper | - | - |
2025-8-28 | Automating the Deep Space Network Data Systems; A Case Study in Adaptive Anomaly Detection through Agentic AI | Evan J. Chou et.al | paper | - | <summary>detail</summary>ACM Class:I |
2025-8-28 | RANGAN: GAN-empowered Anomaly Detection in 5G Cloud RAN | Douglas Liao et.al | paper | - | <summary>detail</summary>Accepted for presentation in the 2025 IEEE Conference on Standards for Communications and Networking (CSCN) |
2025-8-28 | ATM-GAD: Adaptive Temporal Motif Graph Anomaly Detection for Financial Transaction Networks | Zeyue Zhang et.al | paper | - | - |
2025-8-28 | IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection | Xuanming Cao et.al | paper | - | - |
2025-8-27 | Anomaly Detection in Networked Bandits | Xiaotong Cheng et.al | paper | - | - |
2025-8-27 | Context-Aware Zero-Shot Anomaly Detection in Surveillance Using Contrastive and Predictive Spatiotemporal Modeling | Md. Rashid Shahriar Khan et.al | paper | code | - |
2025-8-27 | Topological Uncertainty for Anomaly Detection in the Neural-network EoS Inference with Neutron Star Data | Kenji Fukushima et.al | paper | - | - |
2025-8-27 | DNP-Guided Contrastive Reconstruction with a Reverse Distillation Transformer for Medical Anomaly Detection | Luhu Li et.al | paper | - | - |
2025-8-26 | CITADEL: Continual Anomaly Detection for Enhanced Learning in IoT Intrusion Detection | Elvin Li et.al | paper | - | <summary>detail</summary>Under review at IEEE IoTJ |
3D Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-9-3 | PointAD+: Learning Hierarchical Representations for Zero-shot 3D Anomaly Detection | Qihang Zhou et.al | paper | - | <summary>detail</summary>Submitted to TPAMI |
2025-8-28 | IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection | Xuanming Cao et.al | paper | - | - |
2025-8-19 | Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline | Shiyi Mu et.al | paper | code.) | <summary>detail</summary>under review |
2025-8-2 | C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor | Haoquan Lu et.al | paper | code | <summary>detail</summary>We have provided the code for C3D-AD with checkpoints and BASELINE at this link: https://github |
2025-8-1 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | code | <summary>detail</summary>ICCV 2025 |
2025-8-1 | HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection | Jiaping Cao et.al | paper | - | - |
2025-7-29 | Multi-View Reconstruction with Global Context for 3D Anomaly Detection | Yihan Sun et.al | paper | - | - |
2025-7-27 | Position: Untrained Machine Learning for Anomaly Detection by using 3D Point Cloud Data | Juan Du et.al | paper | - | - |
2025-7-25 | BridgeNet: A Unified Multimodal Framework for Bridging 2D and 3D Industrial Anomaly Detection | An Xiang et.al | paper | code | - |
2025-7-24 | MC3D-AD: A Unified Geometry-aware Reconstruction Model for Multi-category 3D Anomaly Detection | Jiayi Cheng et.al | paper | code | - |
2025-7-17 | 3DKeyAD: High-Resolution 3D Point Cloud Anomaly Detection via Keypoint-Guided Point Clustering | Zi Wang et.al | paper | - | - |
2025-7-10 | 3D-ADAM: A Dataset for 3D Anomaly Detection in Advanced Manufacturing | Paul McHard et.al | paper | - | - |
2025-7-10 | Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects | Yuqi Cheng et.al | paper | - | - |
2025-7-5 | Taming Anomalies with Down-Up Sampling Networks: Group Center Preserving Reconstruction for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>ACM MM25 Accepted |
2025-6-3 | DAS3D: Dual-modality Anomaly Synthesis for 3D Anomaly Detection | Kecen Li et.al | paper | code | <summary>detail</summary>Code available at https://github |
2025-5-27 | Mentor3AD: Feature Reconstruction-based 3D Anomaly Detection via Multi-modality Mentor Learning | Hanzhe Liang et.al | paper | - | <summary>detail</summary>arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission |
2025-5-15 | Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection | Hanzhe Liang et.al | paper | code | - |
2025-4-19 | Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection | Wenbing Zhu et.al | paper | code | - |
2025-4-7 | IterMask3D: Unsupervised Anomaly Detection and Segmentation with Test-Time Iterative Mask Refinement in 3D Brain MR | Ziyun Liang et.al | paper | - | - |
2025-3-30 | Self-Supervised Masked Mesh Learning for Unsupervised Anomaly Detection on 3D Cortical Surfaces | Hao-Chun Yang et.al | paper | - | - |
2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
2025-3-10 | Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>AAAI2025 Poster |
2025-3-3 | Fence Theorem: Towards Dual-Objective Semantic-Structure Isolation in Preprocessing Phase for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | - |
2025-2-16 | Exploiting Point-Language Models with Dual-Prompts for 3D Anomaly Detection | Jiaxiang Wang et.al | paper | - | - |
2025-2-9 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al | paper | code | - |
Multimodal Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-8-20 | PB-IAD: Utilizing multimodal foundation models for semantic industrial anomaly detection in dynamic manufacturing environments | Bernd Hofmann et.al | paper | - | - |
2025-8-6 | AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization | Jingyi Liao et.al | paper | - | - |
2025-8-1 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | code | <summary>detail</summary>ICCV 2025 |
2025-7-25 | BridgeNet: A Unified Multimodal Framework for Bridging 2D and 3D Industrial Anomaly Detection | An Xiang et.al | paper | code | - |
2025-7-25 | Tuned Reverse Distillation: Enhancing Multimodal Industrial Anomaly Detection with Crossmodal Tuners | Xinyue Liu et.al | paper | code | - |
2025-7-23 | HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs | Zhaolin Cai et.al | paper | - | <summary>detail</summary>Accepted by ACM MM 2025 |
2025-6-23 | Multimodal Anomaly Detection with a Mixture-of-Experts | Christoph Willibald et.al | paper | - | - |
2025-6-20 | When Every Millisecond Counts: Real-Time Anomaly Detection via the Multimodal Asynchronous Hybrid Network | Dong Xiao et.al | paper | - | <summary>detail</summary>ICML 2025 Spotlight |
2025-6-4 | MemoryOut: Learning Principal Features via Multimodal Sparse Filtering Network for Semi-supervised Video Anomaly Detection | Juntong Li et.al | paper | code | - |
2025-5-28 | OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning | Shifang Zhao et.al | paper | - | - |
2025-5-19 | Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | Kiarash Naghavi Khanghah et.al | paper | - | <summary>detail</summary>ASME 2025 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference IDETC/CIE2025 |
2025-5-8 | Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection | Sungheon Jeong et.al | paper | code | - |
2025-4-17 | LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection | Weijia Li et.al | paper | - | - |
2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
2025-3-17 | Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models | Jiacong Xu et.al | paper | code | - |
2025-2-24 | Can Multimodal LLMs Perform Time Series Anomaly Detection? | Xiongxiao Xu et.al | paper | code | - |
2025-2-20 | MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection | Xi Jiang et.al | paper | code | <summary>detail</summary>Accepted by ICLR 2025 |
2025-2-18 | Anomaly Detection in Smart Power Grids with Graph-Regularized MS-SVDD: a Multimodal Subspace Learning Approach | Thomas Debelle et.al | paper | - | - |
2025-2-10 | Multimodal Task Representation Memory Bank vs. Catastrophic Forgetting in Anomaly Detection | You Zhou et.al | paper | - | - |
2025-2-9 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al | paper | code | - |
2025-1-27 | Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? | Zhiling Chen et.al | paper | - | - |
2025-1-17 | Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection | Yuanze Li et.al | paper | code | - |
2024-12-23 | Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective | Kaifang Long et.al | paper | - | - |
2024-9-30 | VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection | Huilin Deng et.al | paper | - | - |
2024-9-26 | AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving | Daniel Bogdoll et.al | paper | - | <summary>detail</summary>Daniel Bogdoll |
Vector Quantization
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-8-25 | Scene-Aware Vectorized Memory Multi-Agent Framework with Cross-Modal Differentiated Quantization VLMs for Visually Impaired Assistance | Xiangxiang Wang et.al | paper | - | - |
2025-8-23 | VQL: An End-to-End Context-Aware Vector Quantization Attention for Ultra-Long User Behavior Modeling | Kaiyuan Li et.al | paper | - | - |
2025-8-20 | Making Pose Representations More Expressive and Disentangled via Residual Vector Quantization | Sukhyun Jeong et.al | paper | - | - |
2025-8-19 | Vector-Quantized Vision Foundation Models for Object-Centric Learning | Rongzhen Zhao et.al | paper | code | <summary>detail</summary>Accepted by ACM MM 2025 |
2025-8-8 | Graph is a Natural Regularization: Revisiting Vector Quantization for Graph Representation Learning | Zian Zhai et.al | paper | - | - |
2025-8-7 | Vector Quantized-Elites: Unsupervised and Problem-Agnostic Quality-Diversity Optimization | Constantinos Tsakonas et.al | paper | - | - |
2025-8-7 | Task Vector Quantization for Memory-Efficient Model Merging | Youngeun Kim et.al | paper | - | - |
2025-8-5 | CIVQLLIE: Causal Intervention with Vector Quantization for Low-Light Image Enhancement | Tongshun Zhang et.al | paper | - | - |
2025-8-5 | BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation | Jilong Li et.al | paper | - | - |
2025-8-3 | SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting | Shuaiting Li et.al | paper | code | <summary>detail</summary>ICCV’25 camera ready |
2025-7-31 | VQ-DeepISC: Vector Quantized-Enabled Digital Semantic Communication with Channel Adaptive Image Transmission | Jianqiao Chen et.al | paper | - | - |
2025-7-31 | Optimal and Near-Optimal Adaptive Vector Quantization | Ran Ben-Basat et.al | paper | - | - |
2025-7-30 | ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba | Juncan Deng et.al | paper | - | - |
2025-7-30 | Addressing Representation Collapse in Vector Quantized Models with One Linear Layer | Yongxin Zhu et.al | paper | - | <summary>detail</summary>ICCV2025 |
2025-7-26 | All-in-One Medical Image Restoration with Latent Diffusion-Enhanced Vector-Quantized Codebook Prior | Haowei Chen et.al | paper | - | - |
2025-7-20 | Vector Quantization Prompting for Continual Learning | Li Jiao et.al | paper | code | <summary>detail</summary>Accepted by NeurIPS 2024 |
2025-7-14 | A Vector-Quantized Foundation Model for Patient Behavior Monitoring | Rodrigo Oliver et.al | paper | - | - |
2025-7-9 | Adversarial Defenses via Vector Quantization | Zhiyi Dong et.al | paper | code | <summary>detail</summary>This is the author-accepted version of our paper published in Neurocomputing |
2025-7-9 | Semi-fragile watermarking of remote sensing images using DWT, vector quantization and automatic tiling | Jordi Serra-Ruiz et.al | paper | - | - |
2025-7-9 | VQ-SGen: A Vector Quantized Stroke Representation for Creative Sketch Generation | Jiawei Wang et.al | paper | code | <summary>detail</summary>Project Page: https://enigma-li |
2025-7-8 | EdgeCodec: Onboard Lightweight High Fidelity Neural Compressor with Residual Vector Quantization | Benjamin Hodo et.al | paper | - | <summary>detail</summary>7 Pages |
2025-7-2 | Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization | Mohammad Hassan Vali et.al | paper | - | - |
2025-7-1 | Hierarchical Patch Compression for ColPali: Efficient Multi-Vector Document Retrieval with Dynamic Pruning and Quantization | Duong Bach et.al | paper | code | - |
2025-7-1 | VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers | Yating Wang et.al | paper | code | <summary>detail</summary>Accepted by ICCV 2025 |
2025-6-30 | VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference | Zihan Liu et.al | paper | - | - |