Daily Papers
- Defect Detection
- Defect Segmentation
- Anomaly Detection
- 3D Anomaly Detection
- Multimodal Anomaly Detection
- Vector Quantization
Updated on 2025.08.13
Defect Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-8-12 | Masked Autoencoder Self Pre-Training for Defect Detection in Microelectronics | Nikolai Röhrich et.al | paper | - | - |
2025-8-10 | A Steel Surface Defect Detection Method Based on Lightweight Convolution Optimization | Cong Chen et.al | paper | code | <summary>detail</summary>This is a preprint of an article accepted for publication in the International Journal of Advanced Computer Science and Applications (IJACSA) |
2025-8-8 | Advancing Welding Defect Detection in Maritime Operations via Adapt-WeldNet and Defect Detection Interpretability Analysis | Kamal Basha S et.al | paper | - | - |
2025-8-6 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
2025-8-2 | NATLM: Detecting Defects in NFT Smart Contracts Leveraging LLM | Yuanzheng Niu et.al | paper | - | - |
2025-7-29 | Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control | Sajjad Rezvani Boroujeni et.al | paper | - | - |
2025-7-22 | Multi-Scale PCB Defect Detection with YOLOv8 Network Improved via Pruning and Lightweight Network | Li Pingzhen et.al | paper | - | - |
2025-7-21 | RoadFusion: Latent Diffusion Model for Pavement Defect Detection | Muhammad Aqeel et.al | paper | - | <summary>detail</summary>ICIAP 2025 |
2025-7-21 | ExDD: Explicit Dual Distribution Learning for Surface Defect Detection via Diffusion Synthesis | Muhammad Aqeel et.al | paper | - | <summary>detail</summary>ICIAP 2025 |
2025-7-15 | A Comprehensive Survey for Real-World Industrial Defect Detection: Challenges, Approaches, and Prospects | Yuqi Cheng et.al | paper | - | - |
2025-7-14 | Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al | paper | code | - |
2025-7-10 | NexViTAD: Few-shot Unsupervised Cross-Domain Defect Detection via Vision Foundation Models and Multi-Task Learning | Tianwei Mu et.al | paper | - | - |
2025-7-10 | Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects | Yuqi Cheng et.al | paper | - | - |
2025-7-7 | Semi-Supervised Defect Detection via Conditional Diffusion and CLIP-Guided Noise Filtering | Shuai Li et.al | paper | code | - |
2025-7-4 | MRC-DETR: An Adaptive Multi-Residual Coupled Transformer for Bare Board PCB Defect Detection | Jiangzhong Cao et.al | paper | - | - |
2025-6-30 | VR-YOLO: Enhancing PCB Defect Detection with Viewpoint Robustness Based on YOLO | Hengyi Zhu et.al | paper | - | - |
2025-6-26 | YOLO-FDA: Integrating Hierarchical Attention and Detail Enhancement for Surface Defect Detection | Jiawei Hu et.al | paper | - | - |
2025-6-24 | Evolutionary computing-based image segmentation method to detect defects and features in Additive Friction Stir Deposition Process | Akshansh Mishra et.al | paper | - | - |
2025-6-20 | From Lab to Factory: Pitfalls and Guidelines for Self-/Unsupervised Defect Detection on Low-Quality Industrial Images | Sebastian Hönel et.al | paper | - | - |
2025-6-16 | ESRPCB: an Edge guided Super-Resolution model and Ensemble learning for tiny Printed Circuit Board Defect detection | Xiem HoangVan et.al | paper | - | <summary>detail</summary>Published in Engineering Applications of Artificial Intelligence |
2025-6-12 | Deep Learning-based Multi Project InP Wafer Simulation for Unsupervised Surface Defect Detection | Emílio Dolgener Cantú et.al | paper | - | - |
2025-5-23 | Research on Defect Detection Method of Motor Control Board Based on Image Processing | Jingde Huang et.al | paper | - | - |
2025-5-15 | Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection | Hanzhe Liang et.al | paper | code | - |
2025-5-15 | Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data | Prashant P. Shinde et.al | paper | - | - |
2025-5-12 | Self-Adaptive Gamma Context-Aware SSM-based Model for Metal Defect Detection | Sijin Sun et.al | paper | - | - |
Defect Segmentation
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-8-11 | KARMA: Efficient Structural Defect Segmentation via Kolmogorov-Arnold Representation Learning | Md Meftahul Ferdaus et.al | paper | code | <summary>detail</summary>submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence |
2025-8-6 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
2025-7-23 | Exploring Active Learning for Semiconductor Defect Segmentation | Lile Cai et.al | paper | - | <summary>detail</summary>accepted to ICIP 2022 |
2025-7-14 | Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al | paper | code | - |
2025-6-28 | Region-Aware CAM: High-Resolution Weakly-Supervised Defect Segmentation via Salient Region Perception | Hang-Cheng Dong et.al | paper | - | - |
2025-6-24 | Evolutionary computing-based image segmentation method to detect defects and features in Additive Friction Stir Deposition Process | Akshansh Mishra et.al | paper | - | - |
2025-6-17 | synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections? | Johannes Flotzinger et.al | paper | - | - |
2025-4-24 | Conformal Segmentation in Industrial Surface Defect Detection with Statistical Guarantees | Cheng Shen et.al | paper | - | <summary>detail</summary>Under Review |
2025-4-11 | Weakly Supervised Panoptic Segmentation for Defect-Based Grading of Fresh Produce | Manuel Knott et.al | paper | code | <summary>detail</summary>Accepted as a paper to the 6th International Workshop on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture in conjunction with IEEE/CVF CVPR 2025 |
2025-2-11 | Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models | Tongkun Liu et.al | paper | code | - |
2025-1-23 | Effective Defect Detection Using Instance Segmentation for NDI | Ashiqur Rahman et.al | paper | code | - |
2025-1-17 | Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography | Mohammed Salah et.al | paper | - | <summary>detail</summary>Pulse thermography |
2024-10-24 | Synth4Seg – Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization | Shancong Mou et.al | paper | - | - |
2024-10-1 | Application of Segment Anything Model for Civil Infrastructure Defect Assessment | Mohsen Ahmadi et.al | paper | - | - |
2024-9-20 | Cycle-Consistency Uncertainty Estimation for Visual Prompting based One-Shot Defect Segmentation | Geonuk Kim et.al | paper | - | <summary>detail</summary>ECCV 2024 VISION workshop Most Innovative Prize |
2024-8-31 | Change-Aware Siamese Network for Surface Defects Segmentation under Complex Background | Biyuan Liu et.al | paper | - | - |
2024-8-19 | Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network | Rasha Alshawi et.al | paper | - | - |
2024-8-19 | Dynamic Label Injection for Imbalanced Industrial Defect Segmentation | Emanuele Caruso et.al | paper | code | <summary>detail</summary>ECCV 2024 VISION Workshop |
2024-6-26 | An unsupervised approach towards promptable defect segmentation in laser-based additive manufacturing by Segment Anything | Israt Zarin Era et.al | paper | - | - |
2024-4-17 | CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect | Minh Tran et.al | paper | code | <summary>detail</summary>Poultry Science Journal |
2024-3-17 | LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation | Hanze Ding et.al | paper | - | - |
2024-2-6 | Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing | Jongmin Yu et.al | paper | - | <summary>detail</summary>the ICRA 2024 |
2023-12-21 | Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation | Rasha Alshawi et.al | paper | - | <summary>detail</summary>under review in IEEE Transactions on Artificial Intelligence |
2023-12-8 | Continual learning for surface defect segmentation by subnetwork creation and selection | Aleksandr Dekhovich et.al | paper | - | - |
2023-12-6 | Overhead Line Defect Recognition Based on Unsupervised Semantic Segmentation | Weixi Wang et.al | paper | - | - |
Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-8-12 | ALFred: An Active Learning Framework for Real-world Semi-supervised Anomaly Detection with Adaptive Thresholds | Shanle Yao et.al | paper | - | - |
2025-8-12 | Triad: Empowering LMM-based Anomaly Detection with Vision Expert-guided Visual Tokenizer and Manufacturing Process | Yuanze Li et.al | paper | code | - |
2025-8-11 | Generative AI for Critical Infrastructure in Smart Grids: A Unified Framework for Synthetic Data Generation and Anomaly Detection | Aydin Zaboli et.al | paper | - | - |
2025-8-11 | Robust Anomaly Detection in O-RAN: Leveraging LLMs against Data Manipulation Attacks | Thusitha Dayaratne et.al | paper | - | - |
2025-8-11 | Safeguarding Generative AI Applications in Preclinical Imaging through Hybrid Anomaly Detection | Jakub Binda et.al | paper | - | <summary>detail</summary>Journal ref:2025 Conference on Information and Knowledge Management (CIKM) |
2025-8-11 | Architectural Co-Design for Zero-Shot Anomaly Detection: Decoupling Representation and Dynamically Fusing Features in CLIP | Ke Ma et.al | paper | - | - |
2025-8-11 | Crane: Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly Detection | Alireza Salehi et.al | paper | code | - |
2025-8-11 | AIS-LLM: A Unified Framework for Maritime Trajectory Prediction, Anomaly Detection, and Collision Risk Assessment with Explainable Forecasting | Hyobin Park et.al | paper | - | - |
2025-8-11 | Enhancing Egocentric Object Detection in Static Environments using Graph-based Spatial Anomaly Detection and Correction | Vishakha Lall et.al | paper | - | - |
2025-8-10 | Robust Anomaly Detection in Network Traffic: Evaluating Machine Learning Models on CICIDS2017 | Zhaoyang Xu et.al | paper | - | <summary>detail</summary>submitted to IEEE CNS 2025 |
2025-8-10 | Levarging Learning Bias for Noisy Anomaly Detection | Yuxin Zhang et.al | paper | code | - |
2025-8-10 | Learning Multi-view Anomaly Detection with Efficient Adaptive Selection | Haoyang He et.al | paper | code | <summary>detail</summary>IEEE TRANSACTIONS ON MULTIMEDIA |
2025-8-10 | Self-Navigated Residual Mamba for Universal Industrial Anomaly Detection | Hanxi Li et.al | paper | - | - |
2025-8-10 | Towards Real-World Rumor Detection: Anomaly Detection Framework with Graph Supervised Contrastive Learning | Chaoqun Cui et.al | paper | - | <summary>detail</summary>This paper is accepted by COLING2025 |
2025-8-9 | Statistical Inference for Autoencoder-based Anomaly Detection after Representation Learning-based Domain Adaptation | Tran Tuan Kiet et.al | paper | - | - |
2025-8-8 | Segmented Confidence Sequences and Multi-Scale Adaptive Confidence Segments for Anomaly Detection in Nonstationary Time Series | Muyan Anna Li et.al | paper | - | - |
2025-8-8 | LLM meets ML: Data-efficient Anomaly Detection on Unstable Logs | Fatemeh Hadadi et.al | paper | - | - |
2025-8-8 | Mixture of Experts Guided by Gaussian Splatters Matters: A new Approach to Weakly-Supervised Video Anomaly Detection | Giacomo D’Amicantonio et.al | paper | - | - |
2025-8-8 | AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection | Zhaopeng Gu et.al | paper | - | - |
2025-8-8 | Entropy Causal Graphs for Multivariate Time Series Anomaly Detection | Falih Gozi Febrinanto et.al | paper | - | - |
2025-8-7 | SincVAE: A new semi-supervised approach to improve anomaly detection on EEG data using SincNet and variational autoencoder | Andrea Pollastro et.al | paper | - | <summary>detail</summary>Journal ref:Computer Methods and Programs in Biomedicine Update |
2025-8-7 | AutoIAD: Manager-Driven Multi-Agent Collaboration for Automated Industrial Anomaly Detection | Dongwei Ji et.al | paper | - | - |
2025-8-7 | How and Why: Taming Flow Matching for Unsupervised Anomaly Detection and Localization | Liangwei Li et.al | paper | - | - |
2025-8-7 | CLIP Meets Diffusion: A Synergistic Approach to Anomaly Detection | Byeongchan Lee et.al | paper | - | - |
2025-8-7 | GuARD: Effective Anomaly Detection through a Text-Rich and Graph-Informed Language Model | Yunhe Pang et.al | paper | - | <summary>detail</summary>KDD 2025 |
3D Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-8-2 | C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor | Haoquan Lu et.al | paper | code | <summary>detail</summary>We have provided the code for C3D-AD with checkpoints and BASELINE at this link: https://github |
2025-8-1 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | code | <summary>detail</summary>ICCV 2025 |
2025-8-1 | HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection | Jiaping Cao et.al | paper | - | - |
2025-7-29 | Multi-View Reconstruction with Global Context for 3D Anomaly Detection | Yihan Sun et.al | paper | - | - |
2025-7-27 | Position: Untrained Machine Learning for Anomaly Detection by using 3D Point Cloud Data | Juan Du et.al | paper | - | - |
2025-7-25 | BridgeNet: A Unified Multimodal Framework for Bridging 2D and 3D Industrial Anomaly Detection | An Xiang et.al | paper | code | - |
2025-7-24 | MC3D-AD: A Unified Geometry-aware Reconstruction Model for Multi-category 3D Anomaly Detection | Jiayi Cheng et.al | paper | code | - |
2025-7-17 | 3DKeyAD: High-Resolution 3D Point Cloud Anomaly Detection via Keypoint-Guided Point Clustering | Zi Wang et.al | paper | - | - |
2025-7-12 | Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline | Shiyi Mu et.al | paper | code.) | <summary>detail</summary>under review |
2025-7-10 | 3D-ADAM: A Dataset for 3D Anomaly Detection in Advanced Manufacturing | Paul McHard et.al | paper | - | - |
2025-7-10 | Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects | Yuqi Cheng et.al | paper | - | - |
2025-7-5 | Taming Anomalies with Down-Up Sampling Networks: Group Center Preserving Reconstruction for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>ACM MM25 Accepted |
2025-6-3 | DAS3D: Dual-modality Anomaly Synthesis for 3D Anomaly Detection | Kecen Li et.al | paper | code | <summary>detail</summary>Code available at https://github |
2025-5-27 | Mentor3AD: Feature Reconstruction-based 3D Anomaly Detection via Multi-modality Mentor Learning | Hanzhe Liang et.al | paper | - | <summary>detail</summary>arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission |
2025-5-15 | Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection | Hanzhe Liang et.al | paper | code | - |
2025-4-19 | Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection | Wenbing Zhu et.al | paper | code | - |
2025-4-7 | IterMask3D: Unsupervised Anomaly Detection and Segmentation with Test-Time Iterative Mask Refinement in 3D Brain MR | Ziyun Liang et.al | paper | - | - |
2025-3-30 | Self-Supervised Masked Mesh Learning for Unsupervised Anomaly Detection on 3D Cortical Surfaces | Hao-Chun Yang et.al | paper | - | - |
2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
2025-3-10 | Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>AAAI2025 Poster |
2025-3-3 | Fence Theorem: Towards Dual-Objective Semantic-Structure Isolation in Preprocessing Phase for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | - |
2025-2-16 | Exploiting Point-Language Models with Dual-Prompts for 3D Anomaly Detection | Jiaxiang Wang et.al | paper | - | - |
2025-2-9 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al | paper | code | - |
2024-12-23 | Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective | Kaifang Long et.al | paper | - | - |
2024-12-22 | PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection | Qihang Zhou et.al | paper | - | <summary>detail</summary>NeurIPS 2024 |
Multimodal Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-8-6 | AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization | Jingyi Liao et.al | paper | - | - |
2025-8-1 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | code | <summary>detail</summary>ICCV 2025 |
2025-7-25 | BridgeNet: A Unified Multimodal Framework for Bridging 2D and 3D Industrial Anomaly Detection | An Xiang et.al | paper | code | - |
2025-7-25 | Tuned Reverse Distillation: Enhancing Multimodal Industrial Anomaly Detection with Crossmodal Tuners | Xinyue Liu et.al | paper | code | - |
2025-7-23 | HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs | Zhaolin Cai et.al | paper | - | <summary>detail</summary>Accepted by ACM MM 2025 |
2025-6-23 | Multimodal Anomaly Detection with a Mixture-of-Experts | Christoph Willibald et.al | paper | - | - |
2025-6-20 | When Every Millisecond Counts: Real-Time Anomaly Detection via the Multimodal Asynchronous Hybrid Network | Dong Xiao et.al | paper | - | <summary>detail</summary>ICML 2025 Spotlight |
2025-6-4 | MemoryOut: Learning Principal Features via Multimodal Sparse Filtering Network for Semi-supervised Video Anomaly Detection | Juntong Li et.al | paper | code | - |
2025-5-28 | OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning | Shifang Zhao et.al | paper | - | - |
2025-5-19 | Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | Kiarash Naghavi Khanghah et.al | paper | - | <summary>detail</summary>ASME 2025 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference IDETC/CIE2025 |
2025-5-8 | Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection | Sungheon Jeong et.al | paper | code | - |
2025-4-17 | LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection | Weijia Li et.al | paper | - | - |
2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
2025-3-17 | Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models | Jiacong Xu et.al | paper | code | - |
2025-2-24 | Can Multimodal LLMs Perform Time Series Anomaly Detection? | Xiongxiao Xu et.al | paper | code | - |
2025-2-20 | MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection | Xi Jiang et.al | paper | code | <summary>detail</summary>Accepted by ICLR 2025 |
2025-2-18 | Anomaly Detection in Smart Power Grids with Graph-Regularized MS-SVDD: a Multimodal Subspace Learning Approach | Thomas Debelle et.al | paper | - | - |
2025-2-10 | Multimodal Task Representation Memory Bank vs. Catastrophic Forgetting in Anomaly Detection | You Zhou et.al | paper | - | - |
2025-2-9 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al | paper | code | - |
2025-1-27 | Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? | Zhiling Chen et.al | paper | - | - |
2025-1-17 | Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection | Yuanze Li et.al | paper | code | - |
2024-12-23 | Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective | Kaifang Long et.al | paper | - | - |
2024-9-30 | VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection | Huilin Deng et.al | paper | - | - |
2024-9-26 | AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving | Daniel Bogdoll et.al | paper | - | <summary>detail</summary>Daniel Bogdoll |
2024-9-23 | Incomplete Multimodal Industrial Anomaly Detection via Cross-Modal Distillation | Wenbo Sui et.al | paper | - | - |
Vector Quantization
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-8-8 | Graph is a Natural Regularization: Revisiting Vector Quantization for Graph Representation Learning | Zian Zhai et.al | paper | - | - |
2025-8-7 | Vector Quantized-Elites: Unsupervised and Problem-Agnostic Quality-Diversity Optimization | Constantinos Tsakonas et.al | paper | - | - |
2025-8-7 | Task Vector Quantization for Memory-Efficient Model Merging | Youngeun Kim et.al | paper | - | - |
2025-8-5 | CIVQLLIE: Causal Intervention with Vector Quantization for Low-Light Image Enhancement | Tongshun Zhang et.al | paper | - | - |
2025-8-5 | BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation | Jilong Li et.al | paper | - | - |
2025-8-3 | SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting | Shuaiting Li et.al | paper | code | <summary>detail</summary>ICCV’25 camera ready |
2025-7-31 | VQ-DeepISC: Vector Quantized-Enabled Digital Semantic Communication with Channel Adaptive Image Transmission | Jianqiao Chen et.al | paper | - | - |
2025-7-31 | Vector-Quantized Vision Foundation Models for Object-Centric Learning | Rongzhen Zhao et.al | paper | code | <summary>detail</summary>Accepted by ACM MM 2025 |
2025-7-31 | Optimal and Near-Optimal Adaptive Vector Quantization | Ran Ben-Basat et.al | paper | - | - |
2025-7-30 | ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba | Juncan Deng et.al | paper | - | - |
2025-7-30 | Addressing Representation Collapse in Vector Quantized Models with One Linear Layer | Yongxin Zhu et.al | paper | - | <summary>detail</summary>ICCV2025 |
2025-7-26 | All-in-One Medical Image Restoration with Latent Diffusion-Enhanced Vector-Quantized Codebook Prior | Haowei Chen et.al | paper | - | - |
2025-7-20 | Vector Quantization Prompting for Continual Learning | Li Jiao et.al | paper | code | <summary>detail</summary>Accepted by NeurIPS 2024 |
2025-7-14 | A Vector-Quantized Foundation Model for Patient Behavior Monitoring | Rodrigo Oliver et.al | paper | - | - |
2025-7-9 | Adversarial Defenses via Vector Quantization | Zhiyi Dong et.al | paper | code | <summary>detail</summary>This is the author-accepted version of our paper published in Neurocomputing |
2025-7-9 | Semi-fragile watermarking of remote sensing images using DWT, vector quantization and automatic tiling | Jordi Serra-Ruiz et.al | paper | - | - |
2025-7-9 | VQ-SGen: A Vector Quantized Stroke Representation for Creative Sketch Generation | Jiawei Wang et.al | paper | code | <summary>detail</summary>Project Page: https://enigma-li |
2025-7-8 | EdgeCodec: Onboard Lightweight High Fidelity Neural Compressor with Residual Vector Quantization | Benjamin Hodo et.al | paper | - | <summary>detail</summary>7 Pages |
2025-7-2 | Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization | Mohammad Hassan Vali et.al | paper | - | - |
2025-7-1 | Hierarchical Patch Compression for ColPali: Efficient Multi-Vector Document Retrieval with Dynamic Pruning and Quantization | Duong Bach et.al | paper | code | - |
2025-7-1 | VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers | Yating Wang et.al | paper | code | <summary>detail</summary>Accepted by ICCV 2025 |
2025-6-30 | VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference | Zihan Liu et.al | paper | - | - |
2025-6-28 | Hierarchical Characterization of Brain Dynamics via State Space-based Vector Quantization | Yanwu Yang et.al | paper | - | - |
2025-6-26 | PCDVQ: Enhancing Vector Quantization for Large Language Models via Polar Coordinate Decoupling | Yuxuan Yue et.al | paper | - | - |
2025-6-24 | AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models | Zeyu Li et.al | paper | - | - |