Daily Papers
- Defect Detection
- Defect Segmentation
- Anomaly Detection
- 3D Anomaly Detection
- Multimodal Anomaly Detection
- Vector Quantization
Updated on 2025.09.28
Defect Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-9-25 | A Real-Time On-Device Defect Detection Framework for Laser Power-Meter Sensors via Unsupervised Learning | Dongqi Zheng et.al | paper | - | - |
2025-9-23 | Advancing Metallic Surface Defect Detection via Anomaly-Guided Pretraining on a Large Industrial Dataset | Chuni Liu et.al | paper | code | - |
2025-9-23 | TinyDef-DETR: A DETR-based Framework for Defect Detection in Transmission Lines from UAV Imagery | Feng Shen et.al | paper | - | - |
2025-9-19 | ISP-AD: A Large-Scale Real-World Dataset for Advancing Industrial Anomaly Detection with Synthetic and Real Defects | Paul J. Krassnig et.al | paper | code | - |
2025-9-6 | On the Detection of Internal Defects in Structured Media | Bryl Nico M. Ong et.al | paper | - | - |
2025-9-4 | YOLO Ensemble for UAV-based Multispectral Defect Detection in Wind Turbine Components | Serhii Svystun et.al | paper | - | <summary>detail</summary>The 13th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications |
2025-9-3 | Joint Training of Image Generator and Detector for Road Defect Detection | Kuan-Chuan Peng et.al | paper | - | <summary>detail</summary>This paper is accepted to ICCV 2025 Workshop on Representation Learning with Very Limited Resources: When Data |
2025-9-2 | Resilient Multimodal Industrial Surface Defect Detection with Uncertain Sensors Availability | Shuai Jiang et.al | paper | code | <summary>detail</summary>IEEE/ASME Transactions on Mechatronics |
2025-9-2 | Vision-Based Object Detection for UAV Solar Panel Inspection Using an Enhanced Defects Dataset | Ashen Rodrigo et.al | paper | code | - |
2025-9-2 | FusWay: Multimodal hybrid fusion approach. Application to Railway Defect Detection | Alexey Zhukov et.al | paper | - | - |
2025-9-1 | TransMatch: A Transfer-Learning Framework for Defect Detection in Laser Powder Bed Fusion Additive Manufacturing | Mohsen Asghari Ilani et.al | paper | - | - |
2025-8-31 | Surface Defect Detection with Gabor Filter Using Reconstruction-Based Blurring U-Net-ViT | Jongwook Si et.al | paper | - | - |
2025-8-26 | No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes | Blaž Rolih et.al | paper | code | <summary>detail</summary>Accepted by The Journal of Intelligent Manufacturing |
2025-8-22 | A Lightweight Group Multiscale Bidirectional Interactive Network for Real-Time Steel Surface Defect Detection | Yong Zhang et.al | paper | code | - |
2025-8-15 | Defects4Log: Benchmarking LLMs for Logging Code Defect Detection and Reasoning | Xin Wang et.al | paper | - | - |
2025-8-14 | A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection | Yangjie Xiao et.al | paper | code | - |
2025-8-12 | Masked Autoencoder Self Pre-Training for Defect Detection in Microelectronics | Nikolai Röhrich et.al | paper | - | - |
2025-8-10 | A Steel Surface Defect Detection Method Based on Lightweight Convolution Optimization | Cong Chen et.al | paper | code | <summary>detail</summary>This is a preprint of an article accepted for publication in the International Journal of Advanced Computer Science and Applications (IJACSA) |
2025-8-8 | Advancing Welding Defect Detection in Maritime Operations via Adapt-WeldNet and Defect Detection Interpretability Analysis | Kamal Basha S et.al | paper | - | - |
2025-8-6 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
2025-8-2 | NATLM: Detecting Defects in NFT Smart Contracts Leveraging LLM | Yuanzheng Niu et.al | paper | - | - |
2025-7-29 | Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control | Sajjad Rezvani Boroujeni et.al | paper | - | - |
2025-7-22 | Multi-Scale PCB Defect Detection with YOLOv8 Network Improved via Pruning and Lightweight Network | Li Pingzhen et.al | paper | - | - |
2025-7-21 | RoadFusion: Latent Diffusion Model for Pavement Defect Detection | Muhammad Aqeel et.al | paper | - | <summary>detail</summary>ICIAP 2025 |
2025-7-21 | ExDD: Explicit Dual Distribution Learning for Surface Defect Detection via Diffusion Synthesis | Muhammad Aqeel et.al | paper | - | <summary>detail</summary>ICIAP 2025 |
Defect Segmentation
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-9-11 | Unsupervised Integrated-Circuit Defect Segmentation via Image-Intrinsic Normality | Botong Zhao et.al | paper | - | - |
2025-8-14 | A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection | Yangjie Xiao et.al | paper | code | - |
2025-8-11 | KARMA: Efficient Structural Defect Segmentation via Kolmogorov-Arnold Representation Learning | Md Meftahul Ferdaus et.al | paper | code | <summary>detail</summary>submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence |
2025-8-6 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
2025-7-23 | Exploring Active Learning for Semiconductor Defect Segmentation | Lile Cai et.al | paper | - | <summary>detail</summary>accepted to ICIP 2022 |
2025-7-14 | Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al | paper | code | - |
2025-6-28 | Region-Aware CAM: High-Resolution Weakly-Supervised Defect Segmentation via Salient Region Perception | Hang-Cheng Dong et.al | paper | - | - |
2025-6-24 | Evolutionary computing-based image segmentation method to detect defects and features in Additive Friction Stir Deposition Process | Akshansh Mishra et.al | paper | - | - |
2025-6-17 | synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections? | Johannes Flotzinger et.al | paper | - | - |
2025-4-24 | Conformal Segmentation in Industrial Surface Defect Detection with Statistical Guarantees | Cheng Shen et.al | paper | - | <summary>detail</summary>Under Review |
2025-4-11 | Weakly Supervised Panoptic Segmentation for Defect-Based Grading of Fresh Produce | Manuel Knott et.al | paper | code | <summary>detail</summary>Accepted as a paper to the 6th International Workshop on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture in conjunction with IEEE/CVF CVPR 2025 |
2025-2-11 | Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models | Tongkun Liu et.al | paper | code | - |
2025-1-23 | Effective Defect Detection Using Instance Segmentation for NDI | Ashiqur Rahman et.al | paper | code | - |
2025-1-17 | Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography | Mohammed Salah et.al | paper | - | <summary>detail</summary>Pulse thermography |
2024-10-24 | Synth4Seg – Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization | Shancong Mou et.al | paper | - | - |
2024-10-1 | Application of Segment Anything Model for Civil Infrastructure Defect Assessment | Mohsen Ahmadi et.al | paper | - | - |
2024-9-20 | Cycle-Consistency Uncertainty Estimation for Visual Prompting based One-Shot Defect Segmentation | Geonuk Kim et.al | paper | - | <summary>detail</summary>ECCV 2024 VISION workshop Most Innovative Prize |
2024-8-31 | Change-Aware Siamese Network for Surface Defects Segmentation under Complex Background | Biyuan Liu et.al | paper | - | - |
2024-8-19 | Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network | Rasha Alshawi et.al | paper | - | - |
2024-8-19 | Dynamic Label Injection for Imbalanced Industrial Defect Segmentation | Emanuele Caruso et.al | paper | code | <summary>detail</summary>ECCV 2024 VISION Workshop |
2024-6-26 | An unsupervised approach towards promptable defect segmentation in laser-based additive manufacturing by Segment Anything | Israt Zarin Era et.al | paper | - | - |
2024-4-17 | CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect | Minh Tran et.al | paper | code | <summary>detail</summary>Poultry Science Journal |
2024-3-17 | LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation | Hanze Ding et.al | paper | - | - |
2024-2-6 | Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing | Jongmin Yu et.al | paper | - | <summary>detail</summary>the ICRA 2024 |
2023-12-21 | Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation | Rasha Alshawi et.al | paper | - | <summary>detail</summary>under review in IEEE Transactions on Artificial Intelligence |
Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-9-25 | Towards Foundation Models for Zero-Shot Time Series Anomaly Detection: Leveraging Synthetic Data and Relative Context Discrepancy | Tian Lan et.al | paper | - | - |
2025-9-25 | FracAug: Fractional Augmentation boost Graph-level Anomaly Detection under Limited Supervision | Xiangyu Dong et.al | paper | - | - |
2025-9-25 | A Deep Learning Framework for Evaluating Dynamic Network Generative Models and Anomaly Detection | Alireza Rashnu et.al | paper | - | <summary>detail</summary>Journal ref:Journal of Innovations in Computer Science and Engineering (JICSE) |
2025-9-24 | Unsupervised Log Anomaly Detection with Few Unique Tokens | Antonin Sulc et.al | paper | - | - |
2025-9-24 | An Improved Time Series Anomaly Detection by Applying Structural Similarity | Tiejun Wang et.al | paper | - | - |
2025-9-24 | Anomaly Detection by Clustering DINO Embeddings using a Dirichlet Process Mixture | Nico Schulthess et.al | paper | code | <summary>detail</summary>Paper accepted at MICCAI 2025 |
2025-9-24 | Pi-Transformer: A Physics-informed Attention Mechanism for Time Series Anomaly Detection | Sepehr Maleki et.al | paper | code | - |
2025-9-24 | Anomaly Detection in Complex Dynamical Systems: A Systematic Framework Using Embedding Theory and Physics-Inspired Consistency | Michael Somma et.al | paper | - | - |
2025-9-23 | PointAD+: Learning Hierarchical Representations for Zero-shot 3D Anomaly Detection | Qihang Zhou et.al | paper | - | <summary>detail</summary>Submitted to TPAMI |
2025-9-23 | 3D-ADAM: A Dataset for 3D Anomaly Detection in Additive Manufacturing | Paul McHard et.al | paper | - | - |
2025-9-23 | Advancing Metallic Surface Defect Detection via Anomaly-Guided Pretraining on a Large Industrial Dataset | Chuni Liu et.al | paper | code | - |
2025-9-23 | HDM: Hybrid Diffusion Model for Unified Image Anomaly Detection | Zekang Weng et.al | paper | - | <summary>detail</summary>The paper is withdrawn owing to issues found in the experimental results |
2025-9-23 | Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems | Wen-Dong Jiang et.al | paper | - | - |
2025-9-22 | Graph Enhanced Trajectory Anomaly Detection | Jonathan Kabala Mbuya et.al | paper | - | - |
2025-9-22 | Budgeted Adversarial Attack against Graph-Based Anomaly Detection in Sensor Networks | Sanju Xaviar et.al | paper | - | - |
2025-9-22 | Medical priority fusion: achieving dual optimization of sensitivity and interpretability in nipt anomaly detection | Xiuqi Ge et.al | paper | - | - |
2025-9-22 | Tailored Transformation Invariance for Industrial Anomaly Detection | Mariette Schönfeld et.al | paper | code | - |
2025-9-22 | From Benchmarks to Reality: Advancing Visual Anomaly Detection by the VAND 3.0 Challenge | Lars Heckler-Kram et.al | paper | - | - |
2025-9-22 | Robust Anomaly Detection Under Normality Distribution Shift in Dynamic Graphs | Xiaoyang Xu et.al | paper | - | - |
2025-9-21 | RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning | Song Xu et.al | paper | - | - |
2025-9-21 | Prospective Multi-Graph Cohesion for Multivariate Time Series Anomaly Detection | Jiazhen Chen et.al | paper | - | <summary>detail</summary>Accepted by the 18th ACM International Conference on Web Search and Data Mining (ACM WSDM 2025) |
2025-9-21 | Intention-aware Hierarchical Diffusion Model for Long-term Trajectory Anomaly Detection | Chen Wang et.al | paper | - | - |
2025-9-20 | DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection | Yingli Shen et.al | paper | - | - |
2025-9-20 | AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM | Sunghyun Ahn et.al | paper | - | - |
2025-9-19 | BlockScan: Detecting Anomalies in Blockchain Transactions | Jiahao Yu et.al | paper | - | - |
3D Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-9-23 | PointAD+: Learning Hierarchical Representations for Zero-shot 3D Anomaly Detection | Qihang Zhou et.al | paper | - | <summary>detail</summary>Submitted to TPAMI |
2025-9-23 | 3D-ADAM: A Dataset for 3D Anomaly Detection in Additive Manufacturing | Paul McHard et.al | paper | - | - |
2025-9-16 | Taming Anomalies with Down-Up Sampling Networks: Group Center Preserving Reconstruction for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>ACM MM25 Accepted |
2025-9-12 | MCL-AD: Multimodal Collaboration Learning for Zero-Shot 3D Anomaly Detection | Gang Li et.al | paper | - | <summary>detail</summary>Page 14 |
2025-8-28 | IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection | Xuanming Cao et.al | paper | - | - |
2025-8-19 | Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline | Shiyi Mu et.al | paper | code.) | <summary>detail</summary>under review |
2025-8-2 | C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor | Haoquan Lu et.al | paper | code | <summary>detail</summary>We have provided the code for C3D-AD with checkpoints and BASELINE at this link: https://github |
2025-8-1 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | code | <summary>detail</summary>ICCV 2025 |
2025-8-1 | HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection | Jiaping Cao et.al | paper | - | - |
2025-7-29 | Multi-View Reconstruction with Global Context for 3D Anomaly Detection | Yihan Sun et.al | paper | - | - |
2025-7-27 | Position: Untrained Machine Learning for Anomaly Detection by using 3D Point Cloud Data | Juan Du et.al | paper | - | - |
2025-7-25 | BridgeNet: A Unified Multimodal Framework for Bridging 2D and 3D Industrial Anomaly Detection | An Xiang et.al | paper | code | - |
2025-7-24 | MC3D-AD: A Unified Geometry-aware Reconstruction Model for Multi-category 3D Anomaly Detection | Jiayi Cheng et.al | paper | code | - |
2025-7-17 | 3DKeyAD: High-Resolution 3D Point Cloud Anomaly Detection via Keypoint-Guided Point Clustering | Zi Wang et.al | paper | - | - |
2025-7-10 | Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects | Yuqi Cheng et.al | paper | - | - |
2025-6-3 | DAS3D: Dual-modality Anomaly Synthesis for 3D Anomaly Detection | Kecen Li et.al | paper | code | <summary>detail</summary>Code available at https://github |
2025-5-27 | Mentor3AD: Feature Reconstruction-based 3D Anomaly Detection via Multi-modality Mentor Learning | Hanzhe Liang et.al | paper | - | <summary>detail</summary>arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission |
2025-5-15 | Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection | Hanzhe Liang et.al | paper | code | - |
2025-4-19 | Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection | Wenbing Zhu et.al | paper | code | - |
2025-4-7 | IterMask3D: Unsupervised Anomaly Detection and Segmentation with Test-Time Iterative Mask Refinement in 3D Brain MR | Ziyun Liang et.al | paper | - | - |
2025-3-30 | Self-Supervised Masked Mesh Learning for Unsupervised Anomaly Detection on 3D Cortical Surfaces | Hao-Chun Yang et.al | paper | - | - |
2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
2025-3-10 | Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>AAAI2025 Poster |
2025-3-3 | Fence Theorem: Towards Dual-Objective Semantic-Structure Isolation in Preprocessing Phase for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | - |
2025-2-16 | Exploiting Point-Language Models with Dual-Prompts for 3D Anomaly Detection | Jiaxiang Wang et.al | paper | - | - |
Multimodal Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-9-12 | MCL-AD: Multimodal Collaboration Learning for Zero-Shot 3D Anomaly Detection | Gang Li et.al | paper | - | <summary>detail</summary>Page 14 |
2025-8-20 | PB-IAD: Utilizing multimodal foundation models for semantic industrial anomaly detection in dynamic manufacturing environments | Bernd Hofmann et.al | paper | - | - |
2025-8-6 | AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization | Jingyi Liao et.al | paper | - | - |
2025-8-1 | SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark | Alex Costanzino et.al | paper | code | <summary>detail</summary>ICCV 2025 |
2025-7-25 | BridgeNet: A Unified Multimodal Framework for Bridging 2D and 3D Industrial Anomaly Detection | An Xiang et.al | paper | code | - |
2025-7-25 | Tuned Reverse Distillation: Enhancing Multimodal Industrial Anomaly Detection with Crossmodal Tuners | Xinyue Liu et.al | paper | code | - |
2025-7-23 | HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs | Zhaolin Cai et.al | paper | - | <summary>detail</summary>Accepted by ACM MM 2025 |
2025-6-23 | Multimodal Anomaly Detection with a Mixture-of-Experts | Christoph Willibald et.al | paper | - | - |
2025-6-20 | When Every Millisecond Counts: Real-Time Anomaly Detection via the Multimodal Asynchronous Hybrid Network | Dong Xiao et.al | paper | - | <summary>detail</summary>ICML 2025 Spotlight |
2025-6-4 | MemoryOut: Learning Principal Features via Multimodal Sparse Filtering Network for Semi-supervised Video Anomaly Detection | Juntong Li et.al | paper | code | - |
2025-5-28 | OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning | Shifang Zhao et.al | paper | - | - |
2025-5-19 | Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | Kiarash Naghavi Khanghah et.al | paper | - | <summary>detail</summary>ASME 2025 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference IDETC/CIE2025 |
2025-5-8 | Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection | Sungheon Jeong et.al | paper | code | - |
2025-4-17 | LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection | Weijia Li et.al | paper | - | - |
2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
2025-3-17 | Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models | Jiacong Xu et.al | paper | code | - |
2025-2-24 | Can Multimodal LLMs Perform Time Series Anomaly Detection? | Xiongxiao Xu et.al | paper | code | - |
2025-2-20 | MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection | Xi Jiang et.al | paper | code | <summary>detail</summary>Accepted by ICLR 2025 |
2025-2-18 | Anomaly Detection in Smart Power Grids with Graph-Regularized MS-SVDD: a Multimodal Subspace Learning Approach | Thomas Debelle et.al | paper | - | - |
2025-2-10 | Multimodal Task Representation Memory Bank vs. Catastrophic Forgetting in Anomaly Detection | You Zhou et.al | paper | - | - |
2025-2-9 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al | paper | code | - |
2025-1-27 | Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? | Zhiling Chen et.al | paper | - | - |
2025-1-17 | Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection | Yuanze Li et.al | paper | code | - |
2024-12-23 | Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective | Kaifang Long et.al | paper | - | - |
2024-9-30 | VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection | Huilin Deng et.al | paper | - | - |
Vector Quantization
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-9-22 | Individualized non-uniform quantization for vector search | Mariano Tepper et.al | paper | - | - |
2025-9-16 | Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization | Hao Xu et.al | paper | code | <summary>detail</summary>Code available at https://github |
2025-9-15 | SAQ: Pushing the Limits of Vector Quantization through Code Adjustment and Dimension Segmentation | Hui Li et.al | paper | - | - |
2025-9-12 | Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization | Yifan Chang et.al | paper | - | - |
2025-9-11 | A Vector-Quantized Foundation Model for Patient Behavior Monitoring | Rodrigo Oliver et.al | paper | - | - |
2025-9-4 | Kernel $k$-Medoids as General Vector Quantization | Thore Gerlach et.al | paper | - | - |
2025-8-25 | Scene-Aware Vectorized Memory Multi-Agent Framework with Cross-Modal Differentiated Quantization VLMs for Visually Impaired Assistance | Xiangxiang Wang et.al | paper | - | - |
2025-8-23 | VQL: An End-to-End Context-Aware Vector Quantization Attention for Ultra-Long User Behavior Modeling | Kaiyuan Li et.al | paper | - | - |
2025-8-20 | Making Pose Representations More Expressive and Disentangled via Residual Vector Quantization | Sukhyun Jeong et.al | paper | - | - |
2025-8-19 | Vector-Quantized Vision Foundation Models for Object-Centric Learning | Rongzhen Zhao et.al | paper | code | <summary>detail</summary>Accepted by ACM MM 2025 |
2025-8-8 | Graph is a Natural Regularization: Revisiting Vector Quantization for Graph Representation Learning | Zian Zhai et.al | paper | - | - |
2025-8-7 | Vector Quantized-Elites: Unsupervised and Problem-Agnostic Quality-Diversity Optimization | Constantinos Tsakonas et.al | paper | - | - |
2025-8-7 | Task Vector Quantization for Memory-Efficient Model Merging | Youngeun Kim et.al | paper | - | - |
2025-8-5 | CIVQLLIE: Causal Intervention with Vector Quantization for Low-Light Image Enhancement | Tongshun Zhang et.al | paper | - | - |
2025-8-5 | BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation | Jilong Li et.al | paper | - | - |
2025-8-3 | SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting | Shuaiting Li et.al | paper | code | <summary>detail</summary>ICCV’25 camera ready |
2025-7-31 | VQ-DeepISC: Vector Quantized-Enabled Digital Semantic Communication with Channel Adaptive Image Transmission | Jianqiao Chen et.al | paper | - | - |
2025-7-31 | Optimal and Near-Optimal Adaptive Vector Quantization | Ran Ben-Basat et.al | paper | - | - |
2025-7-30 | ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba | Juncan Deng et.al | paper | - | - |
2025-7-30 | Addressing Representation Collapse in Vector Quantized Models with One Linear Layer | Yongxin Zhu et.al | paper | - | <summary>detail</summary>ICCV2025 |
2025-7-26 | All-in-One Medical Image Restoration with Latent Diffusion-Enhanced Vector-Quantized Codebook Prior | Haowei Chen et.al | paper | - | - |
2025-7-20 | Vector Quantization Prompting for Continual Learning | Li Jiao et.al | paper | code | <summary>detail</summary>Accepted by NeurIPS 2024 |
2025-7-9 | Adversarial Defenses via Vector Quantization | Zhiyi Dong et.al | paper | code | <summary>detail</summary>This is the author-accepted version of our paper published in Neurocomputing |
2025-7-9 | Semi-fragile watermarking of remote sensing images using DWT, vector quantization and automatic tiling | Jordi Serra-Ruiz et.al | paper | - | - |
2025-7-9 | VQ-SGen: A Vector Quantized Stroke Representation for Creative Sketch Generation | Jiawei Wang et.al | paper | code | <summary>detail</summary>Project Page: https://enigma-li |