Daily Papers
- Defect Detection
- Defect Segmentation
- Anomaly Detection
- 3D Anomaly Detection
- Multimodal Anomaly Detection
- Vector Quantization
Updated on 2025.05.25
Defect Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-5-15 | Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection | Hanzhe Liang et.al | paper | code | - |
2025-5-15 | Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data | Prashant P. Shinde et.al | paper | - | - |
2025-5-12 | Self-Adaptive Gamma Context-Aware SSM-based Model for Metal Defect Detection | Sijin Sun et.al | paper | - | - |
2025-5-11 | Differentiable NMS via Sinkhorn Matching for End-to-End Fabric Defect Detection | Zhengyang Lu et.al | paper | - | - |
2025-5-11 | Transmission Line Defect Detection Based on UAV Patrol Images and Vision-language Pretraining | Ke Zhang et.al | paper | - | - |
2025-5-5 | Enhancing Glass Defect Detection with Diffusion Models: Addressing Imbalanced Datasets in Manufacturing Quality Control | Sajjad Rezvani Boroujeni et.al | paper | - | - |
2025-5-2 | A Comprehensive Survey on Machine Learning Driven Material Defect Detection | Jun Bai et.al | paper | - | <summary>detail</summary>ACM Computing Surveys |
2025-4-29 | SteelBlastQC: Shot-blasted Steel Surface Dataset with Interpretable Detection of Surface Defects | Irina Ruzavina et.al | paper | - | <summary>detail</summary>Accepted by IJCNN 2025 |
2025-4-24 | Conformal Segmentation in Industrial Surface Defect Detection with Statistical Guarantees | Cheng Shen et.al | paper | - | <summary>detail</summary>Under Review |
2025-4-22 | LogUpdater: Automated Detection and Repair of Specific Defects in Logging Statements | Renyi Zhong et.al | paper | - | <summary>detail</summary>Accepted by ACM Transactions on Software Engineering and Methodology (TOSEM) |
2025-4-15 | CFIS-YOLO: A Lightweight Multi-Scale Fusion Network for Edge-Deployable Wood Defect Detection | Jincheng Kang et.al | paper | - | - |
2025-4-14 | Masked Autoencoder Self Pre-Training for Defect Detection in Microelectronics | Nikolai Röhrich et.al | paper | - | - |
2025-4-9 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
2025-4-8 | Event-based Civil Infrastructure Visual Defect Detection: ev-CIVIL Dataset and Benchmark | Udayanga G. W. K. N. Gamage et.al | paper | - | <summary>detail</summary>A journal paper which submitted to Sage SHM journa and it is under review currently |
2025-3-31 | NeRF-Based defect detection | Tianqi et.al | paper | - | - |
2025-3-30 | Semantic-Preserving Transformations as Mutation Operators: A Study on Their Effectiveness in Defect Detection | Max Hort et.al | paper | - | <summary>detail</summary>Accepted for publication in Mutation 2025 at the 18th IEEE International Conference on Software Testing |
2025-3-27 | Multimodal surface defect detection from wooden logs for sawing optimization | Bořek Reich et.al | paper | - | - |
2025-3-20 | ISP-AD: A Large-Scale Real-World Dataset for Advancing Industrial Anomaly Detection with Synthetic and Real Defects | Paul J. Krassnig et.al | paper | code | - |
2025-3-5 | AI-Driven Multi-Stage Computer Vision System for Defect Detection in Laser-Engraved Industrial Nameplates | Adhish Anitha Vilasan et.al | paper | - | - |
2025-3-2 | Acoustic Anomaly Detection on UAM Propeller Defect with Acoustic dataset for Crack of drone Propeller (ADCP) | Juho Lee et.al | paper | - | - |
2025-2-28 | Background-Aware Defect Generation for Robust Industrial Anomaly Detection | Youngjae Cho et.al | paper | - | - |
2025-2-27 | A Survey on Foundation-Model-Based Industrial Defect Detection | Tianle Yang et.al | paper | - | <summary>detail</summary>This work has been submitted to the IEEE for possible publication |
2025-2-25 | Improved YOLOv7x-Based Defect Detection Algorithm for Power Equipment | Jin Hou et.al | paper | - | - |
2025-2-15 | SEM-CLIP: Precise Few-Shot Learning for Nanoscale Defect Detection in Scanning Electron Microscope Image | Qian Jin et.al | paper | - | <summary>detail</summary>Published in ACM/IEEE International Conference on Computer-Aided Design (ICCAD) |
2025-2-13 | Unit Testing Past vs. Present: Examining LLMs’ Impact on Defect Detection and Efficiency | Rudolf Ramler et.al | paper | - | - |
Defect Segmentation
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-4-24 | Conformal Segmentation in Industrial Surface Defect Detection with Statistical Guarantees | Cheng Shen et.al | paper | - | <summary>detail</summary>Under Review |
2025-4-11 | Weakly Supervised Panoptic Segmentation for Defect-Based Grading of Fresh Produce | Manuel Knott et.al | paper | code | <summary>detail</summary>Accepted as a paper to the 6th International Workshop on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture in conjunction with IEEE/CVF CVPR 2025 |
2025-4-9 | MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning | Ylli Sadikaj et.al | paper | - | - |
2025-2-11 | Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models | Tongkun Liu et.al | paper | code | - |
2025-1-30 | PV-S3: Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images | Abhishek Jha et.al | paper | code | - |
2025-1-23 | Effective Defect Detection Using Instance Segmentation for NDI | Ashiqur Rahman et.al | paper | code | - |
2025-1-17 | Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography | Mohammed Salah et.al | paper | - | <summary>detail</summary>Pulse thermography |
2024-10-24 | Synth4Seg – Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization | Shancong Mou et.al | paper | - | - |
2024-10-1 | Application of Segment Anything Model for Civil Infrastructure Defect Assessment | Mohsen Ahmadi et.al | paper | - | - |
2024-9-20 | Cycle-Consistency Uncertainty Estimation for Visual Prompting based One-Shot Defect Segmentation | Geonuk Kim et.al | paper | - | <summary>detail</summary>ECCV 2024 VISION workshop Most Innovative Prize |
2024-8-31 | Change-Aware Siamese Network for Surface Defects Segmentation under Complex Background | Biyuan Liu et.al | paper | - | - |
2024-8-19 | Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network | Rasha Alshawi et.al | paper | - | - |
2024-8-19 | Dynamic Label Injection for Imbalanced Industrial Defect Segmentation | Emanuele Caruso et.al | paper | code | <summary>detail</summary>ECCV 2024 VISION Workshop |
2024-6-26 | An unsupervised approach towards promptable defect segmentation in laser-based additive manufacturing by Segment Anything | Israt Zarin Era et.al | paper | - | - |
2024-4-17 | CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect | Minh Tran et.al | paper | code | <summary>detail</summary>Poultry Science Journal |
2024-3-17 | LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation | Hanze Ding et.al | paper | - | - |
2024-2-6 | Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing | Jongmin Yu et.al | paper | - | <summary>detail</summary>the ICRA 2024 |
2023-12-21 | Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation | Rasha Alshawi et.al | paper | - | <summary>detail</summary>under review in IEEE Transactions on Artificial Intelligence |
2023-12-8 | Continual learning for surface defect segmentation by subnetwork creation and selection | Aleksandr Dekhovich et.al | paper | - | - |
2023-12-6 | Overhead Line Defect Recognition Based on Unsupervised Semantic Segmentation | Weixi Wang et.al | paper | - | - |
2023-11-16 | Segment Anything in Defect Detection | Bozhen Hu et.al | paper | - | - |
2023-10-24 | Harmonizing output imbalance for defect segmentation on extremely-imbalanced photovoltaic module cells images | Jianye Yi et.al | paper | - | - |
2023-10-3 | Photonic Accelerators for Image Segmentation in Autonomous Driving and Defect Detection | Lakshmi Nair et.al | paper | - | <summary>detail</summary>MSC Class:I |
2023-9-28 | Investigating Shift Equivalence of Convolutional Neural Networks in Industrial Defect Segmentation | Zhen Qu et.al | paper | code | <summary>detail</summary>submit to IEEE Transactions on Instrumentation & Measurement |
2023-9-22 | CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation | Xiaoheng Jiang et.al | paper | - | - |
Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-5-22 | KAN-AD: Time Series Anomaly Detection with Kolmogorov-Arnold Networks | Quan Zhou et.al | paper | - | - |
2025-5-22 | A Multi-Step Comparative Framework for Anomaly Detection in IoT Data Streams | Mohammed Al-Qudah et.al | paper | - | - |
2025-5-22 | Zero-Shot Anomaly Detection in Battery Thermal Images Using Visual Question Answering with Prior Knowledge | Marcella Astrid et.al | paper | - | <summary>detail</summary>Accepted in EUSIPCO 2025 |
2025-5-22 | SD-MAD: Sign-Driven Few-shot Multi-Anomaly Detection in Medical Images | Kaiyu Guo et.al | paper | - | - |
2025-5-22 | Unsupervised Network Anomaly Detection with Autoencoders and Traffic Images | Michael Neri et.al | paper | code | <summary>detail</summary>Accepted for publication in EUSIPCO 2025 |
2025-5-22 | Interpretable Anomaly Detection in Encrypted Traffic Using SHAP with Machine Learning Models | Kalindi Singh et.al | paper | - | - |
2025-5-22 | MADCluster: Model-agnostic Anomaly Detection with Self-supervised Clustering Network | Sangyong Lee et.al | paper | - | - |
2025-5-21 | Unsupervised Log Anomaly Detection with Few Unique Tokens | Antonin Sulc et.al | paper | - | - |
2025-5-21 | Flashback: Memory-Driven Zero-shot, Real-time Video Anomaly Detection | Hyogun Lee et.al | paper | - | - |
2025-5-20 | Anomaly Detection Based on Critical Paths for Deep Neural Networks | Fangzhen Zhao et.al | paper | - | - |
2025-5-20 | Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning | Xiaohao Xu et.al | paper | code | <summary>detail</summary>Best Student Paper Award at IEEE International Conference on Computer Supported Cooperative Work in Design |
2025-5-20 | LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions | Yejin Kwon et.al | paper | - | <summary>detail</summary>Accepted Industry Track at ACL 2025 |
2025-5-19 | Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | Kiarash Naghavi Khanghah et.al | paper | - | <summary>detail</summary>ASME 2025 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference IDETC/CIE2025 |
2025-5-19 | Unsupervised anomaly detection in MeV ultrafast electron diffraction | Mariana A. Fazio et.al | paper | - | - |
2025-5-19 | View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis | Subin Varghese et.al | paper | code | - |
2025-5-19 | Just Dance with $π$! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection | Snehashis Majhi et.al | paper | - | - |
2025-5-19 | Structure-based Anomaly Detection and Clustering | Filippo Leveni et.al | paper | - | <summary>detail</summary>Doctoral dissertation at Politecnico di Milano |
2025-5-18 | AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection | Bin-Bin Gao et.al | paper | code | - |
2025-5-18 | Deep Probabilistic Modeling of User Behavior for Anomaly Detection via Mixture Density Networks | Lu Dai et.al | paper | - | - |
2025-5-18 | AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection | Tiankai Yang et.al | paper | - | - |
2025-5-16 | DiffusionAD: Norm-guided One-step Denoising Diffusion for Anomaly Detection | Hui Zhang et.al | paper | code | <summary>detail</summary>Accepted by TPAMI |
2025-5-16 | CL-BioGAN: Biologically-Inspired Cross-Domain Continual Learning for Hyperspectral Anomaly Detection | Jianing Wang et.al | paper | - | <summary>detail</summary>Journal ref:IEEE Transactions on Geoscience and Remote Sensing |
2025-5-16 | CL-CaGAN: Capsule differential adversarial continuous learning for cross-domain hyperspectral anomaly detection | Jianing Wang et.al | paper | - | <summary>detail</summary>Journal ref:IEEE Transactions on Geoscience and Remote Sensing |
2025-5-16 | Kick Bad Guys Out! Conditionally Activated Anomaly Detection in Federated Learning with Zero-Knowledge Proof Verification | Shanshan Han et.al | paper | - | - |
2025-5-16 | Benchmarking Anomaly Detection Algorithms: Deep Learning and Beyond | Shanay Mehta et.al | paper | - | - |
3D Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-5-15 | Examining the Source of Defects from a Mechanical Perspective for 3D Anomaly Detection | Hanzhe Liang et.al | paper | code | - |
2025-5-3 | MC3D-AD: A Unified Geometry-aware Reconstruction Model for Multi-category 3D Anomaly Detection | Jiayi Cheng et.al | paper | - | - |
2025-4-19 | Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection | Wenbing Zhu et.al | paper | code | - |
2025-4-7 | IterMask3D: Unsupervised Anomaly Detection and Segmentation with Test-Time Iterative Mask Refinement in 3D Brain MR | Ziyun Liang et.al | paper | - | - |
2025-3-30 | Self-Supervised Masked Mesh Learning for Unsupervised Anomaly Detection on 3D Cortical Surfaces | Hao-Chun Yang et.al | paper | - | - |
2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
2025-3-10 | Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | <summary>detail</summary>AAAI2025 Poster |
2025-3-3 | Fence Theorem: Towards Dual-Objective Semantic-Structure Isolation in Preprocessing Phase for 3D Anomaly Detection | Hanzhe Liang et.al | paper | - | - |
2025-2-16 | Exploiting Point-Language Models with Dual-Prompts for 3D Anomaly Detection | Jiaxiang Wang et.al | paper | - | - |
2025-2-9 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al | paper | code | - |
2024-12-23 | Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective | Kaifang Long et.al | paper | - | - |
2024-12-22 | PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection | Qihang Zhou et.al | paper | - | <summary>detail</summary>NeurIPS 2024 |
2024-12-17 | PO3AD: Predicting Point Offsets toward Better 3D Point Cloud Anomaly Detection | Jianan Ye et.al | paper | - | - |
2024-11-9 | Uni-3DAD: GAN-Inversion Aided Universal 3D Anomaly Detection on Model-free Products | Jiayu Liu et.al | paper | - | - |
2024-10-15 | SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection | Yizhe Liu et.al | paper | - | - |
2024-10-13 | DAS3D: Dual-modality Anomaly Synthesis for 3D Anomaly Detection | Kecen Li et.al | paper | - | - |
2024-8-28 | Efficient Slice Anomaly Detection Network for 3D Brain MRI Volume | Zeduo Zhang et.al | paper | code | - |
2024-8-8 | Towards High-resolution 3D Anomaly Detection via Group-Level Feature Contrastive Learning | Hongze Zhu et.al | paper | code | <summary>detail</summary>ACMMM24 |
2024-7-15 | R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection | Zheyuan Zhou et.al | paper | - | <summary>detail</summary>ECCV 2024 |
2024-6-27 | Looking 3D: Anomaly Detection with 2D-3D Alignment | Ankan Bhunia et.al | paper | code | <summary>detail</summary>CVPR’24 |
2024-6-27 | CLIP3D-AD: Extending CLIP for 3D Few-Shot Anomaly Detection with Multi-View Images Generation | Zuo Zuo et.al | paper | - | - |
2024-6-4 | M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising | Chengjie Wang et.al | paper | - | - |
2024-4-11 | 3D-CSAD: Untrained 3D Anomaly Detection for Complex Manufacturing Surfaces | Xuanming Cao et.al | paper | - | - |
2024-4-10 | SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection | Mathis Kruse et.al | paper | - | <summary>detail</summary>Visual Anomaly and Novelty Detection 2 |
2024-1-17 | Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection | Yuanpeng Tu et.al | paper | - | - |
Multimodal Anomaly Detection
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-5-19 | Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | Kiarash Naghavi Khanghah et.al | paper | - | <summary>detail</summary>ASME 2025 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference IDETC/CIE2025 |
2025-5-8 | Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection | Sungheon Jeong et.al | paper | code | - |
2025-4-17 | LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection | Weijia Li et.al | paper | - | - |
2025-3-21 | A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection | Yuxuan Lin et.al | paper | code | <summary>detail</summary>Accepted by Information Fusion |
2025-3-19 | Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation | Xinyue Liu et.al | paper | - | - |
2025-3-17 | Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models | Jiacong Xu et.al | paper | code | - |
2025-2-24 | Can Multimodal LLMs Perform Time Series Anomaly Detection? | Xiongxiao Xu et.al | paper | code | - |
2025-2-20 | MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection | Xi Jiang et.al | paper | code | <summary>detail</summary>Accepted by ICLR 2025 |
2025-2-18 | Anomaly Detection in Smart Power Grids with Graph-Regularized MS-SVDD: a Multimodal Subspace Learning Approach | Thomas Debelle et.al | paper | - | - |
2025-2-10 | Multimodal Task Representation Memory Bank vs. Catastrophic Forgetting in Anomaly Detection | You Zhou et.al | paper | - | - |
2025-2-9 | A 3D Multimodal Feature for Infrastructure Anomaly Detection | Yixiong Jing et.al | paper | code | - |
2025-1-27 | Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? | Zhiling Chen et.al | paper | - | - |
2025-1-17 | Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection | Yuanze Li et.al | paper | code | - |
2024-12-23 | Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective | Kaifang Long et.al | paper | - | - |
2024-9-30 | VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection | Huilin Deng et.al | paper | - | - |
2024-9-26 | AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving | Daniel Bogdoll et.al | paper | - | <summary>detail</summary>Daniel Bogdoll |
2024-9-23 | Incomplete Multimodal Industrial Anomaly Detection via Cross-Modal Distillation | Wenbo Sui et.al | paper | - | - |
2024-9-17 | Multimodal Attention-Enhanced Feature Fusion-based Weekly Supervised Anomaly Violence Detection | Yuta Kaneko et.al | paper | - | <summary>detail</summary>Journal ref:IEEE Open Journal of the Computer Society |
2024-9-9 | Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning | Zhongbin Sun et.al | paper | - | - |
2024-7-8 | Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping | Alex Costanzino et.al | paper | - | <summary>detail</summary>CVPR 2024 |
2024-6-13 | Weakly-supervised anomaly detection for multimodal data distributions | Xu Tan et.al | paper | - | - |
2024-6-4 | M3DM-NR: RGB-3D Noisy-Resistant Industrial Anomaly Detection via Multimodal Denoising | Chengjie Wang et.al | paper | - | - |
2024-5-21 | Multimodal video analysis for crowd anomaly detection using open access tourism cameras | Alejandro Dionis-Ros et.al | paper | - | - |
2024-3-6 | Multimodal Anomaly Detection based on Deep Auto-Encoder for Object Slip Perception of Mobile Manipulation Robots | Youngjae Yoo et.al | paper | - | - |
2024-3-5 | TSRNet: Simple Framework for Real-time ECG Anomaly Detection with Multimodal Time and Spectrogram Restoration Network | Nhat-Tan Bui et.al | paper | code | <summary>detail</summary>ISBI 2024 |
Vector Quantization
Date | Title | Authors | Code | Comments | |
---|---|---|---|---|---|
2025-5-19 | BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation | Jilong Li et.al | paper | - | - |
2025-5-18 | Does Vector Quantization Fail in Spatio-Temporal Forecasting? Exploring a Differentiable Sparse Soft-Vector Quantization Approach | Chao Chen et.al | paper | code | <summary>detail</summary>Accepted by KDD2025 research track |
2025-5-15 | VQ-Logits: Compressing the Output Bottleneck of Large Language Models via Vector Quantized Logits | Jintian Shao et.al | paper | - | - |
2025-5-9 | A vector quantized masked autoencoder for audiovisual speech emotion recognition | Samir Sadok et.al | paper | code | - |
2025-5-5 | RobSurv: Vector Quantization-Based Multi-Modal Learning for Robust Cancer Survival Prediction | Aiman Farooq et.al | paper | - | - |
2025-5-2 | RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization | Chen Xu et.al | paper | - | - |
2025-4-28 | TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate | Amir Zandieh et.al | paper | - | - |
2025-4-27 | Variable Bitrate Residual Vector Quantization for Audio Coding | Yunkee Chae et.al | paper | - | <summary>detail</summary>ICASSP 2025 camera ready version |
2025-4-18 | Lightweight Road Environment Segmentation using Vector Quantization | Jiyong Kwag et.al | paper | - | <summary>detail</summary>Journal ref:ISPRS Ann |
2025-4-17 | Hierarchical Vector Quantized Graph Autoencoder with Annealing-Based Code Selection | Long Zeng et.al | paper | - | <summary>detail</summary>Journal ref:WWW 2025 |
2025-4-16 | GT-SVQ: A Linear-Time Graph Transformer for Node Classification Using Spiking Vector Quantization | Huizhe Zhang et.al | paper | - | <summary>detail</summary>work in progress |
2025-4-13 | Vector-Quantized Vision Foundation Models for Object-Centric Learning | Rongzhen Zhao et.al | paper | - | - |
2025-4-10 | Vector Quantized-Elites: Unsupervised and Problem-Agnostic Quality-Diversity Optimization | Constantinos Tsakonas et.al | paper | - | - |
2025-4-8 | A Streamable Neural Audio Codec with Residual Scalar-Vector Quantization for Real-Time Communication | Xiao-Hang Jiang et.al | paper | - | <summary>detail</summary>Accepted by IEEE Signal Processing Letters |
2025-4-8 | Compressing 3D Gaussian Splatting by Noise-Substituted Vector Quantization | Haishan Wang et.al | paper | - | <summary>detail</summary>Appearing in Scandinavian Conference on Image Analysis (SCIA) 2025 |
2025-4-6 | HyperVQ: MLR-based Vector Quantization in Hyperbolic Space | Nabarun Goswami et.al | paper | - | - |
2025-4-1 | Improving Vector-Quantized Image Modeling with Latent Consistency-Matching Diffusion | Bac Nguyen et.al | paper | - | - |
2025-3-18 | MAG: Multi-Modal Aligned Autoregressive Co-Speech Gesture Generation without Vector Quantization | Binjie Liu et.al | paper | - | - |
2025-3-17 | VQ-SGen: A Vector Quantized Stroke Representation for Creative Sketch Generation | Jiawei Wang et.al | paper | - | - |
2025-3-17 | STORM: A Spatio-Temporal Factor Model Based on Dual Vector Quantized Variational Autoencoders for Financial Trading | Yilei Zhao et.al | paper | - | - |
2025-3-15 | Restructuring Vector Quantization with the Rotation Trick | Christopher Fifty et.al | paper | code | - |
2025-3-12 | ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba | Juncan Deng et.al | paper | - | - |
2025-3-11 | SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting | Shuaiting Li et.al | paper | - | - |
2025-3-10 | Task Vector Quantization for Memory-Efficient Model Merging | Youngeun Kim et.al | paper | - | - |
2025-3-9 | PathVQ: Reforming Computational Pathology Foundation Model for Whole Slide Image Analysis via Vector Quantization | Honglin Li et.al | paper | - | - |