Skip to content

🎓Automatically Update Interested Papers Daily using Github Actions (Update Every 12th hours)

License

Notifications You must be signed in to change notification settings

lidq92/arxiv-daily

 
 

Repository files navigation

[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]

Updated on 2025.03.01

Usage instructions: here

Table of Contents
  1. Point Cloud Compression
  2. Compression
  3. Quality Assessment
  4. Super Resolution
  5. Remote Sensing

Point Cloud Compression

Publish Date Title Authors PDF Code
2025-02-26 SPU-IMR: Self-supervised Arbitrary-scale Point Cloud Upsampling via Iterative Mask-recovery Network Ziming Nie et.al. 2502.19452 null
2025-02-25 Deep-JGAC: End-to-End Deep Joint Geometry and Attribute Compression for Dense Colored Point Clouds Yun Zhang et.al. 2502.17939 null
2025-02-10 Real-Time LiDAR Point Cloud Compression and Transmission for Resource-constrained Robots Yuhao Cao et.al. 2502.06123 link
2025-02-07 DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection Mingxuan Yan et.al. 2502.04804 null
2025-02-05 Deep Learning-based Event Data Coding: A Joint Spatiotemporal and Polarity Solution Abdelrahman Seleem et.al. 2502.03285 null
2025-02-22 Point Cloud Upsampling as Statistical Shape Model for Pelvic Tongxu Zhang et.al. 2501.16716 null
2025-01-25 Efficient Point Clouds Upsampling via Flow Matching Zhi-Song Liu et.al. 2501.15286 null
2025-01-13 Representation Learning of Point Cloud Upsampling in Global and Local Inputs Tongxu Zhang et.al. 2501.07076 null
2024-12-19 Color Enhancement for V-PCC Compressed Point Cloud via 2D Attribute Map Optimization Jingwei Bao et.al. 2412.14449 null
2024-12-16 EGP3D: Edge-guided Geometric Preserving 3D Point Cloud Super-resolution for RGB-D camera Zheng Fang et.al. 2412.11680 null
2024-12-11 Implicit Neural Compression of Point Clouds Hongning Ruan et.al. 2412.10433 null
2024-12-07 Rate-Distortion Optimized Skip Coding of Region Adaptive Hierarchical Transform Coefficients for MPEG G-PCC Zehan Wang et.al. 2412.05574 null
2025-01-09 Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer Xiao Huo et.al. 2411.07899 null
2024-11-09 Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data Xinran Liu et.al. 2411.06055 null
2024-11-01 PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling Donghyun Kim et.al. 2411.00432 null
2024-10-28 Quality Analysis of the Coding Bitrate Tradeoff Between Geometry and Attributes for Colored Point Clouds Joao Prazeres et.al. 2410.21613 null
2024-10-09 Point Cloud Compression with Bits-back Coding Nguyen Quang Hieu et.al. 2410.18115 null
2024-10-23 Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds Kai Liu et.al. 2410.17823 link
2024-10-22 Joint Point Cloud Upsampling and Cleaning with Octree-based CNNs Jihe Li et.al. 2410.17001 link
2024-10-21 MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering Jiayi Song et.al. 2410.15941 null
2024-10-13 Towards Reproducible Learning-based Compression Jiahao Pang et.al. 2410.09872 null
2024-10-06 Tensor-Train Point Cloud Compression and Efficient Approximate Nearest-Neighbor Search Georgii Novikov et.al. 2410.04462 null
2024-10-01 Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection Pengxi Zeng et.al. 2410.00582 null
2024-09-19 PVContext: Hybrid Context Model for Point Cloud Compression Guoqing Zhang et.al. 2409.12724 null
2024-09-12 The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine André F. R. Guarda et.al. 2409.08130 null
2024-09-08 GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling Huawei Sun et.al. 2409.02720 link
2024-09-03 GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting Zixuan Guo et.al. 2409.01581 null
2024-08-20 End-to-end learned Lossy Dynamic Point Cloud Attribute Compression Dat Thanh Nguyen et.al. 2408.10665 null
2024-08-20 Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds Kai Liu et.al. 2408.10543 null
2024-08-16 LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression Yuqi Ye et.al. 2408.08682 null
2024-08-06 Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement Hao Xu et.al. 2408.02966 null
2024-08-01 Learned Compression of Point Cloud Geometry and Attributes in a Single Model through Multimodal Rate-Control Michael Rudolph et.al. 2408.00599 null
2024-07-22 Double Deep Learning-based Event Data Coding and Classification Abdelrahman Seleem et.al. 2407.15531 null
2024-07-11 Enhancing octree-based context models for point cloud geometry compression with attention-based child node number prediction Chang Sun et.al. 2407.08528 null
2024-07-11 Enhancing context models for point cloud geometry compression with context feature residuals and multi-loss Chang Sun et.al. 2407.08520 null
2024-07-19 PCAC-GAN: A Sparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression Xiaolong Mao et.al. 2407.05677 null
2024-07-05 Rethinking Data Input for Point Cloud Upsampling Tongxu Zhang et.al. 2407.04476 null
2024-08-26 TSC-PCAC: Voxel Transformer and Sparse Convolution Based Point Cloud Attribute Compression for 3D Broadcasting Zixi Guo et.al. 2407.04284 link
2024-06-15 Full reference point cloud quality assessment using support vector regression Ryosuke Watanabe et.al. 2406.10520 link
2024-09-25 Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering Yueyu Hu et.al. 2406.05915 null
2024-06-02 Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor Lei Liu et.al. 2406.00791 null
2024-05-23 NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolation Chaokang Jiang et.al. 2405.14241 link
2024-05-19 Point Cloud Compression with Implicit Neural Representations: A Unified Framework Hongning Ruan et.al. 2405.11493 null
2024-05-02 PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems Walter Zimmer et.al. 2405.01750 null
2024-04-21 Pointsoup: High-Performance and Extremely Low-Decoding-Latency Learned Geometry Codec for Large-Scale Point Cloud Scenes Kang You et.al. 2404.13550 link
2024-04-16 Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery Zohre Karimi et.al. 2404.07185 null
2024-04-10 Efficient and Generic Point Model for Lossless Point Cloud Attribute Compression Kang You et.al. 2404.06936 link
2024-04-09 Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data Kai Luan et.al. 2404.06012 null
2024-03-13 Point Cloud Compression via Constrained Optimal Transport Zezeng Li et.al. 2403.08236 link
2024-03-08 Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning Hang Du et.al. 2403.05117 link
2024-03-01 Assessing objective quality metrics for JPEG and MPEG point cloud coding Davi Lazzarotto et.al. 2403.00410 null
2024-02-23 Scalable Human-Machine Point Cloud Compression Mateen Ulhaq et.al. 2402.12532 link
2024-02-18 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods Till Beemelmanns et.al. 2402.11680 link
2024-02-17 Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression Dingquan Li et.al. 2402.11250 link
2024-02-11 PIVOT-Net: Heterogeneous Point-Voxel-Tree-based Framework for Point Cloud Compression Jiahao Pang et.al. 2402.07243 null
2024-02-07 Performance analysis of Deep Learning-based Lossy Point Cloud Geometry Compression Coding Solutions Joao Prazeres et.al. 2402.05192 null
2024-02-08 Subjective performance evaluation of bitrate allocation strategies for MPEG and JPEG Pleno point cloud compression Davi Lazzarotto et.al. 2402.04760 null
2024-02-15 LiDAR-Forest Dataset: LiDAR Point Cloud Simulation Dataset for Forestry Application Yawen Lu et.al. 2402.04546 null
2023-12-23 Learning Continuous Implicit Field with Local Distance Indicator for Arbitrary-Scale Point Cloud Upsampling Shujuan Li et.al. 2312.15133 null
2024-03-13 DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction Yanlong Li et.al. 2312.03298 link
2023-12-03 A Conditional Denoising Diffusion Probabilistic Model for Point Cloud Upsampling Wentao Qu et.al. 2312.02719 link
2023-11-22 Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression Tam Thuc Do et.al. 2311.13539 null
2023-11-22 Volumetric 3D Point Cloud Attribute Compression: Learned polynomial bilateral filter for prediction Tam Thuc Do et.al. 2311.13533 null
2023-11-22 Test-Time Augmentation for 3D Point Cloud Classification and Segmentation Tuan-Anh Vu et.al. 2311.13152 null
2023-11-03 PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation Yuhan Ding et.al. 2311.01773 null
2023-11-02 Lightweight super resolution network for point cloud geometry compression Wei Zhang et.al. 2311.00970 link
2023-11-17 Deep Learning-based Compressed Domain Multimedia for Man and Machine: A Taxonomy and Application to Point Cloud Classification Abdelrahman Seleem et.al. 2310.18849 null
2023-10-13 iPUNet:Iterative Cross Field Guided Point Cloud Upsampling Guangshun Wei et.al. 2310.09092 link
2024-03-15 PU-Ray: Domain-Independent Point Cloud Upsampling via Ray Marching on Neural Implicit Surface Sangwon Lim et.al. 2310.08755 link
2024-02-16 Quasi-Monte Carlo for 3D Sliced Wasserstein Khai Nguyen et.al. 2309.11713 link
2023-09-08 Poster: Making Edge-assisted LiDAR Perceptions Robust to Lossy Point Cloud Compression Jin Heo et.al. 2309.04549 null
2023-09-01 Test-Time Adaptation for Point Cloud Upsampling Using Meta-Learning Ahmed Hatem et.al. 2308.16484 null
2024-02-08 SCP: Spherical-Coordinate-based Learned Point Cloud Compression Ao Luo et.al. 2308.12535 null
2023-08-22 Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection Junsheng Zhou et.al. 2308.11441 link
2023-08-11 Learned Point Cloud Compression for Classification Mateen Ulhaq et.al. 2308.05959 link
2023-07-27 FLiCR: A Fast and Lightweight LiDAR Point Cloud Compression Based on Lossy RI Jin Heo et.al. 2307.15005 null
2023-07-20 Aggressive saliency-aware point cloud compression Eleftheria Psatha et.al. 2307.10741 null
2023-07-18 Arbitrary point cloud upsampling via Dual Back-Projection Network Zhi-Song Liu et.al. 2307.08992 null
2023-06-01 4DSR-GCN: 4D Video Point Cloud Upsampling using Graph Convolutional Networks Lorenzo Berlincioni et.al. 2306.01081 null
2023-05-16 Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching Shuting Xia et.al. 2305.05356 null
2023-05-02 Geometric Prior Based Deep Human Point Cloud Geometry Compression Xinju Wu et.al. 2305.01309 null
2023-05-02 PU-EdgeFormer: Edge Transformer for Dense Prediction in Point Cloud Upsampling Dohoon Kim et.al. 2305.01148 link
2023-04-24 Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions Yun He et.al. 2304.11846 link
2023-04-01 Volumetric Attribute Compression for 3D Point Clouds using Feedforward Network with Geometric Attention Tam Thuc Do et.al. 2304.00335 null
2023-03-27 NeuralPCI: Spatio-temporal Neural Field for 3D Point Cloud Multi-frame Non-linear Interpolation Zehan Zheng et.al. 2303.15126 link
2023-11-07 GQE-Net: A Graph-based Quality Enhancement Network for Point Cloud Color Attribute Jinrui Xing et.al. 2303.13764 link
2023-03-22 Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction Jianqiang Wang et.al. 2303.12917 null
2023-12-28 Progressive Frame Patching for FoV-based Point Cloud Video Streaming Tongyu Zong et.al. 2303.08336 null
2023-12-03 Parametric Surface Constrained Upsampler Network for Point Cloud Pingping Cai et.al. 2303.08240 link
2024-03-20 Lossless Point Cloud Geometry and Attribute Compression Using a Learned Conditional Probability Model Dat Thanh Nguyen et.al. 2303.06519 link
2023-03-11 Deep probabilistic model for lossless scalable point cloud attribute compression Dat Thanh Nguyen et.al. 2303.06517 null
2023-03-09 BIRD-PCC: Bi-directional Range Image-based Deep LiDAR Point Cloud Compression Chia-Sheng Liu et.al. 2303.04027 null
2023-02-13 gpcgc: a green point cloud geometry coding method Qingyang Zhou et.al. 2302.06062 null
2023-02-09 BASICS: Broad quality Assessment of Static point clouds In Compression Scenarios Ali Ak et.al. 2302.04796 null
2023-04-27 Linear Optimal Partial Transport Embedding Yikun Bai et.al. 2302.03232 link
2023-01-31 Lidar Upsampling with Sliced Wasserstein Distance Artem Savkin et.al. 2301.13558 null
2023-01-28 Dynamic Point Cloud Geometry Compression Using Multiscale Inter Conditional Coding Jianqiang Wang et.al. 2301.12165 null
2023-01-27 Joint Geometry and Attribute Upsampling of Point Clouds Using Frequency-Selective Models with Overlapped Support Viktoria Heimann et.al. 2301.11630 null
2023-01-03 Reduced Reference Quality Assessment for Point Cloud Compression Yipeng Liu et.al. 2301.01009 null
2023-04-06 Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program Tiange Luo et.al. 2212.12952 null
2022-12-11 Learning Neural Volumetric Field for Point Cloud Geometry Compression Yueyu Hu et.al. 2212.05589 link
2022-12-01 Low-Rank Tensor Function Representation for Multi-Dimensional Data Recovery Yisi Luo et.al. 2212.00262 null
2023-12-09 ECM-OPCC: Efficient Context Model for Octree-based Point Cloud Compression Yiqi Jin et.al. 2211.10916 null
2022-11-19 Rate-Distortion Modeling for Bit Rate Constrained Point Cloud Compression Pan Gao et.al. 2211.10646 null
2022-10-21 Motion Policy Networks Adam Fishman et.al. 2210.12209 link
2022-10-28 Motion estimation and filtered prediction for dynamic point cloud attribute compression Haoran Hong et.al. 2210.08262 null
2022-10-08 Point Cloud Upsampling via Cascaded Refinement Network Hang Du et.al. 2210.03942 link
2023-02-14 Multiscale Latent-Guided Entropy Model for LiDAR Point Cloud Compression Tingyu Fan et.al. 2209.12512 null
2022-09-17 CARNet:Compression Artifact Reduction for Point Cloud Attribute Dandan Ding et.al. 2209.08276 null
2022-11-16 CU-Net: Real-Time High-Fidelity Color Upsampling for Point Clouds Lingdong Wang et.al. 2209.06112 link
2022-09-09 GRASP-Net: Geometric Residual Analysis and Synthesis for Point Cloud Compression Jiahao Pang et.al. 2209.04401 link
2022-09-06 Learning to Predict on Octree for Scalable Point Cloud Geometry Coding Yixiang Mao et.al. 2209.02226 null
2022-08-26 Efficient LiDAR Point Cloud Geometry Compression Through Neighborhood Point Attention Ruixiang Xue et.al. 2208.12573 null
2022-08-17 Efficient dynamic point cloud coding using Slice-Wise Segmentation Faranak Tohidi et.al. 2208.08061 null
2023-01-10 Arbitrary Point Cloud Upsampling with Spherical Mixture of Gaussians Anthony Dell'Eva et.al. 2208.05274 link
2022-08-04 IT/IST/IPLeiria Response to the Call for Proposals on JPEG Pleno Point Cloud Coding André F. R. Guarda et.al. 2208.02716 null
2022-08-04 IPDAE: Improved Patch-Based Deep Autoencoder for Lossy Point Cloud Geometry Compression Kang You et.al. 2208.02519 link
2022-07-25 Inter-Frame Compression for Dynamic Point Cloud Geometry Coding Anique Akhtar et.al. 2207.12554 null
2022-07-20 GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation Cristiano Saltori et.al. 2207.09763 link
2022-06-25 BIMS-PU: Bi-Directional and Multi-Scale Point Cloud Upsampling Yechao Bai et.al. 2206.12648 null
2022-06-24 Rate-Distortion Optimal Transform Coefficient Selection for Unoccupied Regions in Video-Based Point Cloud Compression Christian Herglotz et.al. 2206.12186 null
2022-05-24 A Rate Control Algorithm for Video-based Point Cloud Compression Fangyu Shen et.al. 2205.11825 null
2022-05-19 A Comparative Study of Feature Expansion Unit for 3D Point Cloud Upsampling Qiang Li et.al. 2205.09594 null
2022-05-02 D-DPCC: Deep Dynamic Point Cloud Compression via 3D Motion Prediction Tingyu Fan et.al. 2205.01135 link
2022-05-02 Point Cloud Compression with Sibling Context and Surface Priors Zhili Chen et.al. 2205.00760 link
2022-04-29 Deep Geometry Post-Processing for Decompressed Point Clouds Xiaoqing Fan et.al. 2204.13952 link
2022-04-27 Density-preserving Deep Point Cloud Compression Yun He et.al. 2204.12684 null
2022-04-25 4DAC: Learning Attribute Compression for Dynamic Point Clouds Guangchi Fang et.al. 2204.11723 null
2022-04-25 Dynamic Point Cloud Compression with Cross-Sectional Approach Faranak Tohidi et.al. 2204.11409 null
2022-04-22 PU-EVA: An Edge Vector based Approximation Solution for Flexible-scale Point Cloud Upsampling Luqing Luo et.al. 2204.10750 null
2022-04-18 Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation Wenbo Zhao et.al. 2204.08196 link
2022-06-22 Learning-based Lossless Point Cloud Geometry Coding using Sparse Tensors Dat Thanh Nguyen et.al. 2204.05043 null
2022-04-03 Sparse Tensor-based Point Cloud Attribute Compression Jianqiang Wang et.al. 2204.01023 link
2022-03-22 IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment Yiming Zeng et.al. 2203.11590 link
2022-03-21 Upsampling Autoencoder for Self-Supervised Point Cloud Learning Cheng Zhang et.al. 2203.10768 null
2022-05-03 Frequency-Selective Mesh-to-Mesh Resampling for Color Upsampling of Point Clouds Viktoria Heimann et.al. 2203.09224 null
2022-03-02 PUFA-GAN: A Frequency-Aware Generative Adversarial Network for 3D Point Cloud Upsampling Hao Liu et.al. 2203.00914 null
2022-05-16 Variable Rate Compression for Raw 3D Point Clouds Md Ahmed Al Muzaddid et.al. 2202.13862 link
2022-09-14 Point cloud completion via structured feature maps using a feedback network Zejia Su et.al. 2202.08583 null
2022-05-08 OctAttention: Octree-Based Large-Scale Contexts Model for Point Cloud Compression Chunyang Fu et.al. 2202.06028 link
2022-02-01 Point Cloud Compression for Efficient Data Broadcasting: A Performance Comparison Francesco Nardo et.al. 2202.00719 null
2022-02-01 Fractional Motion Estimation for Point Cloud Compression Haoran Hong et.al. 2202.00172 null
2022-01-17 SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations Zhenyu Li et.al. 2112.04680 link
2022-03-31 Neural Points: Point Cloud Representation with Neural Fields for Arbitrary Upsampling Wanquan Feng et.al. 2112.04148 link
2022-03-01 Attribute Artifacts Removal for Geometry-based Point Cloud Compression Xihua Sheng et.al. 2112.00560 null
2022-10-03 PU-Transformer: Point Cloud Upsampling Transformer Shi Qiu et.al. 2111.12242 link
2022-10-21 Sparse Tensor-based Multiscale Representation for Point Cloud Geometry Compression Jianqiang Wang et.al. 2111.10633 link
2021-10-18 Patch-Based Deep Autoencoder for Point Cloud Geometry Compression Kang You et.al. 2110.09109 link
2022-07-12 PC $^2$ -PU: Patch Correlation and Point Correlation for Effective Point Cloud Upsampling Chen Long et.al. 2109.09337 link
2021-09-16 R-PCC: A Baseline for Range Image-based Point Cloud Compression Sukai Wang et.al. 2109.07717 link
2021-09-15 Which One is Better: Assessing Objective Metrics for Point Cloud Compression Yipeng Liu et.al. 2109.07158 null
2021-08-05 Joint Geometry and Color Projection-based Point Cloud Quality Metric Alireza Javaheri et.al. 2108.02481 link
2021-08-03 SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering Yifan Zhao et.al. 2108.00454 link
2021-07-29 Video-based Point Cloud Compression Artifact Removal Anique Akhtar et.al. 2107.14179 null
2024-02-28 Score-Based Point Cloud Denoising Shitong Luo et.al. 2107.10981 link
2022-06-08 PU-Flow: a Point Cloud Upsampling Network with Normalizing Flows Aihua Mao et.al. 2107.05893 link
2022-04-18 "Zero-Shot" Point Cloud Upsampling Kaiyue Zhou et.al. 2106.13765 link
2021-06-23 Lossless Point Cloud Attribute Compression with Normal-based Intra Prediction Qian Yin et.al. 2106.12236 null
2021-06-21 Cylindrical coordinates for LiDAR point cloud compression Shashank N. Sridhara et.al. 2106.11237 null
2021-10-11 Neural Network Modeling of Probabilities for Coding the Octree Representation of Point Clouds Emre Can Kaya et.al. 2106.06482 link
2021-06-09 Point Cloud Upsampling via Disentangled Refinement Ruihui Li et.al. 2106.04779 link
2021-06-02 DeepCompress: Efficient Point Cloud Geometry Compression Ryan Killea et.al. 2106.01504 link
2021-06-01 RAI-Net: Range-Adaptive LiDAR Point Cloud Frame Interpolation Network Lili Zhao et.al. 2106.00496 null
2021-05-28 An Unsupervised Optical Flow Estimation For LiDAR Image Sequences Xuezhou Guo et.al. 2105.13879 null
2021-05-05 VoxelContext-Net: An Octree based Framework for Point Cloud Compression Zizheng Que et.al. 2105.02158 null
2021-04-20 Multiscale deep context modeling for lossless point cloud geometry compression Dat Thanh Nguyen et.al. 2104.09859 link
2021-04-12 Towards Efficient Graph Convolutional Networks for Point Cloud Handling Yawei Li et.al. 2104.05706 null
2021-03-11 Advanced Geometry Surface Coding for Dynamic Point Cloud Compression Jian Xiong et.al. 2103.06549 null
2021-03-05 Hybrid Point Cloud Semantic Compression for Automotive Sensors: A Performance Evaluation Andrea Varischio et.al. 2103.03819 null
2021-02-26 Point Cloud Upsampling and Normal Estimation using Deep Learning for Robust Surface Reconstruction Rajat Sharma et.al. 2102.13391 link
2021-02-25 A deep perceptual metric for 3D point clouds Maurice Quach et.al. 2102.12839 link
2021-02-08 Meta-PU: An Arbitrary-Scale Upsampling Network for Point Cloud Shuquan Ye et.al. 2102.04317 null
2020-12-15 NeuralQAAD: An Efficient Differentiable Framework for High Resolution Point Cloud Compression Nicolas Wagner et.al. 2012.08143 null
2022-06-11 SPU-Net: Self-Supervised Point Cloud Upsampling by Coarse-to-Fine Reconstruction with Self-Projection Optimization Xinhai Liu et.al. 2012.04439 link
2021-11-18 Vehicular Cooperative Perception Through Action Branching and Federated Reinforcement Learning Mohamed K. Abdel-Aziz et.al. 2012.03414 null
2020-12-05 ParaNet: Deep Regular Representation for 3D Point Clouds Qijian Zhang et.al. 2012.03028 null
2020-11-27 Spherical Interpolated Convolutional Network with Distance-Feature Density for 3D Semantic Segmentation of Point Clouds Guangming Wang et.al. 2011.13784 null
2020-11-25 Reduced Reference Perceptual Quality Model and Application to Rate Control for 3D Point Cloud Compression Qi Liu et.al. 2011.12688 null
2020-11-07 Multiscale Point Cloud Geometry Compression Jianqiang Wang et.al. 2011.03799 link
2020-10-29 Point Cloud Attribute Compression via Successive Subspace Graph Transform Yueru Chen et.al. 2010.15302 null
2020-08-16 Real-Time Spatio-Temporal LiDAR Point Cloud Compression Yu Feng et.al. 2008.06972 link
2021-08-03 Subjective Quality Database and Objective Study of Compressed Point Clouds With 6DoF Head-Mounted Display Xinju Wu et.al. 2008.02501 null
2020-06-20 Pseudo-LiDAR Point Cloud Interpolation Based on 3D Motion Representation and Spatial Supervision Haojie Liu et.al. 2006.11481 null
2020-06-24 Improved Deep Point Cloud Geometry Compression Maurice Quach et.al. 2006.09043 link
2020-04-03 Intrinsic Point Cloud Interpolation via Dual Latent Space Navigation Marie-Julie Rakotosaona et.al. 2004.01661 link
2020-03-30 A generalized Hausdorff distance based quality metric for point cloud geometry Alireza Javaheri et.al. 2003.13669 null
2020-03-30 Optimizing Geometry Compression using Quantum Annealing Sebastian Feld et.al. 2003.13253 null
2020-03-27 Model-based Joint Bit Allocation between Geometry and Color for Video-based 3D Point Cloud Compression Qi Liu et.al. 2002.10798 null
2020-03-07 PUGeo-Net: A Geometry-centric Network for 3D Point Cloud Upsampling Yue Qian et.al. 2002.10277 null
2020-06-22 Folding-based compression of point cloud attributes Maurice Quach et.al. 2002.04439 null
2020-01-13 Efficient 3D Road Map Data Exchange for Intelligent Vehicles in Vehicular Fog Networks Ivan Wang-Hei Ho et.al. 2001.04057 null
2020-01-12 Linear Model based Geometry Coding for Lidar Acquired Point Clouds Xiang Zhang et.al. 2001.03871 null
2021-04-09 PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection Shaoshuai Shi et.al. 1912.13192 link
2019-12-20 A Comprehensive Study and Comparison of Core Technologies for MPEG 3D Point Cloud Compression Hao Liu et.al. 1912.09674 null
2020-10-15 Point Cloud Rendering after Coding: Impacts on Subjective and Objective Quality Alireza Javaheri et.al. 1912.09137 null
2021-03-29 PU-GCN: Point Cloud Upsampling using Graph Convolutional Networks Guocheng Qian et.al. 1912.03264 link
2019-11-04 Video-based compression for plenoptic point clouds Li Li et.al. 1911.01355 null
2019-09-26 Learned Point Cloud Geometry Compression Jianqiang Wang et.al. 1909.12037 link
2019-09-16 PLIN: A Network for Pseudo-LiDAR Point Cloud Interpolation Haojie Liu et.al. 1909.07137 null
2019-08-17 3D Point Cloud Super-Resolution via Graph Total Variation on Surface Normals Chinthaka Dinesh et.al. 1908.06261 null
2019-08-06 Point Cloud Super Resolution with Adversarial Residual Graph Networks Huikai Wu et.al. 1908.02111 link
2020-08-10 Predictive Generalized Graph Fourier Transform for Attribute Compression of Dynamic Point Clouds Yiqun Xu et.al. 1908.01970 null
2019-07-25 PU-GAN: a Point Cloud Upsampling Adversarial Network Ruihui Li et.al. 1907.10844 null
2019-06-27 A Convolutional Decoder for Point Clouds using Adaptive Instance Normalization Isaak Lim et.al. 1906.11478 null
2019-04-18 Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds Wei Yan et.al. 1905.03691 null
2019-05-22 Learning Convolutional Transforms for Lossy Point Cloud Geometry Compression Maurice Quach et.al. 1903.08548 link
2019-09-30 Variational Graph Methods for Efficient Point Cloud Sparsification Daniel Tenbrinck et.al. 1903.02858 null
2019-03-05 Pose Estimation of Vehicles Over Uneven Terrain Yingchong Ma et.al. 1903.02052 null
2019-02-11 Occupancy-map-based rate distortion optimization for video-based point cloud compression Li Li et.al. 1902.04169 null
2018-09-30 A Volumetric Approach to Point Cloud Compression Maja Krivokuća et.al. 1810.00484 null
2018-05-29 Surface Light Field Compression using a Point Cloud Codec Xiang Zhang et.al. 1805.11203 null
2018-05-23 Comments on "Compression of 3D Point Clouds Using a Region-Adaptive Hierarchical Transform" Gustavo Sandri et.al. 1805.09146 null
2018-04-28 Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction Yiting Shao et.al. 1804.10783 null
2018-03-26 PU-Net: Point Cloud Upsampling Network Lequan Yu et.al. 1801.06761 link
2017-10-10 Attribute Compression of 3D Point Clouds Using Laplacian Sparsity Optimized Graph Transform Yiting Shao et.al. 1710.03532 null
2017-03-08 Dynamic Polygon Clouds: Representation and Compression for VR/AR Philip A. Chou et.al. 1610.00402 null

(back to top)

Compression

Publish Date Title Authors PDF Code
2025-02-27 Balanced Rate-Distortion Optimization in Learned Image Compression Yichi Zhang et.al. 2502.20161 null
2025-02-27 Transformer-Based Nonlinear Transform Coding for Multi-Rate CSI Compression in MIMO-OFDM Systems Bumsu Park et.al. 2502.19847 null
2025-02-26 Zipping many-body quantum states: a scalable approach to diagonal entropy Yu-Hsueh Chen et.al. 2502.18898 null
2025-02-25 Novel quantum circuit for image compression utilizing modified Toffoli gate and quantized transformed coefficient alongside a novel reset gate Ershadul Haque et.al. 2502.17815 null
2025-02-25 Quantum neural compressive sensing for ghost imaging Xinliang Zhai et.al. 2502.17790 null
2025-02-24 Optimized Memory System Architecture for VESA VDC-M Decoder with Multi-Slice Support Hannah Yang et.al. 2502.17729 null
2025-02-24 Pleno-Generation: A Scalable Generative Face Video Compression Framework with Bandwidth Intelligence Bolin Chen et.al. 2502.17085 null
2025-02-24 Hierarchical Semantic Compression for Consistent Image Semantic Restoration Shengxi Li et.al. 2502.16799 null
2025-02-24 Continuous Patch Stitching for Block-wise Image Compression Zifu Zhang et.al. 2502.16795 null
2025-02-27 Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM Yao Zhang et.al. 2502.16495 null
2025-02-22 Large Language Model for Lossless Image Compression with Visual Prompts Junhao Du et.al. 2502.16163 null
2025-02-21 Quantum autoencoders for image classification Hinako Asaoka et.al. 2502.15254 null
2025-02-21 Interleaved Block-based Learned Image Compression with Feature Enhancement and Quantization Error Compensation Shiqi Jiang et.al. 2502.15188 null
2025-02-21 FD-LSCIC: Frequency Decomposition-based Learned Screen Content Image Compression Shiqi Jiang et.al. 2502.15174 null
2025-02-20 Compact Latent Representation for Image Compression (CLRIC) Ayman A. Ameen et.al. 2502.14937 null
2025-02-20 Stereo Image Coding for Machines with Joint Visual Feature Compression Dengchao Jin et.al. 2502.14190 null
2025-02-19 A General Framework for Augmenting Lossy Compressors with Topological Guarantees Nathaniel Gorski et.al. 2502.14022 null
2025-02-19 A Lightweight Model for Perceptual Image Compression via Implicit Priors Hao Wei et.al. 2502.13988 null
2025-02-19 Improving the Sparse Structure Learning of Spiking Neural Networks from the View of Compression Efficiency Jiangrong Shen et.al. 2502.13572 null
2025-02-18 Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression Jaemoon Lee et.al. 2502.12951 null
2025-02-17 Fully Dynamic LZ77 in Sublinear Time Itai Boneh et.al. 2502.12000 null
2025-02-17 On Quantizing Neural Representation for Variable-Rate Video Coding Junqi Shi et.al. 2502.11729 link
2025-02-15 AquaScope: Reliable Underwater Image Transmission on Mobile Devices Beitong Tian et.al. 2502.10891 null
2025-02-15 ResiComp: Loss-Resilient Image Compression via Dual-Functional Masked Visual Token Modeling Sixian Wang et.al. 2502.10812 null
2025-02-15 A Fast Quantum Image Compression Algorithm based on Taylor Expansion Vu Tuan Hai et.al. 2502.10684 null
2025-02-15 Optimizing CNN Architectures for Advanced Thoracic Disease Classification Tejas Mirthipati et.al. 2502.10614 null
2025-02-14 Conditional Latent Coding with Learnable Synthesized Reference for Deep Image Compression Siqi Wu et.al. 2502.09971 null
2025-02-13 Differentially Private Compression and the Sensitivity of LZ77 Jeremiah Blocki et.al. 2502.09584 null
2025-02-13 SQ-GAN: Semantic Image Communications Using Masked Vector Quantization Francesco Pezone et.al. 2502.09520 link
2025-02-13 Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting Lingting Zhu et.al. 2502.09039 link
2025-02-12 Compression of Site-Specific Deep Neural Networks for Massive MIMO Precoding Ghazal Kasalaee et.al. 2502.08758 null
2025-02-11 To clean or not to clean? Influence of pixel removal on event reconstruction using deep learning in CTAO Tom François et.al. 2502.07643 null
2025-02-19 HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates Lei Lu et.al. 2502.07160 null
2025-02-12 Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT Dongyang Liu et.al. 2502.06782 null
2025-02-10 Solving Optimal Power Flow on a Data-Budget: Feature Selection on Smart Meter Data Vassilis Kekatos et.al. 2502.06683 null
2025-02-13 CANeRV: Content Adaptive Neural Representation for Video Compression Lv Tang et.al. 2502.06181 null
2025-02-09 Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization Jiajun Fan et.al. 2502.06061 null
2025-02-09 Constant sensitivity on the CDAWGs Rikuya Hamai et.al. 2502.05915 null
2025-02-09 Linear Attention Modeling for Learned Image Compression Donghui Feng et.al. 2502.05741 null
2025-02-08 Convolutional Deep Colorization for Image Compression: A Color Grid Based Approach Ian Tassin et.al. 2502.05402 null
2025-02-07 CMamba: Learned Image Compression with State Space Models Zhuojie Wu et.al. 2502.04988 null
2025-02-06 Semantic Feature Division Multiple Access for Digital Semantic Broadcast Channels Shuai Ma et.al. 2502.03949 null
2025-02-06 Enhancing Online Learning Efficiency Through Heterogeneous Resource Integration with a Multi-Agent RAG System Devansh Srivastav et.al. 2502.03948 null
2025-02-05 All-in-One Image Compression and Restoration Huimin Zeng et.al. 2502.03649 null
2025-02-05 Towards characterizing dark matter subhalo perturbations in stellar streams with graph neural networks Peter Xiangyuan Ma et.al. 2502.03522 null
2025-02-05 LED there be DoS: Exploiting variable bitrate IP cameras for network DoS Emmanuel Goldberg et.al. 2502.03177 null
2025-02-04 On likelihood-based analysis of the gravitationally (de)lensed CMB Julien Carron et.al. 2502.02399 null
2025-02-04 PALQA: A Novel Parameterized Position-Aware Lossy Quantum Autoencoder using LSB Control Qubit for Efficient Image Compression Ershadul Haque et.al. 2502.02188 null
2025-02-01 Semantic Communication based on Generative AI: A New Approach to Image Compression and Edge Optimization Francesco Pezone et.al. 2502.01675 null
2025-02-10 Compressed Image Generation with Denoising Diffusion Codebook Models Guy Ohayon et.al. 2502.01189 null
2025-02-02 S2CFormer: Reorienting Learned Image Compression from Spatial Interaction to Channel Aggregation Yunuo Chen et.al. 2502.00700 null
2025-01-28 Rate-Distortion under Neural Tracking of Speech: A Directed Redundancy Approach Jan Østergaard et.al. 2501.16762 null
2025-02-05 Hybrid Quantum Neural Networks with Amplitude Encoding: Advancing Recovery Rate Predictions Ying Chen et.al. 2501.15828 null
2025-01-23 The Redundancy of Non-Singular Channel Simulation Gergely Flamich et.al. 2501.14053 null
2025-02-01 On Disentangled Training for Nonlinear Transform in Learned Image Compression Han Li et.al. 2501.13751 link
2025-01-23 Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse Wenzhuo Ma et.al. 2501.13528 null
2025-01-22 Using simulation based inference on tidally perturbed dwarf galaxies: the dynamics of NGC205 Axel Widmark et.al. 2501.13148 null
2025-01-22 Nonlinear reduction strategies for data compression: a comprehensive comparison from diffusion to advection problems Isabella Carla Gonnella et.al. 2501.12816 null
2025-01-22 Entropy Polarization-Based Data Compression Without Frozen Set Construction Zichang Ren et.al. 2501.12584 null
2025-01-21 The Gap Between Principle and Practice of Lossy Image Coding Haotian Zhang et.al. 2501.12330 null
2025-01-21 RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Uri Gadot et.al. 2501.12216 null
2025-01-20 Efficient Bearing Sensor Data Compression via an Asymmetrical Autoencoder with a Lifting Wavelet Transform Layer Xin Zhu et.al. 2501.11737 null
2025-01-20 Towards Loss-Resilient Image Coding for Unstable Satellite Networks Hongwei Sha et.al. 2501.11263 null
2025-01-18 Mathematical model of parameters relevance in adaptive level-crossing sampling for electrocardiogram signals Silvio Zanoli et.al. 2501.10829 null
2025-01-30 Lossless data compression at pragmatic rates Andreas Theocharous et.al. 2501.10103 null
2025-01-17 Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography Mohammed Salah et.al. 2501.09994 link
2025-01-31 A Simple Aerial Detection Baseline of Multimodal Language Models Qingyun Li et.al. 2501.09720 link
2025-01-16 Split Fine-Tuning for Large Language Models in Wireless Networks Songge Zhang et.al. 2501.09237 null
2025-01-13 Motion Tracks: A Unified Representation for Human-Robot Transfer in Few-Shot Imitation Learning Juntao Ren et.al. 2501.06994 null
2025-01-12 A General Framework for Error-controlled Unstructured Scientific Data Compression Qian Gong et.al. 2501.06910 null
2025-01-10 From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities Dominick Reilly et.al. 2501.05711 link
2025-01-09 Neural Architecture Codesign for Fast Physics Applications Jason Weitz et.al. 2501.05515 link
2025-01-09 Principles and Metrics of Extreme Learning Machines Using a Highly Nonlinear Fiber Mathilde Hary et.al. 2501.05233 null
2025-01-09 Emergence of Painting Ability via Recognition-Driven Evolution Yi Lin et.al. 2501.04966 null
2025-01-08 GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting Andrew Bond et.al. 2501.04782 null
2025-01-08 Unified Coding for Both Human Perception and Generalized Machine Analytics with CLIP Supervision Kangsheng Yin et.al. 2501.04579 link
2025-01-08 An Efficient Adaptive Compression Method for Human Perception and Machine Vision Tasks Lei Liu et.al. 2501.04329 null
2025-01-03 Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition Rui Liu et.al. 2501.04038 link
2024-12-24 MERCURY: A fast and versatile multi-resolution based global emulator of compound climate hazards Shruti Nath et.al. 2501.04018 null
2025-01-06 A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks Rasa Khosrowshahli et.al. 2501.03095 null
2025-01-06 Region of Interest based Medical Image Compression Utkarsh Prakash Srivastava et.al. 2501.02895 null
2025-01-06 Constructing 4D Radio Map in LEO Satellite Networks with Limited Samples Haoxuan Yuan et.al. 2501.02775 null
2025-01-06 Artificial Intelligence in Creative Industries: Advances Prior to 2025 Nantheera Anantrasirichai et.al. 2501.02725 null
2025-01-05 Remote Inference over Dynamic Links via Adaptive Rate Deep Task-Oriented Vector Quantization Eyal Fishel et.al. 2501.02521 link
2025-01-17 MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance Jialong Guo et.al. 2501.02427 null
2025-01-03 Compressed Domain Prior-Guided Video Super-Resolution for Cloud Gaming Content Qizhe Wang et.al. 2501.01773 null
2025-01-01 CoordFlow: Coordinate Flow for Pixel-wise Neural Video Representation Daniel Silver et.al. 2501.00975 null
2025-01-01 Gradient Compression and Correlation Driven Federated Learning for Wireless Traffic Prediction Chuanting Zhang et.al. 2501.00732 link
2025-01-07 Rapid, High-resolution and Distortion-free $R_{2}^{*}$ Mapping of Fetal Brain using Multi-echo Radial FLASH and Model-based Reconstruction Xiaoqing Wang et.al. 2501.00256 null
2024-12-29 Distributed Hybrid Sketching for $\ell_2$ -Embeddings Neophytos Charalambides et.al. 2412.20301 null
2024-12-19 Quantum Implicit Neural Compression Takuya Fujihashi et.al. 2412.19828 null
2024-12-25 Adaptive Rate Control for Deep Video Compression with Rate-Distortion Prediction Bowen Gu et.al. 2412.18834 null
2024-12-24 Ultra-Low Complexity On-Orbit Compression for Remote Sensing Imagery via Block Modulated Imaging Zhibin Wang et.al. 2412.18417 link
2024-12-24 Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task Jinming Liu et.al. 2412.18158 null
2024-12-23 CALLIC: Content Adaptive Learning for Lossless Image Compression Daxin Li et.al. 2412.17464 null
2024-12-23 AsymLLIC: Asymmetric Lightweight Learned Image Compression Shen Wang et.al. 2412.17270 null
2024-12-22 Foundation Model for Lossy Compression of Spatiotemporal Scientific Data Xiao Li et.al. 2412.17184 null
2024-12-24 L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression Junxuan Zhang et.al. 2412.16642 link
2024-12-20 Schmidt quantum compressor Israel F. Araujo et.al. 2412.16337 null
2024-12-20 Sparse Point Clouds Assisted Learned Image Compression Yiheng Jiang et.al. 2412.15752 null
2024-12-18 Super-Resolution Generative Adversarial Network for Data Compression of Direct Numerical Simulations Ludovico Nista et.al. 2412.14150 null
2024-12-18 Efficient high performance computing with the ALICE Event Processing Nodes GPU-based farm Federico Ronchetti et.al. 2412.13755 null
2024-12-18 Robust UAV Jittering and Task Scheduling in Mobile Edge Computing with Data Compression Bin Li et.al. 2412.13676 null
2024-12-18 DarkIR: Robust Low-Light Image Restoration Daniel Feijoo et.al. 2412.13443 link
2024-12-17 Identifying Bias in Deep Neural Networks Using Image Transforms Sai Teja Erukude et.al. 2412.13079 link
2024-12-17 Stable Diffusion is a Natural Cross-Modal Decoder for Layered AI-generated Image Compression Ruijie Chen et.al. 2412.12982 null
2024-12-17 Invisible Watermarks: Attacks and Robustness Dongjun Hwang et.al. 2412.12511 link
2024-12-16 Representation learning for fast radio burst dynamic spectra Dirk Kuiper et.al. 2412.12394 link
2024-12-16 Point Cloud-Assisted Neural Image Compression Ziqun Li et.al. 2412.11771 null
2024-12-16 Whisper-GPT: A Hybrid Representation Audio Large Language Model Prateek Verma et.al. 2412.11449 null
2024-12-16 Controllable Distortion-Perception Tradeoff Through Latent Diffusion for Neural Image Compression Chuqin Zhou et.al. 2412.11379 null
2024-12-16 VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression Qiang Hu et.al. 2412.11362 null
2024-12-14 Progressive Compression with Universally Quantized Diffusion Models Yibo Yang et.al. 2412.10935 null
2024-12-14 Learned Data Compression: Challenges and Opportunities for the Future Qiyu Liu et.al. 2412.10770 null
2024-12-11 Implicit Neural Compression of Point Clouds Hongning Ruan et.al. 2412.10433 null
2024-12-12 Video Seal: Open and Efficient Video Watermarking Pierre Fernandez et.al. 2412.09492 link
2024-12-12 Learned Compression for Compressed Learning Dan Jacobellis et.al. 2412.09405 link
2024-12-12 Versatile Volumetric Medical Image Coding for Human-Machine Vision Jietao Chen et.al. 2412.09231 null
2024-12-11 Unicorn: Unified Neural Image Compression with One Number Reconstruction Qi Zheng et.al. 2412.08210 null
2024-12-09 Splatter-360: Generalizable 360 $^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images Zheng Chen et.al. 2412.06250 link
2024-12-08 Vision Transformer-based Semantic Communications With Importance-Aware Quantization Joohyuk Park et.al. 2412.06038 null
2024-12-08 Matrix Pre-orthogonal-Matching Pursuit as a Fundamental AI Algorithm Wei Qu et.al. 2412.05878 null
2024-12-09 UniMIC: Towards Universal Multi-modality Perceptual Image Compression Yixin Gao et.al. 2412.04912 null
2024-12-05 Solving High-dimensional Inverse Problems Using Amortized Likelihood-free Inference with Noisy and Incomplete Data Jice Zeng et.al. 2412.04565 null
2024-12-05 Diagnosing Systematic Effects Using the Inferred Initial Power Spectrum Tristan Hoellinger et.al. 2412.04443 null
2024-12-05 Multi-Scale Node Embeddings for Graph Modeling and Generation Riccardo Milocco et.al. 2412.04354 null
2024-12-05 Feature Coding in the Era of Large Models: Dataset, Test Conditions, and Benchmark Changsheng Gao et.al. 2412.04307 link
2024-12-05 LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model Yuan Xue et.al. 2412.03841 null
2024-12-04 Electrocardiogram-based diagnosis of liver diseases: an externally validated and explainable machine learning approach Juan Miguel Lopez Alcaraz et.al. 2412.03717 link
2024-12-04 Is JPEG AI going to change image forensics? Edoardo Daniele Cannas et.al. 2412.03261 null
2024-12-03 Efficient Algorithms for Low Tubal Rank Tensor Approximation with Applications to Image Compression, Super-Resolution and Deep Learning Salman Ahmadi-Asl et.al. 2412.02598 null
2024-12-03 Randomized algorithms for Kroncecker tensor decomposition and applications Salman Ahmadi-Asl et.al. 2412.02597 null
2024-12-03 Efficient Model Compression Techniques with FishLeg Jamie McGowan et.al. 2412.02328 null
2024-12-02 Efficient Compression of Sparse Accelerator Data Using Implicit Neural Representations and Importance Sampling Xihaier Luo et.al. 2412.01754 link
2024-12-02 Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior Yi Yu et.al. 2412.01646 null
2024-12-01 Construction of generalized samplets in Banach spaces Peter Balazs et.al. 2412.00954 null
2024-11-30 Good, Cheap, and Fast: Overfitted Image Compression with Wasserstein Distortion Jona Ballé et.al. 2412.00505 null
2024-11-30 Hybrid Local-Global Context Learning for Neural Video Compression Yongqi Zhai et.al. 2412.00446 null
2024-11-30 DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression Yongqi Zhai et.al. 2412.00437 null
2024-11-29 AIDetx: a compression-based method for identification of machine-learning generated text Leonardo Almeida et.al. 2411.19869 link
2024-11-29 Memristive Nanowire Network for Energy Efficient Audio Classification: Pre-Processing-Free Reservoir Computing with Reduced Latency Akshaya Rajesh et.al. 2411.19611 null
2024-11-29 MCUCoder: Adaptive Bitrate Learned Video Compression for IoT Devices Ali Hojjat et.al. 2411.19442 link
2024-11-28 Generalized Gaussian Model for Learned Image Compression Haotian Zhang et.al. 2411.19320 null
2024-11-28 Upsampling Improvement for Overfitted Neural Coding Pierrick Philippe et.al. 2411.19249 null
2024-11-27 Learning Optimal Linear Block Transform by Rate Distortion Minimization Alessandro Gnutti et.al. 2411.18494 null
2024-11-27 HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression Lei Liu et.al. 2411.18473 null
2024-11-26 Evaluating the Overhead of the Performance Profiler Cloudprofiler With MooBench Shinhyung Yang et.al. 2411.17413 null
2024-11-26 Motion Free B-frame Coding for Neural Video Compression Van Thang Nguyen et.al. 2411.17160 null
2024-11-30 An Information-Theoretic Regularizer for Lossy Neural Image Compression Yingwen Zhang et.al. 2411.16727 null
2024-11-25 WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing Kai Han et.al. 2411.16336 null
2024-11-25 Learning Optimal Lattice Vector Quantizers for End-to-end Neural Image Compression Xi Zhang et.al. 2411.16119 null
2024-11-25 TransCompressor: LLM-Powered Multimodal Data Compression for Smart Transportation Huanqi Yang et.al. 2411.16020 null
2024-11-24 Variable-size Symmetry-based Graph Fourier Transforms for image compression Alessandro Gnutti et.al. 2411.15824 null
2024-11-24 M3-CVC: Controllable Video Compression with Multimodal Generative Models Rui Wan et.al. 2411.15798 null
2024-11-24 Advanced Learning-Based Inter Prediction for Future Video Coding Yanchen Zhao et.al. 2411.15759 null
2024-11-24 PEnG: Pose-Enhanced Geo-Localisation Tavis Shore et.al. 2411.15742 null
2024-11-21 U-Motion: Learned Point Cloud Video Compression with U-Structured Motion Estimation Tingyu Fan et.al. 2411.14501 null
2024-11-21 Differentiable SVD based on Moore-Penrose Pseudoinverse for Inverse Imaging Problems Yinghao Zhang et.al. 2411.14141 link
2024-11-21 Compact Visual Data Representation for Green Multimedia -- A Human Visual System Perspective Peilin Chen et.al. 2411.14135 null
2024-11-27 Image Compression Using Novel View Synthesis Priors Luyuan Peng et.al. 2411.13862 null
2024-11-20 Sparse Input View Synthesis: 3D Representations and Reliable Priors Nagabhushan Somraj et.al. 2411.13631 null
2024-11-20 Benchmarking Quantum Convolutional Neural Networks for Classification and Data Compression Tasks Jun Yong Khoo et.al. 2411.13468 null
2024-11-20 Practical Compact Deep Compressed Sensing Bin Chen et.al. 2411.13081 link
2024-11-20 LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image Compression Shimon Murai et.al. 2411.13033 link
2024-11-22 Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need Kecheng Chen et.al. 2411.12448 null
2024-11-19 Breathless: An 8-hour Performance Contrasting Human and Robot Expressiveness Catie Cuan et.al. 2411.12361 null
2024-11-18 Variable Rate Neural Compression for Sparse Detector Data Yi Huang et.al. 2411.11942 link
2024-11-18 Exploring adversarial robustness of JPEG AI: methodology, comparison and new methods Egor Kovalev et.al. 2411.11795 null
2024-11-18 Additional Tests for TV 3.0 Eduardo Peixoto et.al. 2411.11755 null
2024-11-18 Towards fast DBSCAN via Spectrum-Preserving Data Compression Yongyu Wang et.al. 2411.11421 null
2024-11-17 BVI-CR: A Multi-View Human Dataset for Volumetric Video Compression Ge Gao et.al. 2411.11199 link
2024-11-16 An End-to-End Real-World Camera Imaging Pipeline Kepeng Xu et.al. 2411.10773 null
2024-11-16 Deep Learning-Based Image Compression for Wireless Communications: Impacts on Reliability,Throughput, and Latency Mostafa Naseri et.al. 2411.10650 link
2024-11-15 Efficient Progressive Image Compression with Variance-aware Masking Alberto Presta et.al. 2411.10185 link
2024-11-15 A Multi-Scale Spatial-Temporal Network for Wireless Video Transmission Xinyi Zhou et.al. 2411.09936 null
2024-11-14 Application of signal separation to diffraction image compression and serial crystallography Jérôme Kieffer et.al. 2411.09515 link
2024-11-14 DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines Junqi Liu et.al. 2411.09308 null
2024-11-14 Towards efficient compression and communication for prototype-based decentralized learning Pablo Fernández-Piñeiro et.al. 2411.09267 null
2024-11-13 Learning Optimal and Interpretable Summary Statistics of Galaxy Catalogs with SBI Kai Lehman et.al. 2411.08957 null
2024-11-13 LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive Hashing Xiaonan Nie et.al. 2411.08446 null
2024-11-18 Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer Xiao Huo et.al. 2411.07899 null
2024-11-11 Accelerating radio astronomy imaging with RICK Emanuele De Rubeis et.al. 2411.07321 link
2024-11-11 Low Complexity Learning-based Lossless Event-based Compression Ahmadreza Sezavar et.al. 2411.07155 null
2024-11-11 JPEG AI Image Compression Visual Artifacts: Detection Methods and Dataset Daria Tsereh et.al. 2411.06810 null
2024-11-11 Machine vision-aware quality metrics for compressed image and video assessment Mikhail Dremin et.al. 2411.06776 null
2024-11-11 High-Frequency Enhanced Hybrid Neural Representation for Video Compression Li Yu et.al. 2411.06685 null
2024-11-09 HiHa: Introducing Hierarchical Harmonic Decomposition to Implicit Neural Compression for Atmospheric Data Zhewen Xu et.al. 2411.06155 null
2024-11-08 A method based on Generative Adversarial Networks for disentangling physical and chemical properties of stars in astronomical spectra Raúl Santoveña et.al. 2411.05960 null
2024-11-07 Don't Look Twice: Faster Video Transformers with Run-Length Tokenization Rohan Choudhury et.al. 2411.05222 null
2024-11-05 Tuning into spatial frequency space: Satellite and space debris detection in the ZTF alert stream J. P. Carvajal et.al. 2411.03258 null
2024-11-15 ZipCache: A DRAM/SSD Cache with Built-in Transparent Compression Rui Xie et.al. 2411.03174 null
2024-11-05 Learning-based Lossless Event Data Compression Ahmadreza Sezavar et.al. 2411.03010 null
2024-11-04 Neural optical flow for planar and stereo PIV Andrew I. Masker et.al. 2411.02373 null
2024-11-04 The evolution of volumetric video: A survey of smart transcoding and compression approaches Preetish Kakkar et.al. 2411.02095 null
2024-11-03 Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision Xiangzhong Luo et.al. 2411.01431 null
2024-11-02 Autoencoders for At-Source Data Reduction and Anomaly Detection in High Energy Particle Detectors Alexander Yue et.al. 2411.01118 null
2024-11-01 SANN-PSZ: Spatially Adaptive Neural Network for Head-Tracked Personal Sound Zones Yue Qiao et.al. 2411.00772 null
2024-10-28 MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression Noel Elias et.al. 2410.21548 link
2024-10-29 Enhancing Learned Image Compression via Cross Window-based Attention Priyanka Mudgal et.al. 2410.21144 link
2024-10-26 Cross-Platform Neural Video Coding: A Case Study Ruhan Conceição et.al. 2410.20145 null
2024-10-25 Conditional Hallucinations for Image Compression Till Aczel et.al. 2410.19493 null
2024-10-29 Integration of Communication and Computational Imaging Zhenming Yu et.al. 2410.19415 null
2024-10-24 DMVC: Multi-Camera Video Compression Network aimed at Improving Deep Learning Accuracy Huan Cui et.al. 2410.18400 null
2024-10-23 Predicting total time to compress a video corpus using online inference systems Xin Shu et.al. 2410.18260 null
2024-10-23 FIPER: Generalizable Factorized Fields for Joint Image Compression and Super-Resolution Yang-Che Sun et.al. 2410.18083 null
2024-10-23 Learning Lossless Compression for High Bit-Depth Volumetric Medical Image Kai Wang et.al. 2410.17814 null
2024-10-21 Variable Rate Learned Wavelet Video Coding with Temporal Layer Adaptivity Anna Meyer et.al. 2410.15873 link
2024-10-20 Extensions on low-complexity DCT approximations for larger blocklengths based on minimal angle similarity A. P. Radünz et.al. 2410.15244 null
2024-10-19 Standardizing Generative Face Video Compression using Supplemental Enhancement Information Bolin Chen et.al. 2410.15105 null
2024-10-16 MatryoshkaKV: Adaptive KV Compression via Trainable Orthogonal Projection Bokai Lin et.al. 2410.14731 null
2024-10-18 Design and Prototype of a Unified Framework for Error-robust Compression and Encryption in IoT Gajraj Kuldeep et.al. 2410.14396 null
2024-10-18 Compression using Discrete Multi-Level Divisor Transform for Heterogeneous Sensor Data Gajraj Kuldeep et.al. 2410.14287 null
2024-10-17 In-context learning and Occam's razor Eric Elmoznino et.al. 2410.14086 link
2024-10-17 Co-Segmentation without any Pixel-level Supervision with Application to Large-Scale Sketch Classification Nikolaos-Antonios Ypsilantis et.al. 2410.13582 null
2024-10-16 Test-time adaptation for image compression with distribution regularization Kecheng Chen et.al. 2410.12191 null
2024-10-16 Joint Data Compression, Secure Multi-Part Collaborative Task Offloading and Resource Assignment in Ultra-Dense Networks Tianqing Zhou et.al. 2410.12186 null
2024-10-14 Large Language Model Evaluation via Matrix Nuclear-Norm Yahan Li et.al. 2410.10672 link
2024-10-14 QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models Zhumazhan Balapanov et.al. 2410.10318 link
2024-10-14 Generative Human Video Compression with Multi-granularity Temporal Trajectory Factorization Shanzhi Yin et.al. 2410.10171 null
2024-10-13 Towards Reproducible Learning-based Compression Jiahao Pang et.al. 2410.09872 null
2024-10-13 Compressing Scene Dynamics: A Generative Approach Shanzhi Yin et.al. 2410.09768 link
2024-10-13 ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression Wei Jiang et.al. 2410.09706 link
2024-10-12 Fine-grained subjective visual quality assessment for high-fidelity compressed images Michela Testolina et.al. 2410.09501 link
2024-10-11 Fast Data-independent KLT Approximations Based on Integer Functions A. P. Radünz et.al. 2410.09227 null
2024-10-10 Compressing high-resolution data through latent representation encoding for downscaling large-scale AI weather forecast model Qian Liu et.al. 2410.09109 null
2024-10-11 Data-Driven Neural Estimation of Indirect Rate-Distortion Function Zichao Yu et.al. 2410.09018 null
2024-10-11 Compressing regularised dynamics improves link prediction in sparse networks Maja Lindström et.al. 2410.08777 link
2024-10-11 Beyond GFVC: A Progressive Face Video Compression Framework with Adaptive Visual Tokens Bolin Chen et.al. 2410.08485 link
2024-10-10 What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias Aida Mohammadshahi et.al. 2410.08407 null
2024-10-16 Delta-ICM: Entropy Modeling with Delta Function for Learned Image Compression Takahiro Shindo et.al. 2410.07669 null
2024-10-10 MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion Onkar Susladkar et.al. 2410.07659 null
2024-10-10 R-Adaptive Mesh Optimization to Enhance Finite Element Basis Compression Graham Harper et.al. 2410.07646 null
2024-10-09 JPEG Inspired Deep Learning Ahmed H. Salamah et.al. 2410.07081 link
2024-10-09 SHRINK: Data Compression by Semantic Extraction and Residuals Encoding Guoyou Sun et.al. 2410.06713 null
2024-10-09 Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization Prateek Varshney et.al. 2410.06567 null
2024-10-09 Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching Wenqi Niu et.al. 2410.06561 null
2024-10-08 Covering Numbers for Deep ReLU Networks with Applications to Function Approximation and Nonparametric Regression Weigutian Ou et.al. 2410.06378 null
2024-10-08 Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach Sha Guo et.al. 2410.06149 null
2024-10-08 Resolution limit of the eye: how many pixels can we see? Maliha Ashraf et.al. 2410.06068 null
2024-10-07 Transformers learn variable-order Markov chains in-context Ruida Zhou et.al. 2410.05493 null
2024-10-07 Salient Store: Enabling Smart Storage for Continuous Learning Edge Servers Cyan Subhra Mishra et.al. 2410.05435 null
2024-10-07 Causal Context Adjustment Loss for Learned Image Compression Minghao Han et.al. 2410.04847 link
2024-10-06 Channel-Aware Throughput Maximization for Cooperative Data Fusion in CAV Haonan An et.al. 2410.04320 null
2024-10-05 Robust Task-Oriented Communication Framework for Real-Time Collaborative Vision Perception Zhengru Fang et.al. 2410.04168 null
2024-10-04 On the Rate-Distortion-Complexity Trade-offs of Neural Video Coding Yi-Hsin Chen et.al. 2410.03898 null
2024-10-04 A Framework for Automatic Validation and Application of Lossy Data Compression in Ensemble Data Assimilation Kai Keller et.al. 2410.03184 null
2024-10-03 GABIC: Graph-based Attention Block for Image Compression Gabriele Spadaro et.al. 2410.02981 link
2024-10-03 Diffusion-based Extreme Image Compression with Compressed Feature Initialization Zhiyuan Li et.al. 2410.02640 link
2024-10-03 High-Efficiency Neural Video Compression via Hierarchical Predictive Learning Ming Lu et.al. 2410.02598 link
2024-10-02 A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation Liang Chen et.al. 2410.01912 link
2024-10-02 COSMIC: Compress Satellite Images Efficiently via Diffusion Compensation Ziyuan Zhang et.al. 2410.01698 link
2024-10-03 Releasing the Parameter Latency of Neural Representation for High-Efficiency Video Compression Gai Zhang et.al. 2410.01654 null
2024-10-02 Task-Oriented Edge-Assisted Cooperative Data Compression, Communications and Computing for UGV-Enhanced Warehouse Logistics Jiaming Yang et.al. 2410.01515 null
2024-10-01 STanH : Parametric Quantization for Variable Rate Learned Image Compression Alberto Presta et.al. 2410.00557 null
2024-09-30 LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner Xiaopan Zhang et.al. 2409.20560 null
2024-09-30 PerCo (SD): Open Perceptual Compression Nikolai Körber et.al. 2409.20255 link
2024-09-29 All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation Xu Zhang et.al. 2409.19660 link
2024-09-28 Fast Encoding and Decoding for Implicit Video Representation Hao Chen et.al. 2409.19429 null
2024-09-27 Learning-Based Image Compression for Machines Kartik Gupta et.al. 2409.19184 link
2024-09-27 Effectiveness of learning-based image codecs on fingerprint storage Daniele Mari et.al. 2409.18730 link
2024-09-27 Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming Angeliki Katsenou et.al. 2409.18713 null
2024-09-27 Neural Video Representation for Redundancy Reduction and Consistency Preservation Taiga Hayami et.al. 2409.18497 null
2024-09-20 Blockchain-Enabled Variational Information Bottleneck for Data Extraction Based on Mutual Information in Internet of Vehicles Cui Zhang et.al. 2409.17287 null
2024-09-25 Streaming Neural Images Marcos V. Conde et.al. 2409.17134 null
2024-09-25 PhD Forum: Efficient Privacy-Preserving Processing via Memory-Centric Computing Mpoki Mwaisela et.al. 2409.16777 null
2024-09-25 The Effect of Lossy Compression on 3D Medical Images Segmentation with Deep Learning Anvar Kurmukov et.al. 2409.16733 null
2024-09-24 AIM 2024 Challenge on UHD Blind Photo Quality Assessment Vlad Hosu et.al. 2409.16271 null
2024-09-25 COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models Kehui Liu et.al. 2409.15146 link
2024-09-23 AlphaZip: Neural Network-Enhanced Lossless Text Compression Swathi Shree Narashiman et.al. 2409.15046 link
2024-09-23 Anomaly Detection from a Tensor Train Perspective Alejandro Mata Ali et.al. 2409.15030 null
2024-09-23 AIM 2024 Challenge on Video Saliency Prediction: Methods and Results Andrey Moskalenko et.al. 2409.14827 link
2024-09-21 Window-based Channel Attention for Wavelet-enhanced Learned Image Compression Heng Xu et.al. 2409.14090 null
2024-09-20 Reduced bit median quantization: A middle process for Efficient Image Compression Fikresilase Wondmeneh Abebayew et.al. 2409.13789 null
2024-09-20 Data Compression using Rank-1 Lattices for Parameter Estimation in Machine Learning Michael Gnewuch et.al. 2409.13453 null
2024-09-19 Breaking the Barriers of One-to-One Usage of Implicit Neural Representation in Image Compression: A Linear Combination Approach with Performance Guarantees Sai Sanjeet et.al. 2409.13117 link
2024-09-19 Optimal Coding for Randomized Kolmogorov Complexity and Its Applications Shuichi Hirahara et.al. 2409.12744 null
2024-09-19 Multi-Scale Feature Prediction with Auxiliary-Info for Neural Image Compression Chajin Shin et.al. 2409.12719 null
2024-09-18 One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation Finn Lukas Busch et.al. 2409.11764 null
2024-09-18 LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution Shiyu Feng et.al. 2409.11711 null
2024-09-18 k-mer-based approaches to bridging pangenomics and population genetics Miles D. Roberts et.al. 2409.11683 null
2024-09-17 Few-Shot Domain Adaptation for Learned Image Compression Tianyu Zhang et.al. 2409.11111 null
2024-09-17 Edge-based Denoising Image Compression Ryugo Morita et.al. 2409.10978 null
2024-09-16 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning Amin Karimi Monsefi et.al. 2409.10362 null
2024-09-14 Lossy Image Compression with Stochastic Quantization Anton Kozyriev et.al. 2409.09488 null
2024-09-13 Fast DCT+: A Family of Fast Transforms Based on Rank-One Updates of the Path Graph Samuel Fernández-Menduiña et.al. 2409.08970 null
2024-09-13 On the Computation of BD-Rate over a Set of Videos for Fair Assessment of Performance of Learned Video Codecs M. Akin Yilmaz et.al. 2409.08772 null
2024-09-13 USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s Zhuoyuan Li et.al. 2409.08481 null
2024-09-12 Learned Compression for Images and Point Clouds Mateen Ulhaq et.al. 2409.08376 link
2024-09-11 NVRC: Neural Video Representation Compression Ho Man Kwan et.al. 2409.07414 null
2024-09-11 Dynamic Error-Bounded Hierarchical Matrices in Neural Network Compression John Mango et.al. 2409.07028 null
2024-09-10 Universal End-to-End Neural Network for Lossy Image Compression Bouzid Arezki et.al. 2409.06586 null
2024-09-10 Rate-Constrained Quantization for Communication-Efficient Federated Learning Shayan Mohajer Hamidi et.al. 2409.06319 null
2024-09-09 Design and Implementation of TAO DAQ System Shuihan Zhang et.al. 2409.05522 null
2024-09-09 A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression Nora Hofer et.al. 2409.05490 null
2024-09-09 Attention Based Machine Learning Methods for Data Reduction with Guaranteed Error Bounds Xiao Li et.al. 2409.05357 null
2024-09-06 Convolutional Transformer-Based Image Compression Bouzid Arezki et.al. 2409.04118 null
2024-09-06 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors Yujun Huang et.al. 2409.04013 link
2024-09-05 TropNNC: Structured Neural Network Compression Using Tropical Geometry Konstantinos Fotopoulos et.al. 2409.03945 null
2024-09-05 Unified Framework for Neural Network Compression via Decomposition and Optimal Rank Selection Ali Aghababaei-Harandi et.al. 2409.03555 null
2024-09-05 Efficient Image Compression Using Advanced State Space Models Bouzid Arezki et.al. 2409.02743 null
2024-09-10 FrameCorr: Adaptive, Autoencoder-based Neural Compression for Video Reconstruction in Resource and Timing Constrained Network Settings John Li et.al. 2409.02453 null
2024-09-03 Compressed learning based onboard semantic compression for remote sensing platforms Protim Bhattacharjee et.al. 2409.01988 link
2024-09-03 Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates Yixuan Ye et.al. 2409.01935 link
2024-09-03 Privacy-Preserving Multimedia Mobile Cloud Computing Using Protective Perturbation Zhongze Tang et.al. 2409.01710 null
2024-09-02 Multi-Reference Generative Face Video Compression with Contrastive Learning Goluck Konuko et.al. 2409.01029 link
2024-09-02 Accelerating block-level rate control for learned image compression Muchen Dong et.al. 2409.01009 null
2024-09-02 PNVC: Towards Practical INR-based Video Compression Ge Gao et.al. 2409.00953 null
2024-09-01 BWT construction and search at the terabase scale Heng Li et.al. 2409.00613 link
2024-08-30 Prioritized Information Bottleneck Theoretic Framework with Distributed Online Learning for Edge Video Analytics Zhengru Fang et.al. 2409.00146 link
2024-08-28 Quantum Kernel Principal Components Analysis for Compact Readout of Chemiresistive Sensor Arrays Zeheng Wang et.al. 2409.00115 null
2024-08-30 NDP: Next Distribution Prediction as a More Broad Target Junhao Ruan et.al. 2408.17377 null
2024-08-30 Approximately Invertible Neural Network for Learned Image Compression Yanbo Gao et.al. 2408.17073 null
2024-08-29 UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation Piotr Rudol et.al. 2408.16501 null
2024-08-29 Convolutional Neural Network Compression Based on Low-Rank Decomposition Yaping He et.al. 2408.16289 null
2024-08-27 Bandwidth-Aware and Overlap-Weighted Compression for Communication-Efficient Federated Learning Zichen Tang et.al. 2408.14736 null
2024-08-25 Condensed Sample-Guided Model Inversion for Knowledge Distillation Kuluhan Binici et.al. 2408.13850 null
2024-08-12 Semantic Variational Bayes Based on a Semantic Information Theory for Solving Latent Variables Chenguang Lu et.al. 2408.13122 null
2024-08-22 Quantization-free Lossy Image Compression Using Integer Matrix Factorization Pooya Ashtari et.al. 2408.12691 link
2024-08-22 DeepHQ: Learned Hierarchical Quantizer for Progressive Deep Image Coding Jooyoung Lee et.al. 2408.12150 null
2024-08-28 AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results Maksim Smirnov et.al. 2408.11982 link
2024-08-20 Trustworthy Compression? Impact of AI-based Codecs on Biometrics for Law Enforcement Sandra Bergmann et.al. 2408.10823 null
2024-08-20 Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds Kai Liu et.al. 2408.10543 null
2024-08-16 LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression Yuqi Ye et.al. 2408.08682 null
2024-08-16 Bi-Directional Deep Contextual Video Compression Xihua Sheng et.al. 2408.08604 null
2024-08-16 Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs Jinming Liu et.al. 2408.08575 null
2024-08-15 Algebraic Vertex Ordering of a Sparse Graph for Adjacency Access Locality and Graph Compression Dimitris Floros et.al. 2408.08439 null
2024-08-15 When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding Pingping Zhang et.al. 2408.08093 null
2024-08-15 DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions Ryosuke Korekata et.al. 2408.07910 null
2024-08-14 Towards Real-time Video Compressive Sensing on Mobile Devices Miao Cao et.al. 2408.07530 link
2024-08-14 Encoding and Decoding Algorithms of ANS Variants and Evaluation of Their Average Code Lengths Hirosuke Yamamoto et.al. 2408.07322 null
2024-08-13 Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality Yu-Chih Chen et.al. 2408.07041 null
2024-08-13 Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines Samuel Fernández Menduiña et.al. 2408.07028 null
2024-08-19 Joint Source-Channel Optimization for UAV Video Coding and Transmission Kesong Wu et.al. 2408.06667 null
2024-08-08 Flow-Lenia.png: Evolving Multi-Scale Complexity by Means of Compression Tadashi Adachi et.al. 2408.06374 null
2024-08-09 Benchmarking Conventional and Learned Video Codecs with a Low-Delay Configuration Siyue Teng et.al. 2408.05042 null
2024-08-08 SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression Linhan Cao et.al. 2408.04273 null
2024-08-07 Bi-Level Spatial and Channel-aware Transformer for Learned Image Compression Hamidreza Soltani et.al. 2408.03842 null
2024-08-07 BVI-AOM: A New Training Dataset for Deep Video Compression Optimization Jakub Nawała et.al. 2408.03265 link
2024-08-06 Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring Jeremy J. Williams et.al. 2408.02869 null
2024-08-05 Dimensionality Reduction and Nearest Neighbors for Improving Out-of-Distribution Detection in Medical Image Segmentation McKell Woodland et.al. 2408.02761 link
2024-08-04 CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization Xiang He et.al. 2408.01952 link
2024-08-03 Channel-Aware Distributed Transmission Control and Video Streaming in UAV Networks Masoud Ghazikor et.al. 2408.01885 null
2024-08-02 An Adaptive Tensor-Train Decomposition Approach for Efficient Deep Neural Network Compression Shiyi Luo et.al. 2408.01534 null
2024-07-31 Exploiting Change Blindness for Video Coding: Perspectives from a Less Promising User Study Mitra Amiri et.al. 2408.00052 null
2024-07-31 Tora: Trajectory-oriented Diffusion Transformer for Video Generation Zhenghao Zhang et.al. 2407.21705 link
2024-07-30 Edge Learning Based Collaborative Automatic Modulation Classification for Hierarchical Cognitive Radio Networks Peihao Dong et.al. 2407.20772 link
2024-07-30 Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications Yi Ju et.al. 2407.20717 null
2024-07-29 Homomorphic data compression for real time photon correlation analysis Sebastian Strempfer et.al. 2407.20356 null
2024-07-24 Accelerating the Low-Rank Decomposed Models Habib Hajimolahoseini et.al. 2407.20266 null
2024-07-29 ComNeck: Bridging Compressed Image Latents and Multimodal LLMs via Universal Transform-Neck Chia-Hao Kao et.al. 2407.19651 null
2024-07-28 NVC-1B: A Large Neural Video Coding Model Xihua Sheng et.al. 2407.19402 null
2024-07-18 Generative AI Augmented Induction-based Formal Verification Aman Kumar et.al. 2407.18965 null
2024-07-25 The seismic purifier: An unsupervised approach to seismic signal detection via representation learning Onur Efe et.al. 2407.18402 link
2024-07-25 Adaptable Deep Joint Source-and-Channel Coding for Small Satellite Applications Olga Kondrateva et.al. 2407.18146 null
2024-07-25 Scaling Training Data with Lossy Image Compression Katherine L. Mentzer et.al. 2407.17954 link
2024-07-25 Towards the Spectral bias Alleviation by Normalizations in Coordinate Networks Zhicheng Cai et.al. 2407.17834 link
2024-07-24 Lossy Data Compression By Adaptive Mesh Coarsening N. Böing et.al. 2407.17316 null
2024-07-24 High Efficiency Image Compression for Large Visual-Language Models Binzhe Li et.al. 2407.17060 null
2024-07-23 Accelerating Learned Video Compression via Low-Resolution Representation Learning Zidian Qiu et.al. 2407.16418 null
2024-07-24 FCNR: Fast Compressive Neural Representation of Visualization Images Yunfei Lu et.al. 2407.16369 link
2024-07-19 Shapley Pruning for Neural Network Compression Kamil Adamczewski et.al. 2407.15875 null
2024-07-18 CIC: Circular Image Compression Honggui Li et.al. 2407.15870 null
2024-07-22 Online String Attractors Philip Whittington et.al. 2407.15599 null
2024-07-22 Spectral properties of bright deposits in permanently shadowed craters on Ceres Stefan Schröder et.al. 2407.15327 null
2024-07-21 Lessons Learned on the Path to Guaranteeing the Error Bound in Lossy Quantizers Alex Fallin et.al. 2407.15037 null
2024-07-19 A Benchmark for Gaussian Splatting Compression and Quality Assessment Study Qi Yang et.al. 2407.14197 link
2024-07-18 Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law Giorgio Franceschelli et.al. 2407.13493 null
2024-07-18 Learned HDR Image Compression for Perceptually Optimal Storage and Display Peibei Cao et.al. 2407.13179 null
2024-07-17 High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion Juan Song et.al. 2407.12538 link
2024-07-17 Enhancing Film Grain Coding in VVC: Improving Encoding Quality and Efficiency Vignesh V Menon et.al. 2407.12465 null
2024-07-17 Reliability Function of Classical-Quantum Channels Ke Li et.al. 2407.12403 null
2024-07-17 Exploiting Inter-Image Similarity Prior for Low-Bitrate Remote Sensing Image Compression Junhui Li et.al. 2407.12295 null
2024-07-16 Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors Matt Gorbett et.al. 2407.12075 null
2024-07-17 Rate-Distortion-Cognition Controllable Versatile Neural Image Compression Jinming Liu et.al. 2407.11700 null
2024-07-16 MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models Hongrong Cheng et.al. 2407.11681 null
2024-07-17 Neural Compression of Atmospheric States Piotr Mirowski et.al. 2407.11666 null
2024-07-16 Rethinking Learned Image Compression: Context is All You Need Jixiang Luo et.al. 2407.11590 null
2024-07-16 The impact of lossy data compression on the power spectrum of the high redshift 21-cm signal with LOFAR J. K. Chege et.al. 2407.11557 null
2024-07-21 Uniformly Accelerated Motion Model for Inter Prediction Zhuoyuan Li et.al. 2407.11541 null
2024-07-15 M18K: A Comprehensive RGB-D Dataset and Benchmark for Mushroom Detection and Instance Segmentation Abdollah Zakeri et.al. 2407.11275 link
2024-07-15 Enhancing Electrocardiogram Signal Analysis Using NLP-Inspired Techniques: A Novel Approach with Embedding and Self-Attention Prapti Ganguly et.al. 2407.11102 null
2024-07-15 In-Loop Filtering via Trained Look-Up Tables Zhuoyuan Li et.al. 2407.10926 null
2024-07-15 Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model Zhening Liu et.al. 2407.10632 link
2024-07-14 UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers Huy Ha et.al. 2407.10353 null
2024-07-13 WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model Haisheng Fu et.al. 2407.09983 null
2024-07-13 Zero-Shot Image Compression with Diffusion-Based Posterior Sampling Noam Elata et.al. 2407.09896 link
2024-07-13 Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation Han Li et.al. 2407.09853 link
2024-07-13 Infinite families of optimal and minimal codes over rings using simplicial complexes Yanan Wu et.al. 2407.09783 null
2024-07-12 HPC: Hierarchical Progressive Coding Framework for Volumetric Video Zihan Zheng et.al. 2407.09026 null
2024-07-12 Hybrid Temporal Computing for Lower Power Hardware Accelerators Maliha Tasnim et.al. 2407.08975 null
2024-07-11 Manipulating a Tetris-Inspired 3D Video Representation Mihir Godbole et.al. 2407.08885 null
2024-07-11 OMR-NET: a two-stage octave multi-scale residual network for screen content image compression Shiqi Jiang et.al. 2407.08545 null
2024-07-11 CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data Hossein Entezari Zarch et.al. 2407.08108 null
2024-07-10 Using Low-Discrepancy Points for Data Compression in Machine Learning: An Experimental Comparison Simone Göttlich et.al. 2407.07450 null
2024-07-10 Standard compliant video coding using low complexity, switchable neural wrappers Yueyu Hu et.al. 2407.07395 null
2024-07-10 MNeRV: A Multilayer Neural Representation for Videos Qingling Chang et.al. 2407.07347 link
2024-07-11 Entropy Law: The Story Behind Data Compression and LLM Performance Mingjia Yin et.al. 2407.06645 link
2024-07-08 A Hybrid Algorithm for Computing a Partial Singular Value Decomposition Satisfying a Given Threshold James Baglama et.al. 2407.06306 link
2024-07-08 TAPVid-3D: A Benchmark for Tracking Any Point in 3D Skanda Koppula et.al. 2407.05921 link
2024-07-05 The Impact of Quantization and Pruning on Deep Reinforcement Learning Models Heng Lu et.al. 2407.04803 null
2024-07-05 An autoencoder for compressing angle-resolved photoemission spectroscopy data Steinn Ymir Agustsson et.al. 2407.04631 link
2024-07-05 Rethinking Image Compression on the Web with Generative AI Shayan Ali Hassan et.al. 2407.04542 null
2024-07-11 A High-Quality Workflow for Multi-Resolution Scientific Data Reduction and Visualization Daoce Wang et.al. 2407.04267 null
2024-07-04 Autoencoded Image Compression for Secure and Fast Transmission Aryan Kashyap Naveen et.al. 2407.03990 link
2024-07-03 Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations Trevor Ablett et.al. 2407.03311 link
2024-07-03 KeyVideoLLM: Towards Large-scale Video Keyframe Selection Hao Liang et.al. 2407.03104 null
2024-07-01 Statistical Analysis of ZFP: Understanding Bias Alyson Fox et.al. 2407.01826 null
2024-07-01 An AI-based, Error-bounded Compression Scheme for High-frequency Power Quality Disturbance Data Markus Stroot et.al. 2407.01112 null
2024-06-28 Wavelets Are All You Need for Autoregressive Image Generation Wael Mattar et.al. 2406.19997 null
2024-06-28 Optimal Video Compression using Pixel Shift Tracking Hitesh Saai Mananchery Panneerselvam et.al. 2406.19630 link
2024-06-27 MCNC: Manifold Constrained Network Compression Chayne Thrash et.al. 2406.19301 null
2024-06-27 Staggered Quantizers for Perfect Perceptual Quality: A Connection between Quantizers with Common Randomness and Without Ruida Zhou et.al. 2406.19248 null
2024-06-25 Asymptotically Minimax Regret by Bayes Mixtures Jun'ichi Takeuchi et.al. 2406.17929 null
2024-06-24 Hierarchical B-frame Video Coding for Long Group of Pictures Ivan Kirillov et.al. 2406.16544 null
2024-06-20 Ranking LLMs by compression Peijia Guo et.al. 2406.14171 null
2024-06-21 Measuring Sample Importance in Data Pruning for Training LLMs from a Data Compression Perspective Minsang Kim et.al. 2406.14124 null
2024-06-20 Prediction and Reference Quality Adaptation for Learned Video Compression Xihua Sheng et.al. 2406.14118 null
2024-06-19 Convex-hull Estimation using XPSNR for Versatile Video Coding Vignesh V Menon et.al. 2406.13712 null
2024-06-19 A Study on the Effect of Color Spaces in Learned Image Compression Srivatsa Prativadibhayankaram et.al. 2406.13709 null
2024-06-19 Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics Weitong Zhang et.al. 2406.13652 null
2024-06-18 Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution Maximilian Fischer et.al. 2406.12623 null
2024-06-18 Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines Honglei Zhang et.al. 2406.12367 null
2024-06-15 How Should We Extract Discrete Audio Tokens from Self-Supervised Models? Pooneh Mousavi et.al. 2406.10735 null
2024-06-15 Object-Attribute-Relation Representation based Video Semantic Communication Qiyuan Du et.al. 2406.10469 null
2024-06-14 On Efficient Neural Network Architectures for Image Compression Yichi Zhang et.al. 2406.10361 link
2024-06-14 Information Compression in the AI Era: Recent Advances and Future Challenges Jun Chen et.al. 2406.10036 null
2024-06-13 CMC-Bench: Towards a New Paradigm of Visual Signal Compression Chunyi Li et.al. 2406.09356 link
2024-06-13 Neural NeRF Compression Tuan Pham et.al. 2406.08943 null
2024-06-14 Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models Yi-Fan Zhang et.al. 2406.08487 link
2024-06-12 On Annotation-free Optimization of Video Coding for Machines Marc Windsheimer et.al. 2406.07938 null
2024-06-11 SSNVC: Single Stream Neural Video Compression with Implicit Temporal Information Feng Wang et.al. 2406.07645 null
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548 link
2024-06-11 Optimal Matrix-Mimetic Tensor Algebras via Variable Projection Elizabeth Newman et.al. 2406.06942 link
2024-06-10 Deep Generative Modeling Reshapes Compression and Transmission: From Efficiency to Resiliency Jincheng Dai et.al. 2406.06446 null
2024-06-10 Image Compression with Isotropic and Anisotropic Shepard Inpainting Rahul Mohideen Kaja Mohideen et.al. 2406.06247 null
2024-06-10 Efficient Neural Compression with Inference-time Decoding C. Metz et.al. 2406.06237 null
2024-06-10 Fiducial-Cosmology-dependent systematics for the DESI 2024 BAO Analysis A. Pérez-Fernández et.al. 2406.06085 null
2024-06-10 Quantum Sparse Coding and Decoding Based on Quantum Network Xun Ji et.al. 2406.06012 null
2024-06-09 Region of Interest Loss for Anonymizing Learned Image Compression Christoph Liebender et.al. 2406.05726 link
2024-06-08 Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models Minho Park et.al. 2406.05432 link
2024-06-07 PatchSVD: A Non-uniform SVD-based Image Compression Algorithm Zahra Golpayegani et.al. 2406.05129 link
2024-06-07 SMC++: Masked Learning of Unsupervised Video Semantic Compression Yuan Tian et.al. 2406.04765 link
2024-06-06 LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression Junhui Li et.al. 2406.03961 link
2024-06-05 Lossless Image Compression Using Multi-level Dictionaries: Binary Images Samar Agnihotri et.al. 2406.03087 null
2024-06-05 On Jacob Ziv's Individual-Sequence Approach to Information Theory Neri Merhav et.al. 2406.02904 null
2024-06-04 Towards AI-Assisted Sustainable Adaptive Video Streaming Systems: Tutorial and Survey Reza Farahani et.al. 2406.02302 null
2024-06-03 Video Coding with Cross-Component Sample Offset Han Gao et.al. 2406.01795 null
2024-06-05 Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption Anqi Li et.al. 2406.00758 link
2024-06-01 Efficient Massive Black Hole Binary parameter estimation for LISA using Sequential Neural Likelihood Iván Martín Vílchez et.al. 2406.00565 null
2024-06-01 A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing Nurul Rafi et.al. 2406.00239 null
2024-05-31 ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model Yufei Wang et.al. 2405.20721 link
2024-05-30 Quantum encoder for fixed Hamming-weight subspaces Renato M. S. Farias et.al. 2405.20408 null
2024-05-29 Implicit Neural Image Field for Biological Microscopy Image Compression Gaole Dai et.al. 2405.19012 link
2024-05-28 Deep Network Pruning: A Comparative Study on CNNs in Face Recognition Fernando Alonso-Fernandez et.al. 2405.18302 null
2024-05-28 Channel Reciprocity Based Attack Detection for Securing UWB Ranging by Autoencoder Wenlong Gou et.al. 2405.18255 null
2024-05-27 Evaluation of Resource-Efficient Crater Detectors on Embedded Systems Simon Vellas et.al. 2405.16953 link
2024-05-27 UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation Runzhao Yang et.al. 2405.16850 null
2024-05-27 Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model Shoma Iwai et.al. 2405.16817 link
2024-05-25 N-BVH: Neural ray queries with bounding volume hierarchies Philippe Weier et.al. 2405.16237 link
2024-05-25 A 7K Parameter Model for Underwater Image Enhancement based on Transmission Map Prior Fuheng Zhou et.al. 2405.16197 link
2024-05-24 Analytical proxy to families of numerical solutions: the case study of spherical mini-boson stars Jianzhi Yang et.al. 2405.15651 null
2024-05-24 SATSense: Multi-Satellite Collaborative Framework for Spectrum Sensing Haoxuan Yuan et.al. 2405.15542 null
2024-05-24 Meta-meshing and triangulating lattice structures at a large scale Qiang Zou et.al. 2405.15197 null
2024-05-23 NeCGS: Neural Compression for 3D Geometry Sets Siyu Ren et.al. 2405.15034 link
2024-05-23 An augmented Lagrangian trust-region method with inexact gradient evaluations to accelerate constrained optimization problems using model hyperreduction Tianshu Wen et.al. 2405.14827 null
2024-05-23 Motion-based video compression for resource-constrained camera traps Malika Nisal Ratnayake et.al. 2405.14419 null
2024-06-01 I $^2$ VC: A Unified Framework for Intra- & Inter-frame Video Compression Meiqin Liu et.al. 2405.14336 link
2024-05-23 Sparse $L^1$ -Autoencoders for Scientific Data Compression Matthias Chung et.al. 2405.14270 null
2024-05-22 "Turing Tests" For An AI Scientist Xiaoxin Yin et.al. 2405.13352 null
2024-05-21 Efficient Learned Wavelet Image and Video Coding Anna Meyer et.al. 2405.12631 null
2024-05-24 Accelerating Relative Entropy Coding with Space Partitioning Jiajun He et.al. 2405.12203 null
2024-05-20 Refining Coded Image in Human Vision Layer Using CNN-Based Post-Processing Takahiro Shindo et.al. 2405.11894 null
2024-05-19 Effective In-Context Example Selection through Data Compression Zhongxiang Sun et.al. 2405.11465 null
2024-05-18 InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images Wuzhou Li et.al. 2405.11293 link
2024-05-17 Dark Energy Survey Year 3 results: simulation-based cosmological inference with wavelet harmonics, scattering transforms, and moments of weak lensing mass maps II. Cosmological results M. Gatti et.al. 2405.10881 null
2024-05-17 Reduced storage direct tensor ring decomposition for convolutional neural networks compression Mateusz Gabor et.al. 2405.10802 link
2024-05-17 Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network Junhui Li et.al. 2405.10518 null
2024-05-15 Properties that allow or prohibit transferability of adversarial attacks among quantized networks Abhishek Shrestha et.al. 2405.09598 link
2024-05-15 Sensitivity Decouple Learning for Image Compression Artifacts Reduction Li Ma et.al. 2405.09291 null
2024-05-18 Scalable Image Coding for Humans and Machines Using Feature Fusion Network Takahiro Shindo et.al. 2405.09152 link
2024-05-14 Parameter-Efficient Instance-Adaptive Neural Video Compression Hyunmo Yang et.al. 2405.08530 link
2024-05-13 Goal-oriented compression for $L_p$ -norm-type goal functions: Application to power consumption scheduling Yifei Sun et.al. 2405.07808 null
2024-05-13 Neural Network Compression for Reinforcement Learning Tasks Dmitry A. Ivanov et.al. 2405.07748 null
2024-05-13 On the Adversarial Robustness of Learning-based Image Compression Against Rate-Distortion Attacks Chenhao Wu et.al. 2405.07717 null
2024-05-21 An Efficient Compression Method for Sign Information of DCT Coefficients via Sign Retrieval Chihiro Tsutake et.al. 2405.07487 link
2024-05-10 Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming Chin-Yun Yu et.al. 2405.06804 link
2024-05-08 Urban Boundary Delineation from Commuting Data with Bayesian Stochastic Blockmodeling: Scale, Contiguity, and Hierarchy Sebastian Morel-Balbi et.al. 2405.04911 link
2024-05-14 Some Notes on the Sample Complexity of Approximate Channel Simulation Gergely Flamich et.al. 2405.04363 null
2024-05-07 Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression Zhenghao Chen et.al. 2405.04274 null
2024-05-08 Verified Neural Compressed Sensing Rudy Bunel et.al. 2405.04260 null
2024-05-15 Lossy Compression with Data, Perception, and Classification Constraints Yuhan Wang et.al. 2405.04144 null
2024-05-07 DMOFC: Discrimination Metric-Optimized Feature Compression Changsheng Gao et.al. 2405.04044 null
2024-05-06 Computational ghost imaging with hybrid transforms by integrating Hadamard, discrete cosine, and Haar matrices Yi-Ning Zhao et.al. 2405.03729 null
2024-05-06 A Rate-Distortion-Classification Approach for Lossy Image Compression Yuefeng Zhang et.al. 2405.03500 null
2024-05-06 Structure-Preserving Network Compression Via Low-Rank Induced Training Through Linear Layers Composition Xitong Zhang et.al. 2405.03089 link
2024-05-04 Deep Pulse-Signal Magnification for remote Heart Rate Estimation in Compressed Videos Joaquim Comas et.al. 2405.02652 null
2024-05-06 Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design Jian Meng et.al. 2405.01775 link
2024-05-02 PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems Walter Zimmer et.al. 2405.01750 null
2024-04-28 Lightweight Conceptual Dictionary Learning for Text Classification Using Information Compression Li Wan et.al. 2405.01584 null
2024-05-02 GroupedMixer: An Entropy Model with Group-wise Token-Mixers for Learned Image Compression Daxin Li et.al. 2405.01170 null
2024-04-30 Analysis and Enhancement of Lossless Image Compression in JPEG-XL Rustam Mamedov et.al. 2404.19755 null
2024-04-30 EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization Jianzong Wang et.al. 2404.19214 null
2024-04-29 Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior Zhiyuan Li et.al. 2404.18820 link
2024-04-28 Joint Reference Frame Synthesis and Post Filter Enhancement for Versatile Video Coding Weijie Bao et.al. 2404.18058 null
2024-04-25 Learning Visuotactile Skills with Two Multifingered Hands Toru Lin et.al. 2404.16823 link
2024-04-24 Domain Adaptation for Learned Image Compression with Supervised Adapters Alberto Presta et.al. 2404.15591 link
2024-04-23 One-Pass Randomized Algorithm with Practical Rangefinder for Low-Rank Approximation to Quaternion Matrices Chao Chang et.al. 2404.14783 link
2024-04-22 Neural Compress-and-Forward for the Relay Channel Ezgi Ozyilkan et.al. 2404.14594 null
2024-04-22 Taming Server Memory TCO with Multiple Software-Defined Compressed Tiers Sandeep Kumar et.al. 2404.13886 null
2024-04-20 HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression Lei Lu et.al. 2404.13372 null
2024-04-18 Image Compression and Reconstruction Based on Quantum Network Xun Ji et.al. 2404.11994 null
2024-04-17 Spatio-Temporal Motion Retargeting for Quadruped Robots Taerim Yoon et.al. 2404.11557 null
2024-04-17 Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems Luca Bompani et.al. 2404.11488 link
2024-04-17 Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks Eri Hosonuma et.al. 2404.11280 null
2024-04-16 Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning Kyle Hsu et.al. 2404.10282 link
2024-04-16 Compressible and Searchable: AI-native Multi-Modal Retrieval System with Learned Image Compression Jixiang Luo et.al. 2404.10234 null
2024-04-15 One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing Yueyu Hu et.al. 2404.09979 null
2024-04-15 Quantization of Large Language Models with an Overdetermined Basis Daniil Merkulov et.al. 2404.09737 null
2024-04-18 Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition Tobias Weber et.al. 2404.09683 link
2024-04-15 MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image Chengfeng Liu et.al. 2404.09433 null
2024-04-17 Incremental data compression for PDE-constrained optimization with a data assimilation application Xuejian Li et.al. 2404.09323 null
2024-04-14 A Joint Data Compression and Time-Delay Estimation Method For Distributed Systems via Extremum Encoding Amir Weiss et.al. 2404.09244 null
2024-04-12 Lossy Image Compression with Foundation Diffusion Models Lucas Relic et.al. 2404.08580 null
2024-04-12 Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT Miguel Ortiz del Castillo et.al. 2404.08399 null
2024-04-11 Video Compression Beyond VVC: Quantitative Analysis of Intra Coding Tools in Enhanced Compression Model (ECM) Mohsen Abdoli et.al. 2404.07872 null
2024-04-11 Learning to Classify New Foods Incrementally Via Compressed Exemplars Justin Yang et.al. 2404.07507 null
2024-04-14 A comparison between Shapefit compression and Full-Modelling method with PyBird for DESI 2024 and beyond Y. Lai et.al. 2404.07283 link
2024-04-10 Exploring Repetitiveness Measures for Two-Dimensional Strings Giuseppe Romana et.al. 2404.07030 null
2024-04-10 Fine color guidance in diffusion models and its application to image compression at extremely low bitrates Tom Bordin et.al. 2404.06865 null
2024-04-09 Encoder-Quantization-Motion-based Video Quality Metrics Yixu Chen et.al. 2404.06620 null
2024-04-09 DiffHarmony: Latent Diffusion Model Meets Image Harmonization Pengfei Zhou et.al. 2404.06139 link
2024-04-09 Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey Feng Liang et.al. 2404.06114 null
2024-04-09 Image and Video Compression using Generative Sparse Representation with Fidelity Controls Wei Jiang et.al. 2404.06076 null
2024-04-07 Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder Yiyang Ma et.al. 2404.04916 null
2024-04-07 Task-Aware Encoder Control for Deep Video Compression Xingtong Ge et.al. 2404.04848 null
2024-04-06 Power-Efficient Image Storage: Leveraging Super Resolution Generative Adversarial Network for Sustainable Compression and Reduced Carbon Footprint Ashok Mondal et.al. 2404.04642 null
2024-04-05 ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing Alec Helbling et.al. 2404.04376 link
2024-04-03 Convolutional variational autoencoders for secure lossy image compression in remote sensing Alessandro Giuliano et.al. 2404.03696 null
2024-03-25 RL for Consistency Models: Faster Reward Guided Text-to-Image Generation Owen Oertell et.al. 2404.03673 link
2024-04-04 Training LLMs over Neurally Compressed Text Brian Lester et.al. 2404.03626 null
2024-04-04 Leveraging Interpolation Models and Error Bounds for Verifiable Scientific Machine Learning Tyler Chang et.al. 2404.03586 link
2024-04-04 Semantic Compression with Information Lattice Learning Haizi Yu et.al. 2404.03131 null
2024-04-01 Accounting for contact network uncertainty in epidemic inferences with Approximate Bayesian Computation Maxwell H. Wang et.al. 2404.02924 null
2024-04-03 Building test batteries based on analysing random number generator tests within the framework of algorithmic information theory Boris Ryabko et.al. 2404.02708 null
2024-04-03 Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression I. Dror et.al. 2404.02481 null
2024-04-03 MOPAR: A Model Partitioning Framework for Deep Learning Inference Services on Serverless Platforms Jiaang Duan et.al. 2404.02445 null
2024-04-02 NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation Sicheng Li et.al. 2404.02185 null
2024-04-01 The Rate-Distortion-Perception Trade-off: The Role of Private Randomness Yassine Hamdi et.al. 2404.01111 null
2024-03-31 Metric dimensions of generalized Sierpiński graphs over squares Savari Prabhu et.al. 2404.00771 null
2024-03-27 Computationally and Memory-Efficient Robust Predictive Analytics Using Big Data Daniel Menges et.al. 2403.19721 null
2024-03-28 RootInteractive tool for multidimensional statistical analysis, machine learning and analytical model validation Marian Invanov et.al. 2403.19330 null
2024-03-28 Uncertainty-Aware Deep Video Compression with Ensembles Wufei Ma et.al. 2403.19158 null
2024-04-08 Neural Embedding Compression For Efficient Multi-Task Earth Observation Modelling Carlos Gomes et.al. 2403.17886 link
2024-03-26 Low-Latency Neural Stereo Streaming Qiqi Hou et.al. 2403.17879 null
2024-03-26 Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs Kai Yuan et.al. 2403.17607 link
2024-03-25 Neural Image Compression with Quantization Rectifier Wei Luo et.al. 2403.17236 null
2024-03-25 Invertible Diffusion Models for Compressed Sensing Bin Chen et.al. 2403.17006 link
2024-03-25 Virtual Cylindrical PET for Efficient DOI Image Reconstruction with Sub-millimetre Resolution Francisco E Enríquez-Mier-y-Terán et.al. 2403.16465 null
2024-03-25 Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks Madhumitha Sakthi et.al. 2403.16338 null
2024-03-24 Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis Atefeh Khoshkhahtinat et.al. 2403.16258 null
2024-03-23 Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets Robert Underwood et.al. 2403.15953 null
2024-03-23 Droplet shape representation using Fourier series and autoencoders Mihir Durve et.al. 2403.15797 null
2024-03-21 S2LIC: Learned Image Compression with the SwinV2 Block, Adaptive Channel-wise and Global-inter Attention Context Yongqiang Wang et.al. 2403.14471 link
2024-03-21 Tensor network compressibility of convolutional models Sukhbinder Singh et.al. 2403.14379 null
2024-03-26 Powerful Lossy Compression for Noisy Images Shilv Cai et.al. 2403.14135 null
2024-03-20 String attractors and bi-infinite words Pierre Béaur et.al. 2403.13449 null
2024-03-19 Super-High-Fidelity Image Compression via Hierarchical-ROI and Adaptive Quantization Jixiang Luo et.al. 2403.13030 null
2024-03-19 Privacy-Preserving Face Recognition Using Trainable Feature Subtraction Yuxi Mi et.al. 2403.12457 link
2024-03-19 VQ-NeRV: A Vector Quantized Neural Representation for Videos Yunjie Xu et.al. 2403.12401 link
2024-03-18 Encoding of linear kinetic plasma problems in quantum circuits via data compression Ivan Novikau et.al. 2403.11989 null
2024-03-18 Object Segmentation-Assisted Inter Prediction for Versatile Video Coding Zhuoyuan Li et.al. 2403.11694 null
2024-03-18 Overfitted image coding at reduced complexity Théophile Blard et.al. 2403.11651 link
2024-03-18 Hierarchical Frequency-based Upsampling and Refining for Compressed Video Quality Enhancement Qianyu Zhang et.al. 2403.11556 null
2024-03-18 Earth+: on-board satellite imagery compression leveraging historical earth observations Kuntai Du et.al. 2403.11434 null
2024-03-17 Fidelity-preserving Learning-Based Image Compression: Loss Function and Subjective Evaluation Methodology Shima Mohammadi et.al. 2403.11241 link
2024-03-16 Channel-wise Feature Decorrelation for Enhanced Learned Image Compression Farhad Pakdaman et.al. 2403.10936 null
2024-03-16 NARRATE: Versatile Language Architecture for Optimal Control in Robotics Seif Ismail et.al. 2403.10762 link
2024-03-15 Process-and-Forward: Deep Joint Source-Channel Coding Over Cooperative Relay Networks Chenghong Bian et.al. 2403.10613 null
2024-03-15 CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement Qiang Zhu et.al. 2403.10362 link
2024-03-15 Interactive Distance Field Mapping and Planning to Enable Human-Robot Collaboration Usama Ali et.al. 2403.09988 link
2024-03-14 SketchINR: A First Look into Sketches as Implicit Neural Representations Hmrishav Bandyopadhyay et.al. 2403.09344 link
2024-03-14 Noise Dimension of GAN: An Image Compression Perspective Ziran Zhu et.al. 2403.09196 null
2024-03-20 Content-aware Masked Image Modeling Transformer for Stereo Image Compression Xinjie Zhang et.al. 2403.08505 link
2024-03-12 Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding Eric Lei et.al. 2403.07320 null
2024-03-11 Grid Monitoring and Protection with Continuous Point-on-Wave Measurements and Generative AI Lang Tong et.al. 2403.06942 null
2024-03-16 Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression Zhi Cao et.al. 2403.06700 null
2024-03-13 FSViewFusion: Few-Shots View Generation of Novel Objects Rukhshanda Hussain et.al. 2403.06394 null
2024-03-10 Probing Image Compression For Class-Incremental Learning Justin Yang et.al. 2403.06288 null
2024-03-10 Blockchain-Enabled Variational Information Bottleneck for IoT Networks Qiong Wu et.al. 2403.06129 link
2024-03-09 Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding Cunhui Dong et.al. 2403.05937 null
2024-03-07 Complexity-constrained quantum thermodynamics Anthony Munson et.al. 2403.04828 null
2024-03-07 Image Coding for Machines with Edge Information Learning Using Segment Anything Takahiro Shindo et.al. 2403.04173 link
2024-03-06 3D Diffusion Policy Yanjie Ze et.al. 2403.03954 link
2024-03-06 Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer Naifu Xue et.al. 2403.03736 null
2024-03-06 ZF Beamforming Tensor Compression for Massive MIMO Fronthaul Libin Zheng et.al. 2403.03675 null
2024-03-06 Space Complexity of Euclidean Clustering Xiaoyi Zhu et.al. 2403.02971 null
2024-03-05 Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity Hagyeong Lee et.al. 2403.02944 link
2024-03-05 Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders Daniele Mari et.al. 2403.02887 null
2024-03-04 Dark Energy Survey Year 3 results: likelihood-free, simulation-based $w$ CDM inference with neural compression of weak-lensing map statistics N. Jeffrey et.al. 2403.02314 null
2024-03-04 Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000 Xinyue Li et.al. 2403.01647 link
2024-03-03 On the Compressibility of Quantized Large Language Models Yu Mao et.al. 2403.01384 null
2024-03-02 Towards Accurate Lip-to-Speech Synthesis in-the-Wild Sindhu Hegde et.al. 2403.01087 null
2024-03-01 Region-Adaptive Transform with Segmentation Prior for Image Compression Yuxi Liu et.al. 2403.00628 link
2024-03-07 ODVista: An Omnidirectional Video Dataset for super-resolution and Quality Enhancement Tasks Ahmed Telili et.al. 2403.00604 link
2024-02-29 Towards Explaining Deep Neural Network Compression Through a Probabilistic Latent Space Mahsa Mozafari-Nia et.al. 2403.00155 null
2024-02-29 Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling Wenxue Cui et.al. 2402.19111 null
2024-02-29 Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets Fatih Kamisli et.al. 2402.18930 link
2024-02-29 Towards Backward-Compatible Continual Learning of Image Compression Zhihao Duan et.al. 2402.18862 link
2024-02-29 Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression Xinyue Li et.al. 2402.18761 null
2024-01-10 Motion Guided Token Compression for Efficient Masked Video Modeling Yukun Feng et.al. 2402.18577 null
2024-02-28 Tokenization Is More Than Compression Craig W. Schmidt et.al. 2402.18376 link
2024-02-28 NERV++: An Enhanced Implicit Neural Video Representation Ahmed Ghorbel et.al. 2402.18305 null
2024-02-28 Computing Minimal Absent Words and Extended Bispecial Factors with CDAWG Space Shunsuke Inenaga et.al. 2402.18090 null
2024-03-03 Towards Optimal Learning of Language Models Yuxian Gu et.al. 2402.17759 null
2024-02-27 $ζ$ -QVAE: A Quantum Variational Autoencoder utilizing Regularized Mixed-state Latent Representations Gaoyuan Wang et.al. 2402.17749 null
2024-02-27 Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model Panqi Jia et.al. 2402.17487 null
2024-02-27 Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization Panqi Jia et.al. 2402.17470 null
2024-02-29 Neural Video Compression with Feature Modulation Jiahao Li et.al. 2402.17414 link
2024-01-19 MB-RACS: Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network Yujun Huang et.al. 2402.16855 null
2024-02-29 MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model Chunyi Li et.al. 2402.16749 link
2024-02-26 Enabling robust sensor network design with data processing and optimization making use of local beehive image and video files Ephrance Eunice Namugenyi et.al. 2402.16655 null
2024-02-26 Resolution-Agnostic Neural Compression for High-Fidelity Portrait Video Conferencing via Implicit Radiance Fields Yifei Li et.al. 2402.16599 null
2024-02-26 Distortion-Controlled Dithering with Reduced Recompression Rate Morriel Kasher et.al. 2402.16447 null
2024-02-26 Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction Wen-Yang Lu et.al. 2402.16371 null
2024-02-26 SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field Zetian Song et.al. 2402.16366 null
2024-02-24 Traditional Transformation Theory Guided Model for Learned Image Compression Zhiyuan Li et.al. 2402.15744 null
2024-02-22 Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving Eugen Šlapak et.al. 2402.14642 null
2024-02-21 Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel Jordan Dotzel et.al. 2402.13536 null
2024-02-20 Compressing the two-particle Green's function using wavelets: Theory and application to the Hubbard atom Emin Moghadas et.al. 2402.13030 null
2024-02-20 RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models Xinchen Zhang et.al. 2402.12908 link
2024-02-20 Transformer-based Learned Image Compression for Joint Decoding and Denoising Yi-Hsin Chen et.al. 2402.12888 null
2024-02-19 Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI Pooling Philip Müller et.al. 2402.11985 link
2024-02-18 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods Till Beemelmanns et.al. 2402.11680 link
2024-02-18 Learning to Learn Faster from Human Feedback with Language Model Predictive Control Jacky Liang et.al. 2402.11450 null
2024-02-17 TinyLIC-High efficiency lossy image compression method Gaocheng Ma et.al. 2402.11164 null
2024-02-15 Analysis of Neural Video Compression Networks for 360-Degree Video Coding Andy Regensky et.al. 2402.10257 null
2024-02-14 Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion Edgar Heinert et.al. 2402.09530 link
2024-02-14 A Comprehensive Review of Software and Hardware Energy Efficiency of Video Decoders Matthias Kränzler et.al. 2402.09001 null
2024-02-14 Extreme Video Compression with Pre-trained Diffusion Models Bohan Li et.al. 2402.08934 link
2024-02-14 Saliency-aware End-to-end Learned Variable-Bitrate 360-degree Image Compression Oguzhan Gungordu et.al. 2402.08862 null
2024-02-13 Learned Image Compression with Text Quality Enhancement Chih-Yu Lai et.al. 2402.08643 null
2024-02-13 Motion-Adaptive Inference for Flexible Learned B-Frame Compression M. Akin Yilmaz et.al. 2402.08550 null
2024-02-21 A Neural-network Enhanced Video Coding Framework beyond ECM Yanchen Zhao et.al. 2402.08397 null
2024-02-13 Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss Kei Iino et.al. 2402.08267 null
2024-02-12 Distributed Compression in the Era of Machine Learning: A Review of Recent Advances Ezgi Ozyilkan et.al. 2402.07997 null
2024-02-13 Towards Meta-Pruning via Optimal Transport Alexander Theus et.al. 2402.07839 link
2024-02-09 Parameter estimation for quantum jump unraveling Marco Radaelli et.al. 2402.06556 link
2024-02-07 RAGE for the Machine: Image Compression with Low-Cost Random Access for Embedded Applications Christian D. Rask et.al. 2402.05974 null
2024-02-08 Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers Onur G. Guleryuz et.al. 2402.05887 link
2024-02-08 Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-ONNs Yuxin Xie et.al. 2402.05582 null
2024-02-05 TexShape: Information Theoretic Sentence Embedding for Language Models H. Kaan Kale et.al. 2402.05132 link
2024-02-07 Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth Kevin Kögler et.al. 2402.05013 null
2024-02-06 A Novel Local and Hyper-Local Multicast Services Transmission Scheme for Beyond 5G Networks Sweta Singh et.al. 2402.03963 null
2024-02-06 Cool-chic video: Learned video coding with 800 parameters Thomas Leguay et.al. 2402.03179 link
2024-02-05 Perceptual Learned Image Compression via End-to-End JND-Based Optimization Farhad Pakdaman et.al. 2402.02836 null
2024-02-04 Discovering More Effective Tensor Network Structure Search Algorithms via Large Language Models (LLMs) Junhua Zeng et.al. 2402.02456 link
2024-03-04 RecNet: An Invertible Point Cloud Encoding through Range Image Embeddings for Multi-Robot Map Sharing and Reconstruction Nikolaos Stathoulopoulos et.al. 2402.02192 null
2024-02-03 Generative Visual Compression: A Review Bolin Chen et.al. 2402.02140 null
2024-02-23 Immersive Video Compression using Implicit Neural Representations Ho Man Kwan et.al. 2402.01596 link
2024-02-02 Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization Zhiyu Zhang et.al. 2402.01380 null
2024-02-02 UCVC: A Unified Contextual Video Compression Framework with Joint P-frame and B-frame Coding Jiayu Yang et.al. 2402.01289 null
2024-02-02 Flexible Variational Information Bottleneck: Achieving Diverse Compression with a Single Training Sota Kudo et.al. 2402.01238 link
2024-02-02 The O2 software framework and GPU usage in ALICE online and offline reconstruction in Run 3 Giulio Eulisse et.al. 2402.01205 null
2024-02-01 Compressed image quality assessment using stacking S. Farhad Hosseini-Benvidi et.al. 2402.00993 null
2024-02-04 Evaluating Large Language Models for Generalization and Robustness via Data Compression Yucheng Li et.al. 2402.00861 link
2024-03-11 LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression Wei Jiang et.al. 2402.00680 null
2024-02-01 Gain of Grain: A Film Grain Handling Toolchain for VVC-based Open Implementations Vignesh V Menon et.al. 2402.00622 null
2024-01-31 EPSD: Early Pruning with Self-Distillation for Efficient Model Compression Dong Chen et.al. 2402.00084 null
2024-01-31 A Neural Enhancement Post-Processor with a Dynamic AV1 Encoder Configuration Strategy for CLIC 2024 Darren Ramsook et.al. 2401.18021 null
2024-01-31 Robustly overfitting latents for flexible neural image compression Yura Perugachi-Diaz et.al. 2401.17789 null
2024-01-30 A Group Theoretic Metric for Robot State Estimation Leveraging Chebyshev Interpolation Varun Agrawal et.al. 2401.17463 null
2024-01-30 SLIC: A Learned Image Codec Using Structure and Color Srivatsa Prativadibhayankaram et.al. 2401.17246 link
2024-01-30 Large Language Model Evaluation via Matrix Entropy Lai Wei et.al. 2401.17139 link
2024-01-30 Local integrals of motion in dipole-conserving models with Hilbert space fragmentation Patrycja Łydżba et.al. 2401.17097 null
2024-01-29 On Channel Simulation with Causal Rejection Samplers Daniel Goc et.al. 2401.16579 null
2024-01-29 Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression Xihua Sheng et.al. 2401.15864 null
2024-01-29 Bayesian one- and two-sided inference on the local effective dimension Eduard Belitser et.al. 2401.15816 null
2024-01-28 Towards Arbitrary-Scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework via Implicit Self-texture Enhancement Minghong Duan et.al. 2401.15613 null
2024-01-26 Shadow simulation of quantum processes Xuanqiang Zhao et.al. 2401.14934 null
2024-01-26 Study of the gOMP Algorithm for Recovery of Compressed Sensed Hyperspectral Images Jon Alvarez Justo et.al. 2401.14786 null
2024-01-26 A Comparative Study of Compressive Sensing Algorithms for Hyperspectral Imaging Reconstruction Jon Alvarez Justo et.al. 2401.14762 null
2024-01-26 Residual Quantization with Implicit Neural Codebooks Iris Huijben et.al. 2401.14732 link
2024-01-25 Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression Daxin Li et.al. 2401.14007 null
2024-02-07 Perceptual-oriented Learned Image Compression with Dynamic Kernel Nianxiang Fu et.al. 2401.13967 null
2024-01-25 Conditional Neural Video Coding with Spatial-Temporal Super-Resolution Henan Wang et.al. 2401.13959 null
2024-01-24 FLLIC: Functionally Lossless Image Compression Xi Zhang et.al. 2401.13616 null
2024-01-23 Fast Implicit Neural Representation Image Codec in Resource-limited Devices Xiang Liu et.al. 2401.12587 null
2024-01-22 PairwiseHist: Fast, Accurate and Space-Efficient Approximate Query Processing with Data Compression Aaron Hurst et.al. 2401.12018 null
2024-01-22 A Training-Free Defense Framework for Robust Learned Image Compression Myungseo Song et.al. 2401.11902 null
2024-01-21 Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding Yichi Zhang et.al. 2401.11615 null
2024-01-21 ColorVideoVDP: A visual difference predictor for image, video and display distortions Rafal K. Mantiuk et.al. 2401.11485 link
2024-01-21 Data-driven compression of electron-phonon interactions Yao Luo et.al. 2401.11393 null
2024-01-20 Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding Haisheng Fu et.al. 2401.11093 null
2024-01-19 NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines Jukka I. Ahonen et.al. 2401.10761 null
2024-01-19 Bridging the gap between image coding for machines and humans Nam Le et.al. 2401.10732 null
2024-01-18 Attack and Defense Analysis of Learned Image Compression Tianyu Zhu et.al. 2401.10345 null
2024-01-18 Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions Namitha Padmanabhan et.al. 2401.10217 null
2024-01-18 Depth Over RGB: Automatic Evaluation of Open Surgery Skills Using Depth Camera Ido Zuckerman et.al. 2401.10037 null
2024-01-18 Memory Efficient Corner Detection for Event-driven Dynamic Vision Sensors Pao-Sheng Vincent Sun et.al. 2401.09797 null
2024-01-18 Compressing MIMO Channel Submatrices with Tucker Decomposition: Enabling Efficient Storage and Reducing SINR Computation Overhead Yuanwei Zhang et.al. 2401.09792 null
2024-01-17 Idempotence and Perceptual Image Compression Tongda Xu et.al. 2401.08920 link
2024-01-16 End-to-End Optimized Image Compression with the Frequency-Oriented Transform Yuefeng Zhang et.al. 2401.08194 null
2024-01-17 Learned Image Compression with ROI-Weighted Distortion and Bit Allocation Wei Jiang et.al. 2401.08154 null
2024-01-15 Convolutional Neural Network Compression via Dynamic Parameter Rank Pruning Manish Sharma et.al. 2401.08014 null
2024-01-15 Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models Dan Jacobellis et.al. 2401.07957 link
2024-01-14 Exploring Compressed Image Representation as a Perceptual Proxy: A Study Chen-Hsiu Huang et.al. 2401.07200 link
2024-01-13 Progressive Feature Fusion Network for Enhancing Image Quality Assessment Kaiqun Wu et.al. 2401.06992 null
2024-01-12 Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part II: Spatial and Tonal Data Optimization Niklas Kämper et.al. 2401.06747 null
2024-03-18 LiDAR Depth Map Guided Image Compression Model Alessandro Gnutti et.al. 2401.06517 null
2024-01-11 Transformer Masked Autoencoders for Next-Generation Wireless Communications: Architecture and Opportunities Abdullah Zayat et.al. 2401.06274 null
2024-01-11 MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring Qian Gong et.al. 2401.05994 null
2024-01-10 SnapCap: Efficient Snapshot Compressive Video Captioning Jianqiao Sun et.al. 2401.04903 null
2024-01-09 Modified Levenberg-Marquardt Algorithm For Tensor CP Decomposition in Image Compression Ramin Goudarzi Karim et.al. 2401.04670 null
2024-01-09 Optimal Transcoding Resolution Prediction for Efficient Per-Title Bitrate Ladder Estimation Jinhai Yang et.al. 2401.04405 null
2024-01-08 Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion Minglong Xue et.al. 2401.03788 link
2024-01-08 A Video Coding Method Based on Neural Network for CLIC2024 Zhengang Li et.al. 2401.03623 null
2024-01-06 Spatiotemporally adaptive compression for scientific dataset with feature preservation -- a case study on simulation data with extreme climate events analysis Qian Gong et.al. 2401.03317 null
2024-01-06 Comparison of spectrum models as applied to single-particle $\bf p_t$ spectra from high-energy p-p collisions and their physical interpretations Thomas A. Trainor et.al. 2401.03290 null
2024-01-06 Transferable Learned Image Compression-Resistant Adversarial Perturbations Yang Sui et.al. 2401.03115 null
2024-01-05 MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multi-scale Dilated Convolution for Image Compressive Sensing (CS) Youhao Yu et.al. 2401.02884 null
2024-03-08 Importance Matching Lemma for Lossy Compression with Side Information Buu Phan et.al. 2401.02609 null
2024-01-04 Cool-Chic: Perceptually Tuned Low Complexity Overfitted Image Coder Théo Ladune et.al. 2401.02156 link
2024-01-04 ED: Perceptually tuned Enhanced Compression Model Pierrick Philippe et.al. 2401.02145 null
2024-01-02 NU-Class Net: A Novel Deep Learning-based Approach for Video Quality Enhancement Parham Zilouchian Moghaddam et.al. 2401.01163 null
2024-01-28 Higher-Order Cellular Automata Generated Symmetry-Protected Topological Phases and Detection Through Multi-Point Strange Correlators Jie-Yu Zhang et.al. 2401.00505 null
2023-12-28 Selective Run-Length Encoding Xutan Peng et.al. 2312.17024 null
2023-12-29 FFCA-Net: Stereo Image Compression via Fast Cascade Alignment of Side Information Yichong Xia et.al. 2312.16963 null
2023-12-26 Range Entropy Queries and Partitioning Sanjay Krishnan et.al. 2312.15959 null
2023-12-25 MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression Yi-Hsin Chen et.al. 2312.15829 null
2023-12-25 On Robust Wasserstein Barycenter: The Model and Algorithm Xu Wang et.al. 2312.15762 null
2023-12-25 Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision Qi Mao et.al. 2312.15622 null
2023-12-22 The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs Junli Fang et.al. 2312.14792 null
2024-01-09 Enhanced Color Palette Modeling for Lossless Screen Content Compression Hannah Och et.al. 2312.14491 null
2023-12-30 Efficient Communication in Federated Learning Using Floating-Point Lossy Compression Grant Wilkins et.al. 2312.13461 null
2023-12-19 A Huffman based short message service compression technique using adjacent distance array Pranta Sarker et.al. 2312.12495 null
2023-12-19 Full-reference Video Quality Assessment for User Generated Content Transcoding Zihao Qi et.al. 2312.12317 null
2023-12-19 Low-Consumption Partial Transcoding by HEVC Mohsen Abdoli et.al. 2312.12174 link
2023-12-19 Comparative Study of Hardware and Software Power Measurements in Video Compression Angeliki Katsenou et.al. 2312.12150 null
2023-12-18 Blind-Touch: Homomorphic Encryption-Based Distributed Neural Network Inference for Privacy-Preserving Fingerprint Authentication Hyunmin Choi et.al. 2312.11575 link
2024-01-11 Quantized Decoder in Learned Image Compression for Deterministic Reconstruction Esin Koyuncu et.al. 2312.11209 null
2023-12-19 A Computationally Efficient Neural Video Compression Accelerator Based on a Sparse CNN-Transformer Hybrid Network Siyu Zhang et.al. 2312.10716 null
2023-12-17 IntraSeismic: a coordinate-based learning approach to seismic inversion Juan Romero et.al. 2312.10568 null
2023-12-17 Light-weight CNN-based VVC Inter Partitioning Acceleration Yiqun Liu et.al. 2312.10567 null
2023-12-16 Statistical Analysis of Inter Coding in VVC Test Model (VTM) Yiqun Liu et.al. 2312.10406 null
2023-12-15 IQNet: Image Quality Assessment Guided Just Noticeable Difference Prefiltering For Versatile Video Coding Yu-Han Sun et.al. 2312.09799 null
2023-12-15 Towards Neuromorphic Compression based Neural Sensing for Next-Generation Wireless Implantable Brain Machine Interface Vivek Mohan et.al. 2312.09503 null
2023-12-14 Geometry-Corrected Geodesic Motion Modeling with Per-Frame Camera Motion for 360-Degree Video Compression Andy Regensky et.al. 2312.09266 link
2023-12-14 Efficient Online Learning of Contact Force Models for Connector Insertion Kevin Tracy et.al. 2312.09190 null
2023-12-13 Balanced and Deterministic Weight-sharing Helps Network Performance Oscar Chang et.al. 2312.08401 null
2023-12-13 Preparing VVC for Streaming: A Fast Multi-Rate Encoding Approach Yiqun Liu et.al. 2312.08330 null
2023-12-13 CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation Eugenio Chisari et.al. 2312.08240 null
2023-12-13 Explainable Trajectory Representation through Dictionary Learning Yuanbo Tang et.al. 2312.08052 null
2023-12-12 Deep Hierarchical Video Compression Ming Lu et.al. 2312.07126 null
2023-12-12 Communication Cost Reduction for Subgraph Counting under Local Differential Privacy via Hash Functions Quentin Hillebrand et.al. 2312.07055 link
2023-12-11 RAFIC: Retrieval-Augmented Few-shot Image Classification Hangfei Lin et.al. 2312.06868 link
2023-12-11 A New Projection Pursuit Index for Big Data Yajie Duan et.al. 2312.06465 null
2023-12-11 Variational Auto-Encoder Based Deep Learning Technique For Filling Gaps in Reacting PIV Data Shashank Yellapantula et.al. 2312.06461 null
2023-12-07 Analysis of Coding Gain Due to In-Loop Reshaping Chau-Wai Wong et.al. 2312.04022 null
2023-12-05 C3: High-performance and low-complexity neural compression from a single image or video Hyunjik Kim et.al. 2312.02753 null
2023-12-05 Unified learning-based lossy and lossless JPEG recompression Jianghui Zhang et.al. 2312.02705 null
2023-12-05 Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation Tianhao Peng et.al. 2312.02605 null
2023-12-04 Hyperspectral Image Compression Using Sampling and Implicit Neural Representations Shima Rezasoltani et.al. 2312.01558 null

(back to top)

Quality Assessment

Publish Date Title Authors PDF Code
2025-02-27 FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction Siyu Jiao et.al. 2502.20313 null
2025-02-27 Mobius: Text to Seamless Looping Video Generation via Latent Shift Xiuli Bi et.al. 2502.20307 null
2025-02-27 Low-rank tensor completion via a novel minimax $p$ -th order concave penalty function Hongbing Zhang et.al. 2502.19979 null
2025-02-27 Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation Xiang Geng et.al. 2502.19941 null
2025-02-27 Picking the Cream of the Crop: Visual-Centric Data Selection with Collaborative Agents Zhenyu Liu et.al. 2502.19917 null
2025-02-27 High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model Mingtao Guo et.al. 2502.19894 null
2025-02-27 Striving for Faster and Better: A One-Layer Architecture with Auto Re-parameterization for Low-Light Image Enhancement Nan An et.al. 2502.19867 null
2025-02-27 LMHLD: A Large-scale Multi-source High-resolution Landslide Dataset for Landslide Detection based on Deep Learning Guanting Liu et.al. 2502.19866 null
2025-02-27 Adaptive Score Alignment Learning for Continual Perceptual Quality Assessment of 360-Degree Videos in Virtual Reality Kanglei Zhou et.al. 2502.19644 null
2025-02-26 3D Nephrographic Image Synthesis in CT Urography with the Diffusion Model and Swin Transformer Hongkun Yu et.al. 2502.19623 null
2025-02-26 Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones? Yudi Zhang et.al. 2502.19557 null
2025-02-26 CLIP-Optimized Multimodal Image Enhancement via ISP-CNN Fusion for Coal Mine IoVT under Uneven Illumination Shuai Wang et.al. 2502.19450 null
2025-02-26 Does 3D Gaussian Splatting Need Accurate Volumetric Rendering? Adam Celarek et.al. 2502.19318 null
2025-02-27 RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images Yuhan Tang et.al. 2502.19153 null
2025-02-26 Max360IQ: Blind Omnidirectional Image Quality Assessment with Multi-axis Attention Jiebin Yan et.al. 2502.19046 null
2025-02-26 InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model Fengbin Guan et.al. 2502.19026 null
2025-02-26 Hyperspectral image reconstruction by deep learning with super-Rayleigh speckles Ziyan Chen et.al. 2502.18777 null
2025-02-25 Is OpenAlex Suitable for Research Quality Evaluation and Which Citation Indicator is Best? Mike Thelwall et.al. 2502.18427 null
2025-02-25 LAG: LLM agents for Leaderboard Auto Generation on Demanding Jian Wu et.al. 2502.18209 null
2025-02-25 OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation Yunpeng Gao et.al. 2502.18041 null
2025-02-25 Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments Patomporn Payoungkhamdee et.al. 2502.17956 null
2025-02-25 Integrating Boosted learning with Differential Evolution (DE) Optimizer: A Prediction of Groundwater Quality Risk Assessment in Odisha Sonalika Subudhi et.al. 2502.17929 null
2025-02-24 Optimized Memory System Architecture for VESA VDC-M Decoder with Multi-Slice Support Hannah Yang et.al. 2502.17729 null
2025-02-24 Requirements for Quality Assurance of AI Models for Early Detection of Lung Cancer Horst K. Hahn et.al. 2502.17639 null
2025-02-25 KV-Edit: Training-Free Image Editing for Precise Background Preservation Tianrui Zhu et.al. 2502.17363 link
2025-02-24 Motion-Robust T2 Quantification from Gradient Echo MRI with Physics-Informed Deep Learning* Hannah Eichhorn et.al. 2502.17209 null
2025-02-24 SFLD: Reducing the content bias for AI-generated Image Detection Seoyeon Gye et.al. 2502.17105 null
2025-02-24 Pleno-Generation: A Scalable Generative Face Video Compression Framework with Bandwidth Intelligence Bolin Chen et.al. 2502.17085 null
2025-02-24 PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual Quality-Guided Distillation Eleftherios Ioannou et.al. 2502.16996 null
2025-02-24 Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model Kang Fu et.al. 2502.16915 null
2025-02-24 CRTrack: Low-Light Semi-Supervised Multi-object Tracking Based on Consistency Regularization Zijing Zhao et.al. 2502.16809 null
2025-02-23 Automatic Input Rewriting Improves Translation with Large Language Models Dayeon Ki et.al. 2502.16682 link
2025-02-23 AdverX-Ray: Ensuring X-Ray Integrity Through Frequency-Sensitive Adversarial VAEs Francisco Caetano et.al. 2502.16610 null
2025-02-22 Multi-Party Data Pricing for Complex Data Trading Markets: A Rubinstein Bargaining Approach Bing Mi et.al. 2502.16363 null
2025-02-21 Improved Partial Differential Equation and Fast Approximation Algorithm for Hazy/Underwater/Dust Storm Image Enhancement Uche A. Nnolim et.al. 2502.15986 null
2025-02-21 Evaluate with the Inverse: Efficient Approximation of Latent Explanation Quality Distribution Carlos Eiras-Franco et.al. 2502.15403 null
2025-02-21 Super-Resolution for Interferometric Imaging: Model Comparisons and Performance Analysis Hasan Berkay Abdioglu et.al. 2502.15397 null
2025-02-21 Ultrasound Phase Aberrated Point Spread Function Estimation with Convolutional Neural Network: Simulation Study Wei-Hsiang Shen et.al. 2502.15298 null
2025-02-21 Omnidirectional Image Quality Captioning: A Large-scale Database and A New Model Jiebin Yan et.al. 2502.15271 link
2025-02-21 Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis Yifan Jiang et.al. 2502.15204 link
2025-02-21 LUMINA-Net: Low-light Upgrade through Multi-stage Illumination and Noise Adaptation Network for Image Enhancement Namrah Siddiqua et.al. 2502.15186 null
2025-02-21 M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment Chuan Cui et.al. 2502.15167 null
2025-02-21 Optimized Pap Smear Image Enhancement: Hybrid PMD Filter-CLAHE Using Spider Monkey Optimization Ach Khozaimi et.al. 2502.15156 null
2025-02-20 Hardware-Friendly Static Quantization Method for Video Diffusion Transformers Sanghyun Yi et.al. 2502.15077 null
2025-02-20 Multi-Source Static CT with Adaptive Fluence Modulation to Minimize Hallucinations in Generative Reconstructions Matthew Tivnan et.al. 2502.15060 null
2025-02-20 GS-Cache: A GS-Cache Inference Framework for Large-scale Gaussian Splatting Models Miao Tao et.al. 2502.14938 null
2025-02-20 Compact Latent Representation for Image Compression (CLRIC) Ayman A. Ameen et.al. 2502.14937 null
2025-02-20 Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework Yuming Yang et.al. 2502.14864 null
2025-02-20 Towards a Perspectivist Turn in Argument Quality Assessment Julia Romberg et.al. 2502.14501 null
2025-02-20 Early-Exit and Instant Confidence Translation Quality Estimation Vilém Zouhar et.al. 2502.14429 null
2025-02-20 NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis Xiaoxing Liu et.al. 2502.14178 null
2025-02-19 A Baseline Method for Removing Invisible Image Watermarks using Deep Image Prior Hengyue Liang et.al. 2502.13998 link
2025-02-19 Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model Huiying Shi et.al. 2502.13990 null
2025-02-19 A Lightweight Model for Perceptual Image Compression via Implicit Priors Hao Wei et.al. 2502.13988 null
2025-02-19 An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice Wanke Xia et.al. 2502.13764 null
2025-02-19 HawkBench: Investigating Resilience of RAG Methods on Stratified Information-Seeking Tasks Hongjin Qian et.al. 2502.13465 null
2025-02-19 OGBoost: A Python Package for Ordinal Gradient Boosting Mansour T. A. Sharabiani et.al. 2502.13456 null
2025-02-18 VUS: Effective and Efficient Accuracy Measures for Time-Series Anomaly Detection Paul Boniol et.al. 2502.13318 link
2025-02-18 Optimal covering of rectangular grid graphs with tours of constrained length Sergey Bereg et.al. 2502.13306 null
2025-02-18 Application of Context-dependent Interpretation of Biosignals Recognition to Control a Bionic Multifunctional Hand Prosthesis Pawel Trajdos et.al. 2502.13301 null
2025-02-18 Enhancing Machine Learning Performance through Intelligent Data Quality Assessment: An Unsupervised Data-centric Framework Manal Rahal et.al. 2502.13198 null
2025-02-18 GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis Pedro Martin et.al. 2502.13196 null
2025-02-18 Language Barriers: Evaluating Cross-Lingual Performance of CNN and Transformer Architectures for Speech Quality Estimation Wafaa Wardah et.al. 2502.13004 null
2025-02-18 VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation Xinlong Chen et.al. 2502.12782 null
2025-02-18 Efficient Machine Translation Corpus Generation: Integrating Human-in-the-Loop Post-Editing with Large Language Models Kamer Ali Yuksel et.al. 2502.12755 link
2025-02-18 3D Shape-to-Image Brownian Bridge Diffusion for Brain MRI Synthesis from Cortical Surfaces Fabian Bongratz et.al. 2502.12742 null
2025-02-18 Translate Smart, not Hard: Cascaded Translation Systems with Quality-Aware Deferral António Farinhas et.al. 2502.12701 null
2025-02-19 Spherical Dense Text-to-Image Synthesis Timon Winter et.al. 2502.12691 null
2025-02-18 Design and Implementation of a Dual Uncrewed Surface Vessel Platform for Bathymetry Research under High-flow Conditions Dinesh Kumar et.al. 2502.12539 null
2025-02-18 Comprehensive Assessment and Analysis for NSFW Content Erasure in Text-to-Image Diffusion Models Die Chen et.al. 2502.12527 null
2025-02-18 Local Flaw Detection with Adaptive Pyramid Image Fusion Across Spatial Sampling Resolution for SWRs Siyu You et.al. 2502.12512 null
2025-02-17 Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications Li Qiao et.al. 2502.12096 null
2025-02-17 Low-Rank Thinning Annabelle Michael Carrell et.al. 2502.12063 null
2025-02-17 MultiFlow: A unified deep learning framework for multi-vessel classification, segmentation and clustering of phase-contrast MRI validated on a multi-site single ventricle patient cohort Tina Yao et.al. 2502.11993 null
2025-02-17 Deep Spatio-Temporal Neural Network for Air Quality Reanalysis Ammar Kheder et.al. 2502.11941 link
2025-02-17 No-reference geometry quality assessment for colorless point clouds via list-wise rank learning Zheng Li et.al. 2502.11726 link
2025-02-17 The Worse The Better: Content-Aware Viewpoint Generation Network for Projection-related Point Cloud Quality Assessment Zhiyong Su et.al. 2502.11710 link
2025-02-17 Assessing Correctness in LLM-Based Code Generation via Uncertainty Estimation Arindam Sharma et.al. 2502.11620 null
2025-02-17 Syllables to Scenes: Literary-Guided Free-Viewpoint 3D Scene Synthesis from Japanese Haiku Chunan Yu et.al. 2502.11586 null
2025-02-18 AI-Assisted Thin Section Image Processing for Pore-Throat Characterization in Tight Clastic Rocks Muhammad Risha et.al. 2502.11523 null
2025-02-17 Semantically Robust Unsupervised Image Translation for Paired Remote Sensing Images Sheng Fang et.al. 2502.11468 null
2025-02-17 HellaSwag-Pro: A Large-Scale Bilingual Benchmark for Evaluating the Robustness of LLMs in Commonsense Reasoning Xiaoyuan Li et.al. 2502.11393 null
2025-02-17 A Physics-Informed Blur Learning Framework for Imaging Systems Liqun Chen et.al. 2502.11382 null
2025-02-17 LLMs can Perform Multi-Dimensional Analytic Writing Assessments: A Case Study of L2 Graduate-Level Academic English Writing Zhengxiang Wang et.al. 2502.11368 null
2025-02-16 Generating Skyline Datasets for Data Science Models Mengying Wang et.al. 2502.11262 null
2025-02-16 Exploiting network optimization stability for enhanced PET image denoising using deep image prior Fumio Hashimoto et.al. 2502.11259 null
2025-02-16 Are Generative Models Underconfident? An Embarrassingly Simple Quality Estimation Approach Tu Anh Dinh et.al. 2502.11115 null
2025-02-16 Imaging current flow and injection in scalable graphene devices through NV-magnetometry Kaj Dockx et.al. 2502.11076 null
2025-02-15 Automatic Quality Assessment of First Trimester Crown-Rump-Length Ultrasound Images Sevim Cengiz et.al. 2502.10908 null
2025-02-15 AquaScope: Reliable Underwater Image Transmission on Mobile Devices Beitong Tian et.al. 2502.10891 null
2025-02-15 E-3DGS: Event-Based Novel View Rendering of Large-Scale Scenes Using 3D Gaussian Splatting Sohaib Zahid et.al. 2502.10827 null
2025-02-14 Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers Aivin V. Solatorio et.al. 2502.10263 null
2025-02-14 Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Guoqing Ma et.al. 2502.10248 link
2025-02-14 ProReco: A Process Discovery Recommender System Tsung-Hao Huang et.al. 2502.10230 null
2025-02-14 RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control Teng Li et.al. 2502.10059 null
2025-02-14 AffectSRNet : Facial Emotion-Aware Super-Resolution Network Syed Sameen Ahmad Rizvi et.al. 2502.09932 null
2025-02-14 A Deep Learning Approach to Interface Color Quality Assessment in HCI Shixiao Wang et.al. 2502.09914 null
2025-02-14 Compression-Aware One-Step Diffusion Model for JPEG Artifact Removal Jinpei Guo et.al. 2502.09873 null
2025-02-14 Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering Mark Beliaev et.al. 2502.09573 null
2025-02-13 Learned Correction Methods for Ultrasound Computed Tomography Imaging Using Simplified Physics Models Luke Lozenski et.al. 2502.09546 null
2025-02-13 SQ-GAN: Semantic Image Communications Using Masked Vector Quantization Francesco Pezone et.al. 2502.09520 link
2025-02-13 A Physics-Informed Deep Learning Model for MRI Brain Motion Correction Mojtaba Safari et.al. 2502.09296 link
2025-02-13 ConsistentDreamer: View-Consistent Meshes Through Balanced Multi-View Gaussian Optimization Onat Şahin et.al. 2502.09278 null
2025-02-13 PixLift: Accelerating Web Browsing via AI Upscaling Yonas Atinafu et.al. 2502.08995 null
2025-02-13 Some problems of developing astrophysical equipment and combining it with optical telescopes Edward Emelianov et.al. 2502.08992 null
2025-02-13 Dynamic watermarks in images generated by diffusion models Yunzhuo Chen et.al. 2502.08927 null
2025-02-12 A procedure for assessing of machine health index data prediction quality Daniel Kuzio et.al. 2502.08837 null
2025-02-12 Ultrasound imaging of cortical bone: cortex geometry and measurement of porosity based on wave speed for bone remodeling estimation Amadou S. Dia et.al. 2502.08824 null
2025-02-12 Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation Hoigi Seo et.al. 2502.08690 null
2025-02-12 Light-A-Video: Training-free Video Relighting via Progressive Light Fusion Yujie Zhou et.al. 2502.08590 link
2025-02-12 Quality-Aware Decoding: Unifying Quality Estimation and Decoding Sai Koneru et.al. 2502.08561 null
2025-02-12 A Survey on Image Quality Assessment: Insights, Analysis, and Future Outlook Chengqian Ma et.al. 2502.08540 null
2025-02-12 TuMag: the tunable magnetograph for the Sunrise III mission J. C. del Toro Iniesta et.al. 2502.08268 null
2025-02-12 Forward and Inverse Problems in Nonlinear Acoustics Barbara Kaltenbacher et.al. 2502.08194 null
2025-02-11 Automatic Prostate Volume Estimation in Transabdominal Ultrasound Images Tiziano Natali et.al. 2502.07859 null
2025-02-11 Magic 1-For-1: Generating One Minute Video Clips within One Minute Hongwei Yi et.al. 2502.07701 link
2025-02-11 An Improved Optimal Proximal Gradient Algorithm for Non-Blind Image Deblurring Qingsong Wang et.al. 2502.07602 null
2025-02-13 Enhance-A-Video: Better Generated Video for Free Yang Luo et.al. 2502.07508 link
2025-02-11 Compound Mask for Divergent Wave Imaging in Medical Ultrasound Zahraa Alzein et.al. 2502.07453 null
2025-02-11 On Iterative Evaluation and Enhancement of Code Quality Using GPT-4o Rundong Liu et.al. 2502.07399 link
2025-02-11 USRNet: Unified Scene Recovery Network for Enhancing Traffic Imaging under Multiple Adverse Weather Conditions Yuxu Lu et.al. 2502.07372 link
2025-02-11 Multi-Task-oriented Nighttime Haze Imaging Enhancer for Vision-driven Measurement Systems Ai Chen et.al. 2502.07351 link
2025-02-11 Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion Xingpei Ma et.al. 2502.07203 null
2025-02-11 HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates Lei Lu et.al. 2502.07160 null
2025-02-10 Evaluation of Multilingual Image Captioning: How far can we get with CLIP models? Gonçalo Gomes et.al. 2502.06600 link
2025-02-10 Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution Vlad Hosu et.al. 2502.06476 null
2025-02-10 How Humans Help LLMs: Assessing and Incentivizing Human Preference Annotators Shang Liu et.al. 2502.06387 null
2025-02-10 Guidance-base Diffusion Models for Improving Photoacoustic Image Quality Tatsuhiro Eguchi et.al. 2502.06354 null
2025-02-10 LANTERN++: Enhanced Relaxed Speculative Decoding with Static Tree Drafting for Visual Auto-regressive Models Sihwan Park et.al. 2502.06352 null
2025-02-10 A CT Geometry With Multiple Centers Of Rotation For Solving Sparse View Problem Jiayu Duan et.al. 2502.06125 null
2025-02-10 Token-Domain Multiple Access: Exploiting Semantic Orthogonality for Collision Mitigation Li Qiao et.al. 2502.06118 null
2025-02-09 Dual Caption Preference Optimization for Diffusion Models Amir Saeidi et.al. 2502.06023 null
2025-02-09 A Comprehensive Survey on Image Signal Processing Approaches for Low-Illumination Image Enhancement Muhammad Turab et.al. 2502.05995 null
2025-02-09 Multi-Branch Collaborative Learning Network for Video Quality Assessment in Industrial Video Search Hengzhu Tang et.al. 2502.05924 null
2025-02-09 Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models Rafał Karczewski et.al. 2502.05807 null
2025-02-08 Semantic-Aware Adaptive Video Streaming Using Latent Diffusion Models for Wireless Networks Zijiang Yan et.al. 2502.05695 null
2025-02-08 FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion Yufan Zhou et.al. 2502.05606 null
2025-02-07 Distillation and Pruning for Scalable Self-Supervised Representation-Based Speech Quality Assessment Benjamin Stahl et.al. 2502.05356 link
2025-02-07 AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting Chung-Ho Wu et.al. 2502.05176 null
2025-02-07 Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound Andros Tjandra et.al. 2502.05139 link
2025-02-07 Cached Multi-Lora Composition for Multi-Concept Image Generation Xiandong Zou et.al. 2502.04923 link
2025-02-07 Integration Concept of the CBM Micro Vertex Detector Franz Matejcek et.al. 2502.04858 null
2025-02-06 ADIFF: Explaining audio difference using natural language Soham Deshmukh et.al. 2502.04476 link
2025-02-05 DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization Zhenglin Zhou et.al. 2502.04370 null
2025-02-06 BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation The Omnilingual MT Team et.al. 2502.04314 null
2025-02-06 Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency Shangkun Sun et.al. 2502.04076 link
2025-02-06 DICE: Distilling Classifier-Free Guidance into Text Embeddings Zhenyu Zhou et.al. 2502.03726 null
2025-02-05 Quasi-Monte Carlo Methods: What, Why, and How? Fred J. Hickernell et.al. 2502.03644 null
2025-02-05 Efficient Image Restoration via Latent Consistency Flow Matching Elad Cohen et.al. 2502.03500 null
2025-02-05 A new method for structural diagnostics with muon tomography and deep learning Lorenzo Pezzotti et.al. 2502.03339 null
2025-02-05 A Framework for Measuring the Quality of Infrastructure-as-Code Scripts Pandu Ranga Reddy Konala et.al. 2502.03127 null
2025-02-05 Poisson Flow Joint Model for Multiphase contrast-enhanced CT Rongjun Ge et.al. 2502.03079 null
2025-02-05 A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions Hao Yin et.al. 2502.02817 null
2025-02-04 Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications William O'Donnell et.al. 2502.02624 null
2025-02-04 A comparison of translation performance between DeepL and Supertext Alex Flückiger et.al. 2502.02577 link
2025-02-04 Privacy Attacks on Image AutoRegressive Models Antoni Kowalczuk et.al. 2502.02514 link
2025-02-04 VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Hila Chefer et.al. 2502.02492 null
2025-02-04 High-Fidelity Human Avatars from Laptop Webcams using Edge Compute Akash Haridas et.al. 2502.02468 null
2025-02-04 Exploring the Feasibility of AI-Assisted Spine MRI Protocol Optimization Using DICOM Image Metadata Alice Vian et.al. 2502.02351 null
2025-02-04 When Dimensionality Hurts: The Role of LLM Embedding Compression for Noisy Regression Tasks Felix Drinkall et.al. 2502.02199 link
2025-02-04 PALQA: A Novel Parameterized Position-Aware Lossy Quantum Autoencoder using LSB Control Qubit for Efficient Image Compression Ershadul Haque et.al. 2502.02188 null
2025-02-05 IPO: Iterative Preference Optimization for Text-to-Video Generation Xiaomeng Yang et.al. 2502.02088 null
2025-02-03 Spectra of He isotopes and the $^3$He/$^4$ He ratio M. J. Boschini et.al. 2502.01887 null
2025-02-03 Sparse Measurement Medical CT Reconstruction using Multi-Fused Block Matching Denoising Priors Maliha Hossain et.al. 2502.01832 null
2025-02-03 Generating Multi-Image Synthetic Data for Text-to-Image Customization Nupur Kumari et.al. 2502.01720 null
2025-02-03 CLIP-DQA: Blindly Evaluating Dehazed Images from Global and Local Perspectives Using CLIP Yirui Zeng et.al. 2502.01707 null
2025-02-03 Proposal and Evaluation of a Practical CBCT Dose Optimization Method S. Gros et.al. 2502.01509 null
2025-02-03 Human Body Restoration with One-Step Diffusion Model and A New Benchmark Jue Gong et.al. 2502.01411 null
2025-02-03 Explainability-Driven Quality Assessment for Rule-Based Systems Oshani Seneviratne et.al. 2502.01253 null
2025-02-03 Imaging simulation of a dual-panel PET geometry with ultrafast TOF detectors Taiyo Ishikawa et.al. 2502.01006 null
2025-02-02 Weak Supervision Dynamic KL-Weighted Diffusion Models Guided by Large Language Models Julian Perry et.al. 2502.00826 null
2025-02-02 EmoTalkingGaussian: Continuous Emotion-conditioned Talking Head Synthesis Junuk Cha et.al. 2502.00654 null
2025-02-01 Deep Task-Based Beamforming and Channel Data Augmentations for Enhanced Ultrasound Imaging Ariel Amar et.al. 2502.00524 null
2025-02-01 A framework for river connectivity classification using temporal image processing and attention based neural networks Timothy James Becker et.al. 2502.00474 null
2025-01-31 Trust and Trustworthiness from Human-Centered Perspective in HRI -- A Systematic Literature Review Debora Firmino de Souza et.al. 2501.19323 null
2025-01-31 Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search Yuta Oshima et.al. 2501.19252 null
2025-01-31 Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data Xichen Xu et.al. 2501.19094 null
2025-01-31 OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation Yuchen Lin et.al. 2501.18982 null
2025-01-31 Distorting Embedding Space for Safety: A Defense Mechanism for Adversarially Robust Diffusion Models Jaesin Ahn et.al. 2501.18877 link
2025-01-29 Fake News Detection After LLM Laundering: Measurement and Explanation Rupak Kumar Das et.al. 2501.18649 link
2025-01-31 Task-based Regularization in Penalized Least-Squares for Binary Signal Detection Tasks in Medical Image Denoising Wentao Chen et.al. 2501.18418 null
2025-01-30 Adaptive Video Streaming with AI-Based Optimization for Dynamic Network Conditions Mohammad Tarik et.al. 2501.18332 null
2025-01-30 AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment Yuqin Cao et.al. 2501.18314 null
2025-02-03 Efficient Feature Fusion for UAV Object Detection Xudong Wang et.al. 2501.17983 null
2025-01-29 Discrete Dielectric Coatings for Length Control and Tunability of Half-Wave Dipole Antennas at 300 MHz Magnetic Resonance Imaging Applications Aditya A Bhosale et.al. 2501.17954 null
2025-01-29 Leveraging In-Context Learning and Retrieval-Augmented Generation for Automatic Question Generation in Educational Domains Subhankar Maity et.al. 2501.17397 null
2025-01-29 On the Coexistence and Ensembling of Watermarks Aleksandar Petrov et.al. 2501.17356 link
2025-01-28 Giving the Old a Fresh Spin: Quality Estimation-Assisted Constrained Decoding for Automatic Post-Editing Sourabh Deoghare et.al. 2501.17265 null
2025-01-27 Audio Large Language Models Can Be Descriptive Speech Quality Evaluators Chen Chen et.al. 2501.17202 null
2025-01-31 IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait Han Yang et.al. 2501.17159 null
2025-01-28 Three-Dimensional Diffusion-Weighted Multi-Slab MRI With Slice Profile Compensation Using Deep Energy Model Reza Ghorbani et.al. 2501.17152 null
2025-01-28 Evaluating CrowdSplat: Perceived Level of Detail for Gaussian Crowds Xiaohan Sun et.al. 2501.17085 null
2025-01-28 EdgeMLOps: Operationalizing ML models with Cumulocity IoT and thin-edge.io for Visual quality Inspection Kanishk Chaturvedi et.al. 2501.17062 null
2025-01-28 EZOA: Nançay HI follow-up observations in the Zone of Avoidance A. C. Schröder et.al. 2501.17038 null
2025-01-28 Image-Space Gridding for Nonrigid Motion-Corrected MR Image Reconstruction Kwang Eun Jang et.al. 2501.16713 null
2025-01-25 MambaTron: Efficient Cross-Modal Point Cloud Enhancement using Aggregate Selective State Space Modeling Sai Tarun Inaganti et.al. 2501.16384 null
2025-01-27 Adaptive Iterative Compression for High-Resolution Files: an Approach Focused on Preserving Visual Quality in Cinematic Workflows Leonardo Melo et.al. 2501.16319 null
2025-01-27 UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images Tatiana Taís Schein et.al. 2501.16211 link
2025-01-27 Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation Xing Zhang et.al. 2501.16050 null
2025-01-30 Can Location Embeddings Enhance Super-Resolution of Satellite Imagery? Daniel Panangian et.al. 2501.15847 null
2025-01-26 Advancing quantum imaging through learning theory Yunkai Wang et.al. 2501.15685 null
2025-01-26 Radiologist-in-the-Loop Self-Training for Generalizable CT Metal Artifact Reduction Chenglong Ma et.al. 2501.15610 link
2025-01-26 Differentiable Low-computation Global Correlation Loss for Monotonicity Evaluation in Quality Assessment Yipeng Liu et.al. 2501.15485 null
2025-01-25 Image formation theory of optical coherence tomography with optical aberrations and its application for computational aberration correction Shuichi Makita et.al. 2501.15011 null
2025-01-24 SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation Yujian Liu et.al. 2501.14646 null
2025-01-24 WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages Jia Yu et.al. 2501.14506 link
2025-01-24 Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR Hao Ma et.al. 2501.14477 null
2025-01-24 Deep Learning-Powered Classification of Thoracic Diseases in Chest X-Rays Yiming Lei et.al. 2501.14279 null
2025-01-24 CDI: Blind Image Restoration Fidelity Evaluation based on Consistency with Degraded Image Xiaojun Tang et.al. 2501.14264 null
2025-01-24 GreedyPixel: Fine-Grained Black-Box Adversarial Attack Via Greedy Algorithm Hanrui Wang et.al. 2501.14230 null
2025-01-24 Sparse Mixture-of-Experts for Non-Uniform Noise Reduction in MRI Images Zeyun Deng et.al. 2501.14198 null
2025-01-24 VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking Runyi Hu et.al. 2501.14195 link
2025-01-23 AdEval: Alignment-based Dynamic Evaluation to Mitigate Data Contamination in Large Language Models Yang Fan et.al. 2501.13983 null
2025-01-23 Improving Video Generation with Human Feedback Jie Liu et.al. 2501.13918 null
2025-01-23 VARFVV: View-Adaptive Real-Time Interactive Free-View Video Streaming with Edge Computing Qiang Hu et.al. 2501.13630 null
2025-01-23 Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse Wenzhuo Ma et.al. 2501.13528 null
2025-01-23 LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation JiaXin Chen et.al. 2501.13475 null
2025-01-23 From Images to Point Clouds: An Efficient Solution for Cross-media Blind Quality Assessment without Annotated Training Yipeng Liu et.al. 2501.13387 null
2025-01-23 Enhanced Extractor-Selector Framework and Symmetrization Weighted Binary Cross-Entropy for Edge Detections Hao Shu et.al. 2501.13365 null
2025-01-22 UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior I-Hsiang Chen et.al. 2501.13134 null
2025-01-23 Accelerate High-Quality Diffusion Models with Inner Loop Feedback Matthew Gwilliam et.al. 2501.13107 null
2025-01-22 Real-time Terahertz Compressive Optical-Digital Neural Network Imaging Shao-Hsuan Wu et.al. 2501.13065 null
2025-01-22 Sketch and Patch: Efficient 3D Gaussian Representation for Man-Made Scenes Yuang Shi et.al. 2501.13045 null
2025-01-22 Characterizing Collective Efforts in Content Sharing and Quality Control for ADHD-relevant Content on Video-sharing Platforms Hanxiu 'Hazel' Zhu et.al. 2501.13020 null
2025-01-22 Paper Quality Assessment based on Individual Wisdom Metrics from Open Peer Review Andrii Zahorodnii et.al. 2501.13014 null
2025-01-22 SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling Shengshi Yao et.al. 2501.12696 null
2025-01-22 Approximate Puzzlepiece Compositing Xuan Huang et.al. 2501.12581 null
2025-01-21 Interaction Dataset of Autonomous Vehicles with Traffic Lights and Signs Zheng Li et.al. 2501.12536 null
2025-01-21 Bidirectional Brain Image Translation using Transfer Learning from Generic Pre-trained Models Fatima Haimour et.al. 2501.12488 null
2025-01-21 DiffDoctor: Diagnosing Image Diffusion Models Before Treating Yiyang Wang et.al. 2501.12382 null
2025-01-21 Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement Christoph Gebhardt et.al. 2501.12289 null
2025-01-21 A Dynamic Programming Framework for Generating Approximately Diverse and Optimal Solutions Waldo Gálvez et.al. 2501.12261 null
2025-01-21 Joint Reconstruction and Motion Estimation in Sparse-View 4DCT Using Diffusion Models within a Blind Inverse Problem Framework Antoine De Paepe et.al. 2501.12249 null
2025-01-21 DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains Junyu Xia et.al. 2501.12235 null
2025-01-21 RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Uri Gadot et.al. 2501.12216 null
2025-01-21 Fast-RF-Shimming: Accelerate RF Shimming in 7T MRI using Deep Learning Zhengyi Lu et.al. 2501.12157 null
2025-01-21 A Multi-annotated and Multi-modal Dataset for Wide-angle Video Quality Assessment Bo Hu et.al. 2501.12082 null
2025-01-22 GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting Longan Wang et.al. 2501.12060 null
2025-01-21 Power Amplifier-Aware Transmit Power Optimization for OFDM and SC-FDMA Systems Pawel Kryszkiewicz et.al. 2501.11994 null
2025-01-21 Bayesian Despeckling of Structured Sources Ali Zafari et.al. 2501.11860 null
2025-01-20 EfficientVITON: An Efficient Virtual Try-On Model using Optimized Diffusion Process Mostafa Atef et.al. 2501.11776 null
2025-01-20 Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution Zhiyuan You et.al. 2501.11561 null
2025-01-20 Fundus Image Quality Assessment and Enhancement: a Systematic Review Heng Li et.al. 2501.11520 null
2025-01-20 Multitask Auxiliary Network for Perceptual Quality Assessment of Non-Uniformly Distorted Omnidirectional Images Jiebin Yan et.al. 2501.11512 link
2025-01-20 Subjective and Objective Quality Assessment of Non-Uniformly Distorted Omnidirectional Images Jiebin Yan et.al. 2501.11511 link
2025-01-20 See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic Regularization Zongqi He et.al. 2501.11508 null
2025-01-20 Advancing Oyster Phenotype Segmentation with Multi-Network Ensemble and Multi-Scale mechanism Wenli Yang et.al. 2501.11203 null
2025-01-19 Unit Region Encoding: A Unified and Compact Geometry-aware Representation for Floorplan Applications Huichao Zhang et.al. 2501.11097 null
2025-01-18 EMO2: End-Effector Guided Audio-Driven Avatar Video Generation Linrui Tian et.al. 2501.10687 null
2025-01-17 Fundamental mode power estimation through a $M^2$ -measurement Filipp Lausch et.al. 2501.10345 null
2025-01-17 DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration Huiyun Cao et.al. 2501.10325 null
2025-01-17 CSHNet: A Novel Information Asymmetric Image Translation Method Xi Yang et.al. 2501.10197 link
2025-01-17 DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency Xiaohui Li et.al. 2501.10110 null
2025-01-17 CLIP-PCQA: Exploring Subjective-Aligned Vision-Language Modeling for Point Cloud Quality Assessment Yating Liu et.al. 2501.10071 link
2025-01-17 One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression Keita Miwa et.al. 2501.10064 null
2025-01-17 CaFA: Cost-aware, Feasible Attacks With Database Constraints Against Neural Tabular Classifiers Matan Ben-Tov et.al. 2501.10013 link
2025-01-17 IE-Bench: Advancing the Measurement of Text-Driven Image Editing for Human Perception Alignment Shangkun Sun et.al. 2501.09927 null
2025-01-17 Decoding Patterns of Data Generation Teams for Clinical and Scientific Success: Insights from the Bridge2AI Talent Knowledge Graph Jiawei Xu et.al. 2501.09897 null
2025-01-16 EraseBench: Understanding The Ripple Effects of Concept Erasure Techniques Ibtihel Amara et.al. 2501.09833 null
2025-01-16 Scan-Adaptive MRI Undersampling Using Neighbor-based Optimization (SUNO) Siddhant Gautam et.al. 2501.09799 link
2025-01-16 Evaluating Conversational Recommender Systems with Large Language Models: A User-Centric Evaluation Framework Nuo Chen et.al. 2501.09493 null
2025-01-16 Joint Transmission and Deblurring: A Semantic Communication Approach Using Events Pujing Yang et.al. 2501.09396 null
2025-01-16 PATCHEDSERVE: A Patch Management Framework for SLO-Optimized Hybrid Resolution Diffusion Serving Desen Sun et.al. 2501.09253 null
2025-01-16 Estimating Task-based Performance Bounds for Accelerated MRI Image Reconstruction Methods by Use of Learned-Ideal Observers Kaiyan Li et.al. 2501.09224 null
2025-01-15 UNIR-Net: A Novel Approach for Restoring Underwater Images with Non-Uniform Illumination Using Synthetic Data Ezequiel Perez-Zarate et.al. 2501.09053 link
2025-01-15 Lights, Camera, Matching: The Role of Image Illumination in Fair Face Recognition Gabriella Pangelinan et.al. 2501.08910 null
2025-01-15 XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework Sida Tian et.al. 2501.08809 null
2025-01-16 Holoview: Interactive 3D visualization of medical data in AR Pankaj Kaushik et.al. 2501.08736 null
2025-01-15 DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors Runqi Wang et.al. 2501.08553 null
2025-01-15 Comprehensive Subjective and Objective Evaluation Method for Text-generated Video Zelu Qi et.al. 2501.08545 null
2025-01-14 Head Motion Degrades Machine Learning Classification of Alzheimer's Disease from Positron Emission Tomography Eléonore V. Lieffrig et.al. 2501.08459 null
2025-01-14 Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Weichen Fan et.al. 2501.08453 null
2025-01-14 Cross-Modal Transferable Image-to-Video Attack on Video Quality Metrics Georgii Gotin et.al. 2501.08415 link
2025-01-14 Rolling phase modulation regime for dynamic full field OCT Tual Monfort et.al. 2501.08359 null
2025-01-15 Optical information encryption using general temporal ghost imaging with practical experimental condition Juan Wu et.al. 2501.08136 null
2025-01-13 Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes Yuhang Zhang et.al. 2501.08072 null
2025-01-14 VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models Hui Kuurila-Zhang et.al. 2501.07922 link
2025-01-14 Demographic Variability in Face Image Quality Measures Wassim Kabbani et.al. 2501.07898 null
2025-01-14 State-of-the-Art Transformer Models for Image Super-Resolution: Techniques, Challenges, and Applications Debasish Dutta et.al. 2501.07855 null
2025-01-13 FaceOracle: Chat with a Face Image Oracle Wassim Kabbani et.al. 2501.07202 null
2025-01-13 Radial Distortion in Face Images: Detection and Impact Wassim Kabbani et.al. 2501.07179 null
2025-01-13 Eye Sclera for Fair Face Image Quality Assessment Wassim Kabbani et.al. 2501.07158 null
2025-01-13 Privacy-Preserving Data Quality Assessment for Time-Series IoT Sensors Novoneel Chakraborty et.al. 2501.07154 null
2025-01-13 Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling Jiebin Yan et.al. 2501.07087 null
2025-01-12 Real-Time Neural-Enhancement for Online Cloud Gaming Shan Jiang et.al. 2501.06880 null
2025-01-14 Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution Du Chen et.al. 2501.06838 null
2025-01-11 NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes without References Qiang Qu et.al. 2501.06488 link
2025-01-10 VideoAuteur: Towards Long Narrative Video Generation Junfei Xiao et.al. 2501.06173 null
2025-01-10 CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control Stefan Popov et.al. 2501.06006 null
2025-01-10 Universal-2-TF: Robust All-Neural Text Formatting for ASR Yash Khare et.al. 2501.05948 null
2025-01-10 UltraRay: Full-Path Ray Tracing for Enhancing Realism in Ultrasound Simulation Felix Duelmer et.al. 2501.05828 null
2025-01-13 AI-Driven Diabetic Retinopathy Screening: Multicentric Validation of AIDRSS in India Amit Kr Dey et.al. 2501.05826 null
2025-01-10 Conditional Diffusion Model for Electrical Impedance Tomography Duanpeng Shi et.al. 2501.05769 null
2025-01-10 LLVD: LSTM-based Explicit Motion Modeling in Latent Space for Blind Video Denoising Loay Rashid et.al. 2501.05744 null
2025-01-10 FIRM: Federated Image Reconstruction using Multimodal Tomographic Data Geunyeong Byeon et.al. 2501.05642 null
2025-01-09 Interpretable deep learning illuminates multiple structures fluorescence imaging: a path toward trustworthy artificial intelligence in microscopy Mingyang Chen et.al. 2501.05490 null
2025-01-09 Consistent Flow Distillation for Text-to-3D Generation Runjie Yan et.al. 2501.05445 null
2025-01-09 Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping Wen Tianci et.al. 2501.05242 null
2025-01-09 3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering Dewei Zhou et.al. 2501.05131 null
2025-01-09 TipSegNet: Fingertip Segmentation in Contactless Fingerprint Imaging Laurenz Ruzicka et.al. 2501.05076 null
2025-01-09 Towards Fingerprint Mosaicking Artifact Detection: A Self-Supervised Deep Learning Approach Laurenz Ruzicka et.al. 2501.05034 null
2025-01-08 Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling Nannan Li et.al. 2501.04666 null
2025-01-08 Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion Yangfan He et.al. 2501.04606 link
2025-01-08 When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages Archchana Sindhujan et.al. 2501.04473 null
2025-01-08 Enhancing kidney quality assessment: Power Doppler during normothermic machine perfusion Yitian Fang et.al. 2501.04457 null
2025-01-08 iFADIT: Invertible Face Anonymization via Disentangled Identity Transform Lin Yuan et.al. 2501.04390 null
2025-01-08 DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models Hyogon Ryu et.al. 2501.04304 link
2025-01-07 Spatiotemporal Gaussian Optimization for 4D Cone Beam CT Reconstruction from Sparse Projections Yabo Fu et.al. 2501.04140 link
2025-01-07 Motion-Aware Generative Frame Interpolation Guozhen Zhang et.al. 2501.03699 null
2025-01-07 Action Quality Assessment via Hierarchical Pose-guided Multi-stage Contrastive Regression Mengshi Qi et.al. 2501.03674 link
2025-01-07 Deep Learning-based Compression Detection for explainable Face Image Quality Assessment Laurin Jonientz et.al. 2501.03619 link
2025-01-07 A generative approach for lensless imaging in low-light conditions Ziyang Liu et.al. 2501.03511 null
2025-01-07 Can Deep Learning Trigger Alerts from Mobile-Captured Images? Pritisha Sarkar et.al. 2501.03499 null
2025-01-06 A Trust-Guided Approach to MR Image Reconstruction with Side Information Arda Atalık et.al. 2501.03021 link
2025-01-06 Quality Estimation based Feedback Training for Improving Pronoun Translation Harshit Dhankhar et.al. 2501.03008 null
2025-01-06 GLFC: Unified Global-Local Feature and Contrast Learning with Mamba-Enhanced UNet for Synthetic CT Generation from CBCT Xianhao Zhou et.al. 2501.02992 link
2025-01-06 Region of Interest based Medical Image Compression Utkarsh Prakash Srivastava et.al. 2501.02895 null
2025-01-06 COph100: A comprehensive fundus image registration dataset from infants constituting the "RIDIRP" database Yan Hu et.al. 2501.02800 null
2025-01-06 Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging? Hongyi Miao et.al. 2501.02751 null
2025-01-06 Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising Yunlong Yuan et.al. 2501.02741 null
2025-01-06 Artificial Intelligence in Creative Industries: Advances Prior to 2025 Nantheera Anantrasirichai et.al. 2501.02725 null
2025-01-06 Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment Jiaze Li et.al. 2501.02706 null
2025-01-05 DepthMaster: Taming Diffusion Models for Monocular Depth Estimation Ziyang Song et.al. 2501.02576 link
2025-01-05 Multi-LLM Collaborative Caption Generation in Scientific Documents Jaeyoung Kim et.al. 2501.02552 link
2025-01-05 Pixel-Wise Feature Selection for Perceptual Edge Detection without post-processing Hao Shu et.al. 2501.02534 null
2025-01-07 ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Chaojie Mao et.al. 2501.02487 null
2025-01-05 Reducing the Gap Between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module Zhongjian Cui et.al. 2501.02452 null
2025-01-05 Journey into Automation: Image-Derived Pavement Texture Extraction and Evaluation Bingjie Lu et.al. 2501.02414 null
2025-01-04 Optimizing Audio Compression Through Entropy-Controlled Dithering Ellison Murray et.al. 2501.02293 null
2025-01-04 TDM: Temporally-Consistent Diffusion Model for All-in-One Real-World Video Restoration Yizhou Li et.al. 2501.02269 null
2025-01-04 Exploring Secure Machine Learning Through Payload Injection and FGSM Attacks on ResNet-50 Umesh Yadav et.al. 2501.02147 null
2025-01-03 JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing Qili Wang et.al. 2501.01798 link
2025-01-03 Multi-modal classification of forest biodiversity potential from 2D orthophotos and 3D airborne laser scanning point clouds Simon B. Jensen et.al. 2501.01728 null
2025-01-03 Aesthetic Matters in Music Perception for Image Stylization: A Emotion-driven Music-to-Visual Manipulation Junjie Xu et.al. 2501.01700 null
2025-01-02 A Metasemantic-Metapragmatic Framework for Taxonomizing Multimodal Communicative Alignment Eugene Yu Ji et.al. 2501.01535 null
2025-01-02 Embedding Similarity Guided License Plate Super Resolution Abderrezzaq Sendjasni et.al. 2501.01483 null
2024-12-31 Estimation of 3T MR images from 1.5T images regularized with Physics based Constraint Prabhjot Kaur et.al. 2501.01464 null
2024-12-31 GDSR: Global-Detail Integration through Dual-Branch Network with Wavelet Losses for Remote Sensing Image Super-Resolution Qiwei Zhu et.al. 2501.01460 null
2025-01-02 ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI Neda Tavakoli et.al. 2501.01372 link
2025-01-02 TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions Vriksha Srihari et.al. 2501.01156 null
2025-01-02 HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment Zitong Xu et.al. 2501.01116 null
2025-01-02 Generalized Task-Driven Medical Image Quality Enhancement with Gradient Promotion Dong Zhang et.al. 2501.01114 null
2025-01-02 EliGen: Entity-Level Controlled Image Generation with Regional Attention Hong Zhang et.al. 2501.01097 link
2025-01-02 Enhancing Precision of Automated Teller Machines Network Quality Assessment: Machine Learning and Multi Classifier Fusion Approaches Alireza Safarzadeh et.al. 2501.01067 null
2025-01-01 Deconstructing the emission order of protons, neutrons and $α$-particles following fusion in $^{28,30,32}$Si + $^{28}$ Si Rohit Kumar et.al. 2501.00963 null
2025-01-01 Enhancing Early Diabetic Retinopathy Detection through Synthetic DR1 Image Generation: A StyleGAN3 Approach Sagarnil Das et.al. 2501.00954 null
2025-01-01 SPADE: Enhancing Adaptive Cyber Deception Strategies with Generative AI and Structured Prompt Engineering Shihab Ahmed et.al. 2501.00940 null
2025-01-01 Hierarchical Vision-Language Alignment for Text-to-Image Generation via Diffusion Models Emily Johnson et.al. 2501.00917 null
2025-01-01 Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model Chenyang Liu et.al. 2501.00895 null
2025-01-01 RORem: Training a Robust Object Remover with Human-in-the-Loop Ruibin Li et.al. 2501.00740 link
2024-12-31 Token Pruning for Caching Better: 9 Times Acceleration on Stable Diffusion for Free Evelyn Zhang et.al. 2501.00375 link
2024-12-31 SG-Splatting: Accelerating 3D Gaussian Splatting with Spherical Gaussians Yiwen Wang et.al. 2501.00342 null
2024-12-31 Improving image quality of the Solar Disk Imager (SDI) of the Lyman-alpha Solar Telescope (LST) onboard the ASO-S mission Hui Liu et.al. 2501.00231 null
2024-12-30 What Makes for a Good Stereoscopic Image? Netanel Y. Tamir et.al. 2412.21127 null
2024-12-30 VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Jiazheng Xu et.al. 2412.21059 link
2024-12-30 DDIM sampling for Generative AIBIM, a faster intelligent structural design framework Zhili He et.al. 2412.20899 null
2024-12-30 Acquisition-Independent Deep Learning for Quantitative MRI Parameter Estimation using Neural Controlled Differential Equations Daan Kuppens et.al. 2412.20844 null
2024-12-30 4D Gaussian Splatting: Modeling Dynamic Scenes with Native 4D Primitives Zeyu Yang et.al. 2412.20720 null
2024-12-29 Single-image reflection removal via self-supervised diffusion models Zhengyang Lu et.al. 2412.20466 null
2024-12-29 ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos Xilei Zhu et.al. 2412.20423 null
2024-12-29 Bringing Objects to Life: 4D generation from 3D objects Ohad Rahamim et.al. 2412.20422 null
2024-12-28 An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models Yuang Wang et.al. 2412.19992 null
2024-12-27 Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference Keke Zhang et.al. 2412.19553 null
2024-12-30 DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT Xiaotao Hu et.al. 2412.19505 link
2024-12-27 RAIN: Real-time Animation of Infinite Video Stream Zhilei Shu et.al. 2412.19489 null
2024-12-27 Generative Adversarial Network on Motion-Blur Image Restoration Zhengdong Li et.al. 2412.19479 null
2024-12-27 Adrenaline: Adaptive Rendering Optimization System for Scalable Cloud Gaming Jin Heo et.al. 2412.19446 null
2024-12-27 The Hobby-Eberly Telescope Dark Energy Experiment Survey (HETDEX) Active Galactic Nuclei Catalog: the Fourth Data Release Chenxu Liu et.al. 2412.19414 null
2024-12-26 Reflective Gaussian Splatting Yuxuan Yao et.al. 2412.19282 null
2024-12-26 FineVQ: Fine-Grained User Generated Content Video Quality Assessment Huiyu Duan et.al. 2412.19238 null
2024-12-26 FACEMUG: A Multimodal Generative and Fusion Framework for Local Facial Editing Wanglong Lu et.al. 2412.19009 null
2024-12-25 TINQ: Temporal Inconsistency Guided Blind Video Quality Assessment Yixiao Li et.al. 2412.18933 link
2024-12-25 ArtNVG: Content-Style Separated Artistic Neighboring-View Gaussian Stylization Zixiao Gu et.al. 2412.18783 null
2024-12-25 Embodied Image Quality Assessment for Robotic Intelligence Jianbo Zhang et.al. 2412.18774 link
2024-12-25 MRI Reconstruction with Regularized 3D Diffusion Model (R3DM) Arya Bangun et.al. 2412.18723 null
2024-12-24 ZenSVI: An Open-Source Software for the Integrated Acquisition, Processing and Analysis of Street View Imagery Towards Scalable Urban Science Koichi Ito et.al. 2412.18641 link
2024-12-24 Long-Form Speech Generation with Spoken Language Models Se Jin Park et.al. 2412.18603 link
2024-12-24 LatentCRF: Continuous CRF for Efficient Latent Diffusion Kanchana Ranasinghe et.al. 2412.18596 null
2024-12-24 Agreement of Image Quality Metrics with Radiological Evaluation in the Presence of Motion Artifacts Elisa Marchetto et.al. 2412.18389 null
2024-12-24 RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis Yiling Yao et.al. 2412.18380 null
2024-12-24 Expand VSR Benchmark for VLLM to Expertize in Spatial Rules Peijin Xie et.al. 2412.18224 link
2024-12-24 Image Quality Assessment: Exploring Regional Heterogeneity via Response of Adaptive Multiple Quality Factors in Dictionary Space Xuting Lan et.al. 2412.18160 null
2024-12-24 DepthLab: From Partial to Complete Zhiheng Liu et.al. 2412.18153 null
2024-12-24 AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models Yiming Wang et.al. 2412.18123 null
2024-12-24 SAR Despeckling via Log-Yeo-Johnson Transformation and Sparse Representation Xuran Hu et.al. 2412.18121 null
2024-12-24 An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM Wen Wen et.al. 2412.18060 null
2024-12-23 ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance Renyang Liu et.al. 2412.17632 link
2024-12-23 HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data Ting Zhou et.al. 2412.17574 link
2024-12-24 An Evaluation Framework for Product Images Background Inpainting based on Human Feedback and Product Consistency Yuqi Liang et.al. 2412.17504 null
2024-12-23 Predicting Satisfied User and Machine Ratio for Compressed Images: A Unified Approach Qi Zhang et.al. 2412.17477 null
2024-12-23 Assessment of Deep-Learning Methods for the Enhancement of Experimental Low Dose Dental CBCT Volumes Louise Friot--Giroux et.al. 2412.17423 null
2024-12-23 Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling Hao Gui et.al. 2412.17378 null
2024-12-23 FFA Sora, video generation as fundus fluorescein angiography simulator Xinyuan Wu et.al. 2412.17346 null
2024-12-23 GCS-M3VLT: Guided Context Self-Attention based Multi-modal Medical Vision Language Transformer for Retinal Image Captioning Teja Krishna Cherukuri et.al. 2412.17251 null
2024-12-22 Deep Joint Source Channel Coding for Secure End-to-End Image Transmission Mehdi Letafati et.al. 2412.17110 null
2024-12-24 ErasableMask: A Robust and Erasable Privacy Protection Scheme against Black-box Face Recognition Models Sipeng Shen et.al. 2412.17038 null
2024-12-22 PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask Jeongho Kim et.al. 2412.16978 link
2024-12-22 Image Quality Assessment: Investigating Causal Perceptual Effects with Abductive Counterfactual Inference Wenhao Shen et.al. 2412.16939 null
2024-12-22 Time-Graph Frequency Representation with Singular Value Decomposition for Neural Speech Enhancement Tingting Wang et.al. 2412.16823 link
2024-12-21 RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing Zhipeng Huang et.al. 2412.16778 null
2024-12-21 VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation Chi Zhang et.al. 2412.16677 null
2024-12-21 Complementary Advantages: Exploiting Cross-Field Frequency Correlation for NIR-Assisted Image Denoising Yuchen Wang et.al. 2412.16645 null
2024-12-21 OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities Suyoung Lee et.al. 2412.16604 null
2024-12-21 A Generalizable 3D Diffusion Framework for Low-Dose and Few-View Cardiac SPECT Huidong Xie et.al. 2412.16573 null
2024-12-21 Federal Learning Framework for Quality Evaluation of Blastomere Cleavage Jung-Hua Wang et.al. 2412.16567 null
2024-12-21 Positive2Negative: Breaking the Information-Lossy Barrier in Self-Supervised Single Image Denoising Tong Li et.al. 2412.16460 null
2024-12-20 IMPLY-based Approximate Full Adders for Efficient Arithmetic Operations in Image Processing and Machine Learning Melanie Qiu et.al. 2412.15888 null
2024-12-20 Image Quality Assessment: Enhancing Perceptual Exploration and Interpretation with Collaborative Feature Refinement and Hausdorff distance Xuekai Wei et.al. 2412.15847 null
2024-12-20 DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization Zihan Ding et.al. 2412.15689 null
2024-12-20 AI-generated Image Quality Assessment in Visual Communication Yu Tian et.al. 2412.15677 link
2024-12-20 Underwater Image Quality Assessment: A Perceptual Framework Guided by Physical Imaging Weizhi Xian et.al. 2412.15527 null
2024-12-19 Log-Time K-Means Clustering for 1D Data: Novel Approaches with Proof and Implementation Jake Hyun et.al. 2412.15295 link
2024-12-18 A Systematic Examination of Preference Learning through the Lens of Instruction-Following Joongwon Kim et.al. 2412.15282 null
2024-12-19 SqueezeMe: Efficient Gaussian Avatars for VR Shunsuke Saito et.al. 2412.15171 null
2024-12-19 OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization Jiacheng Zhang et.al. 2412.15159 null
2024-12-19 Jet: A Modern Transformer-Based Normalizing Flow Alexander Kolesnikov et.al. 2412.15129 null
2024-12-19 Joint estimation of activity, attenuation and motion in respiratory-self-gated time-of-flight PET Masoud Elhamiasl et.al. 2412.15018 null
2024-12-19 Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model Minglong Xue et.al. 2412.14630 link
2024-12-19 Qua $^2$ SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models Keith G. Mills et.al. 2412.14628 null
2024-12-19 Successive optimization of optics and post-processing with differentiable coherent PSF operator and field information Zheng Ren et.al. 2412.14603 link
2024-12-19 Enhancing Diffusion Models for High-Quality Image Generation Jaineet Shah et.al. 2412.14422 null
2024-12-18 Improving diabetic retinopathy screening using Artificial Intelligence: design, evaluation and before-and-after study of a custom development Imanol Pinto et.al. 2412.14221 null
2024-12-19 E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling Zhihang Yuan et.al. 2412.14170 null
2024-12-18 VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Runtao Liu et.al. 2412.14167 null
2024-12-18 AKiRa: Augmentation Kit on Rays for optical video generation Xi Wang et.al. 2412.14158 null
2024-12-18 Real-Time Position-Aware View Synthesis from Single-View Input Manu Gond et.al. 2412.14005 null
2024-12-18 Data-Efficient Inference of Neural Fluid Fields via SciML Foundation Model Yuqiu Liu et.al. 2412.13897 null
2024-12-18 VIIS: Visible and Infrared Information Synthesis for Severe Low-light Image Enhancement Chen Zhao et.al. 2412.13655 link
2024-12-18 PASCO (PArallel Structured COarsening): an overlay to speed up graph clustering algorithms Etienne Lasalle et.al. 2412.13592 link
2024-12-18 T $^3$ -S2S: Training-free Triplet Tuning for Sketch to Scene Generation Zhenhong Sun et.al. 2412.13486 link
2024-12-18 Real-time One-Step Diffusion-based Expressive Portrait Videos Generation Hanzhong Guo et.al. 2412.13479 link
2024-12-17 Optimisation of Magnetic Field Sensing with Optically Pumped Magnetometers for Magnetic Detection Electrical Impedance Tomography Kai Mason et.al. 2412.13354 null
2024-12-17 Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures Guoxing Sun et.al. 2412.13183 null
2024-12-17 F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration Lu Liu et.al. 2412.13155 null
2024-12-17 Unlocking the Potential of Digital Pathology: Novel Baselines for Compression Maximilian Fischer et.al. 2412.13137 null
2024-12-18 AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark Jianlyu Chen et.al. 2412.13102 link
2024-12-17 Smartphone-based Iris Recognition through High-Quality Visible Spectrum Iris Capture Naveenkumar G Venkataswamy et.al. 2412.13063 null
2024-12-17 Experimental Study of Low-Latency Video Streaming in an ORAN Setup with Generative AI Andreas Casparsen et.al. 2412.12751 null
2024-12-17 Subspace Implicit Neural Representations for Real-Time Cardiac Cine MR Imaging Wenqi Huang et.al. 2412.12742 link
2024-12-17 Complex extension of optical flow and its practical evaluation for undersampled dynamic MRI Matthias J. Ehrhardt et.al. 2412.12711 null
2024-12-17 A Two-Fold Patch Selection Approach for Improved 360-Degree Image Quality Assessment Abderrezzaq Sendjasni et.al. 2412.12667 link
2024-12-17 RDPI: A Refine Diffusion Probability Generation Method for Spatiotemporal Data Imputation Zijin Liu et.al. 2412.12642 link
2024-12-17 Consistent Diffusion: Denoising Diffusion Model with Data-Consistent Training for Image Restoration Xinlong Cheng et.al. 2412.12550 null
2024-12-17 Invisible Watermarks: Attacks and Robustness Dongjun Hwang et.al. 2412.12511 link
2024-12-16 PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting Cheng Zhang et.al. 2412.12096 link
2024-12-16 Wonderland: Navigating 3D Scenes from a Single Image Hanwen Liang et.al. 2412.12091 null
2024-12-16 SPADE: Spectroscopic Photoacoustic Denoising using an Analytical and Data-free Enhancement Framework Fangzhou Lin et.al. 2412.12068 null
2024-12-16 Industrial-scale Prediction of Cement Clinker Phases using Machine Learning Sheikh Junaid Fayaz et.al. 2412.11981 link
2024-12-16 Towards Physically-Based Sky-Modeling Ian J. Maquignaz et.al. 2412.11883 null
2024-12-16 Impact of Face Alignment on Face Image Quality Eren Onaran et.al. 2412.11779 null
2024-12-16 Formal Quality Measures for Predictors in Markov Decision Processes Christel Baier et.al. 2412.11754 null
2024-12-16 Comparison of three reconstruction algorithms for low-dose phase-contrast computed tomography of the breast with synchrotron radiation Sandro Donato et.al. 2412.11641 null
2024-12-16 MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation Javier García Gilabert et.al. 2412.11615 link
2024-12-16 Block-Based Multi-Scale Image Rescaling Jian Li et.al. 2412.11468 null
2024-12-16 Controllable Distortion-Perception Tradeoff Through Latent Diffusion for Neural Image Compression Chuqin Zhou et.al. 2412.11379 null
2024-12-15 VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping Hao Shao et.al. 2412.11279 null
2024-12-15 CATER: Leveraging LLM to Pioneer a Multidimensional, Reference-Independent Paradigm in Translation Quality Evaluation Kurando IIDA et.al. 2412.11261 null
2024-12-15 Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation Yujie Zhang et.al. 2412.11170 null
2024-12-15 A Comprehensive Survey of Action Quality Assessment: Method and Benchmark Kanglei Zhou et.al. 2412.11149 null
2024-12-14 Zigzag Diffusion Sampling: The Path to Success Is Zigzag Lichen Bai et.al. 2412.10891 link
2024-12-14 Unbiased General Annotated Dataset Generation Dengyang Jiang et.al. 2412.10831 null
2024-12-14 Rapid Reconstruction of Extremely Accelerated Liver 4D MRI via Chained Iterative Refinement Di Xu et.al. 2412.10629 null
2024-12-13 RAID-Database: human Responses to Affine Image Distortions Paula Daudén-Oliver et.al. 2412.10211 null
2024-12-13 GT23D-Bench: A Comprehensive General Text-to-3D Generation Benchmark Sitong Su et.al. 2412.09997 null
2024-12-13 EP-CFG: Energy-Preserving Classifier-Free Guidance Kai Zhang et.al. 2412.09966 null
2024-12-13 $\textrm{A}^{\textrm{2}}$ RNet: Adversarial Attack Resilient Network for Robust Infrared and Visible Image Fusion Jiawei Li et.al. 2412.09954 link
2024-12-13 Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images Yasamin Medghalchi et.al. 2412.09910 link
2024-12-13 LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Hongjie Wang et.al. 2412.09856 null
2024-12-13 A Single-Frame and Multi-Frame Cascaded Image Super-Resolution Method Jing Sun et.al. 2412.09846 null
2024-12-13 Super-Resolution for Remote Sensing Imagery via the Coupling of a Variational Model and Deep Learning Jing Sun et.al. 2412.09841 null
2024-12-13 Prospects for Systematic Planetary Nebulae Detection with the Census of the Local Universe Narrowband Survey Rong Du et.al. 2412.09836 null
2024-12-13 Speech-based Multimodel Pipeline for Vietnamese Services Quality Assessment Quang-Anh N. D. et.al. 2412.09829 null
2024-12-12 OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs Yuanzhi Zhu et.al. 2412.09465 link
2024-12-12 UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer Delong Liu et.al. 2412.09389 link
2024-12-13 Are Conditional Latent Diffusion Models Effective for Image Restoration? Yunchen Yuan et.al. 2412.09324 null
2024-12-12 Towards Understanding the Robustness of LLM-based Evaluations under Perturbations Manav Chaudhary et.al. 2412.09269 null
2024-12-12 Elevating Flow-Guided Video Inpainting with Reference Generation Suhwan Cho et.al. 2412.08975 link
2024-12-12 Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression Ali Mollaahmadi Dehaghi et.al. 2412.08912 link
2024-12-11 DeepNose: An Equivariant Convolutional Neural Network Predictive Of Human Olfactory Percepts Sergey Shuvaev et.al. 2412.08747 null
2024-12-13 Utilizing Multi-step Loss for Single Image Reflection Removal Abdelrahman Elnenaey et.al. 2412.08582 link
2024-12-11 PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis Yifan Xie et.al. 2412.08504 null
2024-12-12 Learning Flow Fields in Attention for Controllable Person Image Generation Zijian Zhou et.al. 2412.08486 link
2024-12-11 Visible and Infrared Image Fusion Using Encoder-Decoder Network Ferhat Can Ataman et.al. 2412.08073 link
2024-12-11 NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods Qiang Qu et.al. 2412.08029 link
2024-12-10 Graph convolutional networks enable fast hemorrhagic stroke monitoring with electrical impedance tomography J. Toivanen et.al. 2412.07888 null
2024-12-10 PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition Kartik Narayan et.al. 2412.07771 null
2024-12-10 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation Xiao Fu et.al. 2412.07759 null
2024-12-10 PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation Fatemeh Nazarieh et.al. 2412.07754 null
2024-12-10 Multi-Shot Character Consistency for Text-to-Video Generation Yuval Atzmon et.al. 2412.07750 null
2024-12-11 Direct Low-Dose CT Image Reconstruction on GPU using Out-Of-Core: Precision and Quality Study M. Chillarón et.al. 2412.07631 null
2024-12-10 OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations Linke Ouyang et.al. 2412.07626 link
2024-12-10 CoMA: Compositional Human Motion Generation with Multi-modal Agents Shanlin Sun et.al. 2412.07320 null
2024-12-10 Backdoor Attacks against No-Reference Image Quality Assessment Models via A Scalable Trigger Yi Yu et.al. 2412.07277 link
2024-12-10 Moderating the Generalization of Score-based Generative Model Wan Jiang et.al. 2412.07229 null
2024-12-11 Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation Tal Zeevi et.al. 2412.07169 link
2024-12-10 QCResUNet: Joint Subject-level and Voxel-level Segmentation Quality Prediction Peijie Qiu et.al. 2412.07156 link
2024-12-10 Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions Qiang Qu et.al. 2412.07079 null
2024-12-11 Diff-GO $^\text{n}$ : Enhancing Diffusion Models for Goal-Oriented Communications Suchinthaka Wanninayaka et.al. 2412.06980 null
2024-12-09 Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning Mehdi Noroozi et.al. 2412.06978 null
2024-12-09 Ranking-aware adapter for text-driven image ordering with CLIP Wei-Hsiang Yu et.al. 2412.06760 link
2024-12-09 AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark Lan Li et.al. 2412.06724 link
2024-12-10 A No-Reference Medical Image Quality Assessment Method Based on Automated Distortion Recognition Technology: Application to Preprocessing in MRI-guided Radiotherapy Zilin Wang et.al. 2412.06599 null
2024-12-09 How Certain are Uncertainty Estimates? Three Novel Earth Observation Datasets for Benchmarking Uncertainty Quantification in Machine Learning Yuanyuan Wang et.al. 2412.06451 null
2024-12-09 Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment Kim Sung-Bin et.al. 2412.06209 null
2024-12-09 One-shot Human Motion Transfer via Occlusion-Robust Flow Prediction and Neural Texturing Yuzhu Ji et.al. 2412.06174 null
2024-12-09 A CT Image Denoising Method Based on Projection Domain Feature Mengyu Sun et.al. 2412.06135 null
2024-12-08 Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training Zhenghong Zhou et.al. 2412.06029 null
2024-12-08 Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation Aymen Sekhri et.al. 2412.06003 null
2024-12-08 Nested Diffusion Models Using Hierarchical Latent Priors Xiao Zhang et.al. 2412.05984 null
2024-12-08 Unsupervised Multi-Parameter Inverse Solving for Reducing Ring Artifacts in 3D X-Ray CBCT Qing Wu et.al. 2412.05853 null
2024-12-08 SizeGS: Size-aware Compression of 3D Gaussians with Hierarchical Mixed Precision Quantization Shuzhao Xie et.al. 2412.05808 null
2024-12-07 Emulating Clinical Quality Muscle B-mode Ultrasound Images from Plane Wave Images Using a Two-Stage Machine Learning Model Reed Chen et.al. 2412.05758 link
2024-12-07 A Tiered GAN Approach for Monet-Style Image Generation FNU Neha et.al. 2412.05724 null
2024-12-07 Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes Saqib Javed et.al. 2412.05700 null
2024-12-07 Enhancing Research Methodology and Academic Publishing: A Structured Framework for Quality and Integrity Md. Jalil Piran et.al. 2412.05683 null
2024-12-07 Deep Reinforcement Learning-Based Resource Allocation for Hybrid Bit and Generative Semantic Communications in Space-Air-Ground Integrated Networks Chong Huang et.al. 2412.05647 null
2024-12-06 LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation Donald Shenaj et.al. 2412.05148 link
2024-12-06 Comprehensive Analysis and Improvements in Pansharpening Using Deep Learning Mahek Kantharia et.al. 2412.04896 null
2024-12-06 Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud Yuanhao Yue et.al. 2412.04871 null
2024-12-05 Motion-Guided Deep Image Prior for Cardiac MRI Marc Vornehm et.al. 2412.04639 null
2024-12-05 MetaFormer: High-fidelity Metalens Imaging via Aberration Correcting Transformers Byeonghyeon Lee et.al. 2412.04591 null
2024-12-05 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion Chaoyang Wang et.al. 2412.04462 null
2024-12-05 LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors Yusuf Dalva et.al. 2412.04460 null
2024-12-05 Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction George Webber et.al. 2412.04324 null
2024-12-05 T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts Ziwei Huang et.al. 2412.04300 null
2024-12-05 IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation Sejong Yang et.al. 2412.04000 null
2024-12-05 Blind Underwater Image Restoration using Co-Operational Regressor Networks Ozer Can Devecioglu et.al. 2412.03995 null
2024-12-05 LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model Yuan Xue et.al. 2412.03841 null
2024-12-04 Advancing Auto-Regressive Continuation for Video Frames Ruibo Ming et.al. 2412.03758 null
2024-12-04 MV-Adapter: Multi-view Consistent Image Generation Made Easy Zehuan Huang et.al. 2412.03632 null
2024-12-04 Style3D: Attention-guided Multi-view Style Transfer for 3D Object Generation Bingjie Song et.al. 2412.03571 null
2024-12-04 NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model Xinheng Xie et.al. 2412.03539 null
2024-12-04 SGSST: Scaling Gaussian Splatting StyleTransfer Bruno Galerne et.al. 2412.03371 link
2024-12-04 Is JPEG AI going to change image forensics? Edoardo Daniele Cannas et.al. 2412.03261 null
2024-12-04 Task-driven Image Fusion with Learnable Fusion Loss Haowen Bai et.al. 2412.03240 null
2024-12-04 Parametric Enhancement of PerceptNet: A Human-Inspired Approach for Image Quality Assessment Jorge Vila-Tomás et.al. 2412.03210 link
2024-12-04 Unsupervised Network for Single Image Raindrop Removal Huijiao Wang et.al. 2412.03019 null
2024-12-04 Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach Lingchen Sun et.al. 2412.03017 link
2024-12-04 Partially Conditioned Patch Parallelism for Accelerated Diffusion Model Inference XiuYu Zhang et.al. 2412.02962 null
2024-12-04 Surrogate distributed radiological sources III: quantitative distributed source reconstructions Jayson R. Vavrek et.al. 2412.02926 null
2024-12-04 Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection Prabhat Kc et.al. 2412.02920 null
2024-12-03 Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback Hiroki Furuta et.al. 2412.02617 null
2024-12-03 High-Quality Passive Acoustic Mapping with the Cross-Correlated Angular Spectrum Method Yi Zeng et.al. 2412.02413 null
2024-12-03 Switchable deep beamformer for high-quality and real-time passive acoustic mapping Yi Zeng et.al. 2412.02327 null
2024-12-03 Initial Study On Improving Segmentation By Combining Preoperative CT And Intraoperative CBCT Using Synthetic Data Maximilian E. Tschuchnig et.al. 2412.02294 null
2024-12-02 NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training Dar-Yen Chen et.al. 2412.02030 null
2024-12-02 HybridMQA: Exploring Geometry-Texture Interactions for Colored Mesh Quality Assessment Armin Shafiee Sarvestani et.al. 2412.01986 null
2024-12-02 IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models Khaled Abud et.al. 2412.01794 link
2024-12-02 OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking Xuanyu Zhang et.al. 2412.01615 null
2024-12-02 Negative Token Merging: Image-based Adversarial Feature Guidance Jaskirat Singh et.al. 2412.01339 null
2024-12-02 Data Uncertainty-Aware Learning for Multimodal Aspect-based Sentiment Analysis Hao Yang et.al. 2412.01249 null
2024-12-02 Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation Zilyu Ye et.al. 2412.01243 null
2024-12-02 PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Control Ruichen Wang et.al. 2412.01223 null
2024-12-02 Assessing GPT Model Uncertainty in Mathematical OCR Tasks via Entropy Analysis Alexei Kaltchenko et.al. 2412.01221 link
2024-12-02 LoyalDiffusion: A Diffusion Model Guarding Against Data Replication Chenghao Li et.al. 2412.01118 null
2024-12-02 FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait Taekyung Ki et.al. 2412.01064 null
2024-12-02 Evaluating Automated Radiology Report Quality through Fine-Grained Phrasal Grounding of Clinical Findings Razi Mahmood et.al. 2412.01031 null
2024-12-01 Optimal Algorithms for Augmented Testing of Discrete Distributions Maryam Aliakbarpour et.al. 2412.00974 null
2024-12-01 Generating AI Literacy MCQs: A Multi-Agent LLM Approach Jiayi Wang et.al. 2412.00970 null
2024-12-01 Playable Game Generation Mingyu Yang et.al. 2412.00887 link
2024-11-30 Multi-resolution Guided 3D GANs for Medical Image Translation Juhyung Ha et.al. 2412.00575 null
2024-11-29 INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge Angelika Romanou et.al. 2411.19799 null
2024-11-29 ChineseWebText 2.0: Large-Scale High-quality Chinese Web Text with Multi-dimensional and fine-grained information Wanyue Zhang et.al. 2411.19668 link
2024-11-29 Tortho-Gaussian: Splatting True Digital Orthophoto Maps Xin Wang et.al. 2411.19594 null
2024-11-29 Self-Supervised Denoiser Framework Emilien Valat et.al. 2411.19593 null
2024-11-29 Contextual Checkerboard Denoise -- A Novel Neural Network-Based Approach for Classification-Aware OCT Image Denoising Md. Touhidul Islam et.al. 2411.19549 link
2024-11-29 Subjective and Objective Quality Assessment Methods of Stereoscopic Videos with Visibility Affecting Distortions Sria Biswas et.al. 2411.19522 null
2024-11-29 Retrieval-guided Cross-view Image Synthesis Hongji Yang et.al. 2411.19510 null
2024-11-29 Fleximo: Towards Flexible Text-to-Human Motion Video Generation Yuhang Zhang et.al. 2411.19459 null
2024-11-28 AMO Sampler: Enhancing Text Rendering with Overshooting Xixi Hu et.al. 2411.19415 null
2024-11-28 3D Wasserstein generative adversarial network with dense U-Net based discriminator for preclinical fMRI denoising Sima Soltanpour et.al. 2411.19345 null
2024-11-28 Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model Feng Liu et.al. 2411.19108 null
2024-11-28 SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing Rong-Cheng Tu et.al. 2411.18983 null
2024-11-28 Deep Plug-and-Play HIO Approach for Phase Retrieval Cagatay Isil et.al. 2411.18967 null
2024-12-02 AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers Sherwin Bahmani et.al. 2411.18673 null
2024-11-27 HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior Li-Yuan Tsao et.al. 2411.18662 link
2024-11-27 Textured Gaussians for Enhanced 3D Scene Appearance Modeling Brian Chao et.al. 2411.18625 null
2024-11-27 Uncertainty-driven Sampling for Efficient Pairwise Comparison Subjective Assessment Shima Mohammadi et.al. 2411.18372 link
2024-11-29 HUPE: Heuristic Underwater Perceptual Enhancement with Semantic Collaborative Learning Zengxi Zhang et.al. 2411.18296 link
2024-11-27 Deep End-to-end Adaptive k-Space Sampling, Reconstruction, and Registration for Dynamic MRI George Yiasemis et.al. 2411.18249 null
2024-11-27 Towards Improved Objective Perceptual Audio Quality Assessment -- Part 1: A Novel Data-Driven Cognitive Model Pablo M. Delgado et.al. 2411.18222 null
2024-11-27 KAN See Your Face Dong Han et.al. 2411.18165 null
2024-11-27 Type-R: Automatically Retouching Typos for Text-to-Image Generation Wataru Shimoda et.al. 2411.18159 null
2024-11-26 MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework Xiangcheng Hu et.al. 2411.17928 link
2024-11-26 SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation Ximing Xing et.al. 2411.17832 null
2024-11-26 Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient Zigeng Chen et.al. 2411.17787 link
2024-11-27 Diffusion Autoencoders for Few-shot Image Generation in Hyperbolic Space Lingxiao Li et.al. 2411.17784 null
2024-11-26 Perceptually Optimized Super Resolution Volodymyr Karpenko et.al. 2411.17513 null
2024-11-26 Puzzle Similarity: A Perceptually-guided No-Reference Metric for Artifact Detection in 3D Scene Reconstructions Nicolai Hermann et.al. 2411.17489 null
2024-11-26 Structure-Guided MR-to-CT Synthesis with Spatial and Semantic Alignments for Attenuation Correction of Whole-Body PET/MR Imaging Jiaxu Zheng et.al. 2411.17488 null
2024-11-26 Dual-Representation Interaction Driven Image Quality Assessment with Restoration Assistance Jingtong Yue et.al. 2411.17390 link
2024-11-26 InsightEdit: Towards Better Instruction Following for Image Editing Yingjing Xu et.al. 2411.17323 null
2024-11-26 Reward Incremental Learning in Text-to-Image Generation Maorong Wang et.al. 2411.17310 null
2024-11-26 Grounding-IQA: Multimodal Language Grounding Model for Image Quality Assessment Zheng Chen et.al. 2411.17237 link
2024-11-26 AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM Jiarui Wang et.al. 2411.17221 link
2024-11-26 ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting Chengyou Jia et.al. 2411.17176 null
2024-11-26 OSDFace: One-Step Diffusion Model for Face Restoration Jingkai Wang et.al. 2411.17163 link
2024-11-26 Motion Free B-frame Coding for Neural Video Compression Van Thang Nguyen et.al. 2411.17160 null
2024-11-26 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction Woong Oh Cho et.al. 2411.17044 null
2024-11-26 TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On Zhenchen Wan et.al. 2411.17017 link
2024-11-25 G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs Kunyi Li et.al. 2411.16898 null
2024-11-25 Fully Automatic Deep Learning Pipeline for Whole Slide Image Quality Assessment Falah Jabar et.al. 2411.16885 null
2024-11-25 LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction Yiran Sun et.al. 2411.16629 link
2024-11-25 Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric Zhichao Zhang et.al. 2411.16619 null
2024-11-25 Coherence Based Sound Speed Aberration Correction -- with clinical validation in obstetric ultrasound Anders Emil Vrålstad et.al. 2411.16551 null
2024-11-25 Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGAN Elona Shatri et.al. 2411.16405 null
2024-11-25 Human-Calibrated Automated Testing and Validation of Generative Language Models Agus Sudjianto et.al. 2411.16391 null
2024-11-25 Bounds for the maximum modulus of polynomial roots with nearly optimal worst-case overestimation Prashant Batra et.al. 2411.16385 null
2024-11-25 Privacy-Preserving Federated Foundation Model for Generalist Ultrasound Artificial Intelligence Yuncheng Jiang et.al. 2411.16380 null
2024-11-25 Sonic: Shifting Focus to Global Audio Perception in Portrait Animation Xiaozhong Ji et.al. 2411.16331 null
2024-11-25 EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training Yiying Wei et.al. 2411.16312 null
2024-11-25 Weakly supervised image segmentation for defect-based grading of fresh produce Manuel Knott et.al. 2411.16219 link
2024-11-25 VIRES: Video Instance Repainting with Sketch and Text Guidance Shuchen Weng et.al. 2411.16199 null
2024-11-25 Image Generation Diversity Issues and How to Tame Them Mischa Dombrowski et.al. 2411.16171 link
2024-11-25 ENCLIP: Ensembling and Clustering-Based Contrastive Language-Image Pretraining for Fashion Multimodal Search with Limited Data and Low-Quality Images Prithviraj Purushottam Naik et.al. 2411.16096 null
2024-11-25 AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity Jili Xia et.al. 2411.16087 null
2024-11-24 Distribution models of antennas in radio astronomy: Efficiency comparison of the golden spiral interferometry Elio Quiroga Rodriguez et.al. 2411.15904 null
2024-11-24 A review on Machine Learning based User-Centric Multimedia Streaming Techniques Monalisa Ghosh et.al. 2411.15801 null
2024-11-24 LTCF-Net: A Transformer-Enhanced Dual-Channel Fourier Framework for Low-Light Image Restoration Gaojing Zhang et.al. 2411.15740 null
2024-11-23 SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation Jiayuan Zhu et.al. 2411.15513 null
2024-11-23 Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark Rong-Cheng Tu et.al. 2411.15488 link
2024-11-22 HeadRouter: A Training-free Image Editing Framework for MM-DiTs by Adaptively Routing Attention Heads Yu Xu et.al. 2411.15034 null
2024-11-22 FloAt: Flow Warping of Self-Attention for Clothing Animation Generation Swasti Shreya Mishra et.al. 2411.15028 null
2024-11-22 Information Extraction from Heterogenous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation Aniket Bhattacharyya et.al. 2411.14957 null
2024-11-22 Evaluating Vision Transformer Models for Visual Quality Control in Industrial Manufacturing Miriam Alber et.al. 2411.14953 link
2024-11-22 Fast High-Quality Enhanced Imaging Algorithm for Layered Dielectric Targets Based on MMW MIMO-SAR System Xu Chen et.al. 2411.14837 null
2024-11-22 BrightVAE: Luminosity Enhancement in Underexposed Endoscopic Images Farzaneh Koohestani et.al. 2411.14663 null
2024-11-22 VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space Armani Rodriguez et.al. 2411.14642 null
2024-11-21 Unveiling the Hidden: A Comprehensive Evaluation of Underwater Image Enhancement and Its Impact on Object Detection Ali Awad et.al. 2411.14626 null
2024-11-21 Optimal Transcoding Preset Selection for Live Video Streaming Zahra Nabizadeh et.al. 2411.14613 null
2024-11-21 Roadmap on Advances in Visual and Physiological Optics Jesús E. Gómez-Correa et.al. 2411.14606 null
2024-11-21 Night-to-Day Translation via Illumination Degradation Disentanglement Guanzhou Lan et.al. 2411.14504 null
2024-11-21 Regional Attention for Shadow Removal Hengxing Liu et.al. 2411.14201 link
2024-11-21 Image Compression Using Novel View Synthesis Priors Luyuan Peng et.al. 2411.13862 null
2024-11-21 Detecting Human Artifacts from Text-to-Image Models Kaihong Wang et.al. 2411.13842 link
2024-11-21 Robust Steganography with Boundary-Preserving Overflow Alleviation and Adaptive Error Correction Yu Cheng et.al. 2411.13819 null
2024-11-21 Edge-Cloud Routing for Text-to-Image Model with Token-Level Multi-Metric Prediction Zewei Xin et.al. 2411.13787 null
2024-11-20 What You See Is What Matters: A Novel Visual and Physics-Based Metric for Evaluating Video Generation Quality Zihan Wang et.al. 2411.13609 null
2024-11-20 HF-Diff: High-Frequency Perceptual Loss and Distribution Matching for One-Step Diffusion-Based Image Super-Resolution Shoaib Meraj Sami et.al. 2411.13548 null
2024-11-20 RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content Yuxuan Jiang et.al. 2411.13362 null
2024-11-20 OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging Rajini Makam et.al. 2411.13230 link
2024-11-20 ESARM: 3D Emotional Speech-to-Animation via Reward Model from Automatically-Ranked Demonstrations Xulong Zhang et.al. 2411.13089 null
2024-11-20 LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image Compression Shimon Murai et.al. 2411.13033 link
2024-11-19 HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation Abdul Basit Anees et.al. 2411.12832 link
2024-11-19 Mitigating Perception Bias: A Training-Free Approach to Enhance LMM for Image Quality Assessment Siyi Pan et.al. 2411.12791 null
2024-11-19 Stochastic BIQA: Median Randomized Smoothing for Certified Blind Image Quality Assessment Ekaterina Shumitskaya et.al. 2411.12575 null
2024-11-19 PR-ENDO: Physically Based Relightable Gaussian Splatting for Endoscopy Joanna Kaleta et.al. 2411.12510 link
2024-11-19 A $\ell_2-\ell_p$ regulariser based model for Poisson noise removal using augmented Lagrangian method Abdul Halim et.al. 2411.12457 null
2024-11-19 Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models Jun Xiao et.al. 2411.12450 null
2024-11-19 Acquire Precise and Comparable Fundus Image Quality Score: FTHNet and FQS Dataset Zheng Gong et.al. 2411.12273 null
2024-11-19 Performance of Large Language Models in Technical MRI Question Answering: A Comparative Study Alan B McMillan et.al. 2411.12238 null
2024-11-19 Tangential Randomization in Linear Bandits (TRAiL): Guaranteed Inference and Regret Bounds Arda Güçlü et.al. 2411.12154 null
2024-11-18 FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting Fangyu Wu et.al. 2411.12089 null
2024-11-18 Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion Meng Zhou et.al. 2411.11799 link
2024-11-18 Additional Tests for TV 3.0 Eduardo Peixoto et.al. 2411.11755 null
2024-11-18 Towards Degradation-Robust Reconstruction in Generalizable NeRF Chan Ho Park et.al. 2411.11691 null
2024-11-18 CLUE-MARK: Watermarking Diffusion Models using CLWE Kareem Shehata et.al. 2411.11434 null
2024-11-17 BVI-CR: A Multi-View Human Dataset for Volumetric Video Compression Ge Gao et.al. 2411.11199 link
2024-11-17 Enhanced Anime Image Generation Using USE-CMHSA-GAN J. Lu et.al. 2411.11179 null
2024-11-17 Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion Yu-Fei Shi et.al. 2411.11123 null
2024-11-17 MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild Xi Fang et.al. 2411.11098 null
2024-11-17 Spectral Subspace Clustering for Attributed Graphs Xiaoyang Lin et.al. 2411.11074 link
2024-11-17 Skeleton-Guided Spatial-Temporal Feature Learning for Video-Based Visible-Infrared Person Re-Identification Wenjia Jiang et.al. 2411.11069 null
2024-11-17 Hyperspectral Imaging-Based Grain Quality Assessment With Limited Labelled Data Priyabrata Karmakar et.al. 2411.10924 null
2024-11-16 HJ-Ky-0.1: an Evaluation Dataset for Kyrgyz Word Embeddings Anton Alekseev et.al. 2411.10724 link
2024-11-15 M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation Sucheng Ren et.al. 2411.10433 link
2024-11-15 On the Foundation Model for Cardiac MRI Reconstruction Chi Zhang et.al. 2411.10403 null
2024-11-15 Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting Ziqi Xie et.al. 2411.10309 link
2024-11-15 The Unreasonable Effectiveness of Guidance for Diffusion Models Tim Kaiser et.al. 2411.10257 null
2024-11-15 Block based Adaptive Compressive Sensing with Sampling Rate Control Kosuke Iwama et.al. 2411.10200 null
2024-11-15 Visual question answering based evaluation metrics for text-to-image generation Mizuki Miyamoto et.al. 2411.10183 null
2024-11-15 SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning Zewen Chen et.al. 2411.10161 link
2024-11-15 Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning Yushen Zuo et.al. 2411.10130 null
2024-11-15 EveGuard: Defeating Vibration-based Side-Channel Eavesdropping with Audio Adversarial Perturbations Jung-Woo Chang et.al. 2411.10034 null
2024-11-14 Video Denoising in Fluorescence Guided Surgery Trevor Seets et.al. 2411.09798 null
2024-11-14 Research evaluation with ChatGPT: Is it age, country, length, or field biased? Mike Thelwall et.al. 2411.09768 null
2024-11-14 Evaluating the Predictive Capacity of ChatGPT for Academic Peer Review Outcomes Across Multiple Platforms Mike Thelwall et.al. 2411.09763 null
2024-11-14 MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation Jonas Serych et.al. 2411.09551 link
2024-11-14 GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising Yunuo Wang et.al. 2411.09512 null
2024-11-14 Iterative tomographic reconstruction with TV prior for low-dose CBCT dental imaging Louise Friot-Giroux et.al. 2411.09306 null
2024-11-14 LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution Chenyang Wang et.al. 2411.09293 null
2024-11-14 LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space Guanwen Feng et.al. 2411.09268 null
2024-11-14 JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation Xuyang Cao et.al. 2411.09209 link
2024-11-14 Orthogonal Linear Array based Product Beamforming for Real Time Underwater 3D Acoustical Imaging Mimisha M Menakath et.al. 2411.09197 null
2024-11-14 Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance Md Fahim Anjum et.al. 2411.09174 null
2024-11-13 Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment Zihao Huang et.al. 2411.09007 null
2024-11-13 Causal Explanations for Image Classifiers Hana Chockler et.al. 2411.08875 link
2024-11-13 A novel imaging setup for hybrid radiotherapy tailored PET/MR in patients with head and neck cancer R. M. Winter et.al. 2411.08783 null
2024-11-13 Robust Divergence Learning for Missing-Modality Segmentation Runze Cheng et.al. 2411.08305 null
2024-11-13 Numerical Analysis of Lensless Imaging with Active Metasurfaces and Single-Pixel Detectors Julie Belleville et.al. 2411.08282 null
2024-11-12 DuoLift-GAN:Reconstructing CT from Single-view and Biplanar X-Rays with Generative Adversarial Networks Zhaoxi Zhang et.al. 2411.07941 null
2024-11-12 Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information Minimization Ziyu Shan et.al. 2411.07936 null
2024-11-12 CT-Mamba: A Hybrid Convolutional State Space Model for Low-Dose CT Denoising Linxuan Li et.al. 2411.07930 link
2024-11-12 Joint multi-dimensional dynamic attention and transformer for general image restoration Huan Zhang et.al. 2411.07893 link
2024-11-12 No-Reference Point Cloud Quality Assessment via Graph Convolutional Network Wu Chen et.al. 2411.07728 null
2024-11-12 SegQC: a segmentation network-based framework for multi-metric segmentation quality control and segmentation error detection in volumetric medical images Bella Specktor-Fadida et.al. 2411.07601 null
2024-11-12 IR image databases generation under target intrinsic thermal variability constraints Jerome Gilles et.al. 2411.07577 null
2024-11-12 Multi-task Feature Enhancement Network for No-Reference Image Quality Assessment Li Yu et.al. 2411.07556 null
2024-11-12 A Novel Automatic Real-time Motion Tracking Method for Magnetic Resonance Imaging-guided Radiotherapy: Leveraging the Enhanced Tracking-Learning-Detection Framework with Automatic Segmentation Shengqi Chen et.al. 2411.07503 null
2024-11-12 An Exploration of Parallel Imaging System for Very-low Field (50mT) MRI Scanner Lei Yang et.al. 2411.07489 null
2024-11-11 Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy Sepideh K. Gharamaleki et.al. 2411.07426 null
2024-11-11 Exploring Variational Autoencoders for Medical Image Generation: A Comprehensive Study Khadija Rais et.al. 2411.07348 null
2024-11-11 Artificial Intelligence-Informed Handheld Breast Ultrasound for Screening: A Systematic Review of Diagnostic Test Accuracy Arianna Bunnell et.al. 2411.07322 null
2024-11-11 GPU-Accelerated Inverse Lithography Towards High Quality Curvy Mask Generation Haoyu Yang et.al. 2411.07311 null
2024-11-11 A Hierarchical Compression Technique for 3D Gaussian Splatting Compression He Huang et.al. 2411.06976 null
2024-11-11 Multi-scale Frequency Enhancement Network for Blind Image Deblurring Yawen Xiang et.al. 2411.06893 null
2024-11-11 Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation Reo Yoneyama et.al. 2411.06807 null
2024-11-11 Machine vision-aware quality metrics for compressed image and video assessment Mikhail Dremin et.al. 2411.06776 null
2024-11-11 Loss-tolerant neural video codec aware congestion control for real time video communication Zhengxu Xia et.al. 2411.06742 null
2024-11-11 360-Degree Video Super Resolution and Quality Enhancement Challenge: Methods and Results Ahmed Telili et.al. 2411.06738 null
2024-11-11 Accelerating Low-field MRI: Compressed Sensing and AI for fast noise-robust imaging Efrat Shimron et.al. 2411.06704 link
2024-11-10 CASC: Condition-Aware Semantic Communication with Latent Diffusion Models Weixuan Chen et.al. 2411.06552 null
2024-11-08 A Modular Conditional Diffusion Framework for Image Reconstruction Magauiya Zhussip et.al. 2411.05993 null
2024-11-08 Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings Miguel Moura Ramos et.al. 2411.05986 null
2024-11-08 Dictionary Learning with Convolutional Structure for Seismic Data Denoising and Interpolation Murad Almadani et.al. 2411.05956 null
2024-11-08 Alternative Learning Paradigms for Image Quality Transfer Ahmed Karam Eldaly et.al. 2411.05885 null
2024-11-08 Benchmarking 3D multi-coil NC-PDNet MRI reconstruction Asma Tanabene et.al. 2411.05883 null
2024-11-08 Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation Long Truong To et.al. 2411.05641 null
2024-11-08 DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions Rafael Berral-Soler et.al. 2411.05552 link
2024-11-08 Improving image synthesis with diffusion-negative sampling Alakh Desai et.al. 2411.05473 null
2024-11-08 RED: Residual Estimation Diffusion for Low-Dose PET Sinogram Reconstruction Xingyu Ai et.al. 2411.05354 null
2024-11-08 Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning Quang Truong Nguyen et.al. 2411.05344 null
2024-11-08 A Quality-Centric Framework for Generic Deepfake Detection Wentang Song et.al. 2411.05335 null
2024-11-08 Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet Boxiao Yu et.al. 2411.05302 null
2024-11-07 Quantum Imaging and Metrology with Undetected squeezed Photons: Noise Canceling and Noise Based Imaging S. Samimi et.al. 2411.05175 null
2024-11-08 SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models Muyang Li et.al. 2411.05007 link
2024-11-07 Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Weixin Liang et.al. 2411.04996 null
2024-11-07 SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation Koichi Namekata et.al. 2411.04989 null
2024-11-07 Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification Mischa Dombrowski et.al. 2411.04956 null
2024-11-07 MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views Yuedong Chen et.al. 2411.04924 link
2024-11-07 Differentiable Gaussian Representation for Incomplete CT Reconstruction Shaokai Wu et.al. 2411.04844 null
2024-11-07 Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation Benito Buchheim et.al. 2411.04724 null
2024-11-06 Multi-Reward as Condition for Instruction-based Image Editing Xin Gu et.al. 2411.04713 null
2024-11-06 SEE-DPO: Self Entropy Enhanced Direct Preference Optimization Shivanshu Shekhar et.al. 2411.04712 null
2024-11-07 Generative Semantic Communications with Foundation Models: Perception-Error Analysis and Semantic-Aware Power Allocation Chunmei Xu et.al. 2411.04575 null
2024-11-07 Bayesian Calibration of Win Rate Estimation with LLM Evaluators Yicheng Gao et.al. 2411.04424 link
2024-11-07 A Pre-training Framework that Encodes Noise Information for Speech Quality Assessment Subrina Sultana et.al. 2411.04379 null
2024-11-06 X-ray Single-Pixel Imaging with MPGD-based detectors M. Simões et.al. 2411.03907 null
2024-11-06 VQA $^2$ :Visual Question Answering for Video Quality Assessment Ziheng Jia et.al. 2411.03795 link
2024-11-06 MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models Wen-Chin Huang et.al. 2411.03715 link
2024-11-06 Evaluating Eye Tracking Signal Quality with Real-time Gaze Interaction Simulation Mehedi Hasan Raju et.al. 2411.03708 null
2024-11-06 Investigation of Inward-Outward Ring Permanent Magnet Array for Portable Magnetic Resonance Imaging (MRI) Ting-Ou Liang et.al. 2411.03249 null
2024-11-05 The Impact of Medicaid Expansion on Medicare Quality Measures Hala Algrain et.al. 2411.03140 null
2024-11-05 Investigating the Applicability of a Snapshot Computed Tomography Imaging Spectrometer for the Prediction of Brix and pH of Grapes Mads Svanborg Peters et.al. 2411.03114 null
2024-11-05 Advances in Photoacoustic Imaging Reconstruction and Quantitative Analysis for Biomedical Applications Lei Wang et.al. 2411.02843 null
2024-11-04 Interaction Design with Generative AI: An Empirical Study of Emerging Strategies Across the Four Phases of Design Marie Muehlhaus et.al. 2411.02662 null
2024-11-04 Euclid: High-precision imaging astrometry and photometry from Early Release Observations. I. Internal kinematics of NGC 6397 by combining Euclid and Gaia data M. Libralato et.al. 2411.02487 null
2024-11-02 Cross-D Conv: Cross-Dimensional Transferable Knowledge Base via Fourier Shifting Operation Mehmet Can Yavuz et.al. 2411.02441 link
2024-11-04 Physically Based Neural Bidirectional Reflectance Distribution Function Chenliang Zhou et.al. 2411.02347 null
2024-11-04 Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition Xinkai Liu et.al. 2411.02334 null
2024-11-03 Degradation-Aware Residual-Conditioned Optimal Transport for Unified Image Restoration Xiaole Tang et.al. 2411.01656 link
2024-11-03 Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation Zhenbin Wang et.al. 2411.01647 null
2024-11-03 TPOT: Topology Preserving Optimal Transport in Retinal Fundus Image Enhancement Xuanzhao Dong et.al. 2411.01403 null
2024-11-02 Interacting Large Language Model Agents. Interpretable Models and Social Learning Adit Jain et.al. 2411.01271 null
2024-11-02 The impact of MRI image quality on statistical and predictive analysis on voxel based morphology Felix Hoffstaedter et.al. 2411.01268 link
2024-11-02 Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures Ameya Uppina et.al. 2411.01251 null
2024-11-02 Real-Time Spatio-Temporal Reconstruction of Dynamic Endoscopic Scenes with 4D Gaussian Splatting Fengze Li et.al. 2411.01218 null
2024-11-01 Evaluation Metric for Quality Control and Generative Models in Histopathology Images Pranav Jeevan et.al. 2411.01034 null
2024-11-01 Re-thinking Richardson-Lucy without Iteration Cutoffs: Physically Motivated Bayesian Deconvolution Zachary H. Hendrix et.al. 2411.00991 null
2024-11-01 Inter-Feature-Map Differential Coding of Surveillance Video Kei Iino et.al. 2411.00984 null
2024-11-01 Scalable AI Framework for Defect Detection in Metal Additive Manufacturing Duy Nhat Phan et.al. 2411.00960 null
2024-11-01 Intensity Field Decomposition for Tissue-Guided Neural Tomography Meng-Xun Li et.al. 2411.00900 null
2024-11-01 CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes Yang Liu et.al. 2411.00771 null
2024-11-01 Face Anonymization Made Simple Han-Wei Kung et.al. 2411.00762 link
2024-11-01 Demystifying the use of Compression in Virtual Production Anil Kokaram et.al. 2411.00547 null
2024-11-01 MV-Adapter: Enhancing Underwater Instance Segmentation via Adaptive Channel Attention Lianjun Liu et.al. 2411.00472 null
2024-10-31 IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision Maxwell Meyer et.al. 2411.00252 null
2024-10-31 Denoising study of Fluoroscopic Images in real time tumor tracking System based on Statistical model of noise Yongxuan Yan et.al. 2411.00199 null
2024-10-31 Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning Penghui Ruan et.al. 2410.24219 link
2024-10-31 AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization Amir Kazemi et.al. 2410.24116 null
2024-10-31 Parameter choices in HaarPSI for IQA with medical images Clemens Karner et.al. 2410.24098 link
2024-10-31 Advanced Predictive Quality Assessment for Ultrasonic Additive Manufacturing with Deep Learning Model Lokendra Poudel et.al. 2410.24055 null
2024-10-31 Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation Yihang Zhou et.al. 2410.23962 null
2024-10-29 Temporal and Spatial Super Resolution with Latent Diffusion Model in Medical MRI images Vishal Dubey et.al. 2410.23898 null
2024-10-31 Cycle-Constrained Adversarial Denoising Convolutional Network for PET Image Denoising: Multi-Dimensional Validation on Large Datasets with Reader Study and Real Low-Dose Data Yucun Hou et.al. 2410.23628 null
2024-10-31 LBurst: Learning-Based Robotic Burst Feature Extraction for 3D Reconstruction in Low Light Ahalya Ravendran et.al. 2410.23522 null
2024-10-30 Plug-and-play superiorization Jon Henshaw et.al. 2410.23401 null
2024-10-30 Redundant Cross-Correlation for Drift Correction in SEM Nanoparticle Imaging Iago Bischoff Montenegro et.al. 2410.23390 link
2024-10-30 Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants Azadeh Sharafi et.al. 2410.23329 null
2024-10-30 AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection Yujin Wang et.al. 2410.22939 null
2024-10-30 Prune and Repaint: Content-Aware Image Retargeting for any Ratio Feihong Shen et.al. 2410.22865 link
2024-10-30 Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images Hanlin Wu et.al. 2410.22830 null
2024-10-30 Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models Arash Marioriyad et.al. 2410.22775 null
2024-10-30 st-DTPM: Spatial-Temporal Guided Diffusion Transformer Probabilistic Model for Delayed Scan PET Image Prediction Ran Hong et.al. 2410.22732 null
2024-10-30 FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution Shuai Wang et.al. 2410.22655 null
2024-10-31 Consistency Diffusion Bridge Models Guande He et.al. 2410.22637 null
2024-10-29 Deep Priors for Video Quality Prediction Siddharath Narayan Shakya et.al. 2410.22566 null
2024-10-29 Enhancing Code Annotation Reliability: Generative AI's Role in Comment Quality Assessment Models Seetharam Killivalavan et.al. 2410.22323 null
2024-10-29 Multimodal Semantic Communication for Generative Audio-Driven Video Conferencing Haonan Tong et.al. 2410.22112 null
2024-10-29 Data Generation for Hardware-Friendly Post-Training Quantization Lior Dikstein et.al. 2410.22110 link
2024-10-29 Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis Deepak Sridhar et.al. 2410.21638 link
2024-10-28 Exploring the Design Space of Diffusion Bridge Models via Stochasticity Control Shaorong Zhang et.al. 2410.21553 null
2024-10-28 SpeechQE: Estimating the Quality of Direct Speech Translation HyoJung Han et.al. 2410.21485 link
2024-10-28 Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework Vladimir Arkhipkin et.al. 2410.21061 link
2024-10-28 A Simple Yet Effective Corpus Construction Framework for Indonesian Grammatical Error Correction Nankai Lin et.al. 2410.20838 link
2024-10-28 FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space Yiyang Guo et.al. 2410.20824 null
2024-10-28 Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting Jiawei Xu et.al. 2410.20815 null
2024-10-28 LoDAvatar: Hierarchical Embedding and Adaptive Levels of Detail with Gaussian Splatting for Enhanced Human Avatars Xiaonuo Dongye et.al. 2410.20789 null
2024-10-28 CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians Chongjian Ge et.al. 2410.20723 null
2024-10-28 ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings Suyoung Lee et.al. 2410.20686 link
2024-10-27 Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering Meng Wei et.al. 2410.20593 null
2024-10-27 Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network Chongxiao Liu et.al. 2410.20546 link
2024-10-27 Enhancing Community Vision Screening -- AI Driven Retinal Photography for Early Disease Detection and Patient Trust Xiaofeng Lei et.al. 2410.20309 null
2024-10-27 GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields Yusuke Sekikawa et.al. 2410.20306 null
2024-10-26 OAR-Weighted Dice Score: A spatially aware, radiosensitivity aware metric for target structure contour quality assessment Lucas McCullum et.al. 2410.20243 null
2024-10-26 Cross-Platform Neural Video Coding: A Case Study Ruhan Conceição et.al. 2410.20145 null
2024-10-26 Super-resolved virtual staining of label-free tissue using diffusion models Yijie Zhang et.al. 2410.20073 null
2024-10-25 The Galaxy Zoo Catalogs for the Galaxy And Mass Assembly (GAMA) Survey Benne W. Holwerda et.al. 2410.19985 null
2024-10-25 FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality Zhengyao Lv et.al. 2410.19355 null
2024-10-25 Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion Emiel Hoogeboom et.al. 2410.19324 null
2024-10-24 Optimising image capture for low-light widefield quantitative fluorescence microscopy Zane Peterkovic et.al. 2410.19210 null
2024-10-24 Sort-free Gaussian Splatting via Weighted Sum Rendering Qiqi Hou et.al. 2410.18931 null
2024-10-24 SafeBench: A Safety Evaluation Framework for Multimodal Large Language Models Zonghao Ying et.al. 2410.18927 null
2024-10-24 Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances Shilin Lu et.al. 2410.18775 link
2024-10-24 Advancements in Image Resolution: Super-Resolution Algorithm for Enhanced EOS-06 OCM-3 Data Ankur Garg et.al. 2410.18690 null
2024-10-24 ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks Renshuai Tao et.al. 2410.18687 null
2024-10-24 Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data Anup Shirgaonkar et.al. 2410.18588 null
2024-10-24 ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis Zezhong Wang et.al. 2410.18447 null
2024-10-24 FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling Zhengqiang Zhang et.al. 2410.18410 link
2024-10-23 Neural Cover Selection for Image Steganography Karl Chahine et.al. 2410.18216 link
2024-10-23 In-Pixel Foreground and Contrast Enhancement Circuits with Customizable Mapping Md Rahatul Islam Udoy et.al. 2410.18052 null
2024-10-23 Scalable Ranked Preference Optimization for Text-to-Image Generation Shyamgopal Karthik et.al. 2410.18013 null
2024-10-23 Together We Can: Multilingual Automatic Post-Editing for Low-Resource Languages Sourabh Deoghare et.al. 2410.17973 null
2024-10-23 Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech Danilo de Oliveira et.al. 2410.17834 null
2024-10-23 TopoQA: a topological deep learning-based approach for protein complex structure interface quality assessment Bingqing Han et.al. 2410.17815 null
2024-10-23 An Intelligent Agentic System for Complex Image Restoration Problems Kaiwen Zhu et.al. 2410.17809 link
2024-10-24 Testing Deep Learning Recommender Systems Models on Synthetic GAN-Generated Datasets Jesús Bobadilla et.al. 2410.17651 null
2024-10-25 Comprehensive Evaluation of Matrix Factorization Models for Collaborative Filtering Recommender Systems Jesús Bobadilla et.al. 2410.17644 null
2024-10-23 Bilateral Hippocampi Segmentation in Low Field MRIs Using Mutual Feature Learning via Dual-Views Himashi Peiris et.al. 2410.17502 link
2024-10-21 MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors Honghua Chen et.al. 2410.16272 null
2024-10-21 Multispectral Texture Synthesis using RGB Convolutional Neural Networks Sélim Ollivier et.al. 2410.16019 null
2024-10-22 Wireless Link Quality Estimation Using LSTM Model Yuki Kanto et.al. 2410.15357 null
2024-10-19 A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends Junjun Jiang et.al. 2410.15067 link
2024-10-18 DRACO: Differentiable Reconstruction for Arbitrary CBCT Orbits Chengze Ye et.al. 2410.14900 link
2024-10-18 Dynamic Negative Guidance of Diffusion Models Felix Koulischer et.al. 2410.14398 link
2024-10-18 Gaia Data Release 3: spectroscopic binary-star orbital solutions and the SB1 processing chain E. Gosset et.al. 2410.14372 null
2024-10-18 2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization Junan Chen et.al. 2410.14343 null
2024-10-18 Advanced Underwater Image Quality Enhancement via Hybrid Super-Resolution Convolutional Neural Networks and Multi-Scale Retinex-Based Defogging Techniques Yugandhar Reddy Gogireddy et.al. 2410.14285 null
2024-10-18 Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical and Landmark Loss Optimization Bin Lin et.al. 2410.14283 null
2024-10-18 Combining Hough Transform and Deep Learning Approaches to Reconstruct ECG Signals From Printouts Felix Krones et.al. 2410.14185 null
2024-10-18 Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping Renguang Chen et.al. 2410.14161 null
2024-10-17 Generating Signed Language Instructions in Large-Scale Dialogue Systems Mert İnan et.al. 2410.14026 null
2024-10-17 Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Lijie Fan et.al. 2410.13863 null
2024-10-15 Comparison of Image Preprocessing Techniques for Vehicle License Plate Recognition Using OCR: Performance and Accuracy Evaluation Renato Augusto Tavares et.al. 2410.13622 null
2024-10-17 L3DG: Latent 3D Gaussian Diffusion Barbara Roessle et.al. 2410.13530 null
2024-10-17 Enhancing Crowdsourced Audio for Text-to-Speech Models José Giraldo et.al. 2410.13357 null
2024-10-17 Active inference and deep generative modeling for cognitive ultrasound Ruud JG van Sloun et.al. 2410.13310 null
2024-10-17 Latent Image and Video Resolution Prediction using Convolutional Neural Networks Rittwika Kansabanik et.al. 2410.13227 null
2024-10-17 Anchored Alignment for Self-Explanations Enhancement Luis Felipe Villa-Arenas et.al. 2410.13216 null
2024-10-17 Using RLHF to align speech enhancement approaches to mean-opinion quality scores Anurag Kumar et.al. 2410.13182 null
2024-10-16 Super-resolving Real-world Image Illumination Enhancement: A New Dataset and A Conditional Diffusion Model Yang Liu et.al. 2410.12961 null
2024-10-16 Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization Xingqi Wang et.al. 2410.12700 link
2024-10-16 SWIM: An Attention-Only Model for Speech Quality Assessment Under Subjective Variance Imran E Kibria et.al. 2410.12675 null
2024-10-16 MambaPainter: Neural Stroke-Based Rendering in a Single Step Tomoya Sawada et.al. 2410.12524 link
2024-10-16 Conditional Outcome Equivalence: A Quantile Alternative to CATE Josh Givens et.al. 2410.12454 link
2024-10-16 Triplet: Triangle Patchlet for Mesh-Based Inverse Rendering and Scene Parameters Approximation Jiajie Yang et.al. 2410.12414 link
2024-10-14 Learnable Optimization-Based Algorithms for Low-Dose CT Reconstruction Daisy Chen et.al. 2410.11903 null
2024-10-15 Generative Image Steganography Based on Point Cloud Zhong Yangjie et.al. 2410.11673 null
2024-10-15 Fast Local Neural Regression for Low-Cost, Path Traced Lambertian Global Illumination Arturo Salmi et.al. 2410.11625 null
2024-10-15 Rician Denoising Diffusion Probabilistic Models For Sodium Breast MRI Enhancement Shuaiyu Yuan et.al. 2410.11511 null
2024-10-15 Visual-Geometric Collaborative Guidance for Affordance Learning Hongchen Luo et.al. 2410.11363 link
2024-10-15 Evolutionary Retrofitting Mathurin Videau et.al. 2410.11330 null
2024-10-14 Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation Emmanouil Zaranis et.al. 2410.10995 null
2024-10-14 LVD-2M: A Long-take Video Dataset with Temporally Dense Captions Tianwei Xiong et.al. 2410.10816 link
2024-10-14 Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention Dejia Xu et.al. 2410.10774 null
2024-10-14 LISAC: Learned Coded Waveform Design for ISAC with OFDM Chenghong Bian et.al. 2410.10711 null
2024-10-14 A Novel No-Reference Image Quality Metric For Assessing Sharpness In Satellite Imagery Lucas Gonzalo Antonel et.al. 2410.10488 null
2024-10-14 Two-Stage Approach for Brain MR Image Synthesis: 2D Image Synthesis and 3D Refinement Jihoon Cho et.al. 2410.10269 null
2024-10-14 Saliency Guided Optimization of Diffusion Latents Xiwen Wang et.al. 2410.10257 null
2024-10-14 QE-EBM: Using Quality Estimators as Energy Loss for Machine Translation Gahyun Yoo et.al. 2410.10228 null
2024-10-14 Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models Yongjin Yang et.al. 2410.10166 null
2024-10-14 StegaINR4MIH: steganography by implicit neural representation for multi-image hiding Weina Dong et.al. 2410.10117 link
2024-10-13 Crowd IQ -- Aggregating Opinions to Boost Performance Michal Kosinski et.al. 2410.10004 null
2024-10-13 Combining Generative and Geometry Priors for Wide-Angle Portrait Correction Lan Yao et.al. 2410.09911 link
2024-10-13 Two-Stage Human Verification using HandCAPTCHA and Anti-Spoofed Finger Biometrics with Feature Selection Asish Bera et.al. 2410.09866 null
2024-10-12 Preserving Old Memories in Vivid Detail: Human-Interactive Photo Restoration Framework Seung-Yeon Back et.al. 2410.09529 null
2024-10-12 Fine-grained subjective visual quality assessment for high-fidelity compressed images Michela Testolina et.al. 2410.09501 link
2024-10-12 Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors Hritam Basak et.al. 2410.09467 null
2024-10-11 TD-Paint: Faster Diffusion Inpainting Through Time Aware Pixel Conditioning Tsiry Mayet et.al. 2410.09306 null
2024-10-11 SceneCraft: Layout-Guided 3D Scene Generation Xiuyu Yang et.al. 2410.09049 link
2024-10-11 Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars Xuan Huang et.al. 2410.08840 link
2024-10-11 Towards virtual painting recolouring using Vision Transformer on X-Ray Fluorescence datacubes Alessandro Bombini et.al. 2410.08826 null
2024-10-11 A Theoretical Framework for AI-driven data quality monitoring in high-volume data environments Nikhil Bangad et.al. 2410.08576 null
2024-10-11 Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models Pascl Zwick et.al. 2410.08551 link
2024-10-11 Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities Abhijay Ghildyal et.al. 2410.08534 null
2024-10-10 Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content Qiuheng Wang et.al. 2410.08260 null
2024-10-10 Exploring ASR-Based Wav2Vec2 for Automated Speech Disorder Assessment: Insights and Analysis Tuan Nguyen et.al. 2410.08250 null
2024-10-10 ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion Zitian Zhang et.al. 2410.08168 link
2024-10-10 Efficient Perspective-Correct 3D Gaussian Splatting Using Hybrid Transparency Florian Hahlbohm et.al. 2410.08129 null
2024-10-10 Medical Image Quality Assessment based on Probability of Necessity and Sufficiency Boyu Chen et.al. 2410.08118 null
2024-10-10 High-redshift LBG selection from broadband and wide photometric surveys using a Random Forest algorithm C. Payerne et.al. 2410.08062 null
2024-10-10 Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation Sweta Agrawal et.al. 2410.07779 null
2024-10-10 Synthesizing Multi-Class Surgical Datasets with Anatomy-Aware Diffusion Models Danush Kumar Venkatesh et.al. 2410.07753 link
2024-10-10 Multi-Facet Counterfactual Learning for Content Quality Evaluation Jiasheng Zheng et.al. 2410.07693 null
2024-10-10 DPL: Cross-quality DeepFake Detection via Dual Progressive Learning Dongliang Zhang et.al. 2410.07633 null
2024-10-10 Rank Aggregation in Crowdsourcing for Listwise Annotations Wenshui Luo et.al. 2410.07538 null
2024-10-10 A 3D-Printed Table for Hybrid X-ray CT and Optical Imaging of a Live Mouse Wenxuan Xue et.al. 2410.07517 null
2024-10-09 An undetectable watermark for generative image models Sam Gunn et.al. 2410.07369 link
2024-10-09 Secure Video Quality Assessment Resisting Adversarial Attacks Ao-Xiang Zhang et.al. 2410.06866 null
2024-10-09 Diff-FMT: Diffusion Models for Fluorescence Molecular Tomography Qianqian Xue et.al. 2410.06757 null
2024-10-09 MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes Zhenhui Ye et.al. 2410.06734 null
2024-10-09 Perceptual Quality Assessment of Octree-RAHT Encoded 3D Point Clouds Dongshuai Duan et.al. 2410.06729 link
2024-10-09 Perceptual Quality Assessment of Trisoup-Lifting Encoded 3D Point Clouds Juncheng Long et.al. 2410.06689 link
2024-10-09 SCOREQ: Speech Quality Assessment with Contrastive Regression Alessandro Ragano et.al. 2410.06675 link
2024-10-09 InstantIR: Blind Image Restoration with Instant Generative Reference Jen-Yuan Huang et.al. 2410.06551 null
2024-10-08 Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content? Shenbin Qian et.al. 2410.06338 link
2024-10-08 Automated quality assessment using appearance-based simulations and hippocampus segmentation on low-field paediatric brain MR images Vaanathi Sundaresan et.al. 2410.06161 link
2024-10-08 Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach Sha Guo et.al. 2410.06149 null
2024-10-08 AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation Boyuan Cao et.al. 2410.06055 link
2024-10-08 Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization Wei Liu et.al. 2410.06003 link
2024-10-08 Integrating Online Learning and Connectivity Maintenance for Communication-Aware Multi-Robot Coordination Yupeng Yang et.al. 2410.05798 link
2024-10-08 T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design Jiachen Li et.al. 2410.05677 null
2024-10-08 Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning Saemi Moon et.al. 2410.05664 null
2024-10-08 Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree? Xueru Wen et.al. 2410.05584 null
2024-10-07 Image Watermarks are Removable Using Controllable Regeneration from Clean Noise Yepeng Liu et.al. 2410.05470 null
2024-10-07 SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones Denis Davletshin et.al. 2410.05405 null
2024-10-07 Towards a Modern and Lightweight Rendering Engine for Dynamic Robotic Simulations Christopher John Allison et.al. 2410.05095 null
2024-10-07 Real-time cardiac cine MRI -- A comparison of a diffusion probabilistic model with alternative state-of-the-art image reconstruction techniques for undersampled spiral acquisitions Oliver Schad et.al. 2410.04843 null
2024-10-07 Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration Zhiyu Zhu et.al. 2410.04811 link
2024-10-07 Transforming Color: A Novel Image Colorization Method Hamza Shafiq et.al. 2410.04799 null
2024-10-07 CAR: Controllable Autoregressive Modeling for Visual Generation Ziyu Yao et.al. 2410.04671 link
2024-10-07 Federated Learning Nodes Can Reconstruct Peers' Image Data Ethan Wilson et.al. 2410.04661 null
2024-10-06 Towards Unsupervised Blind Face Restoration using Diffusion Prior Tianshu Kuai et.al. 2410.04618 null
2024-10-06 How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? Zhuoyan Li et.al. 2410.04545 null
2024-10-06 VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide Dohun Lee et.al. 2410.04364 null
2024-10-05 Persona Knowledge-Aligned Prompt Tuning Method for Online Debate Chunkit Chan et.al. 2410.04239 link
2024-10-05 AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results Ivan Molodetskikh et.al. 2410.04225 null
2024-10-05 Deep Transfer Learning Based Peer Review Aggregation and Meta-review Generation for Scientific Articles Md. Tarek Hasan et.al. 2410.04202 null
2024-10-05 Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model Keda Tao et.al. 2410.04161 null
2024-10-05 Can the Variation of Model Weights be used as a Criterion for Self-Paced Multilingual NMT? Àlex R. Atrio et.al. 2410.04147 null
2024-10-05 Beyond Imperfections: A Conditional Inpainting Approach for End-to-End Artifact Removal in VTON and Pose Transfer Aref Tabatabaei et.al. 2410.04052 null
2024-10-04 LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding Doohyuk Jang et.al. 2410.03355 null
2024-10-04 CLOVE: Travelling Salesman's approach to hyperbolic embeddings of complex networks with communities Sámuel G. Balogh et.al. 2410.03270 null
2024-10-04 Parallel Corpus Augmentation using Masked Language Models Vibhuti Kumari et.al. 2410.03194 null
2024-10-04 ECHOPulse: ECG controlled echocardio-grams video generation Yiwei Li et.al. 2410.03143 link
2024-10-03 Diffusion-based Extreme Image Compression with Compressed Feature Initialization Zhiyuan Li et.al. 2410.02640 link
2024-10-03 An Improved Variational Method for Image Denoising Jing-En Huang et.al. 2410.02587 null
2024-10-03 Combining Pre- and Post-Demosaicking Noise Removal for RAW Video Marco Sánchez-Beeckman et.al. 2410.02572 null
2024-10-03 Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment Kai Liu et.al. 2410.02505 link
2024-10-03 Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models Seyedmorteza Sadat et.al. 2410.02416 null
2024-10-03 Morphological evaluation of subwords vocabulary used by BETO language model Óscar García-Sierra et.al. 2410.02283 null
2024-10-03 SC-CDM: Enhancing Quality of Image Semantic Communication with a Compact Diffusion Model Kexin Zhang et.al. 2410.02121 null
2024-10-02 DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation Jing He et.al. 2410.02067 null
2024-10-02 Impact of White-Box Adversarial Attacks on Convolutional Neural Networks Rakesh Podder et.al. 2410.02043 null
2024-10-02 Social Media Authentication and Combating Deepfakes using Semi-fragile Invisible Image Watermarking Aakash Varma Nadimpalli et.al. 2410.01906 null
2024-10-02 Enhancing LLM Fine-tuning for Text-to-SQLs by SQL Quality Measurement Shouvon Sarker et.al. 2410.01869 null
2024-10-02 ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation Rinon Gal et.al. 2410.01731 null
2024-10-04 HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration Yushi Huang et.al. 2410.01723 null
2024-10-02 Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding Yao Teng et.al. 2410.01699 link
2024-10-02 SAFE: Semantic Adaptive Feature Extraction with Rate Control for 6G Wireless Communications Yuna Yan et.al. 2410.01597 null
2024-10-02 Frequency-Dependent F-Numbers Suppress Grating Lobes and Improve the Lateral Resolution in Line-by-Line Scanning Martin F. Schiffner et.al. 2410.01593 null
2024-10-02 Imaging foundation model for universal enhancement of non-ideal measurement CT Yuxin Liu et.al. 2410.01591 link
2024-10-02 HARMONI at ELT: tolerance analysis and expected as-build imaging performance of the infrared spectrograph Eduard Muslimov et.al. 2410.01581 null
2024-10-02 Adaptive Radiofrequency Shimming in MRI using Reconfigurable Dielectric Materials Paulina Šiurytė et.al. 2410.01501 null
2024-10-02 Quo Vadis RankList-based System in Face Recognition? Xinyi Zhang et.al. 2410.01498 null
2024-10-02 Design of a custom wideband camera for MISTRAL imager-spectrograph Eduard Muslimov et.al. 2410.01414 null
2024-10-02 CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment Safouane El Ghazouali et.al. 2410.01411 link
2024-10-01 Generating Seamless Virtual Immunohistochemical Whole Slide Images with Content and Color Consistency Sitong Liu et.al. 2410.01072 null
2024-10-01 LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details Jian Yang et.al. 2410.00990 null
2024-10-01 Energy-Quality-aware Variable Framerate Pareto-Front for Adaptive Video Streaming Prajit T Rajendran et.al. 2410.00849 null
2024-10-01 Maximum entropy and quantized metric models for absolute category ratings Dietmar Saupe et.al. 2410.00817 null
2024-10-01 Basis function compression for field probe monitoring Paul Dubovan et.al. 2410.00754 null
2024-10-01 Development of the normalization method for the first large field-of-view plastic-based PET Modular scanner A. Coussat et.al. 2410.00669 null
2024-10-01 Contribution of soundscape appropriateness to soundscape quality assessment in space: a mediating variable affecting acoustic comfort Xinhao Yang et.al. 2410.00667 null
2024-10-01 AutoTM 2.0: Automatic Topic Modeling Framework for Documents Analysis Maria Khodorchenko et.al. 2410.00655 null
2024-10-01 Dynamic and Scalable Data Preparation for Object-Centric Process Mining Lien Bosmans et.al. 2410.00596 null
2024-09-30 UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation Cheng Zhang et.al. 2409.20197 link
2024-09-30 Segmenting Wood Rot using Computer Vision Models Roland Kammerbauer et.al. 2409.20137 null
2024-09-30 Machine Learning in Industrial Quality Control of Glass Bottle Prints Maximilian Bundscherer et.al. 2409.20132 null
2024-09-30 Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs Zicheng Zhang et.al. 2409.20063 null
2024-09-30 Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis Hippolyte Gisserot-Boukhlef et.al. 2409.20059 null
2024-10-01 UniSumEval: Towards Unified, Fine-Grained, Multi-Dimensional Summarization Evaluation for LLMs Yuho Lee et.al. 2409.19898 link
2024-09-29 OrganiQ: Mitigating Classical Resource Bottlenecks of Quantum Generative Adversarial Networks on NISQ-Era Machines Daniel Silver et.al. 2409.19823 null
2024-09-29 SemiDDM-Weather: A Semi-supervised Learning Framework for All-in-one Adverse Weather Removal Fang Long et.al. 2409.19679 link
2024-09-29 Effective Diffusion Transformer Architecture for Image Super-Resolution Kun Cheng et.al. 2409.19589 link
2024-09-29 High Quality Human Image Animation using Regional Supervision and Motion Blur Condition Zhongcong Xu et.al. 2409.19580 null
2024-09-27 A comprehensive review and new taxonomy on superpixel segmentation I. B. Barcelos et.al. 2409.19179 link
2024-09-27 Multimodal Pragmatic Jailbreak on Text-to-image Models Tong Liu et.al. 2409.19149 null
2024-09-27 ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions Wenfeng Huang et.al. 2409.18932 null
2024-09-27 Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors Yunlong Lin et.al. 2409.18899 null
2024-09-27 Effectiveness of learning-based image codecs on fingerprint storage Daniele Mari et.al. 2409.18730 link
2024-09-27 Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming Angeliki Katsenou et.al. 2409.18713 null
2024-09-27 Align $^2$ LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation Hongzhe Huang et.al. 2409.18541 link
2024-09-27 Underwater Image Enhancement with Physical-based Denoising Diffusion Implicit Models Nguyen Gia Bach et.al. 2409.18476 link
2024-09-27 GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation Jiawei Lu et.al. 2409.18401 null
2024-09-27 SinoSynth: A Physics-based Domain Randomization Approach for Generalizable CBCT Image Enhancement Yunkui Pang et.al. 2409.18355 link
2024-09-26 FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner Wenliang Zhao et.al. 2409.18128 link
2024-09-26 Low Photon Number Non-Invasive Imaging Through Time-Varying Diffusers Adrian Makowski et.al. 2409.18072 null
2024-09-26 LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field Huan Wang et.al. 2409.18057 link
2024-09-26 MARS: Multi-radio Architecture with Radio Selection using Decision Trees for emerging mesoscale CPS/IoT applications Jothi Prasanna Shanmuga Sundaram et.al. 2409.18043 null
2024-09-26 PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging Xin Cai et.al. 2409.17996 null
2024-09-26 Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation Qihan Huang et.al. 2409.17920 link
2024-09-26 Cross-lingual Human-Preference Alignment for Neural Machine Translation with Direct Quality Optimization Kaden Uhlig et.al. 2409.17673 null
2024-09-26 FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates Nicola Pia et.al. 2409.17635 null
2024-09-26 Pixel-Space Post-Training of Latent Diffusion Models Christina Zhang et.al. 2409.17565 null
2024-09-26 Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset Yongrok Kim et.al. 2409.17451 null
2024-09-25 DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion Yukun Huang et.al. 2409.17145 link
2024-09-25 Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts Mohammad Sadil Khan et.al. 2409.17106 link
2024-09-25 Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model Xinfeng Wei et.al. 2409.17104 null
2024-09-25 The effect of image quality on galaxy merger identification with deep learning Robert W. Bickley et.al. 2409.17081 null
2024-09-25 Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors Aiping Zhang et.al. 2409.17058 link
2024-09-25 MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features Katharina Anderer et.al. 2409.16765 link
2024-09-25 Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation Youngwan Jin et.al. 2409.16706 null
2024-09-25 In which fields can ChatGPT detect journal article quality? An evaluation of REF2021 results Mike Thelwall et.al. 2409.16695 null
2024-09-25 Morphological-consistent Diffusion Network for Ultrasound Coronal Image Enhancement Yihao Zhou et.al. 2409.16661 null
2024-09-25 Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts Taehun Cha et.al. 2409.16658 link
2024-09-25 Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation Siyin Wang et.al. 2409.16644 null
2024-09-25 DeformStream: Deformation-based Adaptive Volumetric Video Streaming Boyan Li et.al. 2409.16615 null
2024-09-25 Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models Deepak Sridhar et.al. 2409.16535 link
2024-09-24 Low Latency Point Cloud Rendering with Learned Splatting Yueyu Hu et.al. 2409.16504 link
2024-09-24 A Unified Hallucination Mitigation Framework for Large Vision-Language Models Yue Chang et.al. 2409.16494 link
2024-09-24 AIM 2024 Challenge on UHD Blind Photo Quality Assessment Vlad Hosu et.al. 2409.16271 null
2024-09-26 Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients Wanchen Zhao et.al. 2409.16042 null
2024-09-24 Deep chroma compression of tone-mapped images Xenios Milidonis et.al. 2409.16032 link
2024-09-24 VascX Models: Model Ensembles for Retinal Vascular Analysis from Color Fundus Images Jose Vargas Quiros et.al. 2409.16016 link
2024-09-24 Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction and Rendering in Virtual Reality Hannah Schieber et.al. 2409.15959 null
2024-09-24 Unsupervised dMRI Artifact Detection via Angular Resolution Enhancement and Cycle Consistency Learning Sheng Chen et.al. 2409.15883 null
2024-09-25 Ring Artifacts Removal Based on Implicit Neural Representation of Sinogram Data Ligen Shi et.al. 2409.15731 null
2024-09-23 Blind Localization of Early Room Reflections with Arbitrary Microphone Array Yogev Hadadi et.al. 2409.15484 null
2024-09-23 Simplifying Triangle Meshes in the Wild Hsueh-Ti Derek Liu et.al. 2409.15458 null
2024-09-23 MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning Yue Han et.al. 2409.15179 null
2024-09-23 Advancing Video Quality Assessment for AIGC Xinli Yue et.al. 2409.14888 null
2024-09-23 Revisiting Video Quality Assessment from the Perspective of Generalization Xinli Yue et.al. 2409.14847 link
2024-09-23 AIM 2024 Challenge on Video Saliency Prediction: Methods and Results Andrey Moskalenko et.al. 2409.14827 link
2024-09-23 HiFi-Glot: Neural Formant Synthesis with Differentiable Resonant Filters Lauri Juvela et.al. 2409.14823 null
2024-09-22 Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing Wenze Ren et.al. 2409.14554 null
2024-09-22 Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting Daniel A. Mitchell et.al. 2409.14346 null
2024-09-22 MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators Qingyu Lu et.al. 2409.14335 link
2024-09-22 Quantitative and Qualitative Evaluation of NLM and Wavelet Methods in Image Enhancement Cameron Khanpour et.al. 2409.14334 null
2024-09-21 JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation Hadrien Reynaud et.al. 2409.14149 null
2024-09-21 N-Version Assessment and Enhancement of Generative AI Marcus Kessel et.al. 2409.14071 null
2024-09-18 An Efficient Projection-Based Next-best-view Planning Framework for Reconstruction of Unknown Objects Zhizhou Jia et.al. 2409.12096 null
2024-09-18 Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement Zizhen Lin et.al. 2409.11725 null
2024-09-18 DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion Jian Xu et.al. 2409.11642 link
2024-09-17 Noise-aware Dynamic Image Denoising and Positron Range Correction for Rubidium-82 Cardiac PET Imaging via Self-supervision Huidong Xie et.al. 2409.11543 null
2024-09-17 Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements Jipeng Yan et.al. 2409.11391 null
2024-09-17 Ultrasound Image Enhancement with the Variance of Diffusion Models Yuxin Zhang et.al. 2409.11380 link
2024-09-17 Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks Edgar Heinert et.al. 2409.11373 link
2024-09-17 Edge-based Denoising Image Compression Ryugo Morita et.al. 2409.10978 null
2024-09-17 CUNSB-RFIE: Context-aware Unpaired Neural Schrödinger Bridge in Retinal Fundus Image Enhancement Xuanzhao Dong et.al. 2409.10966 link
2024-09-17 Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending Yongyang Pan et.al. 2409.10958 null
2024-09-17 Neural Fields for Adaptive Photoacoustic Computed Tomography Tianao Li et.al. 2409.10876 link
2024-09-16 Investigating Training Objectives for Generative Speech Enhancement Julius Richter et.al. 2409.10753 link
2024-09-16 Taming Diffusion Models for Image Restoration: A Review Ziwei Luo et.al. 2409.10353 null
2024-09-16 FGR-Net:Interpretable fundus imagegradeability classification based on deepreconstruction learning Saif Khalid et.al. 2409.10246 null
2024-09-16 RF-GML: Reference-Free Generative Machine Listener Arijit Biswas et.al. 2409.10210 null
2024-09-16 Towards Explainable Automated Data Quality Enhancement without Domain Knowledge Djibril Sarr et.al. 2409.10139 null
2024-09-16 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction Atsuya Nakata et.al. 2409.09969 link
2024-09-15 A Global Perspective on the Past, Present, and Future of Video Streaming over Starlink Liz Izhikevich et.al. 2409.09846 null
2024-09-15 Underwater Image Enhancement via Dehazing and Color Restoration Chengqin Wu et.al. 2409.09779 null
2024-09-15 High Definition Map Mapping and Update: A General Overview and Future Directions Benny Wijaya et.al. 2409.09726 null
2024-09-15 Superconducting and low temperature RF Coils for Ultra-Low-Field MRI: A Study on SNR Performance Aditya A Bhosale et.al. 2409.09608 null
2024-09-14 Estimating Neural Orientation Distribution Fields on High Resolution Diffusion MRI Scans Mohammed Munzer Dwedari et.al. 2409.09387 link
2024-09-13 Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions Zahra Ashktorab et.al. 2409.08937 null
2024-09-13 Confocal Raman Microscopy with Adaptive Optics J. D. Munoz-Bolanos et.al. 2409.08725 null
2024-09-13 Joint image reconstruction and segmentation of real-time cardiac MRI in free-breathing using a model based on disentangled representation learning Tobias Wech et.al. 2409.08619 null
2024-09-13 DiffFAS: Face Anti-Spoofing via Generative Diffusion Models Xinxu Ge et.al. 2409.08572 link
2024-09-13 CasDyF-Net: Image Dehazing via Cascaded Dynamic Filters Wang Yinglong et.al. 2409.08510 link
2024-09-12 OpenACE: An Open Benchmark for Evaluating Audio Coding Performance Jozef Coldenhoff et.al. 2409.08374 link
2024-09-12 Expansive Supervision for Neural Radiance Field Weixiang Zhang et.al. 2409.08056 null
2024-09-12 OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation Shun Zou et.al. 2409.08000 link
2024-09-14 Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment Shaode Yu et.al. 2409.07762 null
2024-09-11 Foundation Models Boost Low-Level Perceptual Similarity Metrics Abhijay Ghildyal et.al. 2409.07650 link
2024-09-11 Machine Learning and Constraint Programming for Efficient Healthcare Scheduling Aymen Ben Said et.al. 2409.07547 null
2024-09-11 FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process Yang Luo et.al. 2409.07451 null
2024-09-11 EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion Jian Zhang et.al. 2409.07255 link
2024-09-12 3DGCQA: A Quality Assessment Database for 3D AI-Generated Contents Yingjie Zhou et.al. 2409.07236 link
2024-09-11 Phantom-based gradient waveform measurements with compensated variable-prephasing: Description and application to EPI at 7T Hannah Scholten et.al. 2409.07203 null
2024-09-11 Attention Down-Sampling Transformer, Relative Ranking and Self-Consistency for Blind Image Quality Assessment Mohammed Alsaafin et.al. 2409.07115 link
2024-09-11 CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion Joshua Kazdan et.al. 2409.07025 null
2024-09-11 AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models Boming Miao et.al. 2409.07002 null
2024-09-10 ExIQA: Explainable Image Quality Assessment Using Distortion Attributes Sepehr Kazemi Ranjbar et.al. 2409.06853 null
2024-09-10 Universal End-to-End Neural Network for Lossy Image Compression Bouzid Arezki et.al. 2409.06586 null
2024-09-10 Three-dimensional generative adversarial networks for turbulent flow estimation from wall measurements Antonio Cuéllar et.al. 2409.06548 null
2024-09-11 AMNS: Attention-Weighted Selective Mask and Noise Label Suppression for Text-to-Image Person Retrieval Runqing Zhang et.al. 2409.06385 null
2024-09-10 Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement Yang Wen et.al. 2409.06334 null
2024-09-10 DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing Kuang Yuan et.al. 2409.06137 null
2024-09-09 Enhancing Cross-Modality Synthesis: Subvolume Merging for MRI-to-CT Conversion Fuxin Fan et.al. 2409.05982 null
2024-09-09 SynMorph: Generating Synthetic Face Morphing Dataset with Mated Samples Haoyu Zhang et.al. 2409.05595 null
2024-09-09 Efficient Quality Estimation of True Random Bit-streams Cesare Caratozzolo et.al. 2409.05543 null
2024-09-09 Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild Xiongkuo Min et.al. 2409.05540 null
2024-09-09 A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression Nora Hofer et.al. 2409.05490 null
2024-09-09 Boosting CLIP Adaptation for Image Quality Assessment via Meta-Prompt Learning and Gradient Regularization Xudong Li et.al. 2409.05381 null
2024-09-09 PersonaTalk: Bring Attention to Your Persona in Visual Dubbing Longhao Zhang et.al. 2409.05379 null
2024-09-09 BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec Detai Xin et.al. 2409.05377 link
2024-09-09 Adaptive Offloading and Enhancement for Low-Light Video Analytics on Mobile Devices Yuanyi He et.al. 2409.05297 null
2024-09-08 Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation Haichao Zhu et.al. 2409.05151 null
2024-09-07 Plug-and-Hide: Provable and Adjustable Diffusion Generative Steganography Jiahao Zhu et.al. 2409.04878 null
2024-09-07 Metadata augmented deep neural networks for wild animal classification Aslak Tøn et.al. 2409.04825 link
2024-09-11 Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras Zimu Liao et.al. 2409.04751 link
2024-09-06 Whole Heart Perfusion with High-Multiband Simultaneous Multislice Imaging via Linear Phase Modulated Extended Field of View (SMILE) Shen Zhao et.al. 2409.04353 link
2024-09-06 Design and Characterization of MRI-compatible Plastic Ultrasonic Motor Zhanyue Zhao et.al. 2409.04006 null
2024-09-06 Bi-modality Images Transfer with a Discrete Process Matching Method Zhe Xiong et.al. 2409.03977 null
2024-09-03 Applications and Advances of Artificial Intelligence in Music Generation:A Review Yanxu Chen et.al. 2409.03715 null
2024-09-05 Enabling Practical and Privacy-Preserving Image Processing Chao Wang et.al. 2409.03568 null
2024-09-05 Use of triplet loss for facial restoration in low-resolution images Sebastian Pulgar et.al. 2409.03530 null
2024-09-05 Improving Uncertainty-Error Correspondence in Deep Bayesian Medical Image Segmentation Prerak Mody et.al. 2409.03470 link
2024-09-05 Multiple weather images restoration using the task transformer and adaptive mixup strategy Yang Wen et.al. 2409.03249 null
2024-09-05 Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem Qiwen Zhu et.al. 2409.03179 link
2024-09-05 Large Étendue 3D Holographic Display with Content-adpative Dynamic Fourier Modulation Brian Chao et.al. 2409.03143 null
2024-09-04 Incorporating dense metric depth into neural 3D representations for view synthesis and relighting Arkadeep Narayan Chaudhury et.al. 2409.03061 null
2024-09-04 Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models Pujing Yang et.al. 2409.02597 null
2024-09-04 Coral Model Generation from Single Images for Virtual Reality Applications Jie Fu et.al. 2409.02376 null
2024-09-04 Image Registration with Averaging Network and Edge-Based Loss for Low-SNR Cardiac MRI Xuan Lei et.al. 2409.02348 null
2024-09-03 Coaching a Robotic Sonographer: Learning Robotic Ultrasound with Sparse Expert's Feedback Deepak Raina et.al. 2409.02337 null
2024-09-03 Unveiling Deep Shadows: A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning Xiaowei Hu et.al. 2409.02108 link
2024-09-03 AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions Chenghao Qian et.al. 2409.02045 link
2024-09-03 Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates Yixuan Ye et.al. 2409.01935 link
2024-09-03 UWStereo: A Large Synthetic Dataset for Underwater Stereo Matching Qingxuan Lv et.al. 2409.01782 null
2024-09-03 Boron Isotope Effects on Raman Scattering in Bulk BN, BP, and BAs: A Density-Functional Theory Study Nima Ghafari Cherati et.al. 2409.01671 null
2024-09-03 GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting Zixuan Guo et.al. 2409.01581 null
2024-09-03 Learning Task-Specific Sampling Strategy for Sparse-View CT Reconstruction Liutao Yang et.al. 2409.01544 null
2024-09-03 Long-Range Biometric Identification in Real World Scenarios: A Comprehensive Evaluation Framework Based on Missions Deniz Aykac et.al. 2409.01540 null
2024-09-02 Real-Time Multi-Scene Visibility Enhancement for Promoting Navigational Safety of Vessels Under Complex Weather Conditions Ryan Wen Liu et.al. 2409.01500 link
2024-09-02 Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement Tathagata Bandyopadhyay et.al. 2409.01352 link
2024-09-02 A Roadmap to Holographic Focused Ultrasound Approaches to Generate Thermal Patterns Ceren Cengiz et.al. 2409.01323 null
2024-09-02 Investigation of the spatial resolution of PET imaging system measuring polarization-correlated Compton events Ana Marija Kožuljević et.al. 2409.01238 null
2024-09-02 MobileIQA: Exploiting Mobile-level Diverse Opinion Network For No-Reference Image Quality Assessment Using Knowledge Distillation Zewen Chen et.al. 2409.01212 link
2024-09-02 Generating Synthetic Satellite Imagery for Rare Objects: An Empirical Comparison of Models and Metrics Tuong Vy Nguyen et.al. 2409.01138 null
2024-09-02 Rapid GPU-Based Pangenome Graph Layout Jiajie Li et.al. 2409.00876 null
2024-09-01 An Optimized Binning and Probabilistic Slice Sharing Algorithm for Motion Correction in Abdominal DW-MRI Michelle Su et.al. 2409.00798 null
2024-08-30 Subspace Diffusion Posterior Sampling for Travel-Time Tomography Xiang Cao et.al. 2408.17333 null
2024-08-30 Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution Yixin Wu et.al. 2408.17285 null
2024-08-30 LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model Nasim Jamshidi Avanaki et.al. 2408.17057 link
2024-08-30 Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning Shuyang Zhang et.al. 2408.17005 link
2024-08-29 Legacy Learning Using Few-Shot Font Generation Models for Automatic Text Design in Metaverse Content: Cases Studies in Korean and Chinese Younghwi Kim et.al. 2408.16900 null
2024-08-29 The Continuous Electron Beam Accelerator Facility at 12 GeV P. A. Adderley et.al. 2408.16880 null
2024-08-29 MSLIQA: Enhancing Learning Representations for Image Quality Assessment through Multi-Scale Learning Nasim Jamshidi Avanaki et.al. 2408.16879 null
2024-09-04 Auto-resolving atomic structure at van der Waal interfaces using a generative model Wenqiang Huang et.al. 2408.16802 link
2024-09-02 RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model Zhuan Shi et.al. 2408.16634 null
2024-09-02 A Deep-Learning-Based Label-free No-Reference Image Quality Assessment Metric: Application in Sodium MRI Denoising Shuaiyu Yuan et.al. 2408.16481 null
2024-08-29 LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement Ye Yu et.al. 2408.16235 link
2024-08-28 TEDRA: Text-based Editing of Dynamic and Photoreal Actors Basavaraj Sunagad et.al. 2408.15995 null
2024-08-28 Segmentation-guided Layer-wise Image Vectorization with Gradient Fills Hengyu Zhou et.al. 2408.15741 link
2024-08-28 Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas Fabio Quattrini et.al. 2408.15660 link
2024-08-28 Avoiding Generative Model Writer's Block With Embedding Nudging Ali Zand et.al. 2408.15450 null
2024-09-02 Pitfalls and Outlooks in Using COMET Vilém Zouhar et.al. 2408.15366 link
2024-08-27 Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment Xuan Xu et.al. 2408.15218 null
2024-08-27 CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP Zhenchen Tang et.al. 2408.15098 null
2024-08-27 Towards Real-world Event-guided Low-light Video Enhancement and Deblurring Taewoo Kim et.al. 2408.14916 link
2024-08-27 Alfie: Democratising RGBA Image Generation With No $$$ Fabio Quattrini et.al. 2408.14826 link
2024-08-27 Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation Qiaoxin Li et.al. 2408.14754 null
2024-08-26 Gallery-Aware Uncertainty Estimation For Open-Set Face Recognition Leonid Erlygin et.al. 2408.14229 null
2024-08-27 SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher Trung Dao et.al. 2408.14176 link
2024-08-27 Improving Water Quality Time-Series Prediction in Hong Kong using Sentinel-2 MSI Data and Google Earth Engine Cloud Computing Rohin Sood et.al. 2408.14010 null
2024-08-26 LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models Qihang Ge et.al. 2408.14008 null
2024-08-25 Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching Minghao Liu et.al. 2408.13858 null
2024-08-25 Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In! Stefano Perrella et.al. 2408.13831 link
2024-08-24 G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles Adil Meric et.al. 2408.13508 null
2024-08-23 ReCon: Reconfiguring Analog Rydberg Atom Quantum Computers for Quantum Generative Adversarial Networks Nicholas S. DiBrita et.al. 2408.13389 link
2024-08-23 Re-evaluation of Face Anti-spoofing Algorithm in Post COVID-19 Era Using Mask Based Occlusion Attack Vaibhav Sundharam et.al. 2408.13251 null
2024-08-23 ResSR: A Residual Approach to Super-Resolving Multispectral Images Haley Duba-Sullivan et.al. 2408.13225 link
2024-08-23 A density ratio framework for evaluating the utility of synthetic data Thom Benjamin Volker et.al. 2408.13167 null
2024-08-23 When Diffusion MRI Meets Diffusion Model: A Novel Deep Generative Model for Diffusion MRI Generation Xi Zhu et.al. 2408.12897 null
2024-08-22 Variable Stars in M31 Stellar Clusters from the Panchromatic Hubble Andromeda Treasury Richard Smith et.al. 2408.12765 null
2024-08-22 Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis Memoona Aziz et.al. 2408.12762 null
2024-08-22 Unlocking Intrinsic Fairness in Stable Diffusion Eunji Kim et.al. 2408.12692 null
2024-08-22 Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features Shaoxiang Dang et.al. 2408.12279 null
2024-08-21 MBSS-T1: Model-Based Self-Supervised Motion Correction for Robust Cardiac T1 Mapping Eyal Hanania et.al. 2408.11992 null
2024-08-21 AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results Maksim Smirnov et.al. 2408.11982 link
2024-08-21 Estimating Contribution Quality in Online Deliberations Using a Large Language Model Lodewijk Gelauff et.al. 2408.11936 null
2024-08-21 FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting Liyao Jiang et.al. 2408.11706 null
2024-08-21 Interpretable Long-term Action Quality Assessment Xu Dong et.al. 2408.11687 link
2024-08-21 E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment Shangkun Sun et.al. 2408.11481 link
2024-08-21 Fairness measures for biometric quality assessment André Dörsch et.al. 2408.11392 null
2024-08-21 Gender Bias Evaluation in Text-to-image Generation: A Survey Yankun Wu et.al. 2408.11358 null
2024-08-21 Image Score: Learning and Evaluating Human Preferences for Mercari Search Chingis Oinar et.al. 2408.11349 null
2024-08-21 High-quality imaging of large areas through path-difference ptychography Jizhe Cui et.al. 2408.11332 null
2024-08-21 Optimizing Transmit Field Inhomogeneity of Parallel RF Transmit Design in 7T MRI using Deep Learning Zhengyi Lu et.al. 2408.11323 null
2024-08-21 Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning Methods David Jacob Kedziora et.al. 2408.11322 link
2024-08-20 Compress Guidance in Conditional Diffusion Sampling Anh-Dung Dinh et.al. 2408.11194 null
2024-08-20 Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement Satoshi Kosugi et.al. 2408.11055 link
2024-08-20 Denoising Plane Wave Ultrasound Images Using Diffusion Probabilistic Models Hojat Asgariandehkordi et.al. 2408.10987 null
2024-08-20 Influence of Medical Foreign Bodies on Dark-Field Chest Radiographs: First experiences Lennard Kaster et.al. 2408.10855 null
2024-08-19 Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation Liu He et.al. 2408.10453 null
2024-08-19 Perceptual Depth Quality Assessment of Stereoscopic Omnidirectional Images Wei Zhou et.al. 2408.10134 null
2024-08-19 Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement Kang Xiao et.al. 2408.09920 link
2024-08-19 Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation Yunxin Li et.al. 2408.09787 link
2024-08-21 Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning Zhi Qiao et.al. 2408.09731 null
2024-08-18 FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model Ziyu Yao et.al. 2408.09384 null
2024-08-17 Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming Seungyeop Han et.al. 2408.09244 null
2024-08-16 Explore Cross-Codec Quality-Rate Convex Hulls Relation for Adaptive Streaming Masoumeh Farhadi Nia et.al. 2408.09044 null
2024-08-16 Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions Bhuvanashree Murugadoss et.al. 2408.08781 null
2024-08-16 Speckle Noise Analysis for Synthetic Aperture Radar (SAR) Space Data Sanjjushri Varshini R et.al. 2408.08774 null
2024-08-16 Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs Jinming Liu et.al. 2408.08575 null
2024-08-16 Visual-Friendly Concept Protection via Selective Adversarial Perturbations Xiaoyue Mi et.al. 2408.08518 link
2024-08-16 Achieving Complex Image Edits via Function Aggregation with Diffusion Models Mohammadreza Samadi et.al. 2408.08495 null
2024-08-15 Level Up Your Tutorials: VLMs for Game Tutorials Quality Assessment Daniele Rege Cambrin et.al. 2408.08396 link
2024-08-15 METR: Image Watermarking with Large Number of Unique Messages Alexander Varlamov et.al. 2408.08340 link
2024-08-15 Accelerated Image-Aware Generative Diffusion Modeling Tanmay Asthana et.al. 2408.08306 null
2024-08-15 Rethinking Medical Anomaly Detection in Brain MRI: An Image Quality Assessment Perspective Zixuan Pan et.al. 2408.08228 link
2024-08-15 When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding Pingping Zhang et.al. 2408.08093 null
2024-08-15 KGV: Integrating Large Language Models with Knowledge Graphs for Cyber Threat Intelligence Credibility Assessment Zongzong Wu et.al. 2408.08088 null
2024-08-15 Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation Seon-Hoon Kim et.al. 2408.07947 link
2024-08-15 MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion Lucas Nedel Kirsten et.al. 2408.07932 link
2024-08-14 New Curriculum, New Chance -- Retrieval Augmented Generation for Lesson Planning in Ugandan Secondary Schools. Prototype Quality Evaluation Simon Kloker et.al. 2408.07542 null
2024-08-14 Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models Jean-Marie Lemercier et.al. 2408.07472 null
2024-08-14 DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement Tao Sun et.al. 2408.07388 null
2024-08-13 Direction of Arrival Correction through Speech Quality Feedback Caleb Rascon et.al. 2408.07234 link
2024-08-13 SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis Yuchen Mao et.al. 2408.07196 null
2024-08-13 BVI-UGC: A Video Quality Database for User-Generated Content Transcoding Zihao Qi et.al. 2408.07171 null
2024-08-13 Efficient Deep Model-Based Optoacoustic Image Reconstruction Christoph Dehner et.al. 2408.07109 null
2024-08-13 Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality Yu-Chih Chen et.al. 2408.07041 null
2024-08-13 Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines Samuel Fernández Menduiña et.al. 2408.07028 null
2024-08-13 Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models Cheng Chen et.al. 2408.06995 null
2024-08-13 Evaluating Research Quality with Large Language Models: An Analysis of ChatGPT's Effectiveness with Different Settings and Inputs Mike Thelwall et.al. 2408.06752 null
2024-08-13 Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models Chenqian Yan et.al. 2408.06646 null
2024-08-13 Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture Yu Feng et.al. 2408.06608 null
2024-08-13 HDRGS: High Dynamic Range Gaussian Splatting Jiahao Wu et.al. 2408.06543 link
2024-08-12 FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses Zhongweiyang Xu et.al. 2408.06468 null
2024-08-12 Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming Xinqi Jin et.al. 2408.06152 link
2024-08-12 A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting Felix Assion et.al. 2408.06071 null
2024-08-12 DiagESC: Dialogue Synthesis for Integrating Depression Diagnosis into Emotional Support Conversation Seungyeon Seo et.al. 2408.06044 link
2024-08-12 A Sharpness Based Loss Function for Removing Out-of-Focus Blur Uditangshu Aurangabadkar et.al. 2408.06014 link
2024-08-12 A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models Taehong Moon et.al. 2408.05927 link
2024-08-12 Creating Arabic LLM Prompts at Scale Abdelrahman El-Sheikh et.al. 2408.05882 null
2024-08-11 LaWa: Using Latent Space for In-Generation Image Watermarking Ahmad Rezaei et.al. 2408.05868 null
2024-08-14 Iterative Improvement of an Additively Regularized Topic Model Alex Gorbulev et.al. 2408.05840 null
2024-08-11 SSL: A Self-similarity Loss for Improving Generative Image Super-resolution Du Chen et.al. 2408.05713 link
2024-08-11 Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators Yifan Pu et.al. 2408.05710 link
2024-08-11 Evaluating BM3D and NBNet: A Comprehensive Study of Image Denoising Across Multiple Datasets Ghazal Kaviani et.al. 2408.05697 null
2024-08-09 CBCT scatter correction with dual-layer flat-panel detector Xin Zhang et.al. 2408.04943 null
2024-08-09 Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction Lingbei Meng et.al. 2408.04831 null
2024-08-08 DaedalusData: Exploration, Knowledge Externalization and Labeling of Particles in Medical Manufacturing -- A Design Study Alexander Wyss et.al. 2408.04749 null
2024-08-08 Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond Ravi Ramamoorthi et.al. 2408.04586 null
2024-08-11 Synchronous Multi-modal Semantic Communication System with Packet-level Coding Yun Tian et.al. 2408.04535 null
2024-08-08 Robustness investigation of quality measures for the assessment of machine learning models Thomas Most et.al. 2408.04391 null
2024-08-08 SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression Linhan Cao et.al. 2408.04273 null
2024-08-08 LLDif: Diffusion Models for Low-light Emotion Recognition Zhifeng Wang et.al. 2408.04235 null
2024-08-07 Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation Yiqing Shen et.al. 2408.04098 null
2024-08-07 Machine Learning-Based Reward-Driven Tuning of Scanning Probe Microscopy: Towards Fully Automated Microscopy Yu Liu et.al. 2408.04055 null
2024-08-07 Global-Local Progressive Integration Network for Blind Image Quality Assessment Xiaoqi Wang et.al. 2408.03885 null
2024-08-07 Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields Joo Chan Lee et.al. 2408.03822 null
2024-08-07 Soft-Hard Attention U-Net Model and Benchmark Dataset for Multiscale Image Shadow Removal Eirini Cholopoulou et.al. 2408.03734 null
2024-08-07 Monitoring of Hermit Crabs Using drone-captured imagery and Deep Learning based Super-Resolution Reconstruction and Improved YOLOv8 Fan Zhao et.al. 2408.03559 null
2024-08-07 D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods Onkar Susladkar et.al. 2408.03558 link
2024-08-07 PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting Yijia Guo et.al. 2408.03538 null
2024-08-06 Image Quality Transfer of Diffusion MRI Guided By High-Resolution Structural MRI Alp G. Cicimen et.al. 2408.03216 null
2024-08-06 Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models Sho Ozaki et.al. 2408.03156 null
2024-08-05 VidGen-1M: A Large-Scale Dataset for Text-to-video Generation Zhiyu Tan et.al. 2408.02629 null
2024-08-05 Cascading Refinement Video Denoising with Uncertainty Adaptivity Xinyuan Yu et.al. 2408.02284 null
2024-08-04 PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance Aoming Liu et.al. 2408.02157 null
2024-08-06 RICA2: Rubric-Informed, Calibrated Assessment of Actions Abrar Majeedi et.al. 2408.02138 link
2024-08-04 View-consistent Object Removal in Radiance Fields Yiren Lu et.al. 2408.02100 null
2024-08-04 Constructing Per-Shot Bitrate Ladders using Visual Information Fidelity Krishna Srikar Durbha et.al. 2408.01932 null
2024-08-03 Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation Jintao Tan et.al. 2408.01732 null
2024-08-03 JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Model Farzaneh Jafari et.al. 2408.01627 null
2024-08-02 Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics Alexander Gushchin et.al. 2408.01541 link
2024-08-02 Underwater Object Detection Enhancement via Channel Stabilization Muhammad Ali et.al. 2408.01293 link
2024-08-02 Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement Wenbin Zou et.al. 2408.01276 link
2024-08-02 Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion Ke Li et.al. 2408.01225 link
2024-08-02 Validation of an Analysability Model in Hybrid Quantum Software Díaz-Muñoz Ana et.al. 2408.01105 null
2024-08-06 FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation Xiang Gao et.al. 2408.00998 link
2024-08-01 SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement Mark Boss et.al. 2408.00653 null
2024-08-01 Regional quality estimation for echocardiography using deep learning Gilles Van De Vyver et.al. 2408.00591 link
2024-08-01 Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception Jiancong Feng et.al. 2408.00470 null
2024-08-01 RDP: Ranked Differential Privacy for Facial Feature Protection in Multiscale Sparsified Subspace Lu Ou et.al. 2408.00294 null
2024-07-31 Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification Xingchen Shi et.al. 2407.21683 null
2024-07-31 Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model Zhichao Zhang et.al. 2407.21408 null
2024-07-31 An all-sky catalogue of stellar reddening values E. Paunzen et.al. 2407.21373 null
2024-07-31 ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images Xilei Zhu et.al. 2407.21363 null
2024-08-01 Outlier Detection in Large Radiological Datasets using UMAP Mohammad Tariqul Islam et.al. 2407.21263 link
2024-07-30 MP-You: A Web-based MPI Simulation Tool The-Vinh Tran-Luu et.al. 2407.21155 null
2024-07-30 Simultaneous Multi-Slice Diffusion Imaging using Navigator-free Multishot Spiral Acquisition Yuancheng Jiang et.al. 2407.20904 null
2024-07-30 Highly Efficient No-reference 4K Video Quality Assessment with Full-Pixel Covering Sampling and Training Strategy Xiaoheng Tan et.al. 2407.20766 null
2024-07-30 Questionnaires for Everyone: Streamlining Cross-Cultural Questionnaire Adaptation with GPT-Based Translation Quality Evaluation Otso Haavisto et.al. 2407.20608 link
2024-07-29 Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods Hyeon Yu et.al. 2407.20427 null
2024-07-29 Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception Konstantinos Tzevelekakis et.al. 2407.20336 null
2024-07-29 DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models Jing Yang et.al. 2407.20141 null
2024-07-29 HeadsetOff: Enabling Photorealistic Video Conferencing on Economical VR Headsets Yili Jin et.al. 2407.19988 null
2024-07-29 Noise-Resilient Unsupervised Graph Representation Learning via Multi-Hop Feature Quality Estimation Shiyuan Li et.al. 2407.19944 null
2024-07-29 FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention Yu Lu et.al. 2407.19918 null
2024-07-29 ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement Ezequiel Perez-Zarate et.al. 2407.19708 link
2024-07-29 UNQA: Unified No-Reference Quality Assessment for Audio, Image, Video, and Audio-Visual Content Yuqin Cao et.al. 2407.19704 null
2024-07-29 Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment Wulian Yun et.al. 2407.19675 null
2024-07-28 X-Fake: Juggling Utility Evaluation and Explanation of Simulated SAR Images Zhongling Huang et.al. 2407.19436 null
2024-07-27 Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network Gang Pan et.al. 2407.19271 null
2024-07-27 Towards Clean-Label Backdoor Attacks in the Physical World Thinh Dao et.al. 2407.19203 null
2024-07-26 Regularized Multi-Decoder Ensemble for an Error-Aware Scene Representation Network Tianyu Xiong et.al. 2407.19082 null
2024-07-26 Correcting for objective sample refractive index mismatch in extended field of view selective plane illumination microscopy Steven J. Sheppard et.al. 2407.18862 null
2024-07-25 Joint RGB-Spectral Decomposition Model Guided Image Enhancement in Mobile Photography Kailai Zhou et.al. 2407.17996 link
2024-07-29 Invariance of deep image quality metrics to affine transformations Nuria Alabau-Bosque et.al. 2407.17927 link
2024-07-25 Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion Xiaodan Xing et.al. 2407.17882 null
2024-07-24 Final Alignment and Image Quality Test for the Acquisition and Guiding System of SOXS J. A. Araiza-Duran et.al. 2407.17382 null
2024-07-24 SOXS NIR: Optomechanical integration and alignment, optical performance verification before full instrument assembly M. Genoni et.al. 2407.17244 null
2024-07-24 Q-Ground: Image Quality Grounding with Large Multi-modality Models Chaofeng Chen et.al. 2407.17035 link
2024-07-24 3DAttGAN: A 3D Attention-based Generative Adversarial Network for Joint Space-Time Video Super-Resolution Congrui Fu et.al. 2407.16965 link
2024-07-24 SAR to Optical Image Translation with Color Supervised Diffusion Model Xinyu Bai et.al. 2407.16921 null
2024-07-23 QPT V2: Masked Image Modeling Advances Visual Scoring Qizhi Xie et.al. 2407.16541 link
2024-07-23 ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation Zhenhua Wu et.al. 2407.16508 null
2024-07-23 On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models Deniz Daum et.al. 2407.16405 link
2024-07-23 Improving multidimensional projection quality with user-specific metrics and optimal scaling Maniru Ibrahim et.al. 2407.16328 null
2024-07-23 A new visual quality metric for Evaluating the performance of multidimensional projections Maniru Ibrahim et.al. 2407.16309 null
2024-07-23 Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance Jiyeop Kim et.al. 2407.16173 null
2024-07-23 Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos Jiahe Liu et.al. 2407.16124 link
2024-07-22 Enhancing Cell Instance Segmentation in Scanning Electron Microscopy Images via a Deep Contour Closing Operator Florian Robert et.al. 2407.15817 null
2024-07-22 SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection Daniel Jakab et.al. 2407.15646 null
2024-07-22 Experimenting with Adaptive Bitrate Algorithms for Virtual Reality Streaming over Wi-Fi Ferran Maura et.al. 2407.15614 link
2024-07-22 SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time Stanislav Frolov et.al. 2407.15507 link
2024-07-22 Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures Ruizhe Wang et.al. 2407.15435 null
2024-07-21 Assessing Sample Quality via the Latent Space of Generative Models Jingyi Xu et.al. 2407.15171 link
2024-07-20 Non-Reference Quality Assessment for Medical Imaging: Application to Synthetic Brain MRIs Karl Van Eeden Risager et.al. 2407.14994 null
2024-07-20 Deep Learning CT Image Restoration using System Blur and Noise Models Yijie Yuan et.al. 2407.14983 null
2024-07-20 GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation Jingzhi Gong et.al. 2407.14982 link
2024-07-20 Dual High-Order Total Variation Model for Underwater Image Restoration Yuemei Li et.al. 2407.14868 link
2024-07-20 CBCTLiTS: A Synthetic, Paired CBCT/CT Dataset For Segmentation And Style Transfer Maximilian E. Tschuchnig et.al. 2407.14853 null
2024-07-20 Realistic Surgical Image Dataset Generation Based On 3D Gaussian Splatting Tianle Zeng et.al. 2407.14846 null
2024-07-20 Difflare: Removing Image Lens Flare with Latent Diffusion Model Tianwen Zhou et.al. 2407.14746 link
2024-07-20 Polarimetric compressed sensing with hollow, self-assembled diffractive films Ji Feng et.al. 2407.14722 null
2024-07-19 A Minibatch Alternating Projections Algorithm for Robust and Efficient Magnitude Least-Squares RF Pulse Design in MRI Jonathan B. Martin et.al. 2407.14696 link
2024-07-19 A Benchmark for Gaussian Splatting Compression and Quality Assessment Study Qi Yang et.al. 2407.14197 link
2024-07-19 Shape and Style GAN-based Multispectral Data Augmentation for Crop/Weed Segmentation in Precision Farming Mulham Fawakherji et.al. 2407.14119 null
2024-07-19 DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays Zongyuan Yang et.al. 2407.14053 null
2024-07-19 Personalized Privacy Protection Mask Against Unauthorized Facial Recognition Ka-Ho Chow et.al. 2407.13975 link
2024-07-18 Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion Boyang Deng et.al. 2407.13759 null
2024-07-18 A Novel Freeform Slicer IFU for the Magellan InfraRed Multi-Object Spectrograph (MIRMOS) Maren Cosens et.al. 2407.13747 null
2024-07-18 HazeCLIP: Towards Language Guided Real-World Image Dehazing Ruiyi Wang et.al. 2407.13719 link
2024-07-18 Removing cloud shadows from ground-based solar imagery Amal Chaoui et.al. 2407.13379 null
2024-07-18 Any Image Restoration with Efficient Automatic Degradation Adaptation Bin Ren et.al. 2407.13372 link
2024-07-18 Heterogeneous Clinical Trial Outcomes via Multi-Output Gaussian Processes Owen Thomas et.al. 2407.13283 null
2024-07-18 Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network Hao Yan et.al. 2407.13211 null
2024-07-18 Learned HDR Image Compression for Perceptually Optimal Storage and Display Peibei Cao et.al. 2407.13179 null
2024-07-18 Image Inpainting Models are Effective Tools for Instruction-guided Image Editing Xuan Ju et.al. 2407.13139 null
2024-07-18 Enhanced Denoising of OCT Images Using Residual U-Net: A Cross-Modality Approach on PSOCT and ASOCT for Clinical Diagnostics Akkidas Noel Prakasha et.al. 2407.13090 null
2024-07-17 Hallucination Index: An Image Quality Metric for Generative Reconstruction Models Matthew Tivnan et.al. 2407.12780 null
2024-07-17 CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems Jiankun Zhao et.al. 2407.12676 link
2024-07-17 High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion Juan Song et.al. 2407.12538 link
2024-07-17 Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations Tomáš Chobola et.al. 2407.12511 link
2024-07-17 Enhancing Film Grain Coding in VVC: Improving Encoding Quality and Efficiency Vignesh V Menon et.al. 2407.12465 null
2024-07-17 Voltage-Controlled Magnetoelectric Devices for Neuromorphic Diffusion Process Yang Cheng et.al. 2407.12261 null
2024-07-16 Semantic Communication for the Internet of Sounds: Architecture, Design Principles, and Challenges Chengsi Liang et.al. 2407.12203 null
2024-07-16 Neural Passage Quality Estimation for Static Pruning Xuejun Chang et.al. 2407.12170 link
2024-07-16 MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification Zhuoxiao Li et.al. 2407.11840 null
2024-07-16 LoFTI: Localization and Factuality Transfer to Indian Locales Sona Elza Simon et.al. 2407.11833 link
2024-07-16 XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach Truong Thanh Hung Nguyen et.al. 2407.11771 link
2024-07-16 ITI-IQA: a Toolbox for Heterogeneous Univariate and Multivariate Missing Data Imputation Quality Assessment Pedro Pons-Suñer et.al. 2407.11767 null
2024-07-16 Magnetogram-to-Magnetogram: Generative Forecasting of Solar Evolution Francesco Pio Ramunno et.al. 2407.11659 link
2024-07-16 ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment Xinyi Wang et.al. 2407.11496 link
2024-07-16 Cover-separable Fixed Neural Network Steganography via Deep Generative Models Guobiao Li et.al. 2407.11405 link
2024-07-16 Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering Jingqian Wu et.al. 2407.11343 null
2024-07-15 UFQA: Utility guided Fingerphoto Quality Assessment Amol S. Joshi et.al. 2407.11141 null
2024-07-15 Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation Tu Vu et.al. 2407.10817 null
2024-07-15 Melon Fruit Detection and Quality Assessment Using Generative AI-Based Image Data Augmentation Seungri Yoon et.al. 2407.10413 null
2024-07-15 Exploring the Impact of Moire Pattern on Deepfake Detectors Razaib Tariq et.al. 2407.10399 null
2024-07-14 Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models Qinyu Yang et.al. 2407.10285 link
2024-07-14 Low Sensitivity Hopsets Vikrant Ashvinkumar et.al. 2407.10249 null
2024-07-14 A Novel Approach to Ultrasound Beamforming using Synthetic Transmit Aperture with Low Complexity and High SNR for Medical Imaging Thenmozhi Elango et.al. 2407.10242 null
2024-07-13 Asynchronous Feedback Network for Perceptual Point Cloud Quality Assessment Yujie Zhang et.al. 2407.09806 link
2024-07-12 Quantum-dot-based Kitaev chains: Majorana quality measures and scaling with increasing chain length Viktor Svensson et.al. 2407.09211 null
2024-07-12 HPC: Hierarchical Progressive Coding Framework for Volumetric Video Zihan Zheng et.al. 2407.09026 null
2024-07-12 Task-driven single-image super-resolution reconstruction of document scans Maciej Zyrek et.al. 2407.08993 null
2024-07-12 LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models Hai Jiang et.al. 2407.08939 link
2024-07-12 15M Multimodal Facial Image-Text Dataset Dawei Dai et.al. 2407.08515 null
2024-07-11 Imitation Learning for Robotic Assisted Ultrasound Examination of Deep Venous Thrombosis using Kernelized Movement Primitives Diego Dall'Alba et.al. 2407.08506 null
2024-07-11 E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors Jinxiu Liang et.al. 2407.08231 null
2024-07-11 Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression Yuke Xing et.al. 2407.08165 null
2024-07-10 Coherent and Multi-modality Image Inpainting via Latent Space Optimization Lingzhi Pan et.al. 2407.08019 link
2024-07-10 Intensity-sensitive quality assessment of extended sources in astronomical images X. Li et.al. 2407.07863 link
2024-07-12 Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization Feixiang Zhou et.al. 2407.07673 null
2024-07-10 Video In-context Learning Wentao Zhang et.al. 2407.07356 null
2024-07-10 Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution Yuehan Zhang et.al. 2407.07302 link
2024-07-09 HAMIL-QA: Hierarchical Approach to Multiple Instance Learning for Atrial LGE MRI Quality Assessment K M Arefeen Sultan et.al. 2407.07254 null
2024-07-09 Scaling Up Personalized Aesthetic Assessment via Task Vector Customization Jooyeol Yun et.al. 2407.07176 link
2024-07-09 Microsoft Cloud-based Digitization Workflow with Rich Metadata Acquisition for Cultural Heritage Objects Krzysztof Kutt et.al. 2407.06972 null
2024-07-09 CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection Shuang Hao et.al. 2407.06780 link
2024-07-09 Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition Mingfang Zhang et.al. 2407.06628 null
2024-07-09 Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View Dogyoon Lee et.al. 2407.06613 null
2024-07-09 Low-dose, high-resolution CT of infant-sized lungs via propagation-based phase contrast James A. Pollock et.al. 2407.06527 null
2024-07-08 MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions Xuan Ju et.al. 2407.06358 null
2024-07-08 Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Orr Zohar et.al. 2407.06189 link
2024-07-08 PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes Mohammad Reza Karimi Dastjerdi et.al. 2407.06150 null
2024-07-08 Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation Xinyu Bai et.al. 2407.06095 null
2024-07-08 Pan-denoising: Guided Hyperspectral Image Denoising via Weighted Represent Coefficient Total Variation Shuang Xu et.al. 2407.06064 link
2024-07-08 MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices Jianwen Jiang et.al. 2407.05712 null
2024-07-09 PCAC-GAN:ASparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression Xiaolong Mao et.al. 2407.05677 null
2024-07-08 GSBIQA: Green Saliency-guided Blind Image Quality Assessment Method Zhanxuan Mei et.al. 2407.05590 null
2024-07-08 Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN Jiacheng Su et.al. 2407.05577 null
2024-07-06 Panopticon: a telescope for our times Will Saunders et.al. 2407.05103 null
2024-07-06 CLIPVQA:Video Quality Assessment via CLIP Fengchuang Xing et.al. 2407.04928 link
2024-07-06 OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding Tiancheng Zhao et.al. 2407.04923 null
2024-07-05 MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? Zhaorun Chen et.al. 2407.04842 link
2024-07-05 Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps Mattias Nilsson et.al. 2407.04578 link
2024-07-05 Rethinking Image Compression on the Web with Generative AI Shayan Ali Hassan et.al. 2407.04542 null
2024-07-05 Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain Christophe Karam et.al. 2407.04484 null
2024-07-05 Unsupervised Video Summarization via Reinforcement Learning and a Trained Evaluator Mehryar Abbasi et.al. 2407.04258 null
2024-07-05 HCS-TNAS: Hybrid Constraint-driven Semi-supervised Transformer-NAS for Ultrasound Image Segmentation Renqi Chen et.al. 2407.04203 null
2024-07-04 Performance of Medical Image Fusion in High-level Analysis Tasks: A Mutual Enhancement Framework for Unaligned PAT and MRI Image Fusion Yutian Zhong et.al. 2407.03992 link
2024-07-04 DSMix: Distortion-Induced Sensitivity Map Based Pre-training for No-Reference Image Quality Assessment Jinsong Shi et.al. 2407.03886 link
2024-07-04 Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy Yujie Zhang et.al. 2407.03885 link
2024-07-04 DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts Zheng-Peng Duan et.al. 2407.03757 null
2024-07-04 Adaptive sampling strategy for tolerance analysis of freeform optical surfaces based on critical ray aiming Rundong Fan et.al. 2407.03688 null
2024-07-04 Pathological Semantics-Preserving Learning for H&E-to-IHC Virtual Staining Fuqiang Chen et.al. 2407.03655 link
2024-07-04 Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration Yuhong Zhang et.al. 2407.03636 null
2024-07-04 Orthogonal Constrained Minimization with Tensor $\ell_{2,p}$ Regularization for HSI Denoising and Destriping Xiaoxia Liu et.al. 2407.03605 null
2024-07-03 Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models Chunmei Xu et.al. 2407.03050 null
2024-07-03 Single Image Rolling Shutter Removal with Diffusion Models Zhanglei Yang et.al. 2407.02906 null
2024-07-03 FedPot: A Quality-Aware Collaborative and Incentivized Honeypot-Based Detector for Smart Grid Networks Abdullatif Albaseer et.al. 2407.02845 null
2024-07-03 Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design Gen Li et.al. 2407.02813 link
2024-07-03 SF-GNN: Self Filter for Message Lossless Propagation in Deep Graph Neural Network Yushan Zhu et.al. 2407.02762 null
2024-07-03 MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control Yeonji Lee et.al. 2407.02736 null
2024-07-02 Meta 3D Gen Raphael Bensadoun et.al. 2407.02599 null
2024-07-02 Off-Grid Ultrasound Imaging by Stochastic Optimization Vincent van de Schaft et.al. 2407.02285 link
2024-07-02 SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules Suyi Li et.al. 2407.02031 null
2024-07-01 Free-text Rationale Generation under Readability Level Control Yi-Sheng Hsu et.al. 2407.01384 null
2024-07-01 GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting Chenxin Li et.al. 2407.01301 null
2024-07-01 Optical turbulence vertical distribution at the Peak Terskol Observatory and Mt. Kurapdag A. Y. Shikhovtsev et.al. 2407.00960 null
2024-07-01 Diffusion Transformer Model With Compact Prior for Low-dose PET Reconstruction Bin Huang et.al. 2407.00944 link
2024-06-30 A Comparative Study of Quality Evaluation Methods for Text Summarization Huyen Nguyen et.al. 2407.00747 null
2024-06-30 DCI: An Accurate Quality Assessment Criteria for Protein Complex Structure Models Wenda Wang et.al. 2407.00560 null
2024-06-29 Dynamic Optimization of Video Streaming Quality Using Network Digital Twin Technology Zurh Farus et.al. 2407.00513 null
2024-07-02 RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering Weikai Lin et.al. 2407.00435 link
2024-06-29 Benchmark Evaluation of Image Fusion algorithms for Smartphone Camera Capture Lucas N. Kirsten et.al. 2407.00301 null
2024-06-28 PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration Yuxuan Sun et.al. 2407.00203 null
2024-06-28 Quantitative Methods in Research Evaluation Citation Indicators, Altmetrics, and Artificial Intelligence Mike Thelwall et.al. 2407.00135 null
2024-06-28 MR-zero meets FLASH -- Controlling the transient signal decay in gradient- and rf-spoiled gradient echo sequences Simon Weinmüller et.al. 2406.19877 null
2024-06-28 Deep Fusion Model for Brain Tumor Classification Using Fine-Grained Gradient Preservation Niful Islam et.al. 2406.19690 null
2024-06-28 UltraGelBot: Autonomous Gel Dispenser for Robotic Ultrasound Deepak Raina et.al. 2406.19678 null
2024-06-28 PopAlign: Population-Level Alignment for Fair Text-to-Image Generation Shufan Li et.al. 2406.19668 link
2024-06-27 Robustness Testing of Black-Box Models Against CT Degradation Through Test-Time Augmentation Jack Highton et.al. 2406.19557 null
2024-06-27 Lightweight Predictive 3D Gaussian Splats Junli Cao et.al. 2406.19434 link
2024-06-27 Looking 3D: Anomaly Detection with 2D-3D Alignment Ankan Bhunia et.al. 2406.19393 link
2024-06-27 AI Data Readiness Inspector (AIDRIN) for Quantitative Assessment of Data Readiness for AI Kaveen Hiniduma et.al. 2406.19256 null
2024-06-27 Staggered Quantizers for Perfect Perceptual Quality: A Connection between Quantizers with Common Randomness and Without Ruida Zhou et.al. 2406.19248 null
2024-06-27 Local Manifold Learning for No-Reference Image Quality Assessment Timin Gao et.al. 2406.19247 null
2024-06-27 Complex-valued scatter compensation in nonlinear microscopy Maximilian Sohmen et.al. 2406.19031 null
2024-06-27 Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model Jiangtong Tan et.al. 2406.19030 link
2024-06-26 IDA-UIE: An Iterative Framework for Deep Network-based Degradation Aware Underwater Image Enhancement Pranjali Singh et.al. 2406.18628 null
2024-06-26 On Scaling Up 3D Gaussian Splatting Training Hexu Zhao et.al. 2406.18533 link
2024-06-26 MultiDiff: Consistent Novel View Synthesis from a Single Image Norman Müller et.al. 2406.18524 null
2024-06-26 ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation Shenghai Yuan et.al. 2406.18522 link
2024-06-26 MFDNet: Multi-Frequency Deflare Network for Efficient Nighttime Flare Removal Yiguo Jiang et.al. 2406.18079 link
2024-06-26 Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation Qilai Zhang et.al. 2406.18054 link
2024-06-25 Burst Image Super-Resolution with Base Frame Selection Sanghyun Kim et.al. 2406.17869 null
2024-06-25 Sparse-view Signal-domain Photoacoustic Tomography Reconstruction Method Based on Neural Representation Bowei Yao et.al. 2406.17578 null
2024-06-25 UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment Vlad Hosu et.al. 2406.17472 null
2024-06-25 Leveraging LLMs for Dialogue Quality Measurement Jinghan Jia et.al. 2406.17304 null
2024-06-25 HD snapshot diffractive spectral imaging and inferencing Apratim Majumder et.al. 2406.17302 null
2024-06-25 Image-Guided Outdoor LiDAR Perception Quality Assessment for Autonomous Driving Ce Zhang et.al. 2406.17265 null
2024-06-25 Disentangled Motion Modeling for Video Frame Interpolation Jaihyun Lew et.al. 2406.17256 link
2024-06-24 Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models Bei Yan et.al. 2406.17115 link
2024-06-24 Fine-tuning Diffusion Models for Enhancing Face Quality in Text-to-image Generation Zhenyi Liao et.al. 2406.17100 link
2024-06-24 Reducing the Memory Footprint of 3D Gaussian Splatting Panagiotis Papantonakis et.al. 2406.17074 null
2024-06-24 3D distortion-free, reduced field of view diffusion-prepared GRE at 3T Sarah McElroy et.al. 2406.16809 null
2024-06-24 Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation Katherine M. Collins et.al. 2406.16807 null
2024-06-24 Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment Jun Fu et.al. 2406.16641 link
2024-06-24 DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution Aiwen Jiang et.al. 2406.16477 link
2024-06-24 Approximate DCT and Quantization Techniques for Energy-Constrained Image Sensors Ming-Che Li et.al. 2406.16358 null
2024-06-24 Priorformer: A UGC-VQA Method with content and distortion priors Yajing Pei et.al. 2406.16297 null
2024-06-23 Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation Rafael Redondo et.al. 2406.16155 null
2024-06-23 LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical Scene Reconstruction Hengyu Liu et.al. 2406.16073 link
2024-06-22 Quality-guided Skin Tone Enhancement for Portrait Photography Shiqi Gao et.al. 2406.15848 null
2024-06-21 Adaptive Self-Supervised Consistency-Guided Diffusion Model for Accelerated MRI Reconstruction Mojtaba Safari et.al. 2406.15656 null
2024-06-21 Contrastive Entity Coreference and Disambiguation for Historical Texts Abhishek Arora et.al. 2406.15576 null
2024-06-21 Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild Nadav Orzech et.al. 2406.15331 null
2024-06-21 Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection Lynn Vonderhaar et.al. 2406.15268 null
2024-06-24 VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation Xuan He et.al. 2406.15252 null
2024-06-21 Unsupervised Bayesian Generation of Synthetic CT from CBCT Using Patient-Specific Score-Based Prior Junbo Peng et.al. 2406.15219 null
2024-06-21 Benchmarking Retinal Blood Vessel Segmentation Models for Cross-Dataset and Cross-Disease Generalization Jeremiah Fadugba et.al. 2406.14994 link
2024-06-21 Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning Xu Han et.al. 2406.14847 null
2024-06-21 Is this a bad table? A Closer Look at the Evaluation of Table Generation from Text Pritika Ramu et.al. 2406.14829 null
2024-06-20 Holistic Evaluation for Interleaved Text-and-Image Generation Minqian Liu et.al. 2406.14643 null
2024-06-20 A Fuzzy Logic-Based Quality Model For Identifying Microservices With Low Maintainability Rahime Yilmaz et.al. 2406.14489 null
2024-06-20 Enhancing multivariate post-processed visibility predictions utilizing CAMS forecasts Mária Lakatos et.al. 2406.14159 null
2024-06-20 EnTruth: Enhancing the Traceability of Unauthorized Dataset Usage in Text-to-image Diffusion Models with Minimal and Robust Alterations Jie Ren et.al. 2406.13933 null
2024-06-19 IG-CFAT: An Improved GAN-Based Framework for Effectively Exploiting Transformers in Real-World Image Super-Resolution Alireza Aghelan et.al. 2406.13815 link
2024-06-19 Convex-hull Estimation using XPSNR for Versatile Video Coding Vignesh V Menon et.al. 2406.13712 null
2024-06-19 Assessing the 3D resolution of refocused correlation plenoptic images using a general-purpose image quality estimator Gianlorenzo Massaro et.al. 2406.13501 null
2024-06-19 ALiiCE: Evaluating Positional Fine-grained Citation Generation Yilong Xu et.al. 2406.13375 link
2024-06-19 AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric Conditioned Diffusion Models Ken Chen et.al. 2406.13272 null
2024-06-19 New methods for ALMA angular-scale based observation scheduling, quality assessment, and beam shaping II: refinements Dirk Petry et.al. 2406.13199 null
2024-06-18 NTIRE 2024 Challenge on Night Photography Rendering Egor Ershov et.al. 2406.13007 null
2024-06-18 Pattern or Artifact? Interactively Exploring Embedding Quality with TRACE Edith Heiter et.al. 2406.12953 link
2024-06-18 Automatic generation of insights from workers' actions in industrial workflows with explainable Machine Learning Francisco de Arriba-Pérez et.al. 2406.12732 null
2024-06-18 Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution Maximilian Fischer et.al. 2406.12623 null
2024-06-18 Training Diffusion Models with Federated Learning Matthijs de Goede et.al. 2406.12575 null
2024-06-18 Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation Sophie Loizillon et.al. 2406.12448 link
2024-06-18 AI-Assisted Human Evaluation of Machine Translation Vilém Zouhar et.al. 2406.12419 link
2024-06-18 SDNIA-YOLO: A Robust Object Detection Model for Extreme Weather Conditions Yuexiong Ding et.al. 2406.12395 null
2024-06-17 A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets Bernhard Kerbl et.al. 2406.12080 null
2024-06-17 FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure Ziyue Xu et.al. 2406.12009 link
2024-06-17 RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians Bingling Li et.al. 2406.11836 null
2024-06-17 Latent Denoising Diffusion GAN: Faster sampling, Higher image quality Luan Thanh Trinh et.al. 2406.11713 link
2024-06-17 Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT Maximilian E. Tschuchnig et.al. 2406.11650 null
2024-06-17 Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation Boxuan Lyu et.al. 2406.11632 null
2024-06-17 Compressed Skinning for Facial Blendshapes Ladislav Kavan et.al. 2406.11597 null
2024-06-17 Energy Reduction Opportunities in HDR Video Encoding Christian Herglotz et.al. 2406.11492 null
2024-06-17 A Dictionary Based Approach for Removing Out-of-Focus Blur Uditangshu Aurangabadkar et.al. 2406.11330 link
2024-06-17 NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation Niu Guanchen et.al. 2406.11259 null
2024-06-17 Incentivizing Quality Text Generation via Statistical Contracts Eden Saig et.al. 2406.11118 link
2024-06-16 Parameter Blending for Multi-Camera Harmonization for Automotive Surround View Systems Yuzhuo Ren et.al. 2406.11066 null
2024-06-16 SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction Yuxun Tang et.al. 2406.10911 null
2024-06-15 MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images Tao Yan et.al. 2406.10652 null
2024-06-15 Exploring the Impact of AI-generated Image Tools on Professional and Non-professional Users in the Art and Design Fields Yuying Tang et.al. 2406.10640 null
2024-06-15 Full reference point cloud quality assessment using support vector regression Ryosuke Watanabe et.al. 2406.10520 link
2024-06-15 CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation Wei Chen et.al. 2406.10462 null
2024-06-14 Consistency-diversity-realism Pareto fronts of conditional image generative models Pietro Astolfi et.al. 2406.10429 null
2024-06-14 PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting Alex Hanson et.al. 2406.10219 link
2024-06-14 AlignNet: Learning dataset score alignment functions to enable better training of speech quality estimators Jaden Pieper et.al. 2406.10205 null
2024-06-14 D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video Moritz Kappel et.al. 2406.10078 null
2024-06-14 Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment Fei Zhou et.al. 2406.09858 null
2024-06-14 Full-reference Point Cloud Quality Assessment Using Spectral Graph Wavelets Ryosuke Watanabe et.al. 2406.09762 null
2024-06-14 Compressed Video Quality Enhancement with Temporal Group Alignment and Fusion Qiang Zhu et.al. 2406.09693 null
2024-06-13 DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer Wei-Ting Chen et.al. 2406.09622 null
2024-06-13 Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment Fengbin Guan et.al. 2406.09546 null
2024-06-13 Modeling Ambient Scene Dynamics for Free-view Synthesis Meng-Li Shih et.al. 2406.09395 null
2024-06-14 WonderWorld: Interactive 3D Scene Generation from a Single Image Hong-Xing Yu et.al. 2406.09394 null
2024-06-13 LRM-Zero: Training Large Reconstruction Models with Synthesized Data Desai Xie et.al. 2406.09371 link
2024-06-13 CMC-Bench: Towards a New Paradigm of Visual Signal Compression Chunyi Li et.al. 2406.09356 link
2024-06-13 StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning Giuseppe Vecchio et.al. 2406.09293 null
2024-06-13 SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution Soufiane Belharbi et.al. 2406.09168 link
2024-06-13 Adaptive Cooperative Streaming of Holographic Video Over Wireless Networks: A Proximal Policy Optimization Solution Wanli Wen et.al. 2406.08806 null
2024-06-13 Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation Mingwang Xu et.al. 2406.08801 null
2024-06-13 FouRA: Fourier Low Rank Adaptation Shubhankar Borse et.al. 2406.08798 null
2024-06-12 Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods Eugene Vyborov et.al. 2406.08582 null
2024-06-12 IMFL-AIGC: Incentive Mechanism Design for Federated Learning Empowered by Artificial Intelligence Generated Content Guangjing Huang et.al. 2406.08526 null
2024-06-12 DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor Juncheng Wu et.al. 2406.08377 link
2024-06-12 WMAdapter: Adding WaterMark Control to Latent Diffusion Models Hai Ci et.al. 2406.08337 null
2024-06-12 Guiding In-Context Learning of LLMs through Quality Estimation for Machine Translation Javad Pourmostafa Roshan Sharami et.al. 2406.07970 link
2024-06-12 DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera Senyan Xu et.al. 2406.07951 link
2024-06-12 Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation Jiadong Liang et.al. 2406.07895 null
2024-06-11 A PRISMA Driven Systematic Review of Publicly Available Datasets for Benchmark and Model Developments for Industrial Defect Detection Can Akbas et.al. 2406.07694 null
2024-06-11 Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? Xingyu Fu et.al. 2406.07546 null
2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance Kuan Heng Lin et.al. 2406.07540 null
2024-06-11 Trim 3D Gaussian Splatting for Accurate Geometry Representation Lue Fan et.al. 2406.07499 null
2024-06-11 Textual Similarity as a Key Metric in Machine Translation Quality Estimation Kun Sun et.al. 2406.07440 null
2024-06-11 Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance Ruxin Zheng et.al. 2406.07399 null
2024-06-11 DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling Sixian Wang et.al. 2406.07390 null
2024-06-11 Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment Takuto Igarashi et.al. 2406.07280 null
2024-06-11 Accurate estimate of the ESPRESSO fiber-injection losses inferred from integrated field-stabilization images Tobias M. Schmidt et.al. 2406.07193 null
2024-06-11 Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation Yuanhao Zhai et.al. 2406.06890 link
2024-06-11 A Subjective Quality Evaluation of 3D Mesh with Dynamic Level of Detail in Virtual Reality Duc Nguyen et.al. 2406.06888 null
2024-06-09 Latent Diffusion Model-Enabled Real-Time Semantic Communication Considering Semantic Ambiguities and Channel Noises Jianhua Pei et.al. 2406.06644 link
2024-06-10 Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Peize Sun et.al. 2406.06525 link
2024-06-10 MVGamba: Unify 3D Content Generation as State Space Sequence Modeling Xuanyu Yi et.al. 2406.06367 link
2024-06-10 Federated learning in food research Zuzanna Fendor et.al. 2406.06202 null
2024-06-10 Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios Raül Pérez-Gonzalo et.al. 2406.06165 null
2024-06-10 JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis Hyunjae Cho et.al. 2406.06111 null
2024-06-10 GAIA: Rethinking Action Quality Assessment for AI-Generated Videos Zijian Chen et.al. 2406.06087 link
2024-06-10 FRAG: Frequency Adapting Group for Diffusion Video Editing Sunjae Yoon et.al. 2406.06044 link
2024-06-12 MLCM: Multistep Consistency Distillation of Latent Diffusion Model Qingsong Xie et.al. 2406.05768 link
2024-06-08 Energy-Efficient Approximate Full Adders Applying Memristive Serial IMPLY Logic For Image Processing Seyed Erfan Fatemieh et.al. 2406.05525 null
2024-06-08 Blurry-Consistency Segmentation Framework with Selective Stacking on Differential Interference Contrast 3D Breast Cancer Spheroid Thanh-Huy Nguyen et.al. 2406.05349 null
2024-06-08 Deep convolutional demosaicking network for multispectral polarization filter array Tomoharu Ishiuchi et.al. 2406.05312 null
2024-06-08 YouTube SFV+HDR Quality Dataset Yilin Wang et.al. 2406.05305 null
2024-06-07 Spectral Codecs: Spectrogram-Based Audio Codecs for High Quality Speech Synthesis Ryan Langman et.al. 2406.05298 null
2024-06-07 GANetic Loss for Generative Adversarial Networks with a Focus on Medical Applications Shakhnaz Akhmedova et.al. 2406.05023 link
2024-06-07 Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior Tanvir Mahmud et.al. 2406.04873 link
2024-06-07 SMC++: Masked Learning of Unsupervised Video Semantic Compression Yuan Tian et.al. 2406.04765 link
2024-06-07 The Active Optics System on the Vera C. Rubin Observatory: Optimal Control of Degeneracy Among the Large Number of Degrees of Freedom Guillem Megias Homar et.al. 2406.04656 null
2024-06-07 GenzIQA: Generalized Image Quality Assessment using Prompt-Guided Latent Diffusion Models Diptanu De et.al. 2406.04654 null
2024-06-07 StreamOptix: A Cross-layer Adaptive Video Delivery Scheme Mufan Liu et.al. 2406.04632 link
2024-06-07 Attention Fusion Reverse Distillation for Multi-Lighting Image Anomaly Detection Yiheng Zhang et.al. 2406.04573 null
2024-06-06 Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance Reyhane Askari Hemmat et.al. 2406.04551 null
2024-06-06 A Versatile Collage Visualization Technique Zhenyu Wang et.al. 2406.04008 null
2024-06-06 JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits Minzhou Pan et.al. 2406.03720 link
2024-06-06 Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction Diwen Wan et.al. 2406.03697 link
2024-06-05 Anatomy-based quality metric of diffusion-weighted MRI data for accurate derivation of muscle fiber orientation Nadya Shusharina et.al. 2406.03560 null
2024-06-05 Globally and Locally Optimized Pannini Projection for High FoV Rendering of 360-degree Images Falah Jabar et.al. 2406.03282 null
2024-06-05 FAPNet: An Effective Frequency Adaptive Point-based Eye Tracker Xiaopeng Lin et.al. 2406.03177 null
2024-06-05 Dynamic 3D Gaussian Fields for Urban Areas Tobias Fischer et.al. 2406.03175 null
2024-06-05 The new Herschel/PACS Point Source Catalogue Gábor Marton et.al. 2406.03116 null
2024-06-05 A-Bench: Are LMMs Masters at Evaluating AI-generated Images? Zicheng Zhang et.al. 2406.03070 link
2024-06-05 DifAttack++: Query-Efficient Black-Box Adversarial Attack via Hierarchical Disentangled Feature Space in Cross Domain Jun Liu et.al. 2406.03017 link
2024-06-05 Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms Firas Trabelsi et.al. 2406.02832 null
2024-06-04 ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Tianchen Zhao et.al. 2406.02540 link
2024-06-04 Guiding a Diffusion Model with a Bad Version of Itself Tero Karras et.al. 2406.02507 link
2024-06-04 Towards AI-Assisted Sustainable Adaptive Video Streaming Systems: Tutorial and Survey Reza Farahani et.al. 2406.02302 null
2024-06-04 I4VGen: Image as Stepping Stone for Text-to-Video Generation Xiefan Guo et.al. 2406.02230 null
2024-06-04 OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection Chenyang Huang et.al. 2406.01919 link
2024-06-04 Rank-based No-reference Quality Assessment for Face Swapping Xinghui Zhou et.al. 2406.01884 null
2024-06-03 Video Coding with Cross-Component Sample Offset Han Gao et.al. 2406.01795 null
2024-06-03 DEFT: Efficient Finetuning of Conditional Diffusion Models by Learning the Generalised $h$ -transform Alexander Denker et.al. 2406.01781 link
2024-06-03 Enhancing Dynamic CT Image Reconstruction with Neural Fields Through Explicit Motion Regularizers Pablo Arratia et.al. 2406.01299 null
2024-06-03 Capsule Enhanced Variational AutoEncoder for Underwater Image Reconstruction Rita Pucci et.al. 2406.01294 link
2024-06-03 Dimba: Transformer-Mamba Diffusion Models Zhengcong Fei et.al. 2406.01159 null
2024-06-03 Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline Jan Lippemeier et.al. 2406.01071 null
2024-06-03 UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment Hantao Zhou et.al. 2406.01069 link
2024-06-03 CLIP-Guided Attribute Aware Pretraining for Generalizable Image Quality Assessment Daekyu Kwon et.al. 2406.01020 null
2024-06-02 EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing Hadrien Reynaud et.al. 2406.00808 link
2024-06-04 Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion Models Cristiano Patrício et.al. 2406.00772 link
2024-06-02 W-Net: A Facial Feature-Guided Face Super-Resolution Network Hao Liu et.al. 2406.00676 null
2024-06-01 Bilateral Guided Radiance Field Processing Yuehao Wang et.al. 2406.00448 null
2024-06-01 Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention Reasoner Xing Cui et.al. 2406.00432 link
2024-06-01 Hybrid attention structure preserving network for reconstruction of under-sampled OCT images Zezhao Guo et.al. 2406.00279 null
2024-05-31 Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis Chaoyou Fu et.al. 2405.21075 null
2024-05-31 Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Jiatao Gu et.al. 2405.21048 null
2024-05-31 Tsang's resolution enhancement method for imaging with focused illumination Alexander Duplinskiy et.al. 2405.20979 null
2024-05-31 Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation Shuzhou Yang et.al. 2405.20669 link
2024-05-30 An Automatic Question Usability Evaluation Toolkit Steven Moore et.al. 2405.20529 link
2024-05-30 Can No-Reference Quality-Assessment Methods Serve as Perceptual Losses for Super-Resolution? Egor Kashkarov et.al. 2405.20392 null
2024-05-30 CoSy: Evaluating Textual Explanations of Neurons Laura Kopf et.al. 2405.20331 link
2024-05-31 NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation Pedro Martin et.al. 2405.20078 null
2024-05-30 Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion Jiangkai Wu et.al. 2405.20032 link
2024-06-03 DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild Honghao Fu et.al. 2405.19996 link
2024-05-29 CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning Yiping Wang et.al. 2405.19547 link
2024-05-29 A Full-duplex Speech Dialogue Scheme Based On Large Language Models Peng Wang et.al. 2405.19487 null
2024-05-29 VisTA-SR: Improving the Accuracy and Resolution of Low-Cost Thermal Imaging Cameras for Agriculture Heesup Yun et.al. 2405.19413 null
2024-05-29 Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare Hanwei Zhu et.al. 2405.19298 link
2024-05-29 A study on the adequacy of common IQA measures for medical images Anna Breger et.al. 2405.19224 link
2024-05-29 A study of why we need to reassess full reference image quality assessment with medical images Anna Breger et.al. 2405.19097 null
2024-05-31 Benchmarking and Improving Detail Image Caption Hongyuan Dong et.al. 2405.19092 link
2024-05-29 Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization Zhiwei Tang et.al. 2405.18881 link
2024-05-29 Descriptive Image Quality Assessment in the Wild Zhiyuan You et.al. 2405.18842 null
2024-05-29 Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics Zhangkai Ni et.al. 2405.18790 link
2024-05-28 Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers? Zebin You et.al. 2405.18029 null
2024-05-30 Proof of Quality: A Costless Paradigm for Trustless Generative AI Model Inference on Blockchains Zhenjie Zhang et.al. 2405.17934 null
2024-05-30 MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization Tianchen Zhao et.al. 2405.17873 null
2024-05-28 PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild Kun Yuan et.al. 2405.17765 null
2024-05-28 AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval Sihe Zhang et.al. 2405.17718 null
2024-05-27 Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba Jiahao Huang et.al. 2405.17659 null
2024-05-27 Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction Wenhao Zhang et.al. 2405.17167 null
2024-05-28 F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting Xiangyu Sun et.al. 2405.17083 null
2024-05-29 The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control Arle Lommel et.al. 2405.16969 null
2024-05-27 EM Distillation for One-step Diffusion Models Sirui Xie et.al. 2405.16852 null
2024-05-27 Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model Shoma Iwai et.al. 2405.16817 link
2024-05-26 Coil Reweighting to Suppress Motion Artifacts in Real-Time Exercise Cine Imaging Chong Chen et.al. 2405.16715 null
2024-05-26 Deep learning improved autofocus for motion artifact reduction and its application in quantitative susceptibility mapping Chao Li et.al. 2405.16664 null
2024-05-26 Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models Regev Cohen et.al. 2405.16475 null
2024-05-25 Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination Shelly Golan et.al. 2405.16260 link
2024-05-25 Maintaining and Managing Road Quality:Using MLP and DNN Makgotso Jacqueline Maotwana et.al. 2405.16196 null
2024-05-25 Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection Yun Zhu et.al. 2405.16178 null
2024-05-24 Diff-DTI: Fast Diffusion Tensor Imaging Using A Feature-Enhanced Joint Diffusion Model Lang Zhang et.al. 2405.15830 null
2024-05-24 Erase to Enhance: Data-Efficient Machine Unlearning in MRI Reconstruction Yuyang Xue et.al. 2405.15517 link
2024-05-24 Benchmarking Pre-trained Large Language Models' Potential Across Urdu NLP tasks Munief Hassan Tahir et.al. 2405.15453 null
2024-05-24 Fieldscale: Locality-Aware Field-based Adaptive Rescaling for Thermal Infrared Image Hyeonjae Gil et.al. 2405.15395 link
2024-05-24 CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation Xia Li et.al. 2405.15385 null
2024-05-24 Seeing the World through an Antenna's Eye: Reception Quality Visualization Using Incomplete Technical Signal Information Leif Bergerhoff et.al. 2405.15253 null
2024-05-24 Improved Distribution Matching Distillation for Fast Image Synthesis Tianwei Yin et.al. 2405.14867 link
2024-05-23 Physics-informed Score-based Diffusion Model for Limited-angle Reconstruction of Cardiac Computed Tomography Shuo Han et.al. 2405.14770 null
2024-05-23 Convolutional Neural Network Model Observers Discount Signal-like Anatomical Structures During Search in Virtual Digital Breast Tomosynthesis Phantoms Aditya Jonnalagadda et.al. 2405.14720 null
2024-05-23 OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance Shuheng Ge et.al. 2405.14709 null
2024-05-24 Autoregressive Image Diffusion: Generation of Image Sequence and Application in MRI Guanxiong Luo et.al. 2405.14327 link
2024-05-23 Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization Zhibo Chen et.al. 2405.14221 null
2024-05-22 Uncertainty-aware Evaluation of Auxiliary Anomalies with the Expected Anomaly Posterior Lorenzo Perini et.al. 2405.13699 null
2024-05-22 Euclid: Early Release Observations -- Programme overview and pipeline for compact- and diffuse-emission photometry J. -C. Cuillandre et.al. 2405.13496 null
2024-05-25 Class-Conditional self-reward mechanism for improved Text-to-Image models Safouane El Ghazouali et.al. 2405.13473 link
2024-05-22 Comparative Analysis of Hyperspectral Image Reconstruction Using Deep Learning for Agricultural and Biological Applications Md. Toukir Ahmed et.al. 2405.13331 null
2024-05-21 Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos Jayroop Ramesh et.al. 2405.13235 link
2024-05-24 Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and Next-Token Prediction Maciej Kilian et.al. 2405.13218 null
2024-05-21 NieR: Normal-Based Lighting Scene Rendering Hongsheng Wang et.al. 2405.13097 null
2024-05-21 MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video Hongsheng Wang et.al. 2405.12806 null
2024-05-21 Is Dataset Quality Still a Concern in Diagnosis Using Large Foundation Model? Ziqin Lin et.al. 2405.12584 null
2024-05-20 Paired Conditional Generative Adversarial Network for Highly Accelerated Liver 4D MRI Di Xu et.al. 2405.12357 null
2024-05-20 Deep learning-based hyperspectral image reconstruction for quality assessment of agro-product Md. Toukir Ahmed et.al. 2405.12313 null
2024-05-20 GGAvatar: Geometric Adjustment of Gaussian Head Avatar Xinyang Li et.al. 2405.11993 null
2024-05-20 On Efficient and Statistical Quality Estimation for Data Annotation Jan-Christoph Klie et.al. 2405.11919 null
2024-05-20 ViViD: Video Virtual Try-on using Diffusion Models Zixun Fang et.al. 2405.11794 null
2024-05-19 Solar image quality assessment: a proof of concept using Variance of Laplacian method and its application to optical atmospheric condition monitoring Chu Wing So et.al. 2405.11490 null
2024-05-18 Sampling Strategies for Mitigating Bias in Face Synthesis Methods Emmanouil Maragkoudakis et.al. 2405.11320 null
2024-05-18 Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching Xingyu Miao et.al. 2405.11252 link
2024-05-18 Testing the Performance of Face Recognition for People with Down Syndrome Christian Rathgeb et.al. 2405.11240 null
2024-05-21 SPOR: A Comprehensive and Practical Evaluation Method for Compositional Generalization in Data-to-Text Generation Ziyao Xu et.al. 2405.10650 link
2024-05-17 Simultaneous Deep Learning of Myocardium Segmentation and T2 Quantification for Acute Myocardial Infarction MRI Yirong Zhou et.al. 2405.10570 null
2024-05-17 Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network Junhui Li et.al. 2405.10518 null
2024-05-16 Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder Mohamed Ilyes Lakhal et.al. 2405.10423 null
2024-05-16 GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction Rui Jin et.al. 2405.10142 null
2024-05-16 Semantic Communication via Rate Distortion Perception Bottleneck Zihe Zhao et.al. 2405.09995 null
2024-05-16 VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing Binghui Chen et.al. 2405.09985 null
2024-05-16 NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge Jie Liang et.al. 2405.09923 null
2024-05-16 DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection Yuhao Sun et.al. 2405.09882 link
2024-05-15 Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment Xinying Lin et.al. 2405.09472 null
2024-05-16 Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images Memoona Aziz et.al. 2405.09426 null
2024-05-15 Application of Gated Recurrent Units for CT Trajectory Optimization Yuedong Yuan et.al. 2405.09333 null
2024-05-21 Deep Blur Multi-Model (DeepBlurMM) - a strategy to mitigate the impact of image blur on deep learning model performance in histopathology image analysis Yujie Xiang et.al. 2405.09298 null
2024-05-15 Sensitivity Decouple Learning for Image Compression Artifacts Reduction Li Ma et.al. 2405.09291 null
2024-05-15 Shacl4Bib: custom validation of library data Péter Király et.al. 2405.09177 null
2024-05-18 Scalable Image Coding for Humans and Machines Using Feature Fusion Network Takahiro Shindo et.al. 2405.09152 link
2024-05-15 RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing Jiamei Xiong et.al. 2405.09083 link
2024-05-14 Chemically peculiar stars on the pre-main sequence L. Kueß et.al. 2405.08946 null
2024-05-14 Enhancing Blind Video Quality Assessment with Rich Quality-aware Features Wei Sun et.al. 2405.08745 link
2024-05-13 The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective Andrew Shin et.al. 2405.08720 null
2024-05-14 Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs P. Mas-Buitrago et.al. 2405.08703 link
2024-05-15 RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content Tianhao Peng et.al. 2405.08621 null
2024-05-14 Dual-Branch Network for Portrait Image Quality Assessment Wei Sun et.al. 2405.08555 link
2024-05-14 WaterMamba: Visual State Space Model for Underwater Image Enhancement Meisheng Guan et.al. 2405.08419 null
2024-05-14 Perivascular space Identification Nnunet for Generalised Usage (PINGU) Benjamin Sinclair et.al. 2405.08337 link
2024-05-14 Progressive enhancement and restoration for mural images under low-light and defected conditions based on multi-receptive field strategy Xiameng Wei et.al. 2405.08245 link
2024-05-13 Quality of Experience Optimization for Real-time XR Video Transmission with Energy Constraints Guangjin Pan et.al. 2405.07689 null
2024-05-15 PRANK: a singular value based noise filtering approach Francesco Trainotti et.al. 2405.07578 null
2024-05-13 Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches Gao Yu Lee et.al. 2405.07520 null
2024-05-12 Understanding and Evaluating Human Preferences for AI Generated Images with Instruction Tuning Jiarui Wang et.al. 2405.07346 link
2024-05-12 PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification Mohammad Shafiul Alam et.al. 2405.07332 link
2024-05-12 Stable Signature is Unstable: Removing Image Watermark from Diffusion Models Yuepeng Hu et.al. 2405.07145 null
2024-05-11 Large Language Model-aided Edge Learning in Distribution System State Estimation Renyou Xie et.al. 2405.06999 null
2024-05-15 Generation of Granular-Balls for Clustering Based on the Principle of Justifiable Granularity Zihang Jia et.al. 2405.06904 null
2024-05-11 FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment Jinglin Xu et.al. 2405.06887 link
2024-05-10 Multi-Object Tracking in the Dark Xinzhe Wang et.al. 2405.06600 link
2024-05-10 Compression-Realized Deep Structural Network for Video Quality Enhancement Hanchi Sun et.al. 2405.06342 null
2024-05-09 Perceptual Crack Detection for Rendered 3D Textured Meshes Armin Shafiee Sarvestani et.al. 2405.06143 link
2024-05-09 Distilling Diffusion Models into Conditional GANs Minguk Kang et.al. 2405.05967 null
2024-05-09 How Quality Affects Deep Neural Networks in Fine-Grained Image Classification Joseph Smith et.al. 2405.05742 null
2024-05-09 LatentColorization: Latent Diffusion-Based Speaker Video Colorization Rory Ward et.al. 2405.05707 null
2024-05-09 SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space Zeren Zhang et.al. 2405.05636 null
2024-05-09 Array SAR 3D Sparse Imaging Based on Regularization by Denoising Under Few Observed Data Yangyang Wang et.al. 2405.05565 null
2024-05-08 Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation Jonas Kohler et.al. 2405.05224 null
2024-05-08 Bridging the Gap Between Saliency Prediction and Image Quality Assessment Kirillov Alexey et.al. 2405.04997 link
2024-05-07 Remote Diffusion Kunal Sunil Kasodekar et.al. 2405.04717 null
2024-05-07 Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications Markus Hillemann et.al. 2405.04345 null
2024-05-07 Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation Dogucan Yaman et.al. 2405.04327 null
2024-05-07 Cross-IQA: Unsupervised Learning for Image Quality Assessment Zhen Zhang et.al. 2405.04311 null
2024-05-07 Sora Detector: A Unified Hallucination Detection for Large Text-to-Video Models Zhixuan Chu et.al. 2405.04180 link
2024-05-07 Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment Aobo Li et.al. 2405.04167 link
2024-05-07 Lossy Compression with Data, Perception, and Classification Constraints Yuhan Wang et.al. 2405.04144 null
2024-05-07 Joint Estimation of Identity Verification and Relative Pose for Partial Fingerprints Xiongjun Guan et.al. 2405.03959 link
2024-05-06 AI-Driven Frameworks for Enhancing Data Quality in Big Data Ecosystems: Error_Detection, Correction, and Metadata Integration Widad Elouataoui et.al. 2405.03870 null
2024-05-06 Accelerated MR Cholangiopancreatography with Deep Learning-based Reconstruction Jinho Kim et.al. 2405.03732 link
2024-05-06 All-in-One Deep Learning Framework for MR Image Reconstruction Geunu Jeong et.al. 2405.03684 null
2024-05-06 An Image Quality Evaluation and Masking Algorithm Based On Pre-trained Deep Neural Networks Peng Jia et.al. 2405.03408 null
2024-05-06 Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement Jiesong Bai et.al. 2405.03349 link
2024-05-06 Light-VQA+: A Video Quality Assessment Model for Exposure Correction with Vision-Language Guidance Xunchu Zhou et.al. 2405.03333 link
2024-05-06 Multi-Modality Spatio-Temporal Forecasting via Self-Supervised Learning Jiewen Deng et.al. 2405.03255 link
2024-05-05 Matten: Video Generation with Mamba-Attention Yu Gao et.al. 2405.03025 null
2024-05-05 Design, analysis, and manufacturing of a glass-plastic hybrid minimalist aspheric panoramic annular lens Shaohua Gao et.al. 2405.02942 null
2024-05-05 Residual-Conditioned Optimal Transport: Towards Structure-preserving Unpaired and Paired Image Restoration Xiaole Tang et.al. 2405.02843 link
2024-05-04 Deep Image Restoration For Image Anti-Forensics Eren Tahir et.al. 2405.02751 link
2024-05-04 DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model Liangqi Lei et.al. 2405.02696 null
2024-05-03 On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning? Maxime Zanella et.al. 2405.02266 link
2024-05-01 Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts Han Cui et.al. 2405.02208 null
2024-05-03 HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 Miriam Jäger et.al. 2405.02005 null
2024-05-03 Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics Rucha Deshpande et.al. 2405.01822 null
2024-05-07 Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration Praveen Kumar Chandaliya et.al. 2405.01273 null
2024-05-02 Singular Value and Frame Decomposition-based Reconstruction for Atmospheric Tomography Lukas Weissinger et.al. 2405.01079 null
2024-05-01 Brighteye: Glaucoma Screening with Color Fundus Photographs based on Vision Transformer Hui Lin et.al. 2405.00857 link
2024-05-01 Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models Xiaoshi Wu et.al. 2405.00760 null
2024-05-01 Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays Andrei Chubarau et.al. 2405.00670 link
2024-05-01 Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning Yuxi Xie et.al. 2405.00451 link
2024-04-30 Fast MRI Reconstruction Using Deep Learning-based Compressed Sensing: A Systematic Review Mojtaba Safari et.al. 2405.00241 link
2024-04-30 Charting the Path Forward: CT Image Quality Assessment -- An In-Depth Review Siyi Xun et.al. 2405.00075 null
2024-04-30 Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on Perceptual Similarity Lei Wang et.al. 2404.19666 null
2024-04-30 Perceptual Constancy Constrained Single Opinion Score Calibration for Image Quality Assessment Lei Wang et.al. 2404.19595 null
2024-04-30 Causal Perception Inspired Representation Learning for Trustworthy Image Quality Assessment Lei Wang et.al. 2404.19567 null
2024-05-04 Towards Real-world Video Face Restoration: A New Benchmark Ziyan Chen et.al. 2404.19500 null
2024-04-30 NeRF-Insert: 3D Local Editing with Multimodal Control Signals Benet Oriol Sabat et.al. 2404.19204 null
2024-04-30 Global Search Optics: Automatically Exploring Optimal Solutions to Compact Computational Imaging Systems Yao Gao et.al. 2404.19201 null
2024-04-30 Advancing low-field MRI with a universal denoising imaging transformer: Towards fast and high-quality imaging Zheren Zhu et.al. 2404.19167 link
2024-04-29 A Comprehensive Rubric for Annotating Pathological Speech Mario Corrales-Astorgano et.al. 2404.18851 null
2024-04-29 Autonomous Quality and Hallucination Assessment for Virtual Tissue Staining and Digital Pathology Luzhe Huang et.al. 2404.18458 null
2024-04-29 PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images Jiquan Yuan et.al. 2404.18409 link
2024-04-29 G-Refine: A General Quality Refiner for Text-to-Image Generation Chunyi Li et.al. 2404.18343 link
2024-04-28 An automated pipeline for computation and analysis of functional ventilation and perfusion lung MRI with matrix pencil decomposition: TrueLung Orso Pusterla et.al. 2404.18275 null
2024-04-28 LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM Zic

About

🎓Automatically Update Interested Papers Daily using Github Actions (Update Every 12th hours)

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%