[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]
Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-26 | SPU-IMR: Self-supervised Arbitrary-scale Point Cloud Upsampling via Iterative Mask-recovery Network | Ziming Nie et.al. | 2502.19452 | null |
2025-02-25 | Deep-JGAC: End-to-End Deep Joint Geometry and Attribute Compression for Dense Colored Point Clouds | Yun Zhang et.al. | 2502.17939 | null |
2025-02-10 | Real-Time LiDAR Point Cloud Compression and Transmission for Resource-constrained Robots | Yuhao Cao et.al. | 2502.06123 | link |
2025-02-07 | DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection | Mingxuan Yan et.al. | 2502.04804 | null |
2025-02-05 | Deep Learning-based Event Data Coding: A Joint Spatiotemporal and Polarity Solution | Abdelrahman Seleem et.al. | 2502.03285 | null |
2025-02-22 | Point Cloud Upsampling as Statistical Shape Model for Pelvic | Tongxu Zhang et.al. | 2501.16716 | null |
2025-01-25 | Efficient Point Clouds Upsampling via Flow Matching | Zhi-Song Liu et.al. | 2501.15286 | null |
2025-01-13 | Representation Learning of Point Cloud Upsampling in Global and Local Inputs | Tongxu Zhang et.al. | 2501.07076 | null |
2024-12-19 | Color Enhancement for V-PCC Compressed Point Cloud via 2D Attribute Map Optimization | Jingwei Bao et.al. | 2412.14449 | null |
2024-12-16 | EGP3D: Edge-guided Geometric Preserving 3D Point Cloud Super-resolution for RGB-D camera | Zheng Fang et.al. | 2412.11680 | null |
2024-12-11 | Implicit Neural Compression of Point Clouds | Hongning Ruan et.al. | 2412.10433 | null |
2024-12-07 | Rate-Distortion Optimized Skip Coding of Region Adaptive Hierarchical Transform Coefficients for MPEG G-PCC | Zehan Wang et.al. | 2412.05574 | null |
2025-01-09 | Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer | Xiao Huo et.al. | 2411.07899 | null |
2024-11-09 | Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data | Xinran Liu et.al. | 2411.06055 | null |
2024-11-01 | PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling | Donghyun Kim et.al. | 2411.00432 | null |
2024-10-28 | Quality Analysis of the Coding Bitrate Tradeoff Between Geometry and Attributes for Colored Point Clouds | Joao Prazeres et.al. | 2410.21613 | null |
2024-10-09 | Point Cloud Compression with Bits-back Coding | Nguyen Quang Hieu et.al. | 2410.18115 | null |
2024-10-23 | Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds | Kai Liu et.al. | 2410.17823 | link |
2024-10-22 | Joint Point Cloud Upsampling and Cleaning with Octree-based CNNs | Jihe Li et.al. | 2410.17001 | link |
2024-10-21 | MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering | Jiayi Song et.al. | 2410.15941 | null |
2024-10-13 | Towards Reproducible Learning-based Compression | Jiahao Pang et.al. | 2410.09872 | null |
2024-10-06 | Tensor-Train Point Cloud Compression and Efficient Approximate Nearest-Neighbor Search | Georgii Novikov et.al. | 2410.04462 | null |
2024-10-01 | Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection | Pengxi Zeng et.al. | 2410.00582 | null |
2024-09-19 | PVContext: Hybrid Context Model for Point Cloud Compression | Guoqing Zhang et.al. | 2409.12724 | null |
2024-09-12 | The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine | André F. R. Guarda et.al. | 2409.08130 | null |
2024-09-08 | GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling | Huawei Sun et.al. | 2409.02720 | link |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-08-20 | End-to-end learned Lossy Dynamic Point Cloud Attribute Compression | Dat Thanh Nguyen et.al. | 2408.10665 | null |
2024-08-20 | Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds | Kai Liu et.al. | 2408.10543 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-06 | Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement | Hao Xu et.al. | 2408.02966 | null |
2024-08-01 | Learned Compression of Point Cloud Geometry and Attributes in a Single Model through Multimodal Rate-Control | Michael Rudolph et.al. | 2408.00599 | null |
2024-07-22 | Double Deep Learning-based Event Data Coding and Classification | Abdelrahman Seleem et.al. | 2407.15531 | null |
2024-07-11 | Enhancing octree-based context models for point cloud geometry compression with attention-based child node number prediction | Chang Sun et.al. | 2407.08528 | null |
2024-07-11 | Enhancing context models for point cloud geometry compression with context feature residuals and multi-loss | Chang Sun et.al. | 2407.08520 | null |
2024-07-19 | PCAC-GAN: A Sparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression | Xiaolong Mao et.al. | 2407.05677 | null |
2024-07-05 | Rethinking Data Input for Point Cloud Upsampling | Tongxu Zhang et.al. | 2407.04476 | null |
2024-08-26 | TSC-PCAC: Voxel Transformer and Sparse Convolution Based Point Cloud Attribute Compression for 3D Broadcasting | Zixi Guo et.al. | 2407.04284 | link |
2024-06-15 | Full reference point cloud quality assessment using support vector regression | Ryosuke Watanabe et.al. | 2406.10520 | link |
2024-09-25 | Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering | Yueyu Hu et.al. | 2406.05915 | null |
2024-06-02 | Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor | Lei Liu et.al. | 2406.00791 | null |
2024-05-23 | NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolation | Chaokang Jiang et.al. | 2405.14241 | link |
2024-05-19 | Point Cloud Compression with Implicit Neural Representations: A Unified Framework | Hongning Ruan et.al. | 2405.11493 | null |
2024-05-02 | PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems | Walter Zimmer et.al. | 2405.01750 | null |
2024-04-21 | Pointsoup: High-Performance and Extremely Low-Decoding-Latency Learned Geometry Codec for Large-Scale Point Cloud Scenes | Kang You et.al. | 2404.13550 | link |
2024-04-16 | Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery | Zohre Karimi et.al. | 2404.07185 | null |
2024-04-10 | Efficient and Generic Point Model for Lossless Point Cloud Attribute Compression | Kang You et.al. | 2404.06936 | link |
2024-04-09 | Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data | Kai Luan et.al. | 2404.06012 | null |
2024-03-13 | Point Cloud Compression via Constrained Optimal Transport | Zezeng Li et.al. | 2403.08236 | link |
2024-03-08 | Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning | Hang Du et.al. | 2403.05117 | link |
2024-03-01 | Assessing objective quality metrics for JPEG and MPEG point cloud coding | Davi Lazzarotto et.al. | 2403.00410 | null |
2024-02-23 | Scalable Human-Machine Point Cloud Compression | Mateen Ulhaq et.al. | 2402.12532 | link |
2024-02-18 | 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods | Till Beemelmanns et.al. | 2402.11680 | link |
2024-02-17 | Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression | Dingquan Li et.al. | 2402.11250 | link |
2024-02-11 | PIVOT-Net: Heterogeneous Point-Voxel-Tree-based Framework for Point Cloud Compression | Jiahao Pang et.al. | 2402.07243 | null |
2024-02-07 | Performance analysis of Deep Learning-based Lossy Point Cloud Geometry Compression Coding Solutions | Joao Prazeres et.al. | 2402.05192 | null |
2024-02-08 | Subjective performance evaluation of bitrate allocation strategies for MPEG and JPEG Pleno point cloud compression | Davi Lazzarotto et.al. | 2402.04760 | null |
2024-02-15 | LiDAR-Forest Dataset: LiDAR Point Cloud Simulation Dataset for Forestry Application | Yawen Lu et.al. | 2402.04546 | null |
2023-12-23 | Learning Continuous Implicit Field with Local Distance Indicator for Arbitrary-Scale Point Cloud Upsampling | Shujuan Li et.al. | 2312.15133 | null |
2024-03-13 | DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction | Yanlong Li et.al. | 2312.03298 | link |
2023-12-03 | A Conditional Denoising Diffusion Probabilistic Model for Point Cloud Upsampling | Wentao Qu et.al. | 2312.02719 | link |
2023-11-22 | Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression | Tam Thuc Do et.al. | 2311.13539 | null |
2023-11-22 | Volumetric 3D Point Cloud Attribute Compression: Learned polynomial bilateral filter for prediction | Tam Thuc Do et.al. | 2311.13533 | null |
2023-11-22 | Test-Time Augmentation for 3D Point Cloud Classification and Segmentation | Tuan-Anh Vu et.al. | 2311.13152 | null |
2023-11-03 | PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation | Yuhan Ding et.al. | 2311.01773 | null |
2023-11-02 | Lightweight super resolution network for point cloud geometry compression | Wei Zhang et.al. | 2311.00970 | link |
2023-11-17 | Deep Learning-based Compressed Domain Multimedia for Man and Machine: A Taxonomy and Application to Point Cloud Classification | Abdelrahman Seleem et.al. | 2310.18849 | null |
2023-10-13 | iPUNet:Iterative Cross Field Guided Point Cloud Upsampling | Guangshun Wei et.al. | 2310.09092 | link |
2024-03-15 | PU-Ray: Domain-Independent Point Cloud Upsampling via Ray Marching on Neural Implicit Surface | Sangwon Lim et.al. | 2310.08755 | link |
2024-02-16 | Quasi-Monte Carlo for 3D Sliced Wasserstein | Khai Nguyen et.al. | 2309.11713 | link |
2023-09-08 | Poster: Making Edge-assisted LiDAR Perceptions Robust to Lossy Point Cloud Compression | Jin Heo et.al. | 2309.04549 | null |
2023-09-01 | Test-Time Adaptation for Point Cloud Upsampling Using Meta-Learning | Ahmed Hatem et.al. | 2308.16484 | null |
2024-02-08 | SCP: Spherical-Coordinate-based Learned Point Cloud Compression | Ao Luo et.al. | 2308.12535 | null |
2023-08-22 | Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection | Junsheng Zhou et.al. | 2308.11441 | link |
2023-08-11 | Learned Point Cloud Compression for Classification | Mateen Ulhaq et.al. | 2308.05959 | link |
2023-07-27 | FLiCR: A Fast and Lightweight LiDAR Point Cloud Compression Based on Lossy RI | Jin Heo et.al. | 2307.15005 | null |
2023-07-20 | Aggressive saliency-aware point cloud compression | Eleftheria Psatha et.al. | 2307.10741 | null |
2023-07-18 | Arbitrary point cloud upsampling via Dual Back-Projection Network | Zhi-Song Liu et.al. | 2307.08992 | null |
2023-06-01 | 4DSR-GCN: 4D Video Point Cloud Upsampling using Graph Convolutional Networks | Lorenzo Berlincioni et.al. | 2306.01081 | null |
2023-05-16 | Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching | Shuting Xia et.al. | 2305.05356 | null |
2023-05-02 | Geometric Prior Based Deep Human Point Cloud Geometry Compression | Xinju Wu et.al. | 2305.01309 | null |
2023-05-02 | PU-EdgeFormer: Edge Transformer for Dense Prediction in Point Cloud Upsampling | Dohoon Kim et.al. | 2305.01148 | link |
2023-04-24 | Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions | Yun He et.al. | 2304.11846 | link |
2023-04-01 | Volumetric Attribute Compression for 3D Point Clouds using Feedforward Network with Geometric Attention | Tam Thuc Do et.al. | 2304.00335 | null |
2023-03-27 | NeuralPCI: Spatio-temporal Neural Field for 3D Point Cloud Multi-frame Non-linear Interpolation | Zehan Zheng et.al. | 2303.15126 | link |
2023-11-07 | GQE-Net: A Graph-based Quality Enhancement Network for Point Cloud Color Attribute | Jinrui Xing et.al. | 2303.13764 | link |
2023-03-22 | Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction | Jianqiang Wang et.al. | 2303.12917 | null |
2023-12-28 | Progressive Frame Patching for FoV-based Point Cloud Video Streaming | Tongyu Zong et.al. | 2303.08336 | null |
2023-12-03 | Parametric Surface Constrained Upsampler Network for Point Cloud | Pingping Cai et.al. | 2303.08240 | link |
2024-03-20 | Lossless Point Cloud Geometry and Attribute Compression Using a Learned Conditional Probability Model | Dat Thanh Nguyen et.al. | 2303.06519 | link |
2023-03-11 | Deep probabilistic model for lossless scalable point cloud attribute compression | Dat Thanh Nguyen et.al. | 2303.06517 | null |
2023-03-09 | BIRD-PCC: Bi-directional Range Image-based Deep LiDAR Point Cloud Compression | Chia-Sheng Liu et.al. | 2303.04027 | null |
2023-02-13 | gpcgc: a green point cloud geometry coding method | Qingyang Zhou et.al. | 2302.06062 | null |
2023-02-09 | BASICS: Broad quality Assessment of Static point clouds In Compression Scenarios | Ali Ak et.al. | 2302.04796 | null |
2023-04-27 | Linear Optimal Partial Transport Embedding | Yikun Bai et.al. | 2302.03232 | link |
2023-01-31 | Lidar Upsampling with Sliced Wasserstein Distance | Artem Savkin et.al. | 2301.13558 | null |
2023-01-28 | Dynamic Point Cloud Geometry Compression Using Multiscale Inter Conditional Coding | Jianqiang Wang et.al. | 2301.12165 | null |
2023-01-27 | Joint Geometry and Attribute Upsampling of Point Clouds Using Frequency-Selective Models with Overlapped Support | Viktoria Heimann et.al. | 2301.11630 | null |
2023-01-03 | Reduced Reference Quality Assessment for Point Cloud Compression | Yipeng Liu et.al. | 2301.01009 | null |
2023-04-06 | Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program | Tiange Luo et.al. | 2212.12952 | null |
2022-12-11 | Learning Neural Volumetric Field for Point Cloud Geometry Compression | Yueyu Hu et.al. | 2212.05589 | link |
2022-12-01 | Low-Rank Tensor Function Representation for Multi-Dimensional Data Recovery | Yisi Luo et.al. | 2212.00262 | null |
2023-12-09 | ECM-OPCC: Efficient Context Model for Octree-based Point Cloud Compression | Yiqi Jin et.al. | 2211.10916 | null |
2022-11-19 | Rate-Distortion Modeling for Bit Rate Constrained Point Cloud Compression | Pan Gao et.al. | 2211.10646 | null |
2022-10-21 | Motion Policy Networks | Adam Fishman et.al. | 2210.12209 | link |
2022-10-28 | Motion estimation and filtered prediction for dynamic point cloud attribute compression | Haoran Hong et.al. | 2210.08262 | null |
2022-10-08 | Point Cloud Upsampling via Cascaded Refinement Network | Hang Du et.al. | 2210.03942 | link |
2023-02-14 | Multiscale Latent-Guided Entropy Model for LiDAR Point Cloud Compression | Tingyu Fan et.al. | 2209.12512 | null |
2022-09-17 | CARNet:Compression Artifact Reduction for Point Cloud Attribute | Dandan Ding et.al. | 2209.08276 | null |
2022-11-16 | CU-Net: Real-Time High-Fidelity Color Upsampling for Point Clouds | Lingdong Wang et.al. | 2209.06112 | link |
2022-09-09 | GRASP-Net: Geometric Residual Analysis and Synthesis for Point Cloud Compression | Jiahao Pang et.al. | 2209.04401 | link |
2022-09-06 | Learning to Predict on Octree for Scalable Point Cloud Geometry Coding | Yixiang Mao et.al. | 2209.02226 | null |
2022-08-26 | Efficient LiDAR Point Cloud Geometry Compression Through Neighborhood Point Attention | Ruixiang Xue et.al. | 2208.12573 | null |
2022-08-17 | Efficient dynamic point cloud coding using Slice-Wise Segmentation | Faranak Tohidi et.al. | 2208.08061 | null |
2023-01-10 | Arbitrary Point Cloud Upsampling with Spherical Mixture of Gaussians | Anthony Dell'Eva et.al. | 2208.05274 | link |
2022-08-04 | IT/IST/IPLeiria Response to the Call for Proposals on JPEG Pleno Point Cloud Coding | André F. R. Guarda et.al. | 2208.02716 | null |
2022-08-04 | IPDAE: Improved Patch-Based Deep Autoencoder for Lossy Point Cloud Geometry Compression | Kang You et.al. | 2208.02519 | link |
2022-07-25 | Inter-Frame Compression for Dynamic Point Cloud Geometry Coding | Anique Akhtar et.al. | 2207.12554 | null |
2022-07-20 | GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation | Cristiano Saltori et.al. | 2207.09763 | link |
2022-06-25 | BIMS-PU: Bi-Directional and Multi-Scale Point Cloud Upsampling | Yechao Bai et.al. | 2206.12648 | null |
2022-06-24 | Rate-Distortion Optimal Transform Coefficient Selection for Unoccupied Regions in Video-Based Point Cloud Compression | Christian Herglotz et.al. | 2206.12186 | null |
2022-05-24 | A Rate Control Algorithm for Video-based Point Cloud Compression | Fangyu Shen et.al. | 2205.11825 | null |
2022-05-19 | A Comparative Study of Feature Expansion Unit for 3D Point Cloud Upsampling | Qiang Li et.al. | 2205.09594 | null |
2022-05-02 | D-DPCC: Deep Dynamic Point Cloud Compression via 3D Motion Prediction | Tingyu Fan et.al. | 2205.01135 | link |
2022-05-02 | Point Cloud Compression with Sibling Context and Surface Priors | Zhili Chen et.al. | 2205.00760 | link |
2022-04-29 | Deep Geometry Post-Processing for Decompressed Point Clouds | Xiaoqing Fan et.al. | 2204.13952 | link |
2022-04-27 | Density-preserving Deep Point Cloud Compression | Yun He et.al. | 2204.12684 | null |
2022-04-25 | 4DAC: Learning Attribute Compression for Dynamic Point Clouds | Guangchi Fang et.al. | 2204.11723 | null |
2022-04-25 | Dynamic Point Cloud Compression with Cross-Sectional Approach | Faranak Tohidi et.al. | 2204.11409 | null |
2022-04-22 | PU-EVA: An Edge Vector based Approximation Solution for Flexible-scale Point Cloud Upsampling | Luqing Luo et.al. | 2204.10750 | null |
2022-04-18 | Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation | Wenbo Zhao et.al. | 2204.08196 | link |
2022-06-22 | Learning-based Lossless Point Cloud Geometry Coding using Sparse Tensors | Dat Thanh Nguyen et.al. | 2204.05043 | null |
2022-04-03 | Sparse Tensor-based Point Cloud Attribute Compression | Jianqiang Wang et.al. | 2204.01023 | link |
2022-03-22 | IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment | Yiming Zeng et.al. | 2203.11590 | link |
2022-03-21 | Upsampling Autoencoder for Self-Supervised Point Cloud Learning | Cheng Zhang et.al. | 2203.10768 | null |
2022-05-03 | Frequency-Selective Mesh-to-Mesh Resampling for Color Upsampling of Point Clouds | Viktoria Heimann et.al. | 2203.09224 | null |
2022-03-02 | PUFA-GAN: A Frequency-Aware Generative Adversarial Network for 3D Point Cloud Upsampling | Hao Liu et.al. | 2203.00914 | null |
2022-05-16 | Variable Rate Compression for Raw 3D Point Clouds | Md Ahmed Al Muzaddid et.al. | 2202.13862 | link |
2022-09-14 | Point cloud completion via structured feature maps using a feedback network | Zejia Su et.al. | 2202.08583 | null |
2022-05-08 | OctAttention: Octree-Based Large-Scale Contexts Model for Point Cloud Compression | Chunyang Fu et.al. | 2202.06028 | link |
2022-02-01 | Point Cloud Compression for Efficient Data Broadcasting: A Performance Comparison | Francesco Nardo et.al. | 2202.00719 | null |
2022-02-01 | Fractional Motion Estimation for Point Cloud Compression | Haoran Hong et.al. | 2202.00172 | null |
2022-01-17 | SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations | Zhenyu Li et.al. | 2112.04680 | link |
2022-03-31 | Neural Points: Point Cloud Representation with Neural Fields for Arbitrary Upsampling | Wanquan Feng et.al. | 2112.04148 | link |
2022-03-01 | Attribute Artifacts Removal for Geometry-based Point Cloud Compression | Xihua Sheng et.al. | 2112.00560 | null |
2022-10-03 | PU-Transformer: Point Cloud Upsampling Transformer | Shi Qiu et.al. | 2111.12242 | link |
2022-10-21 | Sparse Tensor-based Multiscale Representation for Point Cloud Geometry Compression | Jianqiang Wang et.al. | 2111.10633 | link |
2021-10-18 | Patch-Based Deep Autoencoder for Point Cloud Geometry Compression | Kang You et.al. | 2110.09109 | link |
2022-07-12 | PC |
Chen Long et.al. | 2109.09337 | link |
2021-09-16 | R-PCC: A Baseline for Range Image-based Point Cloud Compression | Sukai Wang et.al. | 2109.07717 | link |
2021-09-15 | Which One is Better: Assessing Objective Metrics for Point Cloud Compression | Yipeng Liu et.al. | 2109.07158 | null |
2021-08-05 | Joint Geometry and Color Projection-based Point Cloud Quality Metric | Alireza Javaheri et.al. | 2108.02481 | link |
2021-08-03 | SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering | Yifan Zhao et.al. | 2108.00454 | link |
2021-07-29 | Video-based Point Cloud Compression Artifact Removal | Anique Akhtar et.al. | 2107.14179 | null |
2024-02-28 | Score-Based Point Cloud Denoising | Shitong Luo et.al. | 2107.10981 | link |
2022-06-08 | PU-Flow: a Point Cloud Upsampling Network with Normalizing Flows | Aihua Mao et.al. | 2107.05893 | link |
2022-04-18 | "Zero-Shot" Point Cloud Upsampling | Kaiyue Zhou et.al. | 2106.13765 | link |
2021-06-23 | Lossless Point Cloud Attribute Compression with Normal-based Intra Prediction | Qian Yin et.al. | 2106.12236 | null |
2021-06-21 | Cylindrical coordinates for LiDAR point cloud compression | Shashank N. Sridhara et.al. | 2106.11237 | null |
2021-10-11 | Neural Network Modeling of Probabilities for Coding the Octree Representation of Point Clouds | Emre Can Kaya et.al. | 2106.06482 | link |
2021-06-09 | Point Cloud Upsampling via Disentangled Refinement | Ruihui Li et.al. | 2106.04779 | link |
2021-06-02 | DeepCompress: Efficient Point Cloud Geometry Compression | Ryan Killea et.al. | 2106.01504 | link |
2021-06-01 | RAI-Net: Range-Adaptive LiDAR Point Cloud Frame Interpolation Network | Lili Zhao et.al. | 2106.00496 | null |
2021-05-28 | An Unsupervised Optical Flow Estimation For LiDAR Image Sequences | Xuezhou Guo et.al. | 2105.13879 | null |
2021-05-05 | VoxelContext-Net: An Octree based Framework for Point Cloud Compression | Zizheng Que et.al. | 2105.02158 | null |
2021-04-20 | Multiscale deep context modeling for lossless point cloud geometry compression | Dat Thanh Nguyen et.al. | 2104.09859 | link |
2021-04-12 | Towards Efficient Graph Convolutional Networks for Point Cloud Handling | Yawei Li et.al. | 2104.05706 | null |
2021-03-11 | Advanced Geometry Surface Coding for Dynamic Point Cloud Compression | Jian Xiong et.al. | 2103.06549 | null |
2021-03-05 | Hybrid Point Cloud Semantic Compression for Automotive Sensors: A Performance Evaluation | Andrea Varischio et.al. | 2103.03819 | null |
2021-02-26 | Point Cloud Upsampling and Normal Estimation using Deep Learning for Robust Surface Reconstruction | Rajat Sharma et.al. | 2102.13391 | link |
2021-02-25 | A deep perceptual metric for 3D point clouds | Maurice Quach et.al. | 2102.12839 | link |
2021-02-08 | Meta-PU: An Arbitrary-Scale Upsampling Network for Point Cloud | Shuquan Ye et.al. | 2102.04317 | null |
2020-12-15 | NeuralQAAD: An Efficient Differentiable Framework for High Resolution Point Cloud Compression | Nicolas Wagner et.al. | 2012.08143 | null |
2022-06-11 | SPU-Net: Self-Supervised Point Cloud Upsampling by Coarse-to-Fine Reconstruction with Self-Projection Optimization | Xinhai Liu et.al. | 2012.04439 | link |
2021-11-18 | Vehicular Cooperative Perception Through Action Branching and Federated Reinforcement Learning | Mohamed K. Abdel-Aziz et.al. | 2012.03414 | null |
2020-12-05 | ParaNet: Deep Regular Representation for 3D Point Clouds | Qijian Zhang et.al. | 2012.03028 | null |
2020-11-27 | Spherical Interpolated Convolutional Network with Distance-Feature Density for 3D Semantic Segmentation of Point Clouds | Guangming Wang et.al. | 2011.13784 | null |
2020-11-25 | Reduced Reference Perceptual Quality Model and Application to Rate Control for 3D Point Cloud Compression | Qi Liu et.al. | 2011.12688 | null |
2020-11-07 | Multiscale Point Cloud Geometry Compression | Jianqiang Wang et.al. | 2011.03799 | link |
2020-10-29 | Point Cloud Attribute Compression via Successive Subspace Graph Transform | Yueru Chen et.al. | 2010.15302 | null |
2020-08-16 | Real-Time Spatio-Temporal LiDAR Point Cloud Compression | Yu Feng et.al. | 2008.06972 | link |
2021-08-03 | Subjective Quality Database and Objective Study of Compressed Point Clouds With 6DoF Head-Mounted Display | Xinju Wu et.al. | 2008.02501 | null |
2020-06-20 | Pseudo-LiDAR Point Cloud Interpolation Based on 3D Motion Representation and Spatial Supervision | Haojie Liu et.al. | 2006.11481 | null |
2020-06-24 | Improved Deep Point Cloud Geometry Compression | Maurice Quach et.al. | 2006.09043 | link |
2020-04-03 | Intrinsic Point Cloud Interpolation via Dual Latent Space Navigation | Marie-Julie Rakotosaona et.al. | 2004.01661 | link |
2020-03-30 | A generalized Hausdorff distance based quality metric for point cloud geometry | Alireza Javaheri et.al. | 2003.13669 | null |
2020-03-30 | Optimizing Geometry Compression using Quantum Annealing | Sebastian Feld et.al. | 2003.13253 | null |
2020-03-27 | Model-based Joint Bit Allocation between Geometry and Color for Video-based 3D Point Cloud Compression | Qi Liu et.al. | 2002.10798 | null |
2020-03-07 | PUGeo-Net: A Geometry-centric Network for 3D Point Cloud Upsampling | Yue Qian et.al. | 2002.10277 | null |
2020-06-22 | Folding-based compression of point cloud attributes | Maurice Quach et.al. | 2002.04439 | null |
2020-01-13 | Efficient 3D Road Map Data Exchange for Intelligent Vehicles in Vehicular Fog Networks | Ivan Wang-Hei Ho et.al. | 2001.04057 | null |
2020-01-12 | Linear Model based Geometry Coding for Lidar Acquired Point Clouds | Xiang Zhang et.al. | 2001.03871 | null |
2021-04-09 | PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection | Shaoshuai Shi et.al. | 1912.13192 | link |
2019-12-20 | A Comprehensive Study and Comparison of Core Technologies for MPEG 3D Point Cloud Compression | Hao Liu et.al. | 1912.09674 | null |
2020-10-15 | Point Cloud Rendering after Coding: Impacts on Subjective and Objective Quality | Alireza Javaheri et.al. | 1912.09137 | null |
2021-03-29 | PU-GCN: Point Cloud Upsampling using Graph Convolutional Networks | Guocheng Qian et.al. | 1912.03264 | link |
2019-11-04 | Video-based compression for plenoptic point clouds | Li Li et.al. | 1911.01355 | null |
2019-09-26 | Learned Point Cloud Geometry Compression | Jianqiang Wang et.al. | 1909.12037 | link |
2019-09-16 | PLIN: A Network for Pseudo-LiDAR Point Cloud Interpolation | Haojie Liu et.al. | 1909.07137 | null |
2019-08-17 | 3D Point Cloud Super-Resolution via Graph Total Variation on Surface Normals | Chinthaka Dinesh et.al. | 1908.06261 | null |
2019-08-06 | Point Cloud Super Resolution with Adversarial Residual Graph Networks | Huikai Wu et.al. | 1908.02111 | link |
2020-08-10 | Predictive Generalized Graph Fourier Transform for Attribute Compression of Dynamic Point Clouds | Yiqun Xu et.al. | 1908.01970 | null |
2019-07-25 | PU-GAN: a Point Cloud Upsampling Adversarial Network | Ruihui Li et.al. | 1907.10844 | null |
2019-06-27 | A Convolutional Decoder for Point Clouds using Adaptive Instance Normalization | Isaak Lim et.al. | 1906.11478 | null |
2019-04-18 | Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds | Wei Yan et.al. | 1905.03691 | null |
2019-05-22 | Learning Convolutional Transforms for Lossy Point Cloud Geometry Compression | Maurice Quach et.al. | 1903.08548 | link |
2019-09-30 | Variational Graph Methods for Efficient Point Cloud Sparsification | Daniel Tenbrinck et.al. | 1903.02858 | null |
2019-03-05 | Pose Estimation of Vehicles Over Uneven Terrain | Yingchong Ma et.al. | 1903.02052 | null |
2019-02-11 | Occupancy-map-based rate distortion optimization for video-based point cloud compression | Li Li et.al. | 1902.04169 | null |
2018-09-30 | A Volumetric Approach to Point Cloud Compression | Maja Krivokuća et.al. | 1810.00484 | null |
2018-05-29 | Surface Light Field Compression using a Point Cloud Codec | Xiang Zhang et.al. | 1805.11203 | null |
2018-05-23 | Comments on "Compression of 3D Point Clouds Using a Region-Adaptive Hierarchical Transform" | Gustavo Sandri et.al. | 1805.09146 | null |
2018-04-28 | Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction | Yiting Shao et.al. | 1804.10783 | null |
2018-03-26 | PU-Net: Point Cloud Upsampling Network | Lequan Yu et.al. | 1801.06761 | link |
2017-10-10 | Attribute Compression of 3D Point Clouds Using Laplacian Sparsity Optimized Graph Transform | Yiting Shao et.al. | 1710.03532 | null |
2017-03-08 | Dynamic Polygon Clouds: Representation and Compression for VR/AR | Philip A. Chou et.al. | 1610.00402 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-27 | Balanced Rate-Distortion Optimization in Learned Image Compression | Yichi Zhang et.al. | 2502.20161 | null |
2025-02-27 | Transformer-Based Nonlinear Transform Coding for Multi-Rate CSI Compression in MIMO-OFDM Systems | Bumsu Park et.al. | 2502.19847 | null |
2025-02-26 | Zipping many-body quantum states: a scalable approach to diagonal entropy | Yu-Hsueh Chen et.al. | 2502.18898 | null |
2025-02-25 | Novel quantum circuit for image compression utilizing modified Toffoli gate and quantized transformed coefficient alongside a novel reset gate | Ershadul Haque et.al. | 2502.17815 | null |
2025-02-25 | Quantum neural compressive sensing for ghost imaging | Xinliang Zhai et.al. | 2502.17790 | null |
2025-02-24 | Optimized Memory System Architecture for VESA VDC-M Decoder with Multi-Slice Support | Hannah Yang et.al. | 2502.17729 | null |
2025-02-24 | Pleno-Generation: A Scalable Generative Face Video Compression Framework with Bandwidth Intelligence | Bolin Chen et.al. | 2502.17085 | null |
2025-02-24 | Hierarchical Semantic Compression for Consistent Image Semantic Restoration | Shengxi Li et.al. | 2502.16799 | null |
2025-02-24 | Continuous Patch Stitching for Block-wise Image Compression | Zifu Zhang et.al. | 2502.16795 | null |
2025-02-27 | Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM | Yao Zhang et.al. | 2502.16495 | null |
2025-02-22 | Large Language Model for Lossless Image Compression with Visual Prompts | Junhao Du et.al. | 2502.16163 | null |
2025-02-21 | Quantum autoencoders for image classification | Hinako Asaoka et.al. | 2502.15254 | null |
2025-02-21 | Interleaved Block-based Learned Image Compression with Feature Enhancement and Quantization Error Compensation | Shiqi Jiang et.al. | 2502.15188 | null |
2025-02-21 | FD-LSCIC: Frequency Decomposition-based Learned Screen Content Image Compression | Shiqi Jiang et.al. | 2502.15174 | null |
2025-02-20 | Compact Latent Representation for Image Compression (CLRIC) | Ayman A. Ameen et.al. | 2502.14937 | null |
2025-02-20 | Stereo Image Coding for Machines with Joint Visual Feature Compression | Dengchao Jin et.al. | 2502.14190 | null |
2025-02-19 | A General Framework for Augmenting Lossy Compressors with Topological Guarantees | Nathaniel Gorski et.al. | 2502.14022 | null |
2025-02-19 | A Lightweight Model for Perceptual Image Compression via Implicit Priors | Hao Wei et.al. | 2502.13988 | null |
2025-02-19 | Improving the Sparse Structure Learning of Spiking Neural Networks from the View of Compression Efficiency | Jiangrong Shen et.al. | 2502.13572 | null |
2025-02-18 | Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression | Jaemoon Lee et.al. | 2502.12951 | null |
2025-02-17 | Fully Dynamic LZ77 in Sublinear Time | Itai Boneh et.al. | 2502.12000 | null |
2025-02-17 | On Quantizing Neural Representation for Variable-Rate Video Coding | Junqi Shi et.al. | 2502.11729 | link |
2025-02-15 | AquaScope: Reliable Underwater Image Transmission on Mobile Devices | Beitong Tian et.al. | 2502.10891 | null |
2025-02-15 | ResiComp: Loss-Resilient Image Compression via Dual-Functional Masked Visual Token Modeling | Sixian Wang et.al. | 2502.10812 | null |
2025-02-15 | A Fast Quantum Image Compression Algorithm based on Taylor Expansion | Vu Tuan Hai et.al. | 2502.10684 | null |
2025-02-15 | Optimizing CNN Architectures for Advanced Thoracic Disease Classification | Tejas Mirthipati et.al. | 2502.10614 | null |
2025-02-14 | Conditional Latent Coding with Learnable Synthesized Reference for Deep Image Compression | Siqi Wu et.al. | 2502.09971 | null |
2025-02-13 | Differentially Private Compression and the Sensitivity of LZ77 | Jeremiah Blocki et.al. | 2502.09584 | null |
2025-02-13 | SQ-GAN: Semantic Image Communications Using Masked Vector Quantization | Francesco Pezone et.al. | 2502.09520 | link |
2025-02-13 | Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting | Lingting Zhu et.al. | 2502.09039 | link |
2025-02-12 | Compression of Site-Specific Deep Neural Networks for Massive MIMO Precoding | Ghazal Kasalaee et.al. | 2502.08758 | null |
2025-02-11 | To clean or not to clean? Influence of pixel removal on event reconstruction using deep learning in CTAO | Tom François et.al. | 2502.07643 | null |
2025-02-19 | HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates | Lei Lu et.al. | 2502.07160 | null |
2025-02-12 | Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT | Dongyang Liu et.al. | 2502.06782 | null |
2025-02-10 | Solving Optimal Power Flow on a Data-Budget: Feature Selection on Smart Meter Data | Vassilis Kekatos et.al. | 2502.06683 | null |
2025-02-13 | CANeRV: Content Adaptive Neural Representation for Video Compression | Lv Tang et.al. | 2502.06181 | null |
2025-02-09 | Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization | Jiajun Fan et.al. | 2502.06061 | null |
2025-02-09 | Constant sensitivity on the CDAWGs | Rikuya Hamai et.al. | 2502.05915 | null |
2025-02-09 | Linear Attention Modeling for Learned Image Compression | Donghui Feng et.al. | 2502.05741 | null |
2025-02-08 | Convolutional Deep Colorization for Image Compression: A Color Grid Based Approach | Ian Tassin et.al. | 2502.05402 | null |
2025-02-07 | CMamba: Learned Image Compression with State Space Models | Zhuojie Wu et.al. | 2502.04988 | null |
2025-02-06 | Semantic Feature Division Multiple Access for Digital Semantic Broadcast Channels | Shuai Ma et.al. | 2502.03949 | null |
2025-02-06 | Enhancing Online Learning Efficiency Through Heterogeneous Resource Integration with a Multi-Agent RAG System | Devansh Srivastav et.al. | 2502.03948 | null |
2025-02-05 | All-in-One Image Compression and Restoration | Huimin Zeng et.al. | 2502.03649 | null |
2025-02-05 | Towards characterizing dark matter subhalo perturbations in stellar streams with graph neural networks | Peter Xiangyuan Ma et.al. | 2502.03522 | null |
2025-02-05 | LED there be DoS: Exploiting variable bitrate IP cameras for network DoS | Emmanuel Goldberg et.al. | 2502.03177 | null |
2025-02-04 | On likelihood-based analysis of the gravitationally (de)lensed CMB | Julien Carron et.al. | 2502.02399 | null |
2025-02-04 | PALQA: A Novel Parameterized Position-Aware Lossy Quantum Autoencoder using LSB Control Qubit for Efficient Image Compression | Ershadul Haque et.al. | 2502.02188 | null |
2025-02-01 | Semantic Communication based on Generative AI: A New Approach to Image Compression and Edge Optimization | Francesco Pezone et.al. | 2502.01675 | null |
2025-02-10 | Compressed Image Generation with Denoising Diffusion Codebook Models | Guy Ohayon et.al. | 2502.01189 | null |
2025-02-02 | S2CFormer: Reorienting Learned Image Compression from Spatial Interaction to Channel Aggregation | Yunuo Chen et.al. | 2502.00700 | null |
2025-01-28 | Rate-Distortion under Neural Tracking of Speech: A Directed Redundancy Approach | Jan Østergaard et.al. | 2501.16762 | null |
2025-02-05 | Hybrid Quantum Neural Networks with Amplitude Encoding: Advancing Recovery Rate Predictions | Ying Chen et.al. | 2501.15828 | null |
2025-01-23 | The Redundancy of Non-Singular Channel Simulation | Gergely Flamich et.al. | 2501.14053 | null |
2025-02-01 | On Disentangled Training for Nonlinear Transform in Learned Image Compression | Han Li et.al. | 2501.13751 | link |
2025-01-23 | Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse | Wenzhuo Ma et.al. | 2501.13528 | null |
2025-01-22 | Using simulation based inference on tidally perturbed dwarf galaxies: the dynamics of NGC205 | Axel Widmark et.al. | 2501.13148 | null |
2025-01-22 | Nonlinear reduction strategies for data compression: a comprehensive comparison from diffusion to advection problems | Isabella Carla Gonnella et.al. | 2501.12816 | null |
2025-01-22 | Entropy Polarization-Based Data Compression Without Frozen Set Construction | Zichang Ren et.al. | 2501.12584 | null |
2025-01-21 | The Gap Between Principle and Practice of Lossy Image Coding | Haotian Zhang et.al. | 2501.12330 | null |
2025-01-21 | RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression | Uri Gadot et.al. | 2501.12216 | null |
2025-01-20 | Efficient Bearing Sensor Data Compression via an Asymmetrical Autoencoder with a Lifting Wavelet Transform Layer | Xin Zhu et.al. | 2501.11737 | null |
2025-01-20 | Towards Loss-Resilient Image Coding for Unstable Satellite Networks | Hongwei Sha et.al. | 2501.11263 | null |
2025-01-18 | Mathematical model of parameters relevance in adaptive level-crossing sampling for electrocardiogram signals | Silvio Zanoli et.al. | 2501.10829 | null |
2025-01-30 | Lossless data compression at pragmatic rates | Andreas Theocharous et.al. | 2501.10103 | null |
2025-01-17 | Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography | Mohammed Salah et.al. | 2501.09994 | link |
2025-01-31 | A Simple Aerial Detection Baseline of Multimodal Language Models | Qingyun Li et.al. | 2501.09720 | link |
2025-01-16 | Split Fine-Tuning for Large Language Models in Wireless Networks | Songge Zhang et.al. | 2501.09237 | null |
2025-01-13 | Motion Tracks: A Unified Representation for Human-Robot Transfer in Few-Shot Imitation Learning | Juntao Ren et.al. | 2501.06994 | null |
2025-01-12 | A General Framework for Error-controlled Unstructured Scientific Data Compression | Qian Gong et.al. | 2501.06910 | null |
2025-01-10 | From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities | Dominick Reilly et.al. | 2501.05711 | link |
2025-01-09 | Neural Architecture Codesign for Fast Physics Applications | Jason Weitz et.al. | 2501.05515 | link |
2025-01-09 | Principles and Metrics of Extreme Learning Machines Using a Highly Nonlinear Fiber | Mathilde Hary et.al. | 2501.05233 | null |
2025-01-09 | Emergence of Painting Ability via Recognition-Driven Evolution | Yi Lin et.al. | 2501.04966 | null |
2025-01-08 | GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting | Andrew Bond et.al. | 2501.04782 | null |
2025-01-08 | Unified Coding for Both Human Perception and Generalized Machine Analytics with CLIP Supervision | Kangsheng Yin et.al. | 2501.04579 | link |
2025-01-08 | An Efficient Adaptive Compression Method for Human Perception and Machine Vision Tasks | Lei Liu et.al. | 2501.04329 | null |
2025-01-03 | Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition | Rui Liu et.al. | 2501.04038 | link |
2024-12-24 | MERCURY: A fast and versatile multi-resolution based global emulator of compound climate hazards | Shruti Nath et.al. | 2501.04018 | null |
2025-01-06 | A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks | Rasa Khosrowshahli et.al. | 2501.03095 | null |
2025-01-06 | Region of Interest based Medical Image Compression | Utkarsh Prakash Srivastava et.al. | 2501.02895 | null |
2025-01-06 | Constructing 4D Radio Map in LEO Satellite Networks with Limited Samples | Haoxuan Yuan et.al. | 2501.02775 | null |
2025-01-06 | Artificial Intelligence in Creative Industries: Advances Prior to 2025 | Nantheera Anantrasirichai et.al. | 2501.02725 | null |
2025-01-05 | Remote Inference over Dynamic Links via Adaptive Rate Deep Task-Oriented Vector Quantization | Eyal Fishel et.al. | 2501.02521 | link |
2025-01-17 | MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance | Jialong Guo et.al. | 2501.02427 | null |
2025-01-03 | Compressed Domain Prior-Guided Video Super-Resolution for Cloud Gaming Content | Qizhe Wang et.al. | 2501.01773 | null |
2025-01-01 | CoordFlow: Coordinate Flow for Pixel-wise Neural Video Representation | Daniel Silver et.al. | 2501.00975 | null |
2025-01-01 | Gradient Compression and Correlation Driven Federated Learning for Wireless Traffic Prediction | Chuanting Zhang et.al. | 2501.00732 | link |
2025-01-07 | Rapid, High-resolution and Distortion-free |
Xiaoqing Wang et.al. | 2501.00256 | null |
2024-12-29 | Distributed Hybrid Sketching for |
Neophytos Charalambides et.al. | 2412.20301 | null |
2024-12-19 | Quantum Implicit Neural Compression | Takuya Fujihashi et.al. | 2412.19828 | null |
2024-12-25 | Adaptive Rate Control for Deep Video Compression with Rate-Distortion Prediction | Bowen Gu et.al. | 2412.18834 | null |
2024-12-24 | Ultra-Low Complexity On-Orbit Compression for Remote Sensing Imagery via Block Modulated Imaging | Zhibin Wang et.al. | 2412.18417 | link |
2024-12-24 | Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task | Jinming Liu et.al. | 2412.18158 | null |
2024-12-23 | CALLIC: Content Adaptive Learning for Lossless Image Compression | Daxin Li et.al. | 2412.17464 | null |
2024-12-23 | AsymLLIC: Asymmetric Lightweight Learned Image Compression | Shen Wang et.al. | 2412.17270 | null |
2024-12-22 | Foundation Model for Lossy Compression of Spatiotemporal Scientific Data | Xiao Li et.al. | 2412.17184 | null |
2024-12-24 | L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression | Junxuan Zhang et.al. | 2412.16642 | link |
2024-12-20 | Schmidt quantum compressor | Israel F. Araujo et.al. | 2412.16337 | null |
2024-12-20 | Sparse Point Clouds Assisted Learned Image Compression | Yiheng Jiang et.al. | 2412.15752 | null |
2024-12-18 | Super-Resolution Generative Adversarial Network for Data Compression of Direct Numerical Simulations | Ludovico Nista et.al. | 2412.14150 | null |
2024-12-18 | Efficient high performance computing with the ALICE Event Processing Nodes GPU-based farm | Federico Ronchetti et.al. | 2412.13755 | null |
2024-12-18 | Robust UAV Jittering and Task Scheduling in Mobile Edge Computing with Data Compression | Bin Li et.al. | 2412.13676 | null |
2024-12-18 | DarkIR: Robust Low-Light Image Restoration | Daniel Feijoo et.al. | 2412.13443 | link |
2024-12-17 | Identifying Bias in Deep Neural Networks Using Image Transforms | Sai Teja Erukude et.al. | 2412.13079 | link |
2024-12-17 | Stable Diffusion is a Natural Cross-Modal Decoder for Layered AI-generated Image Compression | Ruijie Chen et.al. | 2412.12982 | null |
2024-12-17 | Invisible Watermarks: Attacks and Robustness | Dongjun Hwang et.al. | 2412.12511 | link |
2024-12-16 | Representation learning for fast radio burst dynamic spectra | Dirk Kuiper et.al. | 2412.12394 | link |
2024-12-16 | Point Cloud-Assisted Neural Image Compression | Ziqun Li et.al. | 2412.11771 | null |
2024-12-16 | Whisper-GPT: A Hybrid Representation Audio Large Language Model | Prateek Verma et.al. | 2412.11449 | null |
2024-12-16 | Controllable Distortion-Perception Tradeoff Through Latent Diffusion for Neural Image Compression | Chuqin Zhou et.al. | 2412.11379 | null |
2024-12-16 | VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression | Qiang Hu et.al. | 2412.11362 | null |
2024-12-14 | Progressive Compression with Universally Quantized Diffusion Models | Yibo Yang et.al. | 2412.10935 | null |
2024-12-14 | Learned Data Compression: Challenges and Opportunities for the Future | Qiyu Liu et.al. | 2412.10770 | null |
2024-12-11 | Implicit Neural Compression of Point Clouds | Hongning Ruan et.al. | 2412.10433 | null |
2024-12-12 | Video Seal: Open and Efficient Video Watermarking | Pierre Fernandez et.al. | 2412.09492 | link |
2024-12-12 | Learned Compression for Compressed Learning | Dan Jacobellis et.al. | 2412.09405 | link |
2024-12-12 | Versatile Volumetric Medical Image Coding for Human-Machine Vision | Jietao Chen et.al. | 2412.09231 | null |
2024-12-11 | Unicorn: Unified Neural Image Compression with One Number Reconstruction | Qi Zheng et.al. | 2412.08210 | null |
2024-12-09 | Splatter-360: Generalizable 360 |
Zheng Chen et.al. | 2412.06250 | link |
2024-12-08 | Vision Transformer-based Semantic Communications With Importance-Aware Quantization | Joohyuk Park et.al. | 2412.06038 | null |
2024-12-08 | Matrix Pre-orthogonal-Matching Pursuit as a Fundamental AI Algorithm | Wei Qu et.al. | 2412.05878 | null |
2024-12-09 | UniMIC: Towards Universal Multi-modality Perceptual Image Compression | Yixin Gao et.al. | 2412.04912 | null |
2024-12-05 | Solving High-dimensional Inverse Problems Using Amortized Likelihood-free Inference with Noisy and Incomplete Data | Jice Zeng et.al. | 2412.04565 | null |
2024-12-05 | Diagnosing Systematic Effects Using the Inferred Initial Power Spectrum | Tristan Hoellinger et.al. | 2412.04443 | null |
2024-12-05 | Multi-Scale Node Embeddings for Graph Modeling and Generation | Riccardo Milocco et.al. | 2412.04354 | null |
2024-12-05 | Feature Coding in the Era of Large Models: Dataset, Test Conditions, and Benchmark | Changsheng Gao et.al. | 2412.04307 | link |
2024-12-05 | LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model | Yuan Xue et.al. | 2412.03841 | null |
2024-12-04 | Electrocardiogram-based diagnosis of liver diseases: an externally validated and explainable machine learning approach | Juan Miguel Lopez Alcaraz et.al. | 2412.03717 | link |
2024-12-04 | Is JPEG AI going to change image forensics? | Edoardo Daniele Cannas et.al. | 2412.03261 | null |
2024-12-03 | Efficient Algorithms for Low Tubal Rank Tensor Approximation with Applications to Image Compression, Super-Resolution and Deep Learning | Salman Ahmadi-Asl et.al. | 2412.02598 | null |
2024-12-03 | Randomized algorithms for Kroncecker tensor decomposition and applications | Salman Ahmadi-Asl et.al. | 2412.02597 | null |
2024-12-03 | Efficient Model Compression Techniques with FishLeg | Jamie McGowan et.al. | 2412.02328 | null |
2024-12-02 | Efficient Compression of Sparse Accelerator Data Using Implicit Neural Representations and Importance Sampling | Xihaier Luo et.al. | 2412.01754 | link |
2024-12-02 | Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior | Yi Yu et.al. | 2412.01646 | null |
2024-12-01 | Construction of generalized samplets in Banach spaces | Peter Balazs et.al. | 2412.00954 | null |
2024-11-30 | Good, Cheap, and Fast: Overfitted Image Compression with Wasserstein Distortion | Jona Ballé et.al. | 2412.00505 | null |
2024-11-30 | Hybrid Local-Global Context Learning for Neural Video Compression | Yongqi Zhai et.al. | 2412.00446 | null |
2024-11-30 | DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression | Yongqi Zhai et.al. | 2412.00437 | null |
2024-11-29 | AIDetx: a compression-based method for identification of machine-learning generated text | Leonardo Almeida et.al. | 2411.19869 | link |
2024-11-29 | Memristive Nanowire Network for Energy Efficient Audio Classification: Pre-Processing-Free Reservoir Computing with Reduced Latency | Akshaya Rajesh et.al. | 2411.19611 | null |
2024-11-29 | MCUCoder: Adaptive Bitrate Learned Video Compression for IoT Devices | Ali Hojjat et.al. | 2411.19442 | link |
2024-11-28 | Generalized Gaussian Model for Learned Image Compression | Haotian Zhang et.al. | 2411.19320 | null |
2024-11-28 | Upsampling Improvement for Overfitted Neural Coding | Pierrick Philippe et.al. | 2411.19249 | null |
2024-11-27 | Learning Optimal Linear Block Transform by Rate Distortion Minimization | Alessandro Gnutti et.al. | 2411.18494 | null |
2024-11-27 | HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression | Lei Liu et.al. | 2411.18473 | null |
2024-11-26 | Evaluating the Overhead of the Performance Profiler Cloudprofiler With MooBench | Shinhyung Yang et.al. | 2411.17413 | null |
2024-11-26 | Motion Free B-frame Coding for Neural Video Compression | Van Thang Nguyen et.al. | 2411.17160 | null |
2024-11-30 | An Information-Theoretic Regularizer for Lossy Neural Image Compression | Yingwen Zhang et.al. | 2411.16727 | null |
2024-11-25 | WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing | Kai Han et.al. | 2411.16336 | null |
2024-11-25 | Learning Optimal Lattice Vector Quantizers for End-to-end Neural Image Compression | Xi Zhang et.al. | 2411.16119 | null |
2024-11-25 | TransCompressor: LLM-Powered Multimodal Data Compression for Smart Transportation | Huanqi Yang et.al. | 2411.16020 | null |
2024-11-24 | Variable-size Symmetry-based Graph Fourier Transforms for image compression | Alessandro Gnutti et.al. | 2411.15824 | null |
2024-11-24 | M3-CVC: Controllable Video Compression with Multimodal Generative Models | Rui Wan et.al. | 2411.15798 | null |
2024-11-24 | Advanced Learning-Based Inter Prediction for Future Video Coding | Yanchen Zhao et.al. | 2411.15759 | null |
2024-11-24 | PEnG: Pose-Enhanced Geo-Localisation | Tavis Shore et.al. | 2411.15742 | null |
2024-11-21 | U-Motion: Learned Point Cloud Video Compression with U-Structured Motion Estimation | Tingyu Fan et.al. | 2411.14501 | null |
2024-11-21 | Differentiable SVD based on Moore-Penrose Pseudoinverse for Inverse Imaging Problems | Yinghao Zhang et.al. | 2411.14141 | link |
2024-11-21 | Compact Visual Data Representation for Green Multimedia -- A Human Visual System Perspective | Peilin Chen et.al. | 2411.14135 | null |
2024-11-27 | Image Compression Using Novel View Synthesis Priors | Luyuan Peng et.al. | 2411.13862 | null |
2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
2024-11-20 | Benchmarking Quantum Convolutional Neural Networks for Classification and Data Compression Tasks | Jun Yong Khoo et.al. | 2411.13468 | null |
2024-11-20 | Practical Compact Deep Compressed Sensing | Bin Chen et.al. | 2411.13081 | link |
2024-11-20 | LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image Compression | Shimon Murai et.al. | 2411.13033 | link |
2024-11-22 | Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need | Kecheng Chen et.al. | 2411.12448 | null |
2024-11-19 | Breathless: An 8-hour Performance Contrasting Human and Robot Expressiveness | Catie Cuan et.al. | 2411.12361 | null |
2024-11-18 | Variable Rate Neural Compression for Sparse Detector Data | Yi Huang et.al. | 2411.11942 | link |
2024-11-18 | Exploring adversarial robustness of JPEG AI: methodology, comparison and new methods | Egor Kovalev et.al. | 2411.11795 | null |
2024-11-18 | Additional Tests for TV 3.0 | Eduardo Peixoto et.al. | 2411.11755 | null |
2024-11-18 | Towards fast DBSCAN via Spectrum-Preserving Data Compression | Yongyu Wang et.al. | 2411.11421 | null |
2024-11-17 | BVI-CR: A Multi-View Human Dataset for Volumetric Video Compression | Ge Gao et.al. | 2411.11199 | link |
2024-11-16 | An End-to-End Real-World Camera Imaging Pipeline | Kepeng Xu et.al. | 2411.10773 | null |
2024-11-16 | Deep Learning-Based Image Compression for Wireless Communications: Impacts on Reliability,Throughput, and Latency | Mostafa Naseri et.al. | 2411.10650 | link |
2024-11-15 | Efficient Progressive Image Compression with Variance-aware Masking | Alberto Presta et.al. | 2411.10185 | link |
2024-11-15 | A Multi-Scale Spatial-Temporal Network for Wireless Video Transmission | Xinyi Zhou et.al. | 2411.09936 | null |
2024-11-14 | Application of signal separation to diffraction image compression and serial crystallography | Jérôme Kieffer et.al. | 2411.09515 | link |
2024-11-14 | DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines | Junqi Liu et.al. | 2411.09308 | null |
2024-11-14 | Towards efficient compression and communication for prototype-based decentralized learning | Pablo Fernández-Piñeiro et.al. | 2411.09267 | null |
2024-11-13 | Learning Optimal and Interpretable Summary Statistics of Galaxy Catalogs with SBI | Kai Lehman et.al. | 2411.08957 | null |
2024-11-13 | LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive Hashing | Xiaonan Nie et.al. | 2411.08446 | null |
2024-11-18 | Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer | Xiao Huo et.al. | 2411.07899 | null |
2024-11-11 | Accelerating radio astronomy imaging with RICK | Emanuele De Rubeis et.al. | 2411.07321 | link |
2024-11-11 | Low Complexity Learning-based Lossless Event-based Compression | Ahmadreza Sezavar et.al. | 2411.07155 | null |
2024-11-11 | JPEG AI Image Compression Visual Artifacts: Detection Methods and Dataset | Daria Tsereh et.al. | 2411.06810 | null |
2024-11-11 | Machine vision-aware quality metrics for compressed image and video assessment | Mikhail Dremin et.al. | 2411.06776 | null |
2024-11-11 | High-Frequency Enhanced Hybrid Neural Representation for Video Compression | Li Yu et.al. | 2411.06685 | null |
2024-11-09 | HiHa: Introducing Hierarchical Harmonic Decomposition to Implicit Neural Compression for Atmospheric Data | Zhewen Xu et.al. | 2411.06155 | null |
2024-11-08 | A method based on Generative Adversarial Networks for disentangling physical and chemical properties of stars in astronomical spectra | Raúl Santoveña et.al. | 2411.05960 | null |
2024-11-07 | Don't Look Twice: Faster Video Transformers with Run-Length Tokenization | Rohan Choudhury et.al. | 2411.05222 | null |
2024-11-05 | Tuning into spatial frequency space: Satellite and space debris detection in the ZTF alert stream | J. P. Carvajal et.al. | 2411.03258 | null |
2024-11-15 | ZipCache: A DRAM/SSD Cache with Built-in Transparent Compression | Rui Xie et.al. | 2411.03174 | null |
2024-11-05 | Learning-based Lossless Event Data Compression | Ahmadreza Sezavar et.al. | 2411.03010 | null |
2024-11-04 | Neural optical flow for planar and stereo PIV | Andrew I. Masker et.al. | 2411.02373 | null |
2024-11-04 | The evolution of volumetric video: A survey of smart transcoding and compression approaches | Preetish Kakkar et.al. | 2411.02095 | null |
2024-11-03 | Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision | Xiangzhong Luo et.al. | 2411.01431 | null |
2024-11-02 | Autoencoders for At-Source Data Reduction and Anomaly Detection in High Energy Particle Detectors | Alexander Yue et.al. | 2411.01118 | null |
2024-11-01 | SANN-PSZ: Spatially Adaptive Neural Network for Head-Tracked Personal Sound Zones | Yue Qiao et.al. | 2411.00772 | null |
2024-10-28 | MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression | Noel Elias et.al. | 2410.21548 | link |
2024-10-29 | Enhancing Learned Image Compression via Cross Window-based Attention | Priyanka Mudgal et.al. | 2410.21144 | link |
2024-10-26 | Cross-Platform Neural Video Coding: A Case Study | Ruhan Conceição et.al. | 2410.20145 | null |
2024-10-25 | Conditional Hallucinations for Image Compression | Till Aczel et.al. | 2410.19493 | null |
2024-10-29 | Integration of Communication and Computational Imaging | Zhenming Yu et.al. | 2410.19415 | null |
2024-10-24 | DMVC: Multi-Camera Video Compression Network aimed at Improving Deep Learning Accuracy | Huan Cui et.al. | 2410.18400 | null |
2024-10-23 | Predicting total time to compress a video corpus using online inference systems | Xin Shu et.al. | 2410.18260 | null |
2024-10-23 | FIPER: Generalizable Factorized Fields for Joint Image Compression and Super-Resolution | Yang-Che Sun et.al. | 2410.18083 | null |
2024-10-23 | Learning Lossless Compression for High Bit-Depth Volumetric Medical Image | Kai Wang et.al. | 2410.17814 | null |
2024-10-21 | Variable Rate Learned Wavelet Video Coding with Temporal Layer Adaptivity | Anna Meyer et.al. | 2410.15873 | link |
2024-10-20 | Extensions on low-complexity DCT approximations for larger blocklengths based on minimal angle similarity | A. P. Radünz et.al. | 2410.15244 | null |
2024-10-19 | Standardizing Generative Face Video Compression using Supplemental Enhancement Information | Bolin Chen et.al. | 2410.15105 | null |
2024-10-16 | MatryoshkaKV: Adaptive KV Compression via Trainable Orthogonal Projection | Bokai Lin et.al. | 2410.14731 | null |
2024-10-18 | Design and Prototype of a Unified Framework for Error-robust Compression and Encryption in IoT | Gajraj Kuldeep et.al. | 2410.14396 | null |
2024-10-18 | Compression using Discrete Multi-Level Divisor Transform for Heterogeneous Sensor Data | Gajraj Kuldeep et.al. | 2410.14287 | null |
2024-10-17 | In-context learning and Occam's razor | Eric Elmoznino et.al. | 2410.14086 | link |
2024-10-17 | Co-Segmentation without any Pixel-level Supervision with Application to Large-Scale Sketch Classification | Nikolaos-Antonios Ypsilantis et.al. | 2410.13582 | null |
2024-10-16 | Test-time adaptation for image compression with distribution regularization | Kecheng Chen et.al. | 2410.12191 | null |
2024-10-16 | Joint Data Compression, Secure Multi-Part Collaborative Task Offloading and Resource Assignment in Ultra-Dense Networks | Tianqing Zhou et.al. | 2410.12186 | null |
2024-10-14 | Large Language Model Evaluation via Matrix Nuclear-Norm | Yahan Li et.al. | 2410.10672 | link |
2024-10-14 | QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models | Zhumazhan Balapanov et.al. | 2410.10318 | link |
2024-10-14 | Generative Human Video Compression with Multi-granularity Temporal Trajectory Factorization | Shanzhi Yin et.al. | 2410.10171 | null |
2024-10-13 | Towards Reproducible Learning-based Compression | Jiahao Pang et.al. | 2410.09872 | null |
2024-10-13 | Compressing Scene Dynamics: A Generative Approach | Shanzhi Yin et.al. | 2410.09768 | link |
2024-10-13 | ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression | Wei Jiang et.al. | 2410.09706 | link |
2024-10-12 | Fine-grained subjective visual quality assessment for high-fidelity compressed images | Michela Testolina et.al. | 2410.09501 | link |
2024-10-11 | Fast Data-independent KLT Approximations Based on Integer Functions | A. P. Radünz et.al. | 2410.09227 | null |
2024-10-10 | Compressing high-resolution data through latent representation encoding for downscaling large-scale AI weather forecast model | Qian Liu et.al. | 2410.09109 | null |
2024-10-11 | Data-Driven Neural Estimation of Indirect Rate-Distortion Function | Zichao Yu et.al. | 2410.09018 | null |
2024-10-11 | Compressing regularised dynamics improves link prediction in sparse networks | Maja Lindström et.al. | 2410.08777 | link |
2024-10-11 | Beyond GFVC: A Progressive Face Video Compression Framework with Adaptive Visual Tokens | Bolin Chen et.al. | 2410.08485 | link |
2024-10-10 | What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias | Aida Mohammadshahi et.al. | 2410.08407 | null |
2024-10-16 | Delta-ICM: Entropy Modeling with Delta Function for Learned Image Compression | Takahiro Shindo et.al. | 2410.07669 | null |
2024-10-10 | MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion | Onkar Susladkar et.al. | 2410.07659 | null |
2024-10-10 | R-Adaptive Mesh Optimization to Enhance Finite Element Basis Compression | Graham Harper et.al. | 2410.07646 | null |
2024-10-09 | JPEG Inspired Deep Learning | Ahmed H. Salamah et.al. | 2410.07081 | link |
2024-10-09 | SHRINK: Data Compression by Semantic Extraction and Residuals Encoding | Guoyou Sun et.al. | 2410.06713 | null |
2024-10-09 | Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization | Prateek Varshney et.al. | 2410.06567 | null |
2024-10-09 | Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching | Wenqi Niu et.al. | 2410.06561 | null |
2024-10-08 | Covering Numbers for Deep ReLU Networks with Applications to Function Approximation and Nonparametric Regression | Weigutian Ou et.al. | 2410.06378 | null |
2024-10-08 | Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach | Sha Guo et.al. | 2410.06149 | null |
2024-10-08 | Resolution limit of the eye: how many pixels can we see? | Maliha Ashraf et.al. | 2410.06068 | null |
2024-10-07 | Transformers learn variable-order Markov chains in-context | Ruida Zhou et.al. | 2410.05493 | null |
2024-10-07 | Salient Store: Enabling Smart Storage for Continuous Learning Edge Servers | Cyan Subhra Mishra et.al. | 2410.05435 | null |
2024-10-07 | Causal Context Adjustment Loss for Learned Image Compression | Minghao Han et.al. | 2410.04847 | link |
2024-10-06 | Channel-Aware Throughput Maximization for Cooperative Data Fusion in CAV | Haonan An et.al. | 2410.04320 | null |
2024-10-05 | Robust Task-Oriented Communication Framework for Real-Time Collaborative Vision Perception | Zhengru Fang et.al. | 2410.04168 | null |
2024-10-04 | On the Rate-Distortion-Complexity Trade-offs of Neural Video Coding | Yi-Hsin Chen et.al. | 2410.03898 | null |
2024-10-04 | A Framework for Automatic Validation and Application of Lossy Data Compression in Ensemble Data Assimilation | Kai Keller et.al. | 2410.03184 | null |
2024-10-03 | GABIC: Graph-based Attention Block for Image Compression | Gabriele Spadaro et.al. | 2410.02981 | link |
2024-10-03 | Diffusion-based Extreme Image Compression with Compressed Feature Initialization | Zhiyuan Li et.al. | 2410.02640 | link |
2024-10-03 | High-Efficiency Neural Video Compression via Hierarchical Predictive Learning | Ming Lu et.al. | 2410.02598 | link |
2024-10-02 | A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation | Liang Chen et.al. | 2410.01912 | link |
2024-10-02 | COSMIC: Compress Satellite Images Efficiently via Diffusion Compensation | Ziyuan Zhang et.al. | 2410.01698 | link |
2024-10-03 | Releasing the Parameter Latency of Neural Representation for High-Efficiency Video Compression | Gai Zhang et.al. | 2410.01654 | null |
2024-10-02 | Task-Oriented Edge-Assisted Cooperative Data Compression, Communications and Computing for UGV-Enhanced Warehouse Logistics | Jiaming Yang et.al. | 2410.01515 | null |
2024-10-01 | STanH : Parametric Quantization for Variable Rate Learned Image Compression | Alberto Presta et.al. | 2410.00557 | null |
2024-09-30 | LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner | Xiaopan Zhang et.al. | 2409.20560 | null |
2024-09-30 | PerCo (SD): Open Perceptual Compression | Nikolai Körber et.al. | 2409.20255 | link |
2024-09-29 | All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation | Xu Zhang et.al. | 2409.19660 | link |
2024-09-28 | Fast Encoding and Decoding for Implicit Video Representation | Hao Chen et.al. | 2409.19429 | null |
2024-09-27 | Learning-Based Image Compression for Machines | Kartik Gupta et.al. | 2409.19184 | link |
2024-09-27 | Effectiveness of learning-based image codecs on fingerprint storage | Daniele Mari et.al. | 2409.18730 | link |
2024-09-27 | Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming | Angeliki Katsenou et.al. | 2409.18713 | null |
2024-09-27 | Neural Video Representation for Redundancy Reduction and Consistency Preservation | Taiga Hayami et.al. | 2409.18497 | null |
2024-09-20 | Blockchain-Enabled Variational Information Bottleneck for Data Extraction Based on Mutual Information in Internet of Vehicles | Cui Zhang et.al. | 2409.17287 | null |
2024-09-25 | Streaming Neural Images | Marcos V. Conde et.al. | 2409.17134 | null |
2024-09-25 | PhD Forum: Efficient Privacy-Preserving Processing via Memory-Centric Computing | Mpoki Mwaisela et.al. | 2409.16777 | null |
2024-09-25 | The Effect of Lossy Compression on 3D Medical Images Segmentation with Deep Learning | Anvar Kurmukov et.al. | 2409.16733 | null |
2024-09-24 | AIM 2024 Challenge on UHD Blind Photo Quality Assessment | Vlad Hosu et.al. | 2409.16271 | null |
2024-09-25 | COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models | Kehui Liu et.al. | 2409.15146 | link |
2024-09-23 | AlphaZip: Neural Network-Enhanced Lossless Text Compression | Swathi Shree Narashiman et.al. | 2409.15046 | link |
2024-09-23 | Anomaly Detection from a Tensor Train Perspective | Alejandro Mata Ali et.al. | 2409.15030 | null |
2024-09-23 | AIM 2024 Challenge on Video Saliency Prediction: Methods and Results | Andrey Moskalenko et.al. | 2409.14827 | link |
2024-09-21 | Window-based Channel Attention for Wavelet-enhanced Learned Image Compression | Heng Xu et.al. | 2409.14090 | null |
2024-09-20 | Reduced bit median quantization: A middle process for Efficient Image Compression | Fikresilase Wondmeneh Abebayew et.al. | 2409.13789 | null |
2024-09-20 | Data Compression using Rank-1 Lattices for Parameter Estimation in Machine Learning | Michael Gnewuch et.al. | 2409.13453 | null |
2024-09-19 | Breaking the Barriers of One-to-One Usage of Implicit Neural Representation in Image Compression: A Linear Combination Approach with Performance Guarantees | Sai Sanjeet et.al. | 2409.13117 | link |
2024-09-19 | Optimal Coding for Randomized Kolmogorov Complexity and Its Applications | Shuichi Hirahara et.al. | 2409.12744 | null |
2024-09-19 | Multi-Scale Feature Prediction with Auxiliary-Info for Neural Image Compression | Chajin Shin et.al. | 2409.12719 | null |
2024-09-18 | One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation | Finn Lukas Busch et.al. | 2409.11764 | null |
2024-09-18 | LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution | Shiyu Feng et.al. | 2409.11711 | null |
2024-09-18 | k-mer-based approaches to bridging pangenomics and population genetics | Miles D. Roberts et.al. | 2409.11683 | null |
2024-09-17 | Few-Shot Domain Adaptation for Learned Image Compression | Tianyu Zhang et.al. | 2409.11111 | null |
2024-09-17 | Edge-based Denoising Image Compression | Ryugo Morita et.al. | 2409.10978 | null |
2024-09-16 | Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning | Amin Karimi Monsefi et.al. | 2409.10362 | null |
2024-09-14 | Lossy Image Compression with Stochastic Quantization | Anton Kozyriev et.al. | 2409.09488 | null |
2024-09-13 | Fast DCT+: A Family of Fast Transforms Based on Rank-One Updates of the Path Graph | Samuel Fernández-Menduiña et.al. | 2409.08970 | null |
2024-09-13 | On the Computation of BD-Rate over a Set of Videos for Fair Assessment of Performance of Learned Video Codecs | M. Akin Yilmaz et.al. | 2409.08772 | null |
2024-09-13 | USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s | Zhuoyuan Li et.al. | 2409.08481 | null |
2024-09-12 | Learned Compression for Images and Point Clouds | Mateen Ulhaq et.al. | 2409.08376 | link |
2024-09-11 | NVRC: Neural Video Representation Compression | Ho Man Kwan et.al. | 2409.07414 | null |
2024-09-11 | Dynamic Error-Bounded Hierarchical Matrices in Neural Network Compression | John Mango et.al. | 2409.07028 | null |
2024-09-10 | Universal End-to-End Neural Network for Lossy Image Compression | Bouzid Arezki et.al. | 2409.06586 | null |
2024-09-10 | Rate-Constrained Quantization for Communication-Efficient Federated Learning | Shayan Mohajer Hamidi et.al. | 2409.06319 | null |
2024-09-09 | Design and Implementation of TAO DAQ System | Shuihan Zhang et.al. | 2409.05522 | null |
2024-09-09 | A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression | Nora Hofer et.al. | 2409.05490 | null |
2024-09-09 | Attention Based Machine Learning Methods for Data Reduction with Guaranteed Error Bounds | Xiao Li et.al. | 2409.05357 | null |
2024-09-06 | Convolutional Transformer-Based Image Compression | Bouzid Arezki et.al. | 2409.04118 | null |
2024-09-06 | 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors | Yujun Huang et.al. | 2409.04013 | link |
2024-09-05 | TropNNC: Structured Neural Network Compression Using Tropical Geometry | Konstantinos Fotopoulos et.al. | 2409.03945 | null |
2024-09-05 | Unified Framework for Neural Network Compression via Decomposition and Optimal Rank Selection | Ali Aghababaei-Harandi et.al. | 2409.03555 | null |
2024-09-05 | Efficient Image Compression Using Advanced State Space Models | Bouzid Arezki et.al. | 2409.02743 | null |
2024-09-10 | FrameCorr: Adaptive, Autoencoder-based Neural Compression for Video Reconstruction in Resource and Timing Constrained Network Settings | John Li et.al. | 2409.02453 | null |
2024-09-03 | Compressed learning based onboard semantic compression for remote sensing platforms | Protim Bhattacharjee et.al. | 2409.01988 | link |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-03 | Privacy-Preserving Multimedia Mobile Cloud Computing Using Protective Perturbation | Zhongze Tang et.al. | 2409.01710 | null |
2024-09-02 | Multi-Reference Generative Face Video Compression with Contrastive Learning | Goluck Konuko et.al. | 2409.01029 | link |
2024-09-02 | Accelerating block-level rate control for learned image compression | Muchen Dong et.al. | 2409.01009 | null |
2024-09-02 | PNVC: Towards Practical INR-based Video Compression | Ge Gao et.al. | 2409.00953 | null |
2024-09-01 | BWT construction and search at the terabase scale | Heng Li et.al. | 2409.00613 | link |
2024-08-30 | Prioritized Information Bottleneck Theoretic Framework with Distributed Online Learning for Edge Video Analytics | Zhengru Fang et.al. | 2409.00146 | link |
2024-08-28 | Quantum Kernel Principal Components Analysis for Compact Readout of Chemiresistive Sensor Arrays | Zeheng Wang et.al. | 2409.00115 | null |
2024-08-30 | NDP: Next Distribution Prediction as a More Broad Target | Junhao Ruan et.al. | 2408.17377 | null |
2024-08-30 | Approximately Invertible Neural Network for Learned Image Compression | Yanbo Gao et.al. | 2408.17073 | null |
2024-08-29 | UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation | Piotr Rudol et.al. | 2408.16501 | null |
2024-08-29 | Convolutional Neural Network Compression Based on Low-Rank Decomposition | Yaping He et.al. | 2408.16289 | null |
2024-08-27 | Bandwidth-Aware and Overlap-Weighted Compression for Communication-Efficient Federated Learning | Zichen Tang et.al. | 2408.14736 | null |
2024-08-25 | Condensed Sample-Guided Model Inversion for Knowledge Distillation | Kuluhan Binici et.al. | 2408.13850 | null |
2024-08-12 | Semantic Variational Bayes Based on a Semantic Information Theory for Solving Latent Variables | Chenguang Lu et.al. | 2408.13122 | null |
2024-08-22 | Quantization-free Lossy Image Compression Using Integer Matrix Factorization | Pooya Ashtari et.al. | 2408.12691 | link |
2024-08-22 | DeepHQ: Learned Hierarchical Quantizer for Progressive Deep Image Coding | Jooyoung Lee et.al. | 2408.12150 | null |
2024-08-28 | AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results | Maksim Smirnov et.al. | 2408.11982 | link |
2024-08-20 | Trustworthy Compression? Impact of AI-based Codecs on Biometrics for Law Enforcement | Sandra Bergmann et.al. | 2408.10823 | null |
2024-08-20 | Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds | Kai Liu et.al. | 2408.10543 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-16 | Bi-Directional Deep Contextual Video Compression | Xihua Sheng et.al. | 2408.08604 | null |
2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
2024-08-15 | Algebraic Vertex Ordering of a Sparse Graph for Adjacency Access Locality and Graph Compression | Dimitris Floros et.al. | 2408.08439 | null |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-15 | DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions | Ryosuke Korekata et.al. | 2408.07910 | null |
2024-08-14 | Towards Real-time Video Compressive Sensing on Mobile Devices | Miao Cao et.al. | 2408.07530 | link |
2024-08-14 | Encoding and Decoding Algorithms of ANS Variants and Evaluation of Their Average Code Lengths | Hirosuke Yamamoto et.al. | 2408.07322 | null |
2024-08-13 | Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality | Yu-Chih Chen et.al. | 2408.07041 | null |
2024-08-13 | Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines | Samuel Fernández Menduiña et.al. | 2408.07028 | null |
2024-08-19 | Joint Source-Channel Optimization for UAV Video Coding and Transmission | Kesong Wu et.al. | 2408.06667 | null |
2024-08-08 | Flow-Lenia.png: Evolving Multi-Scale Complexity by Means of Compression | Tadashi Adachi et.al. | 2408.06374 | null |
2024-08-09 | Benchmarking Conventional and Learned Video Codecs with a Low-Delay Configuration | Siyue Teng et.al. | 2408.05042 | null |
2024-08-08 | SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression | Linhan Cao et.al. | 2408.04273 | null |
2024-08-07 | Bi-Level Spatial and Channel-aware Transformer for Learned Image Compression | Hamidreza Soltani et.al. | 2408.03842 | null |
2024-08-07 | BVI-AOM: A New Training Dataset for Deep Video Compression Optimization | Jakub Nawała et.al. | 2408.03265 | link |
2024-08-06 | Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring | Jeremy J. Williams et.al. | 2408.02869 | null |
2024-08-05 | Dimensionality Reduction and Nearest Neighbors for Improving Out-of-Distribution Detection in Medical Image Segmentation | McKell Woodland et.al. | 2408.02761 | link |
2024-08-04 | CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization | Xiang He et.al. | 2408.01952 | link |
2024-08-03 | Channel-Aware Distributed Transmission Control and Video Streaming in UAV Networks | Masoud Ghazikor et.al. | 2408.01885 | null |
2024-08-02 | An Adaptive Tensor-Train Decomposition Approach for Efficient Deep Neural Network Compression | Shiyi Luo et.al. | 2408.01534 | null |
2024-07-31 | Exploiting Change Blindness for Video Coding: Perspectives from a Less Promising User Study | Mitra Amiri et.al. | 2408.00052 | null |
2024-07-31 | Tora: Trajectory-oriented Diffusion Transformer for Video Generation | Zhenghao Zhang et.al. | 2407.21705 | link |
2024-07-30 | Edge Learning Based Collaborative Automatic Modulation Classification for Hierarchical Cognitive Radio Networks | Peihao Dong et.al. | 2407.20772 | link |
2024-07-30 | Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications | Yi Ju et.al. | 2407.20717 | null |
2024-07-29 | Homomorphic data compression for real time photon correlation analysis | Sebastian Strempfer et.al. | 2407.20356 | null |
2024-07-24 | Accelerating the Low-Rank Decomposed Models | Habib Hajimolahoseini et.al. | 2407.20266 | null |
2024-07-29 | ComNeck: Bridging Compressed Image Latents and Multimodal LLMs via Universal Transform-Neck | Chia-Hao Kao et.al. | 2407.19651 | null |
2024-07-28 | NVC-1B: A Large Neural Video Coding Model | Xihua Sheng et.al. | 2407.19402 | null |
2024-07-18 | Generative AI Augmented Induction-based Formal Verification | Aman Kumar et.al. | 2407.18965 | null |
2024-07-25 | The seismic purifier: An unsupervised approach to seismic signal detection via representation learning | Onur Efe et.al. | 2407.18402 | link |
2024-07-25 | Adaptable Deep Joint Source-and-Channel Coding for Small Satellite Applications | Olga Kondrateva et.al. | 2407.18146 | null |
2024-07-25 | Scaling Training Data with Lossy Image Compression | Katherine L. Mentzer et.al. | 2407.17954 | link |
2024-07-25 | Towards the Spectral bias Alleviation by Normalizations in Coordinate Networks | Zhicheng Cai et.al. | 2407.17834 | link |
2024-07-24 | Lossy Data Compression By Adaptive Mesh Coarsening | N. Böing et.al. | 2407.17316 | null |
2024-07-24 | High Efficiency Image Compression for Large Visual-Language Models | Binzhe Li et.al. | 2407.17060 | null |
2024-07-23 | Accelerating Learned Video Compression via Low-Resolution Representation Learning | Zidian Qiu et.al. | 2407.16418 | null |
2024-07-24 | FCNR: Fast Compressive Neural Representation of Visualization Images | Yunfei Lu et.al. | 2407.16369 | link |
2024-07-19 | Shapley Pruning for Neural Network Compression | Kamil Adamczewski et.al. | 2407.15875 | null |
2024-07-18 | CIC: Circular Image Compression | Honggui Li et.al. | 2407.15870 | null |
2024-07-22 | Online String Attractors | Philip Whittington et.al. | 2407.15599 | null |
2024-07-22 | Spectral properties of bright deposits in permanently shadowed craters on Ceres | Stefan Schröder et.al. | 2407.15327 | null |
2024-07-21 | Lessons Learned on the Path to Guaranteeing the Error Bound in Lossy Quantizers | Alex Fallin et.al. | 2407.15037 | null |
2024-07-19 | A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Qi Yang et.al. | 2407.14197 | link |
2024-07-18 | Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law | Giorgio Franceschelli et.al. | 2407.13493 | null |
2024-07-18 | Learned HDR Image Compression for Perceptually Optimal Storage and Display | Peibei Cao et.al. | 2407.13179 | null |
2024-07-17 | High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion | Juan Song et.al. | 2407.12538 | link |
2024-07-17 | Enhancing Film Grain Coding in VVC: Improving Encoding Quality and Efficiency | Vignesh V Menon et.al. | 2407.12465 | null |
2024-07-17 | Reliability Function of Classical-Quantum Channels | Ke Li et.al. | 2407.12403 | null |
2024-07-17 | Exploiting Inter-Image Similarity Prior for Low-Bitrate Remote Sensing Image Compression | Junhui Li et.al. | 2407.12295 | null |
2024-07-16 | Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors | Matt Gorbett et.al. | 2407.12075 | null |
2024-07-17 | Rate-Distortion-Cognition Controllable Versatile Neural Image Compression | Jinming Liu et.al. | 2407.11700 | null |
2024-07-16 | MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models | Hongrong Cheng et.al. | 2407.11681 | null |
2024-07-17 | Neural Compression of Atmospheric States | Piotr Mirowski et.al. | 2407.11666 | null |
2024-07-16 | Rethinking Learned Image Compression: Context is All You Need | Jixiang Luo et.al. | 2407.11590 | null |
2024-07-16 | The impact of lossy data compression on the power spectrum of the high redshift 21-cm signal with LOFAR | J. K. Chege et.al. | 2407.11557 | null |
2024-07-21 | Uniformly Accelerated Motion Model for Inter Prediction | Zhuoyuan Li et.al. | 2407.11541 | null |
2024-07-15 | M18K: A Comprehensive RGB-D Dataset and Benchmark for Mushroom Detection and Instance Segmentation | Abdollah Zakeri et.al. | 2407.11275 | link |
2024-07-15 | Enhancing Electrocardiogram Signal Analysis Using NLP-Inspired Techniques: A Novel Approach with Embedding and Self-Attention | Prapti Ganguly et.al. | 2407.11102 | null |
2024-07-15 | In-Loop Filtering via Trained Look-Up Tables | Zhuoyuan Li et.al. | 2407.10926 | null |
2024-07-15 | Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model | Zhening Liu et.al. | 2407.10632 | link |
2024-07-14 | UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers | Huy Ha et.al. | 2407.10353 | null |
2024-07-13 | WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model | Haisheng Fu et.al. | 2407.09983 | null |
2024-07-13 | Zero-Shot Image Compression with Diffusion-Based Posterior Sampling | Noam Elata et.al. | 2407.09896 | link |
2024-07-13 | Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation | Han Li et.al. | 2407.09853 | link |
2024-07-13 | Infinite families of optimal and minimal codes over rings using simplicial complexes | Yanan Wu et.al. | 2407.09783 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-12 | Hybrid Temporal Computing for Lower Power Hardware Accelerators | Maliha Tasnim et.al. | 2407.08975 | null |
2024-07-11 | Manipulating a Tetris-Inspired 3D Video Representation | Mihir Godbole et.al. | 2407.08885 | null |
2024-07-11 | OMR-NET: a two-stage octave multi-scale residual network for screen content image compression | Shiqi Jiang et.al. | 2407.08545 | null |
2024-07-11 | CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data | Hossein Entezari Zarch et.al. | 2407.08108 | null |
2024-07-10 | Using Low-Discrepancy Points for Data Compression in Machine Learning: An Experimental Comparison | Simone Göttlich et.al. | 2407.07450 | null |
2024-07-10 | Standard compliant video coding using low complexity, switchable neural wrappers | Yueyu Hu et.al. | 2407.07395 | null |
2024-07-10 | MNeRV: A Multilayer Neural Representation for Videos | Qingling Chang et.al. | 2407.07347 | link |
2024-07-11 | Entropy Law: The Story Behind Data Compression and LLM Performance | Mingjia Yin et.al. | 2407.06645 | link |
2024-07-08 | A Hybrid Algorithm for Computing a Partial Singular Value Decomposition Satisfying a Given Threshold | James Baglama et.al. | 2407.06306 | link |
2024-07-08 | TAPVid-3D: A Benchmark for Tracking Any Point in 3D | Skanda Koppula et.al. | 2407.05921 | link |
2024-07-05 | The Impact of Quantization and Pruning on Deep Reinforcement Learning Models | Heng Lu et.al. | 2407.04803 | null |
2024-07-05 | An autoencoder for compressing angle-resolved photoemission spectroscopy data | Steinn Ymir Agustsson et.al. | 2407.04631 | link |
2024-07-05 | Rethinking Image Compression on the Web with Generative AI | Shayan Ali Hassan et.al. | 2407.04542 | null |
2024-07-11 | A High-Quality Workflow for Multi-Resolution Scientific Data Reduction and Visualization | Daoce Wang et.al. | 2407.04267 | null |
2024-07-04 | Autoencoded Image Compression for Secure and Fast Transmission | Aryan Kashyap Naveen et.al. | 2407.03990 | link |
2024-07-03 | Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations | Trevor Ablett et.al. | 2407.03311 | link |
2024-07-03 | KeyVideoLLM: Towards Large-scale Video Keyframe Selection | Hao Liang et.al. | 2407.03104 | null |
2024-07-01 | Statistical Analysis of ZFP: Understanding Bias | Alyson Fox et.al. | 2407.01826 | null |
2024-07-01 | An AI-based, Error-bounded Compression Scheme for High-frequency Power Quality Disturbance Data | Markus Stroot et.al. | 2407.01112 | null |
2024-06-28 | Wavelets Are All You Need for Autoregressive Image Generation | Wael Mattar et.al. | 2406.19997 | null |
2024-06-28 | Optimal Video Compression using Pixel Shift Tracking | Hitesh Saai Mananchery Panneerselvam et.al. | 2406.19630 | link |
2024-06-27 | MCNC: Manifold Constrained Network Compression | Chayne Thrash et.al. | 2406.19301 | null |
2024-06-27 | Staggered Quantizers for Perfect Perceptual Quality: A Connection between Quantizers with Common Randomness and Without | Ruida Zhou et.al. | 2406.19248 | null |
2024-06-25 | Asymptotically Minimax Regret by Bayes Mixtures | Jun'ichi Takeuchi et.al. | 2406.17929 | null |
2024-06-24 | Hierarchical B-frame Video Coding for Long Group of Pictures | Ivan Kirillov et.al. | 2406.16544 | null |
2024-06-20 | Ranking LLMs by compression | Peijia Guo et.al. | 2406.14171 | null |
2024-06-21 | Measuring Sample Importance in Data Pruning for Training LLMs from a Data Compression Perspective | Minsang Kim et.al. | 2406.14124 | null |
2024-06-20 | Prediction and Reference Quality Adaptation for Learned Video Compression | Xihua Sheng et.al. | 2406.14118 | null |
2024-06-19 | Convex-hull Estimation using XPSNR for Versatile Video Coding | Vignesh V Menon et.al. | 2406.13712 | null |
2024-06-19 | A Study on the Effect of Color Spaces in Learned Image Compression | Srivatsa Prativadibhayankaram et.al. | 2406.13709 | null |
2024-06-19 | Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics | Weitong Zhang et.al. | 2406.13652 | null |
2024-06-18 | Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution | Maximilian Fischer et.al. | 2406.12623 | null |
2024-06-18 | Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines | Honglei Zhang et.al. | 2406.12367 | null |
2024-06-15 | How Should We Extract Discrete Audio Tokens from Self-Supervised Models? | Pooneh Mousavi et.al. | 2406.10735 | null |
2024-06-15 | Object-Attribute-Relation Representation based Video Semantic Communication | Qiyuan Du et.al. | 2406.10469 | null |
2024-06-14 | On Efficient Neural Network Architectures for Image Compression | Yichi Zhang et.al. | 2406.10361 | link |
2024-06-14 | Information Compression in the AI Era: Recent Advances and Future Challenges | Jun Chen et.al. | 2406.10036 | null |
2024-06-13 | CMC-Bench: Towards a New Paradigm of Visual Signal Compression | Chunyi Li et.al. | 2406.09356 | link |
2024-06-13 | Neural NeRF Compression | Tuan Pham et.al. | 2406.08943 | null |
2024-06-14 | Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models | Yi-Fan Zhang et.al. | 2406.08487 | link |
2024-06-12 | On Annotation-free Optimization of Video Coding for Machines | Marc Windsheimer et.al. | 2406.07938 | null |
2024-06-11 | SSNVC: Single Stream Neural Video Compression with Implicit Temporal Information | Feng Wang et.al. | 2406.07645 | null |
2024-06-11 | Image and Video Tokenization with Binary Spherical Quantization | Yue Zhao et.al. | 2406.07548 | link |
2024-06-11 | Optimal Matrix-Mimetic Tensor Algebras via Variable Projection | Elizabeth Newman et.al. | 2406.06942 | link |
2024-06-10 | Deep Generative Modeling Reshapes Compression and Transmission: From Efficiency to Resiliency | Jincheng Dai et.al. | 2406.06446 | null |
2024-06-10 | Image Compression with Isotropic and Anisotropic Shepard Inpainting | Rahul Mohideen Kaja Mohideen et.al. | 2406.06247 | null |
2024-06-10 | Efficient Neural Compression with Inference-time Decoding | C. Metz et.al. | 2406.06237 | null |
2024-06-10 | Fiducial-Cosmology-dependent systematics for the DESI 2024 BAO Analysis | A. Pérez-Fernández et.al. | 2406.06085 | null |
2024-06-10 | Quantum Sparse Coding and Decoding Based on Quantum Network | Xun Ji et.al. | 2406.06012 | null |
2024-06-09 | Region of Interest Loss for Anonymizing Learned Image Compression | Christoph Liebender et.al. | 2406.05726 | link |
2024-06-08 | Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models | Minho Park et.al. | 2406.05432 | link |
2024-06-07 | PatchSVD: A Non-uniform SVD-based Image Compression Algorithm | Zahra Golpayegani et.al. | 2406.05129 | link |
2024-06-07 | SMC++: Masked Learning of Unsupervised Video Semantic Compression | Yuan Tian et.al. | 2406.04765 | link |
2024-06-06 | LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression | Junhui Li et.al. | 2406.03961 | link |
2024-06-05 | Lossless Image Compression Using Multi-level Dictionaries: Binary Images | Samar Agnihotri et.al. | 2406.03087 | null |
2024-06-05 | On Jacob Ziv's Individual-Sequence Approach to Information Theory | Neri Merhav et.al. | 2406.02904 | null |
2024-06-04 | Towards AI-Assisted Sustainable Adaptive Video Streaming Systems: Tutorial and Survey | Reza Farahani et.al. | 2406.02302 | null |
2024-06-03 | Video Coding with Cross-Component Sample Offset | Han Gao et.al. | 2406.01795 | null |
2024-06-05 | Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption | Anqi Li et.al. | 2406.00758 | link |
2024-06-01 | Efficient Massive Black Hole Binary parameter estimation for LISA using Sequential Neural Likelihood | Iván Martín Vílchez et.al. | 2406.00565 | null |
2024-06-01 | A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing | Nurul Rafi et.al. | 2406.00239 | null |
2024-05-31 | ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model | Yufei Wang et.al. | 2405.20721 | link |
2024-05-30 | Quantum encoder for fixed Hamming-weight subspaces | Renato M. S. Farias et.al. | 2405.20408 | null |
2024-05-29 | Implicit Neural Image Field for Biological Microscopy Image Compression | Gaole Dai et.al. | 2405.19012 | link |
2024-05-28 | Deep Network Pruning: A Comparative Study on CNNs in Face Recognition | Fernando Alonso-Fernandez et.al. | 2405.18302 | null |
2024-05-28 | Channel Reciprocity Based Attack Detection for Securing UWB Ranging by Autoencoder | Wenlong Gou et.al. | 2405.18255 | null |
2024-05-27 | Evaluation of Resource-Efficient Crater Detectors on Embedded Systems | Simon Vellas et.al. | 2405.16953 | link |
2024-05-27 | UniCompress: Enhancing Multi-Data Medical Image Compression with Knowledge Distillation | Runzhao Yang et.al. | 2405.16850 | null |
2024-05-27 | Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model | Shoma Iwai et.al. | 2405.16817 | link |
2024-05-25 | N-BVH: Neural ray queries with bounding volume hierarchies | Philippe Weier et.al. | 2405.16237 | link |
2024-05-25 | A 7K Parameter Model for Underwater Image Enhancement based on Transmission Map Prior | Fuheng Zhou et.al. | 2405.16197 | link |
2024-05-24 | Analytical proxy to families of numerical solutions: the case study of spherical mini-boson stars | Jianzhi Yang et.al. | 2405.15651 | null |
2024-05-24 | SATSense: Multi-Satellite Collaborative Framework for Spectrum Sensing | Haoxuan Yuan et.al. | 2405.15542 | null |
2024-05-24 | Meta-meshing and triangulating lattice structures at a large scale | Qiang Zou et.al. | 2405.15197 | null |
2024-05-23 | NeCGS: Neural Compression for 3D Geometry Sets | Siyu Ren et.al. | 2405.15034 | link |
2024-05-23 | An augmented Lagrangian trust-region method with inexact gradient evaluations to accelerate constrained optimization problems using model hyperreduction | Tianshu Wen et.al. | 2405.14827 | null |
2024-05-23 | Motion-based video compression for resource-constrained camera traps | Malika Nisal Ratnayake et.al. | 2405.14419 | null |
2024-06-01 | I |
Meiqin Liu et.al. | 2405.14336 | link |
2024-05-23 | Sparse |
Matthias Chung et.al. | 2405.14270 | null |
2024-05-22 | "Turing Tests" For An AI Scientist | Xiaoxin Yin et.al. | 2405.13352 | null |
2024-05-21 | Efficient Learned Wavelet Image and Video Coding | Anna Meyer et.al. | 2405.12631 | null |
2024-05-24 | Accelerating Relative Entropy Coding with Space Partitioning | Jiajun He et.al. | 2405.12203 | null |
2024-05-20 | Refining Coded Image in Human Vision Layer Using CNN-Based Post-Processing | Takahiro Shindo et.al. | 2405.11894 | null |
2024-05-19 | Effective In-Context Example Selection through Data Compression | Zhongxiang Sun et.al. | 2405.11465 | null |
2024-05-18 | InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images | Wuzhou Li et.al. | 2405.11293 | link |
2024-05-17 | Dark Energy Survey Year 3 results: simulation-based cosmological inference with wavelet harmonics, scattering transforms, and moments of weak lensing mass maps II. Cosmological results | M. Gatti et.al. | 2405.10881 | null |
2024-05-17 | Reduced storage direct tensor ring decomposition for convolutional neural networks compression | Mateusz Gabor et.al. | 2405.10802 | link |
2024-05-17 | Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network | Junhui Li et.al. | 2405.10518 | null |
2024-05-15 | Properties that allow or prohibit transferability of adversarial attacks among quantized networks | Abhishek Shrestha et.al. | 2405.09598 | link |
2024-05-15 | Sensitivity Decouple Learning for Image Compression Artifacts Reduction | Li Ma et.al. | 2405.09291 | null |
2024-05-18 | Scalable Image Coding for Humans and Machines Using Feature Fusion Network | Takahiro Shindo et.al. | 2405.09152 | link |
2024-05-14 | Parameter-Efficient Instance-Adaptive Neural Video Compression | Hyunmo Yang et.al. | 2405.08530 | link |
2024-05-13 | Goal-oriented compression for |
Yifei Sun et.al. | 2405.07808 | null |
2024-05-13 | Neural Network Compression for Reinforcement Learning Tasks | Dmitry A. Ivanov et.al. | 2405.07748 | null |
2024-05-13 | On the Adversarial Robustness of Learning-based Image Compression Against Rate-Distortion Attacks | Chenhao Wu et.al. | 2405.07717 | null |
2024-05-21 | An Efficient Compression Method for Sign Information of DCT Coefficients via Sign Retrieval | Chihiro Tsutake et.al. | 2405.07487 | link |
2024-05-10 | Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming | Chin-Yun Yu et.al. | 2405.06804 | link |
2024-05-08 | Urban Boundary Delineation from Commuting Data with Bayesian Stochastic Blockmodeling: Scale, Contiguity, and Hierarchy | Sebastian Morel-Balbi et.al. | 2405.04911 | link |
2024-05-14 | Some Notes on the Sample Complexity of Approximate Channel Simulation | Gergely Flamich et.al. | 2405.04363 | null |
2024-05-07 | Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression | Zhenghao Chen et.al. | 2405.04274 | null |
2024-05-08 | Verified Neural Compressed Sensing | Rudy Bunel et.al. | 2405.04260 | null |
2024-05-15 | Lossy Compression with Data, Perception, and Classification Constraints | Yuhan Wang et.al. | 2405.04144 | null |
2024-05-07 | DMOFC: Discrimination Metric-Optimized Feature Compression | Changsheng Gao et.al. | 2405.04044 | null |
2024-05-06 | Computational ghost imaging with hybrid transforms by integrating Hadamard, discrete cosine, and Haar matrices | Yi-Ning Zhao et.al. | 2405.03729 | null |
2024-05-06 | A Rate-Distortion-Classification Approach for Lossy Image Compression | Yuefeng Zhang et.al. | 2405.03500 | null |
2024-05-06 | Structure-Preserving Network Compression Via Low-Rank Induced Training Through Linear Layers Composition | Xitong Zhang et.al. | 2405.03089 | link |
2024-05-04 | Deep Pulse-Signal Magnification for remote Heart Rate Estimation in Compressed Videos | Joaquim Comas et.al. | 2405.02652 | null |
2024-05-06 | Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design | Jian Meng et.al. | 2405.01775 | link |
2024-05-02 | PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems | Walter Zimmer et.al. | 2405.01750 | null |
2024-04-28 | Lightweight Conceptual Dictionary Learning for Text Classification Using Information Compression | Li Wan et.al. | 2405.01584 | null |
2024-05-02 | GroupedMixer: An Entropy Model with Group-wise Token-Mixers for Learned Image Compression | Daxin Li et.al. | 2405.01170 | null |
2024-04-30 | Analysis and Enhancement of Lossless Image Compression in JPEG-XL | Rustam Mamedov et.al. | 2404.19755 | null |
2024-04-30 | EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization | Jianzong Wang et.al. | 2404.19214 | null |
2024-04-29 | Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior | Zhiyuan Li et.al. | 2404.18820 | link |
2024-04-28 | Joint Reference Frame Synthesis and Post Filter Enhancement for Versatile Video Coding | Weijie Bao et.al. | 2404.18058 | null |
2024-04-25 | Learning Visuotactile Skills with Two Multifingered Hands | Toru Lin et.al. | 2404.16823 | link |
2024-04-24 | Domain Adaptation for Learned Image Compression with Supervised Adapters | Alberto Presta et.al. | 2404.15591 | link |
2024-04-23 | One-Pass Randomized Algorithm with Practical Rangefinder for Low-Rank Approximation to Quaternion Matrices | Chao Chang et.al. | 2404.14783 | link |
2024-04-22 | Neural Compress-and-Forward for the Relay Channel | Ezgi Ozyilkan et.al. | 2404.14594 | null |
2024-04-22 | Taming Server Memory TCO with Multiple Software-Defined Compressed Tiers | Sandeep Kumar et.al. | 2404.13886 | null |
2024-04-20 | HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression | Lei Lu et.al. | 2404.13372 | null |
2024-04-18 | Image Compression and Reconstruction Based on Quantum Network | Xun Ji et.al. | 2404.11994 | null |
2024-04-17 | Spatio-Temporal Motion Retargeting for Quadruped Robots | Taerim Yoon et.al. | 2404.11557 | null |
2024-04-17 | Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems | Luca Bompani et.al. | 2404.11488 | link |
2024-04-17 | Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks | Eri Hosonuma et.al. | 2404.11280 | null |
2024-04-16 | Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning | Kyle Hsu et.al. | 2404.10282 | link |
2024-04-16 | Compressible and Searchable: AI-native Multi-Modal Retrieval System with Learned Image Compression | Jixiang Luo et.al. | 2404.10234 | null |
2024-04-15 | One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing | Yueyu Hu et.al. | 2404.09979 | null |
2024-04-15 | Quantization of Large Language Models with an Overdetermined Basis | Daniil Merkulov et.al. | 2404.09737 | null |
2024-04-18 | Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition | Tobias Weber et.al. | 2404.09683 | link |
2024-04-15 | MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image | Chengfeng Liu et.al. | 2404.09433 | null |
2024-04-17 | Incremental data compression for PDE-constrained optimization with a data assimilation application | Xuejian Li et.al. | 2404.09323 | null |
2024-04-14 | A Joint Data Compression and Time-Delay Estimation Method For Distributed Systems via Extremum Encoding | Amir Weiss et.al. | 2404.09244 | null |
2024-04-12 | Lossy Image Compression with Foundation Diffusion Models | Lucas Relic et.al. | 2404.08580 | null |
2024-04-12 | Mitigating Challenges of the Space Environment for Onboard Artificial Intelligence: Design Overview of the Imaging Payload on SpIRIT | Miguel Ortiz del Castillo et.al. | 2404.08399 | null |
2024-04-11 | Video Compression Beyond VVC: Quantitative Analysis of Intra Coding Tools in Enhanced Compression Model (ECM) | Mohsen Abdoli et.al. | 2404.07872 | null |
2024-04-11 | Learning to Classify New Foods Incrementally Via Compressed Exemplars | Justin Yang et.al. | 2404.07507 | null |
2024-04-14 | A comparison between Shapefit compression and Full-Modelling method with PyBird for DESI 2024 and beyond | Y. Lai et.al. | 2404.07283 | link |
2024-04-10 | Exploring Repetitiveness Measures for Two-Dimensional Strings | Giuseppe Romana et.al. | 2404.07030 | null |
2024-04-10 | Fine color guidance in diffusion models and its application to image compression at extremely low bitrates | Tom Bordin et.al. | 2404.06865 | null |
2024-04-09 | Encoder-Quantization-Motion-based Video Quality Metrics | Yixu Chen et.al. | 2404.06620 | null |
2024-04-09 | DiffHarmony: Latent Diffusion Model Meets Image Harmonization | Pengfei Zhou et.al. | 2404.06139 | link |
2024-04-09 | Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey | Feng Liang et.al. | 2404.06114 | null |
2024-04-09 | Image and Video Compression using Generative Sparse Representation with Fidelity Controls | Wei Jiang et.al. | 2404.06076 | null |
2024-04-07 | Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder | Yiyang Ma et.al. | 2404.04916 | null |
2024-04-07 | Task-Aware Encoder Control for Deep Video Compression | Xingtong Ge et.al. | 2404.04848 | null |
2024-04-06 | Power-Efficient Image Storage: Leveraging Super Resolution Generative Adversarial Network for Sustainable Compression and Reduced Carbon Footprint | Ashok Mondal et.al. | 2404.04642 | null |
2024-04-05 | ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing | Alec Helbling et.al. | 2404.04376 | link |
2024-04-03 | Convolutional variational autoencoders for secure lossy image compression in remote sensing | Alessandro Giuliano et.al. | 2404.03696 | null |
2024-03-25 | RL for Consistency Models: Faster Reward Guided Text-to-Image Generation | Owen Oertell et.al. | 2404.03673 | link |
2024-04-04 | Training LLMs over Neurally Compressed Text | Brian Lester et.al. | 2404.03626 | null |
2024-04-04 | Leveraging Interpolation Models and Error Bounds for Verifiable Scientific Machine Learning | Tyler Chang et.al. | 2404.03586 | link |
2024-04-04 | Semantic Compression with Information Lattice Learning | Haizi Yu et.al. | 2404.03131 | null |
2024-04-01 | Accounting for contact network uncertainty in epidemic inferences with Approximate Bayesian Computation | Maxwell H. Wang et.al. | 2404.02924 | null |
2024-04-03 | Building test batteries based on analysing random number generator tests within the framework of algorithmic information theory | Boris Ryabko et.al. | 2404.02708 | null |
2024-04-03 | Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression | I. Dror et.al. | 2404.02481 | null |
2024-04-03 | MOPAR: A Model Partitioning Framework for Deep Learning Inference Services on Serverless Platforms | Jiaang Duan et.al. | 2404.02445 | null |
2024-04-02 | NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li et.al. | 2404.02185 | null |
2024-04-01 | The Rate-Distortion-Perception Trade-off: The Role of Private Randomness | Yassine Hamdi et.al. | 2404.01111 | null |
2024-03-31 | Metric dimensions of generalized Sierpiński graphs over squares | Savari Prabhu et.al. | 2404.00771 | null |
2024-03-27 | Computationally and Memory-Efficient Robust Predictive Analytics Using Big Data | Daniel Menges et.al. | 2403.19721 | null |
2024-03-28 | RootInteractive tool for multidimensional statistical analysis, machine learning and analytical model validation | Marian Invanov et.al. | 2403.19330 | null |
2024-03-28 | Uncertainty-Aware Deep Video Compression with Ensembles | Wufei Ma et.al. | 2403.19158 | null |
2024-04-08 | Neural Embedding Compression For Efficient Multi-Task Earth Observation Modelling | Carlos Gomes et.al. | 2403.17886 | link |
2024-03-26 | Low-Latency Neural Stereo Streaming | Qiqi Hou et.al. | 2403.17879 | null |
2024-03-26 | Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs | Kai Yuan et.al. | 2403.17607 | link |
2024-03-25 | Neural Image Compression with Quantization Rectifier | Wei Luo et.al. | 2403.17236 | null |
2024-03-25 | Invertible Diffusion Models for Compressed Sensing | Bin Chen et.al. | 2403.17006 | link |
2024-03-25 | Virtual Cylindrical PET for Efficient DOI Image Reconstruction with Sub-millimetre Resolution | Francisco E Enríquez-Mier-y-Terán et.al. | 2403.16465 | null |
2024-03-25 | Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks | Madhumitha Sakthi et.al. | 2403.16338 | null |
2024-03-24 | Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis | Atefeh Khoshkhahtinat et.al. | 2403.16258 | null |
2024-03-23 | Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets | Robert Underwood et.al. | 2403.15953 | null |
2024-03-23 | Droplet shape representation using Fourier series and autoencoders | Mihir Durve et.al. | 2403.15797 | null |
2024-03-21 | S2LIC: Learned Image Compression with the SwinV2 Block, Adaptive Channel-wise and Global-inter Attention Context | Yongqiang Wang et.al. | 2403.14471 | link |
2024-03-21 | Tensor network compressibility of convolutional models | Sukhbinder Singh et.al. | 2403.14379 | null |
2024-03-26 | Powerful Lossy Compression for Noisy Images | Shilv Cai et.al. | 2403.14135 | null |
2024-03-20 | String attractors and bi-infinite words | Pierre Béaur et.al. | 2403.13449 | null |
2024-03-19 | Super-High-Fidelity Image Compression via Hierarchical-ROI and Adaptive Quantization | Jixiang Luo et.al. | 2403.13030 | null |
2024-03-19 | Privacy-Preserving Face Recognition Using Trainable Feature Subtraction | Yuxi Mi et.al. | 2403.12457 | link |
2024-03-19 | VQ-NeRV: A Vector Quantized Neural Representation for Videos | Yunjie Xu et.al. | 2403.12401 | link |
2024-03-18 | Encoding of linear kinetic plasma problems in quantum circuits via data compression | Ivan Novikau et.al. | 2403.11989 | null |
2024-03-18 | Object Segmentation-Assisted Inter Prediction for Versatile Video Coding | Zhuoyuan Li et.al. | 2403.11694 | null |
2024-03-18 | Overfitted image coding at reduced complexity | Théophile Blard et.al. | 2403.11651 | link |
2024-03-18 | Hierarchical Frequency-based Upsampling and Refining for Compressed Video Quality Enhancement | Qianyu Zhang et.al. | 2403.11556 | null |
2024-03-18 | Earth+: on-board satellite imagery compression leveraging historical earth observations | Kuntai Du et.al. | 2403.11434 | null |
2024-03-17 | Fidelity-preserving Learning-Based Image Compression: Loss Function and Subjective Evaluation Methodology | Shima Mohammadi et.al. | 2403.11241 | link |
2024-03-16 | Channel-wise Feature Decorrelation for Enhanced Learned Image Compression | Farhad Pakdaman et.al. | 2403.10936 | null |
2024-03-16 | NARRATE: Versatile Language Architecture for Optimal Control in Robotics | Seif Ismail et.al. | 2403.10762 | link |
2024-03-15 | Process-and-Forward: Deep Joint Source-Channel Coding Over Cooperative Relay Networks | Chenghong Bian et.al. | 2403.10613 | null |
2024-03-15 | CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement | Qiang Zhu et.al. | 2403.10362 | link |
2024-03-15 | Interactive Distance Field Mapping and Planning to Enable Human-Robot Collaboration | Usama Ali et.al. | 2403.09988 | link |
2024-03-14 | SketchINR: A First Look into Sketches as Implicit Neural Representations | Hmrishav Bandyopadhyay et.al. | 2403.09344 | link |
2024-03-14 | Noise Dimension of GAN: An Image Compression Perspective | Ziran Zhu et.al. | 2403.09196 | null |
2024-03-20 | Content-aware Masked Image Modeling Transformer for Stereo Image Compression | Xinjie Zhang et.al. | 2403.08505 | link |
2024-03-12 | Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding | Eric Lei et.al. | 2403.07320 | null |
2024-03-11 | Grid Monitoring and Protection with Continuous Point-on-Wave Measurements and Generative AI | Lang Tong et.al. | 2403.06942 | null |
2024-03-16 | Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression | Zhi Cao et.al. | 2403.06700 | null |
2024-03-13 | FSViewFusion: Few-Shots View Generation of Novel Objects | Rukhshanda Hussain et.al. | 2403.06394 | null |
2024-03-10 | Probing Image Compression For Class-Incremental Learning | Justin Yang et.al. | 2403.06288 | null |
2024-03-10 | Blockchain-Enabled Variational Information Bottleneck for IoT Networks | Qiong Wu et.al. | 2403.06129 | link |
2024-03-09 | Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding | Cunhui Dong et.al. | 2403.05937 | null |
2024-03-07 | Complexity-constrained quantum thermodynamics | Anthony Munson et.al. | 2403.04828 | null |
2024-03-07 | Image Coding for Machines with Edge Information Learning Using Segment Anything | Takahiro Shindo et.al. | 2403.04173 | link |
2024-03-06 | 3D Diffusion Policy | Yanjie Ze et.al. | 2403.03954 | link |
2024-03-06 | Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer | Naifu Xue et.al. | 2403.03736 | null |
2024-03-06 | ZF Beamforming Tensor Compression for Massive MIMO Fronthaul | Libin Zheng et.al. | 2403.03675 | null |
2024-03-06 | Space Complexity of Euclidean Clustering | Xiaoyi Zhu et.al. | 2403.02971 | null |
2024-03-05 | Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity | Hagyeong Lee et.al. | 2403.02944 | link |
2024-03-05 | Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders | Daniele Mari et.al. | 2403.02887 | null |
2024-03-04 | Dark Energy Survey Year 3 results: likelihood-free, simulation-based |
N. Jeffrey et.al. | 2403.02314 | null |
2024-03-04 | Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000 | Xinyue Li et.al. | 2403.01647 | link |
2024-03-03 | On the Compressibility of Quantized Large Language Models | Yu Mao et.al. | 2403.01384 | null |
2024-03-02 | Towards Accurate Lip-to-Speech Synthesis in-the-Wild | Sindhu Hegde et.al. | 2403.01087 | null |
2024-03-01 | Region-Adaptive Transform with Segmentation Prior for Image Compression | Yuxi Liu et.al. | 2403.00628 | link |
2024-03-07 | ODVista: An Omnidirectional Video Dataset for super-resolution and Quality Enhancement Tasks | Ahmed Telili et.al. | 2403.00604 | link |
2024-02-29 | Towards Explaining Deep Neural Network Compression Through a Probabilistic Latent Space | Mahsa Mozafari-Nia et.al. | 2403.00155 | null |
2024-02-29 | Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling | Wenxue Cui et.al. | 2402.19111 | null |
2024-02-29 | Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets | Fatih Kamisli et.al. | 2402.18930 | link |
2024-02-29 | Towards Backward-Compatible Continual Learning of Image Compression | Zhihao Duan et.al. | 2402.18862 | link |
2024-02-29 | Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression | Xinyue Li et.al. | 2402.18761 | null |
2024-01-10 | Motion Guided Token Compression for Efficient Masked Video Modeling | Yukun Feng et.al. | 2402.18577 | null |
2024-02-28 | Tokenization Is More Than Compression | Craig W. Schmidt et.al. | 2402.18376 | link |
2024-02-28 | NERV++: An Enhanced Implicit Neural Video Representation | Ahmed Ghorbel et.al. | 2402.18305 | null |
2024-02-28 | Computing Minimal Absent Words and Extended Bispecial Factors with CDAWG Space | Shunsuke Inenaga et.al. | 2402.18090 | null |
2024-03-03 | Towards Optimal Learning of Language Models | Yuxian Gu et.al. | 2402.17759 | null |
2024-02-27 | Gaoyuan Wang et.al. | 2402.17749 | null | |
2024-02-27 | Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model | Panqi Jia et.al. | 2402.17487 | null |
2024-02-27 | Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization | Panqi Jia et.al. | 2402.17470 | null |
2024-02-29 | Neural Video Compression with Feature Modulation | Jiahao Li et.al. | 2402.17414 | link |
2024-01-19 | MB-RACS: Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network | Yujun Huang et.al. | 2402.16855 | null |
2024-02-29 | MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model | Chunyi Li et.al. | 2402.16749 | link |
2024-02-26 | Enabling robust sensor network design with data processing and optimization making use of local beehive image and video files | Ephrance Eunice Namugenyi et.al. | 2402.16655 | null |
2024-02-26 | Resolution-Agnostic Neural Compression for High-Fidelity Portrait Video Conferencing via Implicit Radiance Fields | Yifei Li et.al. | 2402.16599 | null |
2024-02-26 | Distortion-Controlled Dithering with Reduced Recompression Rate | Morriel Kasher et.al. | 2402.16447 | null |
2024-02-26 | Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction | Wen-Yang Lu et.al. | 2402.16371 | null |
2024-02-26 | SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Zetian Song et.al. | 2402.16366 | null |
2024-02-24 | Traditional Transformation Theory Guided Model for Learned Image Compression | Zhiyuan Li et.al. | 2402.15744 | null |
2024-02-22 | Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving | Eugen Šlapak et.al. | 2402.14642 | null |
2024-02-21 | Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel | Jordan Dotzel et.al. | 2402.13536 | null |
2024-02-20 | Compressing the two-particle Green's function using wavelets: Theory and application to the Hubbard atom | Emin Moghadas et.al. | 2402.13030 | null |
2024-02-20 | RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models | Xinchen Zhang et.al. | 2402.12908 | link |
2024-02-20 | Transformer-based Learned Image Compression for Joint Decoding and Denoising | Yi-Hsin Chen et.al. | 2402.12888 | null |
2024-02-19 | Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI Pooling | Philip Müller et.al. | 2402.11985 | link |
2024-02-18 | 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods | Till Beemelmanns et.al. | 2402.11680 | link |
2024-02-18 | Learning to Learn Faster from Human Feedback with Language Model Predictive Control | Jacky Liang et.al. | 2402.11450 | null |
2024-02-17 | TinyLIC-High efficiency lossy image compression method | Gaocheng Ma et.al. | 2402.11164 | null |
2024-02-15 | Analysis of Neural Video Compression Networks for 360-Degree Video Coding | Andy Regensky et.al. | 2402.10257 | null |
2024-02-14 | Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion | Edgar Heinert et.al. | 2402.09530 | link |
2024-02-14 | A Comprehensive Review of Software and Hardware Energy Efficiency of Video Decoders | Matthias Kränzler et.al. | 2402.09001 | null |
2024-02-14 | Extreme Video Compression with Pre-trained Diffusion Models | Bohan Li et.al. | 2402.08934 | link |
2024-02-14 | Saliency-aware End-to-end Learned Variable-Bitrate 360-degree Image Compression | Oguzhan Gungordu et.al. | 2402.08862 | null |
2024-02-13 | Learned Image Compression with Text Quality Enhancement | Chih-Yu Lai et.al. | 2402.08643 | null |
2024-02-13 | Motion-Adaptive Inference for Flexible Learned B-Frame Compression | M. Akin Yilmaz et.al. | 2402.08550 | null |
2024-02-21 | A Neural-network Enhanced Video Coding Framework beyond ECM | Yanchen Zhao et.al. | 2402.08397 | null |
2024-02-13 | Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss | Kei Iino et.al. | 2402.08267 | null |
2024-02-12 | Distributed Compression in the Era of Machine Learning: A Review of Recent Advances | Ezgi Ozyilkan et.al. | 2402.07997 | null |
2024-02-13 | Towards Meta-Pruning via Optimal Transport | Alexander Theus et.al. | 2402.07839 | link |
2024-02-09 | Parameter estimation for quantum jump unraveling | Marco Radaelli et.al. | 2402.06556 | link |
2024-02-07 | RAGE for the Machine: Image Compression with Low-Cost Random Access for Embedded Applications | Christian D. Rask et.al. | 2402.05974 | null |
2024-02-08 | Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers | Onur G. Guleryuz et.al. | 2402.05887 | link |
2024-02-08 | Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-ONNs | Yuxin Xie et.al. | 2402.05582 | null |
2024-02-05 | TexShape: Information Theoretic Sentence Embedding for Language Models | H. Kaan Kale et.al. | 2402.05132 | link |
2024-02-07 | Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth | Kevin Kögler et.al. | 2402.05013 | null |
2024-02-06 | A Novel Local and Hyper-Local Multicast Services Transmission Scheme for Beyond 5G Networks | Sweta Singh et.al. | 2402.03963 | null |
2024-02-06 | Cool-chic video: Learned video coding with 800 parameters | Thomas Leguay et.al. | 2402.03179 | link |
2024-02-05 | Perceptual Learned Image Compression via End-to-End JND-Based Optimization | Farhad Pakdaman et.al. | 2402.02836 | null |
2024-02-04 | Discovering More Effective Tensor Network Structure Search Algorithms via Large Language Models (LLMs) | Junhua Zeng et.al. | 2402.02456 | link |
2024-03-04 | RecNet: An Invertible Point Cloud Encoding through Range Image Embeddings for Multi-Robot Map Sharing and Reconstruction | Nikolaos Stathoulopoulos et.al. | 2402.02192 | null |
2024-02-03 | Generative Visual Compression: A Review | Bolin Chen et.al. | 2402.02140 | null |
2024-02-23 | Immersive Video Compression using Implicit Neural Representations | Ho Man Kwan et.al. | 2402.01596 | link |
2024-02-02 | Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization | Zhiyu Zhang et.al. | 2402.01380 | null |
2024-02-02 | UCVC: A Unified Contextual Video Compression Framework with Joint P-frame and B-frame Coding | Jiayu Yang et.al. | 2402.01289 | null |
2024-02-02 | Flexible Variational Information Bottleneck: Achieving Diverse Compression with a Single Training | Sota Kudo et.al. | 2402.01238 | link |
2024-02-02 | The O2 software framework and GPU usage in ALICE online and offline reconstruction in Run 3 | Giulio Eulisse et.al. | 2402.01205 | null |
2024-02-01 | Compressed image quality assessment using stacking | S. Farhad Hosseini-Benvidi et.al. | 2402.00993 | null |
2024-02-04 | Evaluating Large Language Models for Generalization and Robustness via Data Compression | Yucheng Li et.al. | 2402.00861 | link |
2024-03-11 | LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression | Wei Jiang et.al. | 2402.00680 | null |
2024-02-01 | Gain of Grain: A Film Grain Handling Toolchain for VVC-based Open Implementations | Vignesh V Menon et.al. | 2402.00622 | null |
2024-01-31 | EPSD: Early Pruning with Self-Distillation for Efficient Model Compression | Dong Chen et.al. | 2402.00084 | null |
2024-01-31 | A Neural Enhancement Post-Processor with a Dynamic AV1 Encoder Configuration Strategy for CLIC 2024 | Darren Ramsook et.al. | 2401.18021 | null |
2024-01-31 | Robustly overfitting latents for flexible neural image compression | Yura Perugachi-Diaz et.al. | 2401.17789 | null |
2024-01-30 | A Group Theoretic Metric for Robot State Estimation Leveraging Chebyshev Interpolation | Varun Agrawal et.al. | 2401.17463 | null |
2024-01-30 | SLIC: A Learned Image Codec Using Structure and Color | Srivatsa Prativadibhayankaram et.al. | 2401.17246 | link |
2024-01-30 | Large Language Model Evaluation via Matrix Entropy | Lai Wei et.al. | 2401.17139 | link |
2024-01-30 | Local integrals of motion in dipole-conserving models with Hilbert space fragmentation | Patrycja Łydżba et.al. | 2401.17097 | null |
2024-01-29 | On Channel Simulation with Causal Rejection Samplers | Daniel Goc et.al. | 2401.16579 | null |
2024-01-29 | Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression | Xihua Sheng et.al. | 2401.15864 | null |
2024-01-29 | Bayesian one- and two-sided inference on the local effective dimension | Eduard Belitser et.al. | 2401.15816 | null |
2024-01-28 | Towards Arbitrary-Scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework via Implicit Self-texture Enhancement | Minghong Duan et.al. | 2401.15613 | null |
2024-01-26 | Shadow simulation of quantum processes | Xuanqiang Zhao et.al. | 2401.14934 | null |
2024-01-26 | Study of the gOMP Algorithm for Recovery of Compressed Sensed Hyperspectral Images | Jon Alvarez Justo et.al. | 2401.14786 | null |
2024-01-26 | A Comparative Study of Compressive Sensing Algorithms for Hyperspectral Imaging Reconstruction | Jon Alvarez Justo et.al. | 2401.14762 | null |
2024-01-26 | Residual Quantization with Implicit Neural Codebooks | Iris Huijben et.al. | 2401.14732 | link |
2024-01-25 | Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression | Daxin Li et.al. | 2401.14007 | null |
2024-02-07 | Perceptual-oriented Learned Image Compression with Dynamic Kernel | Nianxiang Fu et.al. | 2401.13967 | null |
2024-01-25 | Conditional Neural Video Coding with Spatial-Temporal Super-Resolution | Henan Wang et.al. | 2401.13959 | null |
2024-01-24 | FLLIC: Functionally Lossless Image Compression | Xi Zhang et.al. | 2401.13616 | null |
2024-01-23 | Fast Implicit Neural Representation Image Codec in Resource-limited Devices | Xiang Liu et.al. | 2401.12587 | null |
2024-01-22 | PairwiseHist: Fast, Accurate and Space-Efficient Approximate Query Processing with Data Compression | Aaron Hurst et.al. | 2401.12018 | null |
2024-01-22 | A Training-Free Defense Framework for Robust Learned Image Compression | Myungseo Song et.al. | 2401.11902 | null |
2024-01-21 | Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding | Yichi Zhang et.al. | 2401.11615 | null |
2024-01-21 | ColorVideoVDP: A visual difference predictor for image, video and display distortions | Rafal K. Mantiuk et.al. | 2401.11485 | link |
2024-01-21 | Data-driven compression of electron-phonon interactions | Yao Luo et.al. | 2401.11393 | null |
2024-01-20 | Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding | Haisheng Fu et.al. | 2401.11093 | null |
2024-01-19 | NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines | Jukka I. Ahonen et.al. | 2401.10761 | null |
2024-01-19 | Bridging the gap between image coding for machines and humans | Nam Le et.al. | 2401.10732 | null |
2024-01-18 | Attack and Defense Analysis of Learned Image Compression | Tianyu Zhu et.al. | 2401.10345 | null |
2024-01-18 | Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions | Namitha Padmanabhan et.al. | 2401.10217 | null |
2024-01-18 | Depth Over RGB: Automatic Evaluation of Open Surgery Skills Using Depth Camera | Ido Zuckerman et.al. | 2401.10037 | null |
2024-01-18 | Memory Efficient Corner Detection for Event-driven Dynamic Vision Sensors | Pao-Sheng Vincent Sun et.al. | 2401.09797 | null |
2024-01-18 | Compressing MIMO Channel Submatrices with Tucker Decomposition: Enabling Efficient Storage and Reducing SINR Computation Overhead | Yuanwei Zhang et.al. | 2401.09792 | null |
2024-01-17 | Idempotence and Perceptual Image Compression | Tongda Xu et.al. | 2401.08920 | link |
2024-01-16 | End-to-End Optimized Image Compression with the Frequency-Oriented Transform | Yuefeng Zhang et.al. | 2401.08194 | null |
2024-01-17 | Learned Image Compression with ROI-Weighted Distortion and Bit Allocation | Wei Jiang et.al. | 2401.08154 | null |
2024-01-15 | Convolutional Neural Network Compression via Dynamic Parameter Rank Pruning | Manish Sharma et.al. | 2401.08014 | null |
2024-01-15 | Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models | Dan Jacobellis et.al. | 2401.07957 | link |
2024-01-14 | Exploring Compressed Image Representation as a Perceptual Proxy: A Study | Chen-Hsiu Huang et.al. | 2401.07200 | link |
2024-01-13 | Progressive Feature Fusion Network for Enhancing Image Quality Assessment | Kaiqun Wu et.al. | 2401.06992 | null |
2024-01-12 | Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part II: Spatial and Tonal Data Optimization | Niklas Kämper et.al. | 2401.06747 | null |
2024-03-18 | LiDAR Depth Map Guided Image Compression Model | Alessandro Gnutti et.al. | 2401.06517 | null |
2024-01-11 | Transformer Masked Autoencoders for Next-Generation Wireless Communications: Architecture and Opportunities | Abdullah Zayat et.al. | 2401.06274 | null |
2024-01-11 | MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring | Qian Gong et.al. | 2401.05994 | null |
2024-01-10 | SnapCap: Efficient Snapshot Compressive Video Captioning | Jianqiao Sun et.al. | 2401.04903 | null |
2024-01-09 | Modified Levenberg-Marquardt Algorithm For Tensor CP Decomposition in Image Compression | Ramin Goudarzi Karim et.al. | 2401.04670 | null |
2024-01-09 | Optimal Transcoding Resolution Prediction for Efficient Per-Title Bitrate Ladder Estimation | Jinhai Yang et.al. | 2401.04405 | null |
2024-01-08 | Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion | Minglong Xue et.al. | 2401.03788 | link |
2024-01-08 | A Video Coding Method Based on Neural Network for CLIC2024 | Zhengang Li et.al. | 2401.03623 | null |
2024-01-06 | Spatiotemporally adaptive compression for scientific dataset with feature preservation -- a case study on simulation data with extreme climate events analysis | Qian Gong et.al. | 2401.03317 | null |
2024-01-06 | Comparison of spectrum models as applied to single-particle |
Thomas A. Trainor et.al. | 2401.03290 | null |
2024-01-06 | Transferable Learned Image Compression-Resistant Adversarial Perturbations | Yang Sui et.al. | 2401.03115 | null |
2024-01-05 | MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multi-scale Dilated Convolution for Image Compressive Sensing (CS) | Youhao Yu et.al. | 2401.02884 | null |
2024-03-08 | Importance Matching Lemma for Lossy Compression with Side Information | Buu Phan et.al. | 2401.02609 | null |
2024-01-04 | Cool-Chic: Perceptually Tuned Low Complexity Overfitted Image Coder | Théo Ladune et.al. | 2401.02156 | link |
2024-01-04 | ED: Perceptually tuned Enhanced Compression Model | Pierrick Philippe et.al. | 2401.02145 | null |
2024-01-02 | NU-Class Net: A Novel Deep Learning-based Approach for Video Quality Enhancement | Parham Zilouchian Moghaddam et.al. | 2401.01163 | null |
2024-01-28 | Higher-Order Cellular Automata Generated Symmetry-Protected Topological Phases and Detection Through Multi-Point Strange Correlators | Jie-Yu Zhang et.al. | 2401.00505 | null |
2023-12-28 | Selective Run-Length Encoding | Xutan Peng et.al. | 2312.17024 | null |
2023-12-29 | FFCA-Net: Stereo Image Compression via Fast Cascade Alignment of Side Information | Yichong Xia et.al. | 2312.16963 | null |
2023-12-26 | Range Entropy Queries and Partitioning | Sanjay Krishnan et.al. | 2312.15959 | null |
2023-12-25 | MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression | Yi-Hsin Chen et.al. | 2312.15829 | null |
2023-12-25 | On Robust Wasserstein Barycenter: The Model and Algorithm | Xu Wang et.al. | 2312.15762 | null |
2023-12-25 | Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision | Qi Mao et.al. | 2312.15622 | null |
2023-12-22 | The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs | Junli Fang et.al. | 2312.14792 | null |
2024-01-09 | Enhanced Color Palette Modeling for Lossless Screen Content Compression | Hannah Och et.al. | 2312.14491 | null |
2023-12-30 | Efficient Communication in Federated Learning Using Floating-Point Lossy Compression | Grant Wilkins et.al. | 2312.13461 | null |
2023-12-19 | A Huffman based short message service compression technique using adjacent distance array | Pranta Sarker et.al. | 2312.12495 | null |
2023-12-19 | Full-reference Video Quality Assessment for User Generated Content Transcoding | Zihao Qi et.al. | 2312.12317 | null |
2023-12-19 | Low-Consumption Partial Transcoding by HEVC | Mohsen Abdoli et.al. | 2312.12174 | link |
2023-12-19 | Comparative Study of Hardware and Software Power Measurements in Video Compression | Angeliki Katsenou et.al. | 2312.12150 | null |
2023-12-18 | Blind-Touch: Homomorphic Encryption-Based Distributed Neural Network Inference for Privacy-Preserving Fingerprint Authentication | Hyunmin Choi et.al. | 2312.11575 | link |
2024-01-11 | Quantized Decoder in Learned Image Compression for Deterministic Reconstruction | Esin Koyuncu et.al. | 2312.11209 | null |
2023-12-19 | A Computationally Efficient Neural Video Compression Accelerator Based on a Sparse CNN-Transformer Hybrid Network | Siyu Zhang et.al. | 2312.10716 | null |
2023-12-17 | IntraSeismic: a coordinate-based learning approach to seismic inversion | Juan Romero et.al. | 2312.10568 | null |
2023-12-17 | Light-weight CNN-based VVC Inter Partitioning Acceleration | Yiqun Liu et.al. | 2312.10567 | null |
2023-12-16 | Statistical Analysis of Inter Coding in VVC Test Model (VTM) | Yiqun Liu et.al. | 2312.10406 | null |
2023-12-15 | IQNet: Image Quality Assessment Guided Just Noticeable Difference Prefiltering For Versatile Video Coding | Yu-Han Sun et.al. | 2312.09799 | null |
2023-12-15 | Towards Neuromorphic Compression based Neural Sensing for Next-Generation Wireless Implantable Brain Machine Interface | Vivek Mohan et.al. | 2312.09503 | null |
2023-12-14 | Geometry-Corrected Geodesic Motion Modeling with Per-Frame Camera Motion for 360-Degree Video Compression | Andy Regensky et.al. | 2312.09266 | link |
2023-12-14 | Efficient Online Learning of Contact Force Models for Connector Insertion | Kevin Tracy et.al. | 2312.09190 | null |
2023-12-13 | Balanced and Deterministic Weight-sharing Helps Network Performance | Oscar Chang et.al. | 2312.08401 | null |
2023-12-13 | Preparing VVC for Streaming: A Fast Multi-Rate Encoding Approach | Yiqun Liu et.al. | 2312.08330 | null |
2023-12-13 | CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation | Eugenio Chisari et.al. | 2312.08240 | null |
2023-12-13 | Explainable Trajectory Representation through Dictionary Learning | Yuanbo Tang et.al. | 2312.08052 | null |
2023-12-12 | Deep Hierarchical Video Compression | Ming Lu et.al. | 2312.07126 | null |
2023-12-12 | Communication Cost Reduction for Subgraph Counting under Local Differential Privacy via Hash Functions | Quentin Hillebrand et.al. | 2312.07055 | link |
2023-12-11 | RAFIC: Retrieval-Augmented Few-shot Image Classification | Hangfei Lin et.al. | 2312.06868 | link |
2023-12-11 | A New Projection Pursuit Index for Big Data | Yajie Duan et.al. | 2312.06465 | null |
2023-12-11 | Variational Auto-Encoder Based Deep Learning Technique For Filling Gaps in Reacting PIV Data | Shashank Yellapantula et.al. | 2312.06461 | null |
2023-12-07 | Analysis of Coding Gain Due to In-Loop Reshaping | Chau-Wai Wong et.al. | 2312.04022 | null |
2023-12-05 | C3: High-performance and low-complexity neural compression from a single image or video | Hyunjik Kim et.al. | 2312.02753 | null |
2023-12-05 | Unified learning-based lossy and lossless JPEG recompression | Jianghui Zhang et.al. | 2312.02705 | null |
2023-12-05 | Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation | Tianhao Peng et.al. | 2312.02605 | null |
2023-12-04 | Hyperspectral Image Compression Using Sampling and Implicit Neural Representations | Shima Rezasoltani et.al. | 2312.01558 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-27 | FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction | Siyu Jiao et.al. | 2502.20313 | null |
2025-02-27 | Mobius: Text to Seamless Looping Video Generation via Latent Shift | Xiuli Bi et.al. | 2502.20307 | null |
2025-02-27 | Low-rank tensor completion via a novel minimax |
Hongbing Zhang et.al. | 2502.19979 | null |
2025-02-27 | Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation | Xiang Geng et.al. | 2502.19941 | null |
2025-02-27 | Picking the Cream of the Crop: Visual-Centric Data Selection with Collaborative Agents | Zhenyu Liu et.al. | 2502.19917 | null |
2025-02-27 | High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model | Mingtao Guo et.al. | 2502.19894 | null |
2025-02-27 | Striving for Faster and Better: A One-Layer Architecture with Auto Re-parameterization for Low-Light Image Enhancement | Nan An et.al. | 2502.19867 | null |
2025-02-27 | LMHLD: A Large-scale Multi-source High-resolution Landslide Dataset for Landslide Detection based on Deep Learning | Guanting Liu et.al. | 2502.19866 | null |
2025-02-27 | Adaptive Score Alignment Learning for Continual Perceptual Quality Assessment of 360-Degree Videos in Virtual Reality | Kanglei Zhou et.al. | 2502.19644 | null |
2025-02-26 | 3D Nephrographic Image Synthesis in CT Urography with the Diffusion Model and Swin Transformer | Hongkun Yu et.al. | 2502.19623 | null |
2025-02-26 | Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones? | Yudi Zhang et.al. | 2502.19557 | null |
2025-02-26 | CLIP-Optimized Multimodal Image Enhancement via ISP-CNN Fusion for Coal Mine IoVT under Uneven Illumination | Shuai Wang et.al. | 2502.19450 | null |
2025-02-26 | Does 3D Gaussian Splatting Need Accurate Volumetric Rendering? | Adam Celarek et.al. | 2502.19318 | null |
2025-02-27 | RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images | Yuhan Tang et.al. | 2502.19153 | null |
2025-02-26 | Max360IQ: Blind Omnidirectional Image Quality Assessment with Multi-axis Attention | Jiebin Yan et.al. | 2502.19046 | null |
2025-02-26 | InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model | Fengbin Guan et.al. | 2502.19026 | null |
2025-02-26 | Hyperspectral image reconstruction by deep learning with super-Rayleigh speckles | Ziyan Chen et.al. | 2502.18777 | null |
2025-02-25 | Is OpenAlex Suitable for Research Quality Evaluation and Which Citation Indicator is Best? | Mike Thelwall et.al. | 2502.18427 | null |
2025-02-25 | LAG: LLM agents for Leaderboard Auto Generation on Demanding | Jian Wu et.al. | 2502.18209 | null |
2025-02-25 | OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation | Yunpeng Gao et.al. | 2502.18041 | null |
2025-02-25 | Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments | Patomporn Payoungkhamdee et.al. | 2502.17956 | null |
2025-02-25 | Integrating Boosted learning with Differential Evolution (DE) Optimizer: A Prediction of Groundwater Quality Risk Assessment in Odisha | Sonalika Subudhi et.al. | 2502.17929 | null |
2025-02-24 | Optimized Memory System Architecture for VESA VDC-M Decoder with Multi-Slice Support | Hannah Yang et.al. | 2502.17729 | null |
2025-02-24 | Requirements for Quality Assurance of AI Models for Early Detection of Lung Cancer | Horst K. Hahn et.al. | 2502.17639 | null |
2025-02-25 | KV-Edit: Training-Free Image Editing for Precise Background Preservation | Tianrui Zhu et.al. | 2502.17363 | link |
2025-02-24 | Motion-Robust T2 Quantification from Gradient Echo MRI with Physics-Informed Deep Learning* | Hannah Eichhorn et.al. | 2502.17209 | null |
2025-02-24 | SFLD: Reducing the content bias for AI-generated Image Detection | Seoyeon Gye et.al. | 2502.17105 | null |
2025-02-24 | Pleno-Generation: A Scalable Generative Face Video Compression Framework with Bandwidth Intelligence | Bolin Chen et.al. | 2502.17085 | null |
2025-02-24 | PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual Quality-Guided Distillation | Eleftherios Ioannou et.al. | 2502.16996 | null |
2025-02-24 | Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model | Kang Fu et.al. | 2502.16915 | null |
2025-02-24 | CRTrack: Low-Light Semi-Supervised Multi-object Tracking Based on Consistency Regularization | Zijing Zhao et.al. | 2502.16809 | null |
2025-02-23 | Automatic Input Rewriting Improves Translation with Large Language Models | Dayeon Ki et.al. | 2502.16682 | link |
2025-02-23 | AdverX-Ray: Ensuring X-Ray Integrity Through Frequency-Sensitive Adversarial VAEs | Francisco Caetano et.al. | 2502.16610 | null |
2025-02-22 | Multi-Party Data Pricing for Complex Data Trading Markets: A Rubinstein Bargaining Approach | Bing Mi et.al. | 2502.16363 | null |
2025-02-21 | Improved Partial Differential Equation and Fast Approximation Algorithm for Hazy/Underwater/Dust Storm Image Enhancement | Uche A. Nnolim et.al. | 2502.15986 | null |
2025-02-21 | Evaluate with the Inverse: Efficient Approximation of Latent Explanation Quality Distribution | Carlos Eiras-Franco et.al. | 2502.15403 | null |
2025-02-21 | Super-Resolution for Interferometric Imaging: Model Comparisons and Performance Analysis | Hasan Berkay Abdioglu et.al. | 2502.15397 | null |
2025-02-21 | Ultrasound Phase Aberrated Point Spread Function Estimation with Convolutional Neural Network: Simulation Study | Wei-Hsiang Shen et.al. | 2502.15298 | null |
2025-02-21 | Omnidirectional Image Quality Captioning: A Large-scale Database and A New Model | Jiebin Yan et.al. | 2502.15271 | link |
2025-02-21 | Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis | Yifan Jiang et.al. | 2502.15204 | link |
2025-02-21 | LUMINA-Net: Low-light Upgrade through Multi-stage Illumination and Noise Adaptation Network for Image Enhancement | Namrah Siddiqua et.al. | 2502.15186 | null |
2025-02-21 | M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment | Chuan Cui et.al. | 2502.15167 | null |
2025-02-21 | Optimized Pap Smear Image Enhancement: Hybrid PMD Filter-CLAHE Using Spider Monkey Optimization | Ach Khozaimi et.al. | 2502.15156 | null |
2025-02-20 | Hardware-Friendly Static Quantization Method for Video Diffusion Transformers | Sanghyun Yi et.al. | 2502.15077 | null |
2025-02-20 | Multi-Source Static CT with Adaptive Fluence Modulation to Minimize Hallucinations in Generative Reconstructions | Matthew Tivnan et.al. | 2502.15060 | null |
2025-02-20 | GS-Cache: A GS-Cache Inference Framework for Large-scale Gaussian Splatting Models | Miao Tao et.al. | 2502.14938 | null |
2025-02-20 | Compact Latent Representation for Image Compression (CLRIC) | Ayman A. Ameen et.al. | 2502.14937 | null |
2025-02-20 | Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework | Yuming Yang et.al. | 2502.14864 | null |
2025-02-20 | Towards a Perspectivist Turn in Argument Quality Assessment | Julia Romberg et.al. | 2502.14501 | null |
2025-02-20 | Early-Exit and Instant Confidence Translation Quality Estimation | Vilém Zouhar et.al. | 2502.14429 | null |
2025-02-20 | NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis | Xiaoxing Liu et.al. | 2502.14178 | null |
2025-02-19 | A Baseline Method for Removing Invisible Image Watermarks using Deep Image Prior | Hengyue Liang et.al. | 2502.13998 | link |
2025-02-19 | Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model | Huiying Shi et.al. | 2502.13990 | null |
2025-02-19 | A Lightweight Model for Perceptual Image Compression via Implicit Priors | Hao Wei et.al. | 2502.13988 | null |
2025-02-19 | An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice | Wanke Xia et.al. | 2502.13764 | null |
2025-02-19 | HawkBench: Investigating Resilience of RAG Methods on Stratified Information-Seeking Tasks | Hongjin Qian et.al. | 2502.13465 | null |
2025-02-19 | OGBoost: A Python Package for Ordinal Gradient Boosting | Mansour T. A. Sharabiani et.al. | 2502.13456 | null |
2025-02-18 | VUS: Effective and Efficient Accuracy Measures for Time-Series Anomaly Detection | Paul Boniol et.al. | 2502.13318 | link |
2025-02-18 | Optimal covering of rectangular grid graphs with tours of constrained length | Sergey Bereg et.al. | 2502.13306 | null |
2025-02-18 | Application of Context-dependent Interpretation of Biosignals Recognition to Control a Bionic Multifunctional Hand Prosthesis | Pawel Trajdos et.al. | 2502.13301 | null |
2025-02-18 | Enhancing Machine Learning Performance through Intelligent Data Quality Assessment: An Unsupervised Data-centric Framework | Manal Rahal et.al. | 2502.13198 | null |
2025-02-18 | GS-QA: Comprehensive Quality Assessment Benchmark for Gaussian Splatting View Synthesis | Pedro Martin et.al. | 2502.13196 | null |
2025-02-18 | Language Barriers: Evaluating Cross-Lingual Performance of CNN and Transformer Architectures for Speech Quality Estimation | Wafaa Wardah et.al. | 2502.13004 | null |
2025-02-18 | VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation | Xinlong Chen et.al. | 2502.12782 | null |
2025-02-18 | Efficient Machine Translation Corpus Generation: Integrating Human-in-the-Loop Post-Editing with Large Language Models | Kamer Ali Yuksel et.al. | 2502.12755 | link |
2025-02-18 | 3D Shape-to-Image Brownian Bridge Diffusion for Brain MRI Synthesis from Cortical Surfaces | Fabian Bongratz et.al. | 2502.12742 | null |
2025-02-18 | Translate Smart, not Hard: Cascaded Translation Systems with Quality-Aware Deferral | António Farinhas et.al. | 2502.12701 | null |
2025-02-19 | Spherical Dense Text-to-Image Synthesis | Timon Winter et.al. | 2502.12691 | null |
2025-02-18 | Design and Implementation of a Dual Uncrewed Surface Vessel Platform for Bathymetry Research under High-flow Conditions | Dinesh Kumar et.al. | 2502.12539 | null |
2025-02-18 | Comprehensive Assessment and Analysis for NSFW Content Erasure in Text-to-Image Diffusion Models | Die Chen et.al. | 2502.12527 | null |
2025-02-18 | Local Flaw Detection with Adaptive Pyramid Image Fusion Across Spatial Sampling Resolution for SWRs | Siyu You et.al. | 2502.12512 | null |
2025-02-17 | Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications | Li Qiao et.al. | 2502.12096 | null |
2025-02-17 | Low-Rank Thinning | Annabelle Michael Carrell et.al. | 2502.12063 | null |
2025-02-17 | MultiFlow: A unified deep learning framework for multi-vessel classification, segmentation and clustering of phase-contrast MRI validated on a multi-site single ventricle patient cohort | Tina Yao et.al. | 2502.11993 | null |
2025-02-17 | Deep Spatio-Temporal Neural Network for Air Quality Reanalysis | Ammar Kheder et.al. | 2502.11941 | link |
2025-02-17 | No-reference geometry quality assessment for colorless point clouds via list-wise rank learning | Zheng Li et.al. | 2502.11726 | link |
2025-02-17 | The Worse The Better: Content-Aware Viewpoint Generation Network for Projection-related Point Cloud Quality Assessment | Zhiyong Su et.al. | 2502.11710 | link |
2025-02-17 | Assessing Correctness in LLM-Based Code Generation via Uncertainty Estimation | Arindam Sharma et.al. | 2502.11620 | null |
2025-02-17 | Syllables to Scenes: Literary-Guided Free-Viewpoint 3D Scene Synthesis from Japanese Haiku | Chunan Yu et.al. | 2502.11586 | null |
2025-02-18 | AI-Assisted Thin Section Image Processing for Pore-Throat Characterization in Tight Clastic Rocks | Muhammad Risha et.al. | 2502.11523 | null |
2025-02-17 | Semantically Robust Unsupervised Image Translation for Paired Remote Sensing Images | Sheng Fang et.al. | 2502.11468 | null |
2025-02-17 | HellaSwag-Pro: A Large-Scale Bilingual Benchmark for Evaluating the Robustness of LLMs in Commonsense Reasoning | Xiaoyuan Li et.al. | 2502.11393 | null |
2025-02-17 | A Physics-Informed Blur Learning Framework for Imaging Systems | Liqun Chen et.al. | 2502.11382 | null |
2025-02-17 | LLMs can Perform Multi-Dimensional Analytic Writing Assessments: A Case Study of L2 Graduate-Level Academic English Writing | Zhengxiang Wang et.al. | 2502.11368 | null |
2025-02-16 | Generating Skyline Datasets for Data Science Models | Mengying Wang et.al. | 2502.11262 | null |
2025-02-16 | Exploiting network optimization stability for enhanced PET image denoising using deep image prior | Fumio Hashimoto et.al. | 2502.11259 | null |
2025-02-16 | Are Generative Models Underconfident? An Embarrassingly Simple Quality Estimation Approach | Tu Anh Dinh et.al. | 2502.11115 | null |
2025-02-16 | Imaging current flow and injection in scalable graphene devices through NV-magnetometry | Kaj Dockx et.al. | 2502.11076 | null |
2025-02-15 | Automatic Quality Assessment of First Trimester Crown-Rump-Length Ultrasound Images | Sevim Cengiz et.al. | 2502.10908 | null |
2025-02-15 | AquaScope: Reliable Underwater Image Transmission on Mobile Devices | Beitong Tian et.al. | 2502.10891 | null |
2025-02-15 | E-3DGS: Event-Based Novel View Rendering of Large-Scale Scenes Using 3D Gaussian Splatting | Sohaib Zahid et.al. | 2502.10827 | null |
2025-02-14 | Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers | Aivin V. Solatorio et.al. | 2502.10263 | null |
2025-02-14 | Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model | Guoqing Ma et.al. | 2502.10248 | link |
2025-02-14 | ProReco: A Process Discovery Recommender System | Tsung-Hao Huang et.al. | 2502.10230 | null |
2025-02-14 | RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control | Teng Li et.al. | 2502.10059 | null |
2025-02-14 | AffectSRNet : Facial Emotion-Aware Super-Resolution Network | Syed Sameen Ahmad Rizvi et.al. | 2502.09932 | null |
2025-02-14 | A Deep Learning Approach to Interface Color Quality Assessment in HCI | Shixiao Wang et.al. | 2502.09914 | null |
2025-02-14 | Compression-Aware One-Step Diffusion Model for JPEG Artifact Removal | Jinpei Guo et.al. | 2502.09873 | null |
2025-02-14 | Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering | Mark Beliaev et.al. | 2502.09573 | null |
2025-02-13 | Learned Correction Methods for Ultrasound Computed Tomography Imaging Using Simplified Physics Models | Luke Lozenski et.al. | 2502.09546 | null |
2025-02-13 | SQ-GAN: Semantic Image Communications Using Masked Vector Quantization | Francesco Pezone et.al. | 2502.09520 | link |
2025-02-13 | A Physics-Informed Deep Learning Model for MRI Brain Motion Correction | Mojtaba Safari et.al. | 2502.09296 | link |
2025-02-13 | ConsistentDreamer: View-Consistent Meshes Through Balanced Multi-View Gaussian Optimization | Onat Şahin et.al. | 2502.09278 | null |
2025-02-13 | PixLift: Accelerating Web Browsing via AI Upscaling | Yonas Atinafu et.al. | 2502.08995 | null |
2025-02-13 | Some problems of developing astrophysical equipment and combining it with optical telescopes | Edward Emelianov et.al. | 2502.08992 | null |
2025-02-13 | Dynamic watermarks in images generated by diffusion models | Yunzhuo Chen et.al. | 2502.08927 | null |
2025-02-12 | A procedure for assessing of machine health index data prediction quality | Daniel Kuzio et.al. | 2502.08837 | null |
2025-02-12 | Ultrasound imaging of cortical bone: cortex geometry and measurement of porosity based on wave speed for bone remodeling estimation | Amadou S. Dia et.al. | 2502.08824 | null |
2025-02-12 | Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation | Hoigi Seo et.al. | 2502.08690 | null |
2025-02-12 | Light-A-Video: Training-free Video Relighting via Progressive Light Fusion | Yujie Zhou et.al. | 2502.08590 | link |
2025-02-12 | Quality-Aware Decoding: Unifying Quality Estimation and Decoding | Sai Koneru et.al. | 2502.08561 | null |
2025-02-12 | A Survey on Image Quality Assessment: Insights, Analysis, and Future Outlook | Chengqian Ma et.al. | 2502.08540 | null |
2025-02-12 | TuMag: the tunable magnetograph for the Sunrise III mission | J. C. del Toro Iniesta et.al. | 2502.08268 | null |
2025-02-12 | Forward and Inverse Problems in Nonlinear Acoustics | Barbara Kaltenbacher et.al. | 2502.08194 | null |
2025-02-11 | Automatic Prostate Volume Estimation in Transabdominal Ultrasound Images | Tiziano Natali et.al. | 2502.07859 | null |
2025-02-11 | Magic 1-For-1: Generating One Minute Video Clips within One Minute | Hongwei Yi et.al. | 2502.07701 | link |
2025-02-11 | An Improved Optimal Proximal Gradient Algorithm for Non-Blind Image Deblurring | Qingsong Wang et.al. | 2502.07602 | null |
2025-02-13 | Enhance-A-Video: Better Generated Video for Free | Yang Luo et.al. | 2502.07508 | link |
2025-02-11 | Compound Mask for Divergent Wave Imaging in Medical Ultrasound | Zahraa Alzein et.al. | 2502.07453 | null |
2025-02-11 | On Iterative Evaluation and Enhancement of Code Quality Using GPT-4o | Rundong Liu et.al. | 2502.07399 | link |
2025-02-11 | USRNet: Unified Scene Recovery Network for Enhancing Traffic Imaging under Multiple Adverse Weather Conditions | Yuxu Lu et.al. | 2502.07372 | link |
2025-02-11 | Multi-Task-oriented Nighttime Haze Imaging Enhancer for Vision-driven Measurement Systems | Ai Chen et.al. | 2502.07351 | link |
2025-02-11 | Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion | Xingpei Ma et.al. | 2502.07203 | null |
2025-02-11 | HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates | Lei Lu et.al. | 2502.07160 | null |
2025-02-10 | Evaluation of Multilingual Image Captioning: How far can we get with CLIP models? | Gonçalo Gomes et.al. | 2502.06600 | link |
2025-02-10 | Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution | Vlad Hosu et.al. | 2502.06476 | null |
2025-02-10 | How Humans Help LLMs: Assessing and Incentivizing Human Preference Annotators | Shang Liu et.al. | 2502.06387 | null |
2025-02-10 | Guidance-base Diffusion Models for Improving Photoacoustic Image Quality | Tatsuhiro Eguchi et.al. | 2502.06354 | null |
2025-02-10 | LANTERN++: Enhanced Relaxed Speculative Decoding with Static Tree Drafting for Visual Auto-regressive Models | Sihwan Park et.al. | 2502.06352 | null |
2025-02-10 | A CT Geometry With Multiple Centers Of Rotation For Solving Sparse View Problem | Jiayu Duan et.al. | 2502.06125 | null |
2025-02-10 | Token-Domain Multiple Access: Exploiting Semantic Orthogonality for Collision Mitigation | Li Qiao et.al. | 2502.06118 | null |
2025-02-09 | Dual Caption Preference Optimization for Diffusion Models | Amir Saeidi et.al. | 2502.06023 | null |
2025-02-09 | A Comprehensive Survey on Image Signal Processing Approaches for Low-Illumination Image Enhancement | Muhammad Turab et.al. | 2502.05995 | null |
2025-02-09 | Multi-Branch Collaborative Learning Network for Video Quality Assessment in Industrial Video Search | Hengzhu Tang et.al. | 2502.05924 | null |
2025-02-09 | Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models | Rafał Karczewski et.al. | 2502.05807 | null |
2025-02-08 | Semantic-Aware Adaptive Video Streaming Using Latent Diffusion Models for Wireless Networks | Zijiang Yan et.al. | 2502.05695 | null |
2025-02-08 | FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion | Yufan Zhou et.al. | 2502.05606 | null |
2025-02-07 | Distillation and Pruning for Scalable Self-Supervised Representation-Based Speech Quality Assessment | Benjamin Stahl et.al. | 2502.05356 | link |
2025-02-07 | AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting | Chung-Ho Wu et.al. | 2502.05176 | null |
2025-02-07 | Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound | Andros Tjandra et.al. | 2502.05139 | link |
2025-02-07 | Cached Multi-Lora Composition for Multi-Concept Image Generation | Xiandong Zou et.al. | 2502.04923 | link |
2025-02-07 | Integration Concept of the CBM Micro Vertex Detector | Franz Matejcek et.al. | 2502.04858 | null |
2025-02-06 | ADIFF: Explaining audio difference using natural language | Soham Deshmukh et.al. | 2502.04476 | link |
2025-02-05 | DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization | Zhenglin Zhou et.al. | 2502.04370 | null |
2025-02-06 | BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation | The Omnilingual MT Team et.al. | 2502.04314 | null |
2025-02-06 | Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency | Shangkun Sun et.al. | 2502.04076 | link |
2025-02-06 | DICE: Distilling Classifier-Free Guidance into Text Embeddings | Zhenyu Zhou et.al. | 2502.03726 | null |
2025-02-05 | Quasi-Monte Carlo Methods: What, Why, and How? | Fred J. Hickernell et.al. | 2502.03644 | null |
2025-02-05 | Efficient Image Restoration via Latent Consistency Flow Matching | Elad Cohen et.al. | 2502.03500 | null |
2025-02-05 | A new method for structural diagnostics with muon tomography and deep learning | Lorenzo Pezzotti et.al. | 2502.03339 | null |
2025-02-05 | A Framework for Measuring the Quality of Infrastructure-as-Code Scripts | Pandu Ranga Reddy Konala et.al. | 2502.03127 | null |
2025-02-05 | Poisson Flow Joint Model for Multiphase contrast-enhanced CT | Rongjun Ge et.al. | 2502.03079 | null |
2025-02-05 | A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions | Hao Yin et.al. | 2502.02817 | null |
2025-02-04 | Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications | William O'Donnell et.al. | 2502.02624 | null |
2025-02-04 | A comparison of translation performance between DeepL and Supertext | Alex Flückiger et.al. | 2502.02577 | link |
2025-02-04 | Privacy Attacks on Image AutoRegressive Models | Antoni Kowalczuk et.al. | 2502.02514 | link |
2025-02-04 | VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models | Hila Chefer et.al. | 2502.02492 | null |
2025-02-04 | High-Fidelity Human Avatars from Laptop Webcams using Edge Compute | Akash Haridas et.al. | 2502.02468 | null |
2025-02-04 | Exploring the Feasibility of AI-Assisted Spine MRI Protocol Optimization Using DICOM Image Metadata | Alice Vian et.al. | 2502.02351 | null |
2025-02-04 | When Dimensionality Hurts: The Role of LLM Embedding Compression for Noisy Regression Tasks | Felix Drinkall et.al. | 2502.02199 | link |
2025-02-04 | PALQA: A Novel Parameterized Position-Aware Lossy Quantum Autoencoder using LSB Control Qubit for Efficient Image Compression | Ershadul Haque et.al. | 2502.02188 | null |
2025-02-05 | IPO: Iterative Preference Optimization for Text-to-Video Generation | Xiaomeng Yang et.al. | 2502.02088 | null |
2025-02-03 | Spectra of He isotopes and the $^3$He/$^4$ He ratio | M. J. Boschini et.al. | 2502.01887 | null |
2025-02-03 | Sparse Measurement Medical CT Reconstruction using Multi-Fused Block Matching Denoising Priors | Maliha Hossain et.al. | 2502.01832 | null |
2025-02-03 | Generating Multi-Image Synthetic Data for Text-to-Image Customization | Nupur Kumari et.al. | 2502.01720 | null |
2025-02-03 | CLIP-DQA: Blindly Evaluating Dehazed Images from Global and Local Perspectives Using CLIP | Yirui Zeng et.al. | 2502.01707 | null |
2025-02-03 | Proposal and Evaluation of a Practical CBCT Dose Optimization Method | S. Gros et.al. | 2502.01509 | null |
2025-02-03 | Human Body Restoration with One-Step Diffusion Model and A New Benchmark | Jue Gong et.al. | 2502.01411 | null |
2025-02-03 | Explainability-Driven Quality Assessment for Rule-Based Systems | Oshani Seneviratne et.al. | 2502.01253 | null |
2025-02-03 | Imaging simulation of a dual-panel PET geometry with ultrafast TOF detectors | Taiyo Ishikawa et.al. | 2502.01006 | null |
2025-02-02 | Weak Supervision Dynamic KL-Weighted Diffusion Models Guided by Large Language Models | Julian Perry et.al. | 2502.00826 | null |
2025-02-02 | EmoTalkingGaussian: Continuous Emotion-conditioned Talking Head Synthesis | Junuk Cha et.al. | 2502.00654 | null |
2025-02-01 | Deep Task-Based Beamforming and Channel Data Augmentations for Enhanced Ultrasound Imaging | Ariel Amar et.al. | 2502.00524 | null |
2025-02-01 | A framework for river connectivity classification using temporal image processing and attention based neural networks | Timothy James Becker et.al. | 2502.00474 | null |
2025-01-31 | Trust and Trustworthiness from Human-Centered Perspective in HRI -- A Systematic Literature Review | Debora Firmino de Souza et.al. | 2501.19323 | null |
2025-01-31 | Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search | Yuta Oshima et.al. | 2501.19252 | null |
2025-01-31 | Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data | Xichen Xu et.al. | 2501.19094 | null |
2025-01-31 | OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation | Yuchen Lin et.al. | 2501.18982 | null |
2025-01-31 | Distorting Embedding Space for Safety: A Defense Mechanism for Adversarially Robust Diffusion Models | Jaesin Ahn et.al. | 2501.18877 | link |
2025-01-29 | Fake News Detection After LLM Laundering: Measurement and Explanation | Rupak Kumar Das et.al. | 2501.18649 | link |
2025-01-31 | Task-based Regularization in Penalized Least-Squares for Binary Signal Detection Tasks in Medical Image Denoising | Wentao Chen et.al. | 2501.18418 | null |
2025-01-30 | Adaptive Video Streaming with AI-Based Optimization for Dynamic Network Conditions | Mohammad Tarik et.al. | 2501.18332 | null |
2025-01-30 | AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment | Yuqin Cao et.al. | 2501.18314 | null |
2025-02-03 | Efficient Feature Fusion for UAV Object Detection | Xudong Wang et.al. | 2501.17983 | null |
2025-01-29 | Discrete Dielectric Coatings for Length Control and Tunability of Half-Wave Dipole Antennas at 300 MHz Magnetic Resonance Imaging Applications | Aditya A Bhosale et.al. | 2501.17954 | null |
2025-01-29 | Leveraging In-Context Learning and Retrieval-Augmented Generation for Automatic Question Generation in Educational Domains | Subhankar Maity et.al. | 2501.17397 | null |
2025-01-29 | On the Coexistence and Ensembling of Watermarks | Aleksandar Petrov et.al. | 2501.17356 | link |
2025-01-28 | Giving the Old a Fresh Spin: Quality Estimation-Assisted Constrained Decoding for Automatic Post-Editing | Sourabh Deoghare et.al. | 2501.17265 | null |
2025-01-27 | Audio Large Language Models Can Be Descriptive Speech Quality Evaluators | Chen Chen et.al. | 2501.17202 | null |
2025-01-31 | IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait | Han Yang et.al. | 2501.17159 | null |
2025-01-28 | Three-Dimensional Diffusion-Weighted Multi-Slab MRI With Slice Profile Compensation Using Deep Energy Model | Reza Ghorbani et.al. | 2501.17152 | null |
2025-01-28 | Evaluating CrowdSplat: Perceived Level of Detail for Gaussian Crowds | Xiaohan Sun et.al. | 2501.17085 | null |
2025-01-28 | EdgeMLOps: Operationalizing ML models with Cumulocity IoT and thin-edge.io for Visual quality Inspection | Kanishk Chaturvedi et.al. | 2501.17062 | null |
2025-01-28 | EZOA: Nançay HI follow-up observations in the Zone of Avoidance | A. C. Schröder et.al. | 2501.17038 | null |
2025-01-28 | Image-Space Gridding for Nonrigid Motion-Corrected MR Image Reconstruction | Kwang Eun Jang et.al. | 2501.16713 | null |
2025-01-25 | MambaTron: Efficient Cross-Modal Point Cloud Enhancement using Aggregate Selective State Space Modeling | Sai Tarun Inaganti et.al. | 2501.16384 | null |
2025-01-27 | Adaptive Iterative Compression for High-Resolution Files: an Approach Focused on Preserving Visual Quality in Cinematic Workflows | Leonardo Melo et.al. | 2501.16319 | null |
2025-01-27 | UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images | Tatiana Taís Schein et.al. | 2501.16211 | link |
2025-01-27 | Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation | Xing Zhang et.al. | 2501.16050 | null |
2025-01-30 | Can Location Embeddings Enhance Super-Resolution of Satellite Imagery? | Daniel Panangian et.al. | 2501.15847 | null |
2025-01-26 | Advancing quantum imaging through learning theory | Yunkai Wang et.al. | 2501.15685 | null |
2025-01-26 | Radiologist-in-the-Loop Self-Training for Generalizable CT Metal Artifact Reduction | Chenglong Ma et.al. | 2501.15610 | link |
2025-01-26 | Differentiable Low-computation Global Correlation Loss for Monotonicity Evaluation in Quality Assessment | Yipeng Liu et.al. | 2501.15485 | null |
2025-01-25 | Image formation theory of optical coherence tomography with optical aberrations and its application for computational aberration correction | Shuichi Makita et.al. | 2501.15011 | null |
2025-01-24 | SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation | Yujian Liu et.al. | 2501.14646 | null |
2025-01-24 | WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages | Jia Yu et.al. | 2501.14506 | link |
2025-01-24 | Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR | Hao Ma et.al. | 2501.14477 | null |
2025-01-24 | Deep Learning-Powered Classification of Thoracic Diseases in Chest X-Rays | Yiming Lei et.al. | 2501.14279 | null |
2025-01-24 | CDI: Blind Image Restoration Fidelity Evaluation based on Consistency with Degraded Image | Xiaojun Tang et.al. | 2501.14264 | null |
2025-01-24 | GreedyPixel: Fine-Grained Black-Box Adversarial Attack Via Greedy Algorithm | Hanrui Wang et.al. | 2501.14230 | null |
2025-01-24 | Sparse Mixture-of-Experts for Non-Uniform Noise Reduction in MRI Images | Zeyun Deng et.al. | 2501.14198 | null |
2025-01-24 | VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking | Runyi Hu et.al. | 2501.14195 | link |
2025-01-23 | AdEval: Alignment-based Dynamic Evaluation to Mitigate Data Contamination in Large Language Models | Yang Fan et.al. | 2501.13983 | null |
2025-01-23 | Improving Video Generation with Human Feedback | Jie Liu et.al. | 2501.13918 | null |
2025-01-23 | VARFVV: View-Adaptive Real-Time Interactive Free-View Video Streaming with Edge Computing | Qiang Hu et.al. | 2501.13630 | null |
2025-01-23 | Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse | Wenzhuo Ma et.al. | 2501.13528 | null |
2025-01-23 | LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation | JiaXin Chen et.al. | 2501.13475 | null |
2025-01-23 | From Images to Point Clouds: An Efficient Solution for Cross-media Blind Quality Assessment without Annotated Training | Yipeng Liu et.al. | 2501.13387 | null |
2025-01-23 | Enhanced Extractor-Selector Framework and Symmetrization Weighted Binary Cross-Entropy for Edge Detections | Hao Shu et.al. | 2501.13365 | null |
2025-01-22 | UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior | I-Hsiang Chen et.al. | 2501.13134 | null |
2025-01-23 | Accelerate High-Quality Diffusion Models with Inner Loop Feedback | Matthew Gwilliam et.al. | 2501.13107 | null |
2025-01-22 | Real-time Terahertz Compressive Optical-Digital Neural Network Imaging | Shao-Hsuan Wu et.al. | 2501.13065 | null |
2025-01-22 | Sketch and Patch: Efficient 3D Gaussian Representation for Man-Made Scenes | Yuang Shi et.al. | 2501.13045 | null |
2025-01-22 | Characterizing Collective Efforts in Content Sharing and Quality Control for ADHD-relevant Content on Video-sharing Platforms | Hanxiu 'Hazel' Zhu et.al. | 2501.13020 | null |
2025-01-22 | Paper Quality Assessment based on Individual Wisdom Metrics from Open Peer Review | Andrii Zahorodnii et.al. | 2501.13014 | null |
2025-01-22 | SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling | Shengshi Yao et.al. | 2501.12696 | null |
2025-01-22 | Approximate Puzzlepiece Compositing | Xuan Huang et.al. | 2501.12581 | null |
2025-01-21 | Interaction Dataset of Autonomous Vehicles with Traffic Lights and Signs | Zheng Li et.al. | 2501.12536 | null |
2025-01-21 | Bidirectional Brain Image Translation using Transfer Learning from Generic Pre-trained Models | Fatima Haimour et.al. | 2501.12488 | null |
2025-01-21 | DiffDoctor: Diagnosing Image Diffusion Models Before Treating | Yiyang Wang et.al. | 2501.12382 | null |
2025-01-21 | Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement | Christoph Gebhardt et.al. | 2501.12289 | null |
2025-01-21 | A Dynamic Programming Framework for Generating Approximately Diverse and Optimal Solutions | Waldo Gálvez et.al. | 2501.12261 | null |
2025-01-21 | Joint Reconstruction and Motion Estimation in Sparse-View 4DCT Using Diffusion Models within a Blind Inverse Problem Framework | Antoine De Paepe et.al. | 2501.12249 | null |
2025-01-21 | DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains | Junyu Xia et.al. | 2501.12235 | null |
2025-01-21 | RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression | Uri Gadot et.al. | 2501.12216 | null |
2025-01-21 | Fast-RF-Shimming: Accelerate RF Shimming in 7T MRI using Deep Learning | Zhengyi Lu et.al. | 2501.12157 | null |
2025-01-21 | A Multi-annotated and Multi-modal Dataset for Wide-angle Video Quality Assessment | Bo Hu et.al. | 2501.12082 | null |
2025-01-22 | GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting | Longan Wang et.al. | 2501.12060 | null |
2025-01-21 | Power Amplifier-Aware Transmit Power Optimization for OFDM and SC-FDMA Systems | Pawel Kryszkiewicz et.al. | 2501.11994 | null |
2025-01-21 | Bayesian Despeckling of Structured Sources | Ali Zafari et.al. | 2501.11860 | null |
2025-01-20 | EfficientVITON: An Efficient Virtual Try-On Model using Optimized Diffusion Process | Mostafa Atef et.al. | 2501.11776 | null |
2025-01-20 | Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution | Zhiyuan You et.al. | 2501.11561 | null |
2025-01-20 | Fundus Image Quality Assessment and Enhancement: a Systematic Review | Heng Li et.al. | 2501.11520 | null |
2025-01-20 | Multitask Auxiliary Network for Perceptual Quality Assessment of Non-Uniformly Distorted Omnidirectional Images | Jiebin Yan et.al. | 2501.11512 | link |
2025-01-20 | Subjective and Objective Quality Assessment of Non-Uniformly Distorted Omnidirectional Images | Jiebin Yan et.al. | 2501.11511 | link |
2025-01-20 | See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic Regularization | Zongqi He et.al. | 2501.11508 | null |
2025-01-20 | Advancing Oyster Phenotype Segmentation with Multi-Network Ensemble and Multi-Scale mechanism | Wenli Yang et.al. | 2501.11203 | null |
2025-01-19 | Unit Region Encoding: A Unified and Compact Geometry-aware Representation for Floorplan Applications | Huichao Zhang et.al. | 2501.11097 | null |
2025-01-18 | EMO2: End-Effector Guided Audio-Driven Avatar Video Generation | Linrui Tian et.al. | 2501.10687 | null |
2025-01-17 | Fundamental mode power estimation through a |
Filipp Lausch et.al. | 2501.10345 | null |
2025-01-17 | DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration | Huiyun Cao et.al. | 2501.10325 | null |
2025-01-17 | CSHNet: A Novel Information Asymmetric Image Translation Method | Xi Yang et.al. | 2501.10197 | link |
2025-01-17 | DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency | Xiaohui Li et.al. | 2501.10110 | null |
2025-01-17 | CLIP-PCQA: Exploring Subjective-Aligned Vision-Language Modeling for Point Cloud Quality Assessment | Yating Liu et.al. | 2501.10071 | link |
2025-01-17 | One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression | Keita Miwa et.al. | 2501.10064 | null |
2025-01-17 | CaFA: Cost-aware, Feasible Attacks With Database Constraints Against Neural Tabular Classifiers | Matan Ben-Tov et.al. | 2501.10013 | link |
2025-01-17 | IE-Bench: Advancing the Measurement of Text-Driven Image Editing for Human Perception Alignment | Shangkun Sun et.al. | 2501.09927 | null |
2025-01-17 | Decoding Patterns of Data Generation Teams for Clinical and Scientific Success: Insights from the Bridge2AI Talent Knowledge Graph | Jiawei Xu et.al. | 2501.09897 | null |
2025-01-16 | EraseBench: Understanding The Ripple Effects of Concept Erasure Techniques | Ibtihel Amara et.al. | 2501.09833 | null |
2025-01-16 | Scan-Adaptive MRI Undersampling Using Neighbor-based Optimization (SUNO) | Siddhant Gautam et.al. | 2501.09799 | link |
2025-01-16 | Evaluating Conversational Recommender Systems with Large Language Models: A User-Centric Evaluation Framework | Nuo Chen et.al. | 2501.09493 | null |
2025-01-16 | Joint Transmission and Deblurring: A Semantic Communication Approach Using Events | Pujing Yang et.al. | 2501.09396 | null |
2025-01-16 | PATCHEDSERVE: A Patch Management Framework for SLO-Optimized Hybrid Resolution Diffusion Serving | Desen Sun et.al. | 2501.09253 | null |
2025-01-16 | Estimating Task-based Performance Bounds for Accelerated MRI Image Reconstruction Methods by Use of Learned-Ideal Observers | Kaiyan Li et.al. | 2501.09224 | null |
2025-01-15 | UNIR-Net: A Novel Approach for Restoring Underwater Images with Non-Uniform Illumination Using Synthetic Data | Ezequiel Perez-Zarate et.al. | 2501.09053 | link |
2025-01-15 | Lights, Camera, Matching: The Role of Image Illumination in Fair Face Recognition | Gabriella Pangelinan et.al. | 2501.08910 | null |
2025-01-15 | XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework | Sida Tian et.al. | 2501.08809 | null |
2025-01-16 | Holoview: Interactive 3D visualization of medical data in AR | Pankaj Kaushik et.al. | 2501.08736 | null |
2025-01-15 | DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors | Runqi Wang et.al. | 2501.08553 | null |
2025-01-15 | Comprehensive Subjective and Objective Evaluation Method for Text-generated Video | Zelu Qi et.al. | 2501.08545 | null |
2025-01-14 | Head Motion Degrades Machine Learning Classification of Alzheimer's Disease from Positron Emission Tomography | Eléonore V. Lieffrig et.al. | 2501.08459 | null |
2025-01-14 | Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models | Weichen Fan et.al. | 2501.08453 | null |
2025-01-14 | Cross-Modal Transferable Image-to-Video Attack on Video Quality Metrics | Georgii Gotin et.al. | 2501.08415 | link |
2025-01-14 | Rolling phase modulation regime for dynamic full field OCT | Tual Monfort et.al. | 2501.08359 | null |
2025-01-15 | Optical information encryption using general temporal ghost imaging with practical experimental condition | Juan Wu et.al. | 2501.08136 | null |
2025-01-13 | Evaluating Human Perception of Novel View Synthesis: Subjective Quality Assessment of Gaussian Splatting and NeRF in Dynamic Scenes | Yuhang Zhang et.al. | 2501.08072 | null |
2025-01-14 | VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models | Hui Kuurila-Zhang et.al. | 2501.07922 | link |
2025-01-14 | Demographic Variability in Face Image Quality Measures | Wassim Kabbani et.al. | 2501.07898 | null |
2025-01-14 | State-of-the-Art Transformer Models for Image Super-Resolution: Techniques, Challenges, and Applications | Debasish Dutta et.al. | 2501.07855 | null |
2025-01-13 | FaceOracle: Chat with a Face Image Oracle | Wassim Kabbani et.al. | 2501.07202 | null |
2025-01-13 | Radial Distortion in Face Images: Detection and Impact | Wassim Kabbani et.al. | 2501.07179 | null |
2025-01-13 | Eye Sclera for Fair Face Image Quality Assessment | Wassim Kabbani et.al. | 2501.07158 | null |
2025-01-13 | Privacy-Preserving Data Quality Assessment for Time-Series IoT Sensors | Novoneel Chakraborty et.al. | 2501.07154 | null |
2025-01-13 | Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling | Jiebin Yan et.al. | 2501.07087 | null |
2025-01-12 | Real-Time Neural-Enhancement for Online Cloud Gaming | Shan Jiang et.al. | 2501.06880 | null |
2025-01-14 | Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution | Du Chen et.al. | 2501.06838 | null |
2025-01-11 | NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes without References | Qiang Qu et.al. | 2501.06488 | link |
2025-01-10 | VideoAuteur: Towards Long Narrative Video Generation | Junfei Xiao et.al. | 2501.06173 | null |
2025-01-10 | CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control | Stefan Popov et.al. | 2501.06006 | null |
2025-01-10 | Universal-2-TF: Robust All-Neural Text Formatting for ASR | Yash Khare et.al. | 2501.05948 | null |
2025-01-10 | UltraRay: Full-Path Ray Tracing for Enhancing Realism in Ultrasound Simulation | Felix Duelmer et.al. | 2501.05828 | null |
2025-01-13 | AI-Driven Diabetic Retinopathy Screening: Multicentric Validation of AIDRSS in India | Amit Kr Dey et.al. | 2501.05826 | null |
2025-01-10 | Conditional Diffusion Model for Electrical Impedance Tomography | Duanpeng Shi et.al. | 2501.05769 | null |
2025-01-10 | LLVD: LSTM-based Explicit Motion Modeling in Latent Space for Blind Video Denoising | Loay Rashid et.al. | 2501.05744 | null |
2025-01-10 | FIRM: Federated Image Reconstruction using Multimodal Tomographic Data | Geunyeong Byeon et.al. | 2501.05642 | null |
2025-01-09 | Interpretable deep learning illuminates multiple structures fluorescence imaging: a path toward trustworthy artificial intelligence in microscopy | Mingyang Chen et.al. | 2501.05490 | null |
2025-01-09 | Consistent Flow Distillation for Text-to-3D Generation | Runjie Yan et.al. | 2501.05445 | null |
2025-01-09 | Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping | Wen Tianci et.al. | 2501.05242 | null |
2025-01-09 | 3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering | Dewei Zhou et.al. | 2501.05131 | null |
2025-01-09 | TipSegNet: Fingertip Segmentation in Contactless Fingerprint Imaging | Laurenz Ruzicka et.al. | 2501.05076 | null |
2025-01-09 | Towards Fingerprint Mosaicking Artifact Detection: A Self-Supervised Deep Learning Approach | Laurenz Ruzicka et.al. | 2501.05034 | null |
2025-01-08 | Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling | Nannan Li et.al. | 2501.04666 | null |
2025-01-08 | Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion | Yangfan He et.al. | 2501.04606 | link |
2025-01-08 | When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages | Archchana Sindhujan et.al. | 2501.04473 | null |
2025-01-08 | Enhancing kidney quality assessment: Power Doppler during normothermic machine perfusion | Yitian Fang et.al. | 2501.04457 | null |
2025-01-08 | iFADIT: Invertible Face Anonymization via Disentangled Identity Transform | Lin Yuan et.al. | 2501.04390 | null |
2025-01-08 | DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models | Hyogon Ryu et.al. | 2501.04304 | link |
2025-01-07 | Spatiotemporal Gaussian Optimization for 4D Cone Beam CT Reconstruction from Sparse Projections | Yabo Fu et.al. | 2501.04140 | link |
2025-01-07 | Motion-Aware Generative Frame Interpolation | Guozhen Zhang et.al. | 2501.03699 | null |
2025-01-07 | Action Quality Assessment via Hierarchical Pose-guided Multi-stage Contrastive Regression | Mengshi Qi et.al. | 2501.03674 | link |
2025-01-07 | Deep Learning-based Compression Detection for explainable Face Image Quality Assessment | Laurin Jonientz et.al. | 2501.03619 | link |
2025-01-07 | A generative approach for lensless imaging in low-light conditions | Ziyang Liu et.al. | 2501.03511 | null |
2025-01-07 | Can Deep Learning Trigger Alerts from Mobile-Captured Images? | Pritisha Sarkar et.al. | 2501.03499 | null |
2025-01-06 | A Trust-Guided Approach to MR Image Reconstruction with Side Information | Arda Atalık et.al. | 2501.03021 | link |
2025-01-06 | Quality Estimation based Feedback Training for Improving Pronoun Translation | Harshit Dhankhar et.al. | 2501.03008 | null |
2025-01-06 | GLFC: Unified Global-Local Feature and Contrast Learning with Mamba-Enhanced UNet for Synthetic CT Generation from CBCT | Xianhao Zhou et.al. | 2501.02992 | link |
2025-01-06 | Region of Interest based Medical Image Compression | Utkarsh Prakash Srivastava et.al. | 2501.02895 | null |
2025-01-06 | COph100: A comprehensive fundus image registration dataset from infants constituting the "RIDIRP" database | Yan Hu et.al. | 2501.02800 | null |
2025-01-06 | Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging? | Hongyi Miao et.al. | 2501.02751 | null |
2025-01-06 | Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising | Yunlong Yuan et.al. | 2501.02741 | null |
2025-01-06 | Artificial Intelligence in Creative Industries: Advances Prior to 2025 | Nantheera Anantrasirichai et.al. | 2501.02725 | null |
2025-01-06 | Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment | Jiaze Li et.al. | 2501.02706 | null |
2025-01-05 | DepthMaster: Taming Diffusion Models for Monocular Depth Estimation | Ziyang Song et.al. | 2501.02576 | link |
2025-01-05 | Multi-LLM Collaborative Caption Generation in Scientific Documents | Jaeyoung Kim et.al. | 2501.02552 | link |
2025-01-05 | Pixel-Wise Feature Selection for Perceptual Edge Detection without post-processing | Hao Shu et.al. | 2501.02534 | null |
2025-01-07 | ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling | Chaojie Mao et.al. | 2501.02487 | null |
2025-01-05 | Reducing the Gap Between Pretrained Speech Enhancement and Recognition Models Using a Real Speech-Trained Bridging Module | Zhongjian Cui et.al. | 2501.02452 | null |
2025-01-05 | Journey into Automation: Image-Derived Pavement Texture Extraction and Evaluation | Bingjie Lu et.al. | 2501.02414 | null |
2025-01-04 | Optimizing Audio Compression Through Entropy-Controlled Dithering | Ellison Murray et.al. | 2501.02293 | null |
2025-01-04 | TDM: Temporally-Consistent Diffusion Model for All-in-One Real-World Video Restoration | Yizhou Li et.al. | 2501.02269 | null |
2025-01-04 | Exploring Secure Machine Learning Through Payload Injection and FGSM Attacks on ResNet-50 | Umesh Yadav et.al. | 2501.02147 | null |
2025-01-03 | JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing | Qili Wang et.al. | 2501.01798 | link |
2025-01-03 | Multi-modal classification of forest biodiversity potential from 2D orthophotos and 3D airborne laser scanning point clouds | Simon B. Jensen et.al. | 2501.01728 | null |
2025-01-03 | Aesthetic Matters in Music Perception for Image Stylization: A Emotion-driven Music-to-Visual Manipulation | Junjie Xu et.al. | 2501.01700 | null |
2025-01-02 | A Metasemantic-Metapragmatic Framework for Taxonomizing Multimodal Communicative Alignment | Eugene Yu Ji et.al. | 2501.01535 | null |
2025-01-02 | Embedding Similarity Guided License Plate Super Resolution | Abderrezzaq Sendjasni et.al. | 2501.01483 | null |
2024-12-31 | Estimation of 3T MR images from 1.5T images regularized with Physics based Constraint | Prabhjot Kaur et.al. | 2501.01464 | null |
2024-12-31 | GDSR: Global-Detail Integration through Dual-Branch Network with Wavelet Losses for Remote Sensing Image Super-Resolution | Qiwei Zhu et.al. | 2501.01460 | null |
2025-01-02 | ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI | Neda Tavakoli et.al. | 2501.01372 | link |
2025-01-02 | TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions | Vriksha Srihari et.al. | 2501.01156 | null |
2025-01-02 | HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment | Zitong Xu et.al. | 2501.01116 | null |
2025-01-02 | Generalized Task-Driven Medical Image Quality Enhancement with Gradient Promotion | Dong Zhang et.al. | 2501.01114 | null |
2025-01-02 | EliGen: Entity-Level Controlled Image Generation with Regional Attention | Hong Zhang et.al. | 2501.01097 | link |
2025-01-02 | Enhancing Precision of Automated Teller Machines Network Quality Assessment: Machine Learning and Multi Classifier Fusion Approaches | Alireza Safarzadeh et.al. | 2501.01067 | null |
2025-01-01 | Deconstructing the emission order of protons, neutrons and |
Rohit Kumar et.al. | 2501.00963 | null |
2025-01-01 | Enhancing Early Diabetic Retinopathy Detection through Synthetic DR1 Image Generation: A StyleGAN3 Approach | Sagarnil Das et.al. | 2501.00954 | null |
2025-01-01 | SPADE: Enhancing Adaptive Cyber Deception Strategies with Generative AI and Structured Prompt Engineering | Shihab Ahmed et.al. | 2501.00940 | null |
2025-01-01 | Hierarchical Vision-Language Alignment for Text-to-Image Generation via Diffusion Models | Emily Johnson et.al. | 2501.00917 | null |
2025-01-01 | Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model | Chenyang Liu et.al. | 2501.00895 | null |
2025-01-01 | RORem: Training a Robust Object Remover with Human-in-the-Loop | Ruibin Li et.al. | 2501.00740 | link |
2024-12-31 | Token Pruning for Caching Better: 9 Times Acceleration on Stable Diffusion for Free | Evelyn Zhang et.al. | 2501.00375 | link |
2024-12-31 | SG-Splatting: Accelerating 3D Gaussian Splatting with Spherical Gaussians | Yiwen Wang et.al. | 2501.00342 | null |
2024-12-31 | Improving image quality of the Solar Disk Imager (SDI) of the Lyman-alpha Solar Telescope (LST) onboard the ASO-S mission | Hui Liu et.al. | 2501.00231 | null |
2024-12-30 | What Makes for a Good Stereoscopic Image? | Netanel Y. Tamir et.al. | 2412.21127 | null |
2024-12-30 | VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation | Jiazheng Xu et.al. | 2412.21059 | link |
2024-12-30 | DDIM sampling for Generative AIBIM, a faster intelligent structural design framework | Zhili He et.al. | 2412.20899 | null |
2024-12-30 | Acquisition-Independent Deep Learning for Quantitative MRI Parameter Estimation using Neural Controlled Differential Equations | Daan Kuppens et.al. | 2412.20844 | null |
2024-12-30 | 4D Gaussian Splatting: Modeling Dynamic Scenes with Native 4D Primitives | Zeyu Yang et.al. | 2412.20720 | null |
2024-12-29 | Single-image reflection removal via self-supervised diffusion models | Zhengyang Lu et.al. | 2412.20466 | null |
2024-12-29 | ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos | Xilei Zhu et.al. | 2412.20423 | null |
2024-12-29 | Bringing Objects to Life: 4D generation from 3D objects | Ohad Rahamim et.al. | 2412.20422 | null |
2024-12-28 | An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models | Yuang Wang et.al. | 2412.19992 | null |
2024-12-27 | Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference | Keke Zhang et.al. | 2412.19553 | null |
2024-12-30 | DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT | Xiaotao Hu et.al. | 2412.19505 | link |
2024-12-27 | RAIN: Real-time Animation of Infinite Video Stream | Zhilei Shu et.al. | 2412.19489 | null |
2024-12-27 | Generative Adversarial Network on Motion-Blur Image Restoration | Zhengdong Li et.al. | 2412.19479 | null |
2024-12-27 | Adrenaline: Adaptive Rendering Optimization System for Scalable Cloud Gaming | Jin Heo et.al. | 2412.19446 | null |
2024-12-27 | The Hobby-Eberly Telescope Dark Energy Experiment Survey (HETDEX) Active Galactic Nuclei Catalog: the Fourth Data Release | Chenxu Liu et.al. | 2412.19414 | null |
2024-12-26 | Reflective Gaussian Splatting | Yuxuan Yao et.al. | 2412.19282 | null |
2024-12-26 | FineVQ: Fine-Grained User Generated Content Video Quality Assessment | Huiyu Duan et.al. | 2412.19238 | null |
2024-12-26 | FACEMUG: A Multimodal Generative and Fusion Framework for Local Facial Editing | Wanglong Lu et.al. | 2412.19009 | null |
2024-12-25 | TINQ: Temporal Inconsistency Guided Blind Video Quality Assessment | Yixiao Li et.al. | 2412.18933 | link |
2024-12-25 | ArtNVG: Content-Style Separated Artistic Neighboring-View Gaussian Stylization | Zixiao Gu et.al. | 2412.18783 | null |
2024-12-25 | Embodied Image Quality Assessment for Robotic Intelligence | Jianbo Zhang et.al. | 2412.18774 | link |
2024-12-25 | MRI Reconstruction with Regularized 3D Diffusion Model (R3DM) | Arya Bangun et.al. | 2412.18723 | null |
2024-12-24 | ZenSVI: An Open-Source Software for the Integrated Acquisition, Processing and Analysis of Street View Imagery Towards Scalable Urban Science | Koichi Ito et.al. | 2412.18641 | link |
2024-12-24 | Long-Form Speech Generation with Spoken Language Models | Se Jin Park et.al. | 2412.18603 | link |
2024-12-24 | LatentCRF: Continuous CRF for Efficient Latent Diffusion | Kanchana Ranasinghe et.al. | 2412.18596 | null |
2024-12-24 | Agreement of Image Quality Metrics with Radiological Evaluation in the Presence of Motion Artifacts | Elisa Marchetto et.al. | 2412.18389 | null |
2024-12-24 | RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis | Yiling Yao et.al. | 2412.18380 | null |
2024-12-24 | Expand VSR Benchmark for VLLM to Expertize in Spatial Rules | Peijin Xie et.al. | 2412.18224 | link |
2024-12-24 | Image Quality Assessment: Exploring Regional Heterogeneity via Response of Adaptive Multiple Quality Factors in Dictionary Space | Xuting Lan et.al. | 2412.18160 | null |
2024-12-24 | DepthLab: From Partial to Complete | Zhiheng Liu et.al. | 2412.18153 | null |
2024-12-24 | AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models | Yiming Wang et.al. | 2412.18123 | null |
2024-12-24 | SAR Despeckling via Log-Yeo-Johnson Transformation and Sparse Representation | Xuran Hu et.al. | 2412.18121 | null |
2024-12-24 | An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM | Wen Wen et.al. | 2412.18060 | null |
2024-12-23 | ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance | Renyang Liu et.al. | 2412.17632 | link |
2024-12-23 | HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data | Ting Zhou et.al. | 2412.17574 | link |
2024-12-24 | An Evaluation Framework for Product Images Background Inpainting based on Human Feedback and Product Consistency | Yuqi Liang et.al. | 2412.17504 | null |
2024-12-23 | Predicting Satisfied User and Machine Ratio for Compressed Images: A Unified Approach | Qi Zhang et.al. | 2412.17477 | null |
2024-12-23 | Assessment of Deep-Learning Methods for the Enhancement of Experimental Low Dose Dental CBCT Volumes | Louise Friot--Giroux et.al. | 2412.17423 | null |
2024-12-23 | Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling | Hao Gui et.al. | 2412.17378 | null |
2024-12-23 | FFA Sora, video generation as fundus fluorescein angiography simulator | Xinyuan Wu et.al. | 2412.17346 | null |
2024-12-23 | GCS-M3VLT: Guided Context Self-Attention based Multi-modal Medical Vision Language Transformer for Retinal Image Captioning | Teja Krishna Cherukuri et.al. | 2412.17251 | null |
2024-12-22 | Deep Joint Source Channel Coding for Secure End-to-End Image Transmission | Mehdi Letafati et.al. | 2412.17110 | null |
2024-12-24 | ErasableMask: A Robust and Erasable Privacy Protection Scheme against Black-box Face Recognition Models | Sipeng Shen et.al. | 2412.17038 | null |
2024-12-22 | PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask | Jeongho Kim et.al. | 2412.16978 | link |
2024-12-22 | Image Quality Assessment: Investigating Causal Perceptual Effects with Abductive Counterfactual Inference | Wenhao Shen et.al. | 2412.16939 | null |
2024-12-22 | Time-Graph Frequency Representation with Singular Value Decomposition for Neural Speech Enhancement | Tingting Wang et.al. | 2412.16823 | link |
2024-12-21 | RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing | Zhipeng Huang et.al. | 2412.16778 | null |
2024-12-21 | VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation | Chi Zhang et.al. | 2412.16677 | null |
2024-12-21 | Complementary Advantages: Exploiting Cross-Field Frequency Correlation for NIR-Assisted Image Denoising | Yuchen Wang et.al. | 2412.16645 | null |
2024-12-21 | OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities | Suyoung Lee et.al. | 2412.16604 | null |
2024-12-21 | A Generalizable 3D Diffusion Framework for Low-Dose and Few-View Cardiac SPECT | Huidong Xie et.al. | 2412.16573 | null |
2024-12-21 | Federal Learning Framework for Quality Evaluation of Blastomere Cleavage | Jung-Hua Wang et.al. | 2412.16567 | null |
2024-12-21 | Positive2Negative: Breaking the Information-Lossy Barrier in Self-Supervised Single Image Denoising | Tong Li et.al. | 2412.16460 | null |
2024-12-20 | IMPLY-based Approximate Full Adders for Efficient Arithmetic Operations in Image Processing and Machine Learning | Melanie Qiu et.al. | 2412.15888 | null |
2024-12-20 | Image Quality Assessment: Enhancing Perceptual Exploration and Interpretation with Collaborative Feature Refinement and Hausdorff distance | Xuekai Wei et.al. | 2412.15847 | null |
2024-12-20 | DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization | Zihan Ding et.al. | 2412.15689 | null |
2024-12-20 | AI-generated Image Quality Assessment in Visual Communication | Yu Tian et.al. | 2412.15677 | link |
2024-12-20 | Underwater Image Quality Assessment: A Perceptual Framework Guided by Physical Imaging | Weizhi Xian et.al. | 2412.15527 | null |
2024-12-19 | Log-Time K-Means Clustering for 1D Data: Novel Approaches with Proof and Implementation | Jake Hyun et.al. | 2412.15295 | link |
2024-12-18 | A Systematic Examination of Preference Learning through the Lens of Instruction-Following | Joongwon Kim et.al. | 2412.15282 | null |
2024-12-19 | SqueezeMe: Efficient Gaussian Avatars for VR | Shunsuke Saito et.al. | 2412.15171 | null |
2024-12-19 | OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization | Jiacheng Zhang et.al. | 2412.15159 | null |
2024-12-19 | Jet: A Modern Transformer-Based Normalizing Flow | Alexander Kolesnikov et.al. | 2412.15129 | null |
2024-12-19 | Joint estimation of activity, attenuation and motion in respiratory-self-gated time-of-flight PET | Masoud Elhamiasl et.al. | 2412.15018 | null |
2024-12-19 | Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model | Minglong Xue et.al. | 2412.14630 | link |
2024-12-19 | Qua |
Keith G. Mills et.al. | 2412.14628 | null |
2024-12-19 | Successive optimization of optics and post-processing with differentiable coherent PSF operator and field information | Zheng Ren et.al. | 2412.14603 | link |
2024-12-19 | Enhancing Diffusion Models for High-Quality Image Generation | Jaineet Shah et.al. | 2412.14422 | null |
2024-12-18 | Improving diabetic retinopathy screening using Artificial Intelligence: design, evaluation and before-and-after study of a custom development | Imanol Pinto et.al. | 2412.14221 | null |
2024-12-19 | E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling | Zhihang Yuan et.al. | 2412.14170 | null |
2024-12-18 | VideoDPO: Omni-Preference Alignment for Video Diffusion Generation | Runtao Liu et.al. | 2412.14167 | null |
2024-12-18 | AKiRa: Augmentation Kit on Rays for optical video generation | Xi Wang et.al. | 2412.14158 | null |
2024-12-18 | Real-Time Position-Aware View Synthesis from Single-View Input | Manu Gond et.al. | 2412.14005 | null |
2024-12-18 | Data-Efficient Inference of Neural Fluid Fields via SciML Foundation Model | Yuqiu Liu et.al. | 2412.13897 | null |
2024-12-18 | VIIS: Visible and Infrared Information Synthesis for Severe Low-light Image Enhancement | Chen Zhao et.al. | 2412.13655 | link |
2024-12-18 | PASCO (PArallel Structured COarsening): an overlay to speed up graph clustering algorithms | Etienne Lasalle et.al. | 2412.13592 | link |
2024-12-18 | T |
Zhenhong Sun et.al. | 2412.13486 | link |
2024-12-18 | Real-time One-Step Diffusion-based Expressive Portrait Videos Generation | Hanzhong Guo et.al. | 2412.13479 | link |
2024-12-17 | Optimisation of Magnetic Field Sensing with Optically Pumped Magnetometers for Magnetic Detection Electrical Impedance Tomography | Kai Mason et.al. | 2412.13354 | null |
2024-12-17 | Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures | Guoxing Sun et.al. | 2412.13183 | null |
2024-12-17 | F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration | Lu Liu et.al. | 2412.13155 | null |
2024-12-17 | Unlocking the Potential of Digital Pathology: Novel Baselines for Compression | Maximilian Fischer et.al. | 2412.13137 | null |
2024-12-18 | AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark | Jianlyu Chen et.al. | 2412.13102 | link |
2024-12-17 | Smartphone-based Iris Recognition through High-Quality Visible Spectrum Iris Capture | Naveenkumar G Venkataswamy et.al. | 2412.13063 | null |
2024-12-17 | Experimental Study of Low-Latency Video Streaming in an ORAN Setup with Generative AI | Andreas Casparsen et.al. | 2412.12751 | null |
2024-12-17 | Subspace Implicit Neural Representations for Real-Time Cardiac Cine MR Imaging | Wenqi Huang et.al. | 2412.12742 | link |
2024-12-17 | Complex extension of optical flow and its practical evaluation for undersampled dynamic MRI | Matthias J. Ehrhardt et.al. | 2412.12711 | null |
2024-12-17 | A Two-Fold Patch Selection Approach for Improved 360-Degree Image Quality Assessment | Abderrezzaq Sendjasni et.al. | 2412.12667 | link |
2024-12-17 | RDPI: A Refine Diffusion Probability Generation Method for Spatiotemporal Data Imputation | Zijin Liu et.al. | 2412.12642 | link |
2024-12-17 | Consistent Diffusion: Denoising Diffusion Model with Data-Consistent Training for Image Restoration | Xinlong Cheng et.al. | 2412.12550 | null |
2024-12-17 | Invisible Watermarks: Attacks and Robustness | Dongjun Hwang et.al. | 2412.12511 | link |
2024-12-16 | PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Cheng Zhang et.al. | 2412.12096 | link |
2024-12-16 | Wonderland: Navigating 3D Scenes from a Single Image | Hanwen Liang et.al. | 2412.12091 | null |
2024-12-16 | SPADE: Spectroscopic Photoacoustic Denoising using an Analytical and Data-free Enhancement Framework | Fangzhou Lin et.al. | 2412.12068 | null |
2024-12-16 | Industrial-scale Prediction of Cement Clinker Phases using Machine Learning | Sheikh Junaid Fayaz et.al. | 2412.11981 | link |
2024-12-16 | Towards Physically-Based Sky-Modeling | Ian J. Maquignaz et.al. | 2412.11883 | null |
2024-12-16 | Impact of Face Alignment on Face Image Quality | Eren Onaran et.al. | 2412.11779 | null |
2024-12-16 | Formal Quality Measures for Predictors in Markov Decision Processes | Christel Baier et.al. | 2412.11754 | null |
2024-12-16 | Comparison of three reconstruction algorithms for low-dose phase-contrast computed tomography of the breast with synchrotron radiation | Sandro Donato et.al. | 2412.11641 | null |
2024-12-16 | MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation | Javier García Gilabert et.al. | 2412.11615 | link |
2024-12-16 | Block-Based Multi-Scale Image Rescaling | Jian Li et.al. | 2412.11468 | null |
2024-12-16 | Controllable Distortion-Perception Tradeoff Through Latent Diffusion for Neural Image Compression | Chuqin Zhou et.al. | 2412.11379 | null |
2024-12-15 | VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping | Hao Shao et.al. | 2412.11279 | null |
2024-12-15 | CATER: Leveraging LLM to Pioneer a Multidimensional, Reference-Independent Paradigm in Translation Quality Evaluation | Kurando IIDA et.al. | 2412.11261 | null |
2024-12-15 | Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation | Yujie Zhang et.al. | 2412.11170 | null |
2024-12-15 | A Comprehensive Survey of Action Quality Assessment: Method and Benchmark | Kanglei Zhou et.al. | 2412.11149 | null |
2024-12-14 | Zigzag Diffusion Sampling: The Path to Success Is Zigzag | Lichen Bai et.al. | 2412.10891 | link |
2024-12-14 | Unbiased General Annotated Dataset Generation | Dengyang Jiang et.al. | 2412.10831 | null |
2024-12-14 | Rapid Reconstruction of Extremely Accelerated Liver 4D MRI via Chained Iterative Refinement | Di Xu et.al. | 2412.10629 | null |
2024-12-13 | RAID-Database: human Responses to Affine Image Distortions | Paula Daudén-Oliver et.al. | 2412.10211 | null |
2024-12-13 | GT23D-Bench: A Comprehensive General Text-to-3D Generation Benchmark | Sitong Su et.al. | 2412.09997 | null |
2024-12-13 | EP-CFG: Energy-Preserving Classifier-Free Guidance | Kai Zhang et.al. | 2412.09966 | null |
2024-12-13 | Jiawei Li et.al. | 2412.09954 | link | |
2024-12-13 | Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images | Yasamin Medghalchi et.al. | 2412.09910 | link |
2024-12-13 | LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity | Hongjie Wang et.al. | 2412.09856 | null |
2024-12-13 | A Single-Frame and Multi-Frame Cascaded Image Super-Resolution Method | Jing Sun et.al. | 2412.09846 | null |
2024-12-13 | Super-Resolution for Remote Sensing Imagery via the Coupling of a Variational Model and Deep Learning | Jing Sun et.al. | 2412.09841 | null |
2024-12-13 | Prospects for Systematic Planetary Nebulae Detection with the Census of the Local Universe Narrowband Survey | Rong Du et.al. | 2412.09836 | null |
2024-12-13 | Speech-based Multimodel Pipeline for Vietnamese Services Quality Assessment | Quang-Anh N. D. et.al. | 2412.09829 | null |
2024-12-12 | OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs | Yuanzhi Zhu et.al. | 2412.09465 | link |
2024-12-12 | UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer | Delong Liu et.al. | 2412.09389 | link |
2024-12-13 | Are Conditional Latent Diffusion Models Effective for Image Restoration? | Yunchen Yuan et.al. | 2412.09324 | null |
2024-12-12 | Towards Understanding the Robustness of LLM-based Evaluations under Perturbations | Manav Chaudhary et.al. | 2412.09269 | null |
2024-12-12 | Elevating Flow-Guided Video Inpainting with Reference Generation | Suhwan Cho et.al. | 2412.08975 | link |
2024-12-12 | Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression | Ali Mollaahmadi Dehaghi et.al. | 2412.08912 | link |
2024-12-11 | DeepNose: An Equivariant Convolutional Neural Network Predictive Of Human Olfactory Percepts | Sergey Shuvaev et.al. | 2412.08747 | null |
2024-12-13 | Utilizing Multi-step Loss for Single Image Reflection Removal | Abdelrahman Elnenaey et.al. | 2412.08582 | link |
2024-12-11 | PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis | Yifan Xie et.al. | 2412.08504 | null |
2024-12-12 | Learning Flow Fields in Attention for Controllable Person Image Generation | Zijian Zhou et.al. | 2412.08486 | link |
2024-12-11 | Visible and Infrared Image Fusion Using Encoder-Decoder Network | Ferhat Can Ataman et.al. | 2412.08073 | link |
2024-12-11 | NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods | Qiang Qu et.al. | 2412.08029 | link |
2024-12-10 | Graph convolutional networks enable fast hemorrhagic stroke monitoring with electrical impedance tomography | J. Toivanen et.al. | 2412.07888 | null |
2024-12-10 | PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition | Kartik Narayan et.al. | 2412.07771 | null |
2024-12-10 | 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation | Xiao Fu et.al. | 2412.07759 | null |
2024-12-10 | PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation | Fatemeh Nazarieh et.al. | 2412.07754 | null |
2024-12-10 | Multi-Shot Character Consistency for Text-to-Video Generation | Yuval Atzmon et.al. | 2412.07750 | null |
2024-12-11 | Direct Low-Dose CT Image Reconstruction on GPU using Out-Of-Core: Precision and Quality Study | M. Chillarón et.al. | 2412.07631 | null |
2024-12-10 | OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations | Linke Ouyang et.al. | 2412.07626 | link |
2024-12-10 | CoMA: Compositional Human Motion Generation with Multi-modal Agents | Shanlin Sun et.al. | 2412.07320 | null |
2024-12-10 | Backdoor Attacks against No-Reference Image Quality Assessment Models via A Scalable Trigger | Yi Yu et.al. | 2412.07277 | link |
2024-12-10 | Moderating the Generalization of Score-based Generative Model | Wan Jiang et.al. | 2412.07229 | null |
2024-12-11 | Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation | Tal Zeevi et.al. | 2412.07169 | link |
2024-12-10 | QCResUNet: Joint Subject-level and Voxel-level Segmentation Quality Prediction | Peijie Qiu et.al. | 2412.07156 | link |
2024-12-10 | Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions | Qiang Qu et.al. | 2412.07079 | null |
2024-12-11 | Diff-GO |
Suchinthaka Wanninayaka et.al. | 2412.06980 | null |
2024-12-09 | Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning | Mehdi Noroozi et.al. | 2412.06978 | null |
2024-12-09 | Ranking-aware adapter for text-driven image ordering with CLIP | Wei-Hsiang Yu et.al. | 2412.06760 | link |
2024-12-09 | AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark | Lan Li et.al. | 2412.06724 | link |
2024-12-10 | A No-Reference Medical Image Quality Assessment Method Based on Automated Distortion Recognition Technology: Application to Preprocessing in MRI-guided Radiotherapy | Zilin Wang et.al. | 2412.06599 | null |
2024-12-09 | How Certain are Uncertainty Estimates? Three Novel Earth Observation Datasets for Benchmarking Uncertainty Quantification in Machine Learning | Yuanyuan Wang et.al. | 2412.06451 | null |
2024-12-09 | Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment | Kim Sung-Bin et.al. | 2412.06209 | null |
2024-12-09 | One-shot Human Motion Transfer via Occlusion-Robust Flow Prediction and Neural Texturing | Yuzhu Ji et.al. | 2412.06174 | null |
2024-12-09 | A CT Image Denoising Method Based on Projection Domain Feature | Mengyu Sun et.al. | 2412.06135 | null |
2024-12-08 | Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training | Zhenghong Zhou et.al. | 2412.06029 | null |
2024-12-08 | Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation | Aymen Sekhri et.al. | 2412.06003 | null |
2024-12-08 | Nested Diffusion Models Using Hierarchical Latent Priors | Xiao Zhang et.al. | 2412.05984 | null |
2024-12-08 | Unsupervised Multi-Parameter Inverse Solving for Reducing Ring Artifacts in 3D X-Ray CBCT | Qing Wu et.al. | 2412.05853 | null |
2024-12-08 | SizeGS: Size-aware Compression of 3D Gaussians with Hierarchical Mixed Precision Quantization | Shuzhao Xie et.al. | 2412.05808 | null |
2024-12-07 | Emulating Clinical Quality Muscle B-mode Ultrasound Images from Plane Wave Images Using a Two-Stage Machine Learning Model | Reed Chen et.al. | 2412.05758 | link |
2024-12-07 | A Tiered GAN Approach for Monet-Style Image Generation | FNU Neha et.al. | 2412.05724 | null |
2024-12-07 | Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes | Saqib Javed et.al. | 2412.05700 | null |
2024-12-07 | Enhancing Research Methodology and Academic Publishing: A Structured Framework for Quality and Integrity | Md. Jalil Piran et.al. | 2412.05683 | null |
2024-12-07 | Deep Reinforcement Learning-Based Resource Allocation for Hybrid Bit and Generative Semantic Communications in Space-Air-Ground Integrated Networks | Chong Huang et.al. | 2412.05647 | null |
2024-12-06 | LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation | Donald Shenaj et.al. | 2412.05148 | link |
2024-12-06 | Comprehensive Analysis and Improvements in Pansharpening Using Deep Learning | Mahek Kantharia et.al. | 2412.04896 | null |
2024-12-06 | Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud | Yuanhao Yue et.al. | 2412.04871 | null |
2024-12-05 | Motion-Guided Deep Image Prior for Cardiac MRI | Marc Vornehm et.al. | 2412.04639 | null |
2024-12-05 | MetaFormer: High-fidelity Metalens Imaging via Aberration Correcting Transformers | Byeonghyeon Lee et.al. | 2412.04591 | null |
2024-12-05 | 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion | Chaoyang Wang et.al. | 2412.04462 | null |
2024-12-05 | LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors | Yusuf Dalva et.al. | 2412.04460 | null |
2024-12-05 | Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction | George Webber et.al. | 2412.04324 | null |
2024-12-05 | T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts | Ziwei Huang et.al. | 2412.04300 | null |
2024-12-05 | IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation | Sejong Yang et.al. | 2412.04000 | null |
2024-12-05 | Blind Underwater Image Restoration using Co-Operational Regressor Networks | Ozer Can Devecioglu et.al. | 2412.03995 | null |
2024-12-05 | LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model | Yuan Xue et.al. | 2412.03841 | null |
2024-12-04 | Advancing Auto-Regressive Continuation for Video Frames | Ruibo Ming et.al. | 2412.03758 | null |
2024-12-04 | MV-Adapter: Multi-view Consistent Image Generation Made Easy | Zehuan Huang et.al. | 2412.03632 | null |
2024-12-04 | Style3D: Attention-guided Multi-view Style Transfer for 3D Object Generation | Bingjie Song et.al. | 2412.03571 | null |
2024-12-04 | NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model | Xinheng Xie et.al. | 2412.03539 | null |
2024-12-04 | SGSST: Scaling Gaussian Splatting StyleTransfer | Bruno Galerne et.al. | 2412.03371 | link |
2024-12-04 | Is JPEG AI going to change image forensics? | Edoardo Daniele Cannas et.al. | 2412.03261 | null |
2024-12-04 | Task-driven Image Fusion with Learnable Fusion Loss | Haowen Bai et.al. | 2412.03240 | null |
2024-12-04 | Parametric Enhancement of PerceptNet: A Human-Inspired Approach for Image Quality Assessment | Jorge Vila-Tomás et.al. | 2412.03210 | link |
2024-12-04 | Unsupervised Network for Single Image Raindrop Removal | Huijiao Wang et.al. | 2412.03019 | null |
2024-12-04 | Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach | Lingchen Sun et.al. | 2412.03017 | link |
2024-12-04 | Partially Conditioned Patch Parallelism for Accelerated Diffusion Model Inference | XiuYu Zhang et.al. | 2412.02962 | null |
2024-12-04 | Surrogate distributed radiological sources III: quantitative distributed source reconstructions | Jayson R. Vavrek et.al. | 2412.02926 | null |
2024-12-04 | Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection | Prabhat Kc et.al. | 2412.02920 | null |
2024-12-03 | Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Hiroki Furuta et.al. | 2412.02617 | null |
2024-12-03 | High-Quality Passive Acoustic Mapping with the Cross-Correlated Angular Spectrum Method | Yi Zeng et.al. | 2412.02413 | null |
2024-12-03 | Switchable deep beamformer for high-quality and real-time passive acoustic mapping | Yi Zeng et.al. | 2412.02327 | null |
2024-12-03 | Initial Study On Improving Segmentation By Combining Preoperative CT And Intraoperative CBCT Using Synthetic Data | Maximilian E. Tschuchnig et.al. | 2412.02294 | null |
2024-12-02 | NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training | Dar-Yen Chen et.al. | 2412.02030 | null |
2024-12-02 | HybridMQA: Exploring Geometry-Texture Interactions for Colored Mesh Quality Assessment | Armin Shafiee Sarvestani et.al. | 2412.01986 | null |
2024-12-02 | IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models | Khaled Abud et.al. | 2412.01794 | link |
2024-12-02 | OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking | Xuanyu Zhang et.al. | 2412.01615 | null |
2024-12-02 | Negative Token Merging: Image-based Adversarial Feature Guidance | Jaskirat Singh et.al. | 2412.01339 | null |
2024-12-02 | Data Uncertainty-Aware Learning for Multimodal Aspect-based Sentiment Analysis | Hao Yang et.al. | 2412.01249 | null |
2024-12-02 | Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation | Zilyu Ye et.al. | 2412.01243 | null |
2024-12-02 | PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Control | Ruichen Wang et.al. | 2412.01223 | null |
2024-12-02 | Assessing GPT Model Uncertainty in Mathematical OCR Tasks via Entropy Analysis | Alexei Kaltchenko et.al. | 2412.01221 | link |
2024-12-02 | LoyalDiffusion: A Diffusion Model Guarding Against Data Replication | Chenghao Li et.al. | 2412.01118 | null |
2024-12-02 | FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait | Taekyung Ki et.al. | 2412.01064 | null |
2024-12-02 | Evaluating Automated Radiology Report Quality through Fine-Grained Phrasal Grounding of Clinical Findings | Razi Mahmood et.al. | 2412.01031 | null |
2024-12-01 | Optimal Algorithms for Augmented Testing of Discrete Distributions | Maryam Aliakbarpour et.al. | 2412.00974 | null |
2024-12-01 | Generating AI Literacy MCQs: A Multi-Agent LLM Approach | Jiayi Wang et.al. | 2412.00970 | null |
2024-12-01 | Playable Game Generation | Mingyu Yang et.al. | 2412.00887 | link |
2024-11-30 | Multi-resolution Guided 3D GANs for Medical Image Translation | Juhyung Ha et.al. | 2412.00575 | null |
2024-11-29 | INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge | Angelika Romanou et.al. | 2411.19799 | null |
2024-11-29 | ChineseWebText 2.0: Large-Scale High-quality Chinese Web Text with Multi-dimensional and fine-grained information | Wanyue Zhang et.al. | 2411.19668 | link |
2024-11-29 | Tortho-Gaussian: Splatting True Digital Orthophoto Maps | Xin Wang et.al. | 2411.19594 | null |
2024-11-29 | Self-Supervised Denoiser Framework | Emilien Valat et.al. | 2411.19593 | null |
2024-11-29 | Contextual Checkerboard Denoise -- A Novel Neural Network-Based Approach for Classification-Aware OCT Image Denoising | Md. Touhidul Islam et.al. | 2411.19549 | link |
2024-11-29 | Subjective and Objective Quality Assessment Methods of Stereoscopic Videos with Visibility Affecting Distortions | Sria Biswas et.al. | 2411.19522 | null |
2024-11-29 | Retrieval-guided Cross-view Image Synthesis | Hongji Yang et.al. | 2411.19510 | null |
2024-11-29 | Fleximo: Towards Flexible Text-to-Human Motion Video Generation | Yuhang Zhang et.al. | 2411.19459 | null |
2024-11-28 | AMO Sampler: Enhancing Text Rendering with Overshooting | Xixi Hu et.al. | 2411.19415 | null |
2024-11-28 | 3D Wasserstein generative adversarial network with dense U-Net based discriminator for preclinical fMRI denoising | Sima Soltanpour et.al. | 2411.19345 | null |
2024-11-28 | Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model | Feng Liu et.al. | 2411.19108 | null |
2024-11-28 | SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing | Rong-Cheng Tu et.al. | 2411.18983 | null |
2024-11-28 | Deep Plug-and-Play HIO Approach for Phase Retrieval | Cagatay Isil et.al. | 2411.18967 | null |
2024-12-02 | AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers | Sherwin Bahmani et.al. | 2411.18673 | null |
2024-11-27 | HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior | Li-Yuan Tsao et.al. | 2411.18662 | link |
2024-11-27 | Textured Gaussians for Enhanced 3D Scene Appearance Modeling | Brian Chao et.al. | 2411.18625 | null |
2024-11-27 | Uncertainty-driven Sampling for Efficient Pairwise Comparison Subjective Assessment | Shima Mohammadi et.al. | 2411.18372 | link |
2024-11-29 | HUPE: Heuristic Underwater Perceptual Enhancement with Semantic Collaborative Learning | Zengxi Zhang et.al. | 2411.18296 | link |
2024-11-27 | Deep End-to-end Adaptive k-Space Sampling, Reconstruction, and Registration for Dynamic MRI | George Yiasemis et.al. | 2411.18249 | null |
2024-11-27 | Towards Improved Objective Perceptual Audio Quality Assessment -- Part 1: A Novel Data-Driven Cognitive Model | Pablo M. Delgado et.al. | 2411.18222 | null |
2024-11-27 | KAN See Your Face | Dong Han et.al. | 2411.18165 | null |
2024-11-27 | Type-R: Automatically Retouching Typos for Text-to-Image Generation | Wataru Shimoda et.al. | 2411.18159 | null |
2024-11-26 | MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework | Xiangcheng Hu et.al. | 2411.17928 | link |
2024-11-26 | SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation | Ximing Xing et.al. | 2411.17832 | null |
2024-11-26 | Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient | Zigeng Chen et.al. | 2411.17787 | link |
2024-11-27 | Diffusion Autoencoders for Few-shot Image Generation in Hyperbolic Space | Lingxiao Li et.al. | 2411.17784 | null |
2024-11-26 | Perceptually Optimized Super Resolution | Volodymyr Karpenko et.al. | 2411.17513 | null |
2024-11-26 | Puzzle Similarity: A Perceptually-guided No-Reference Metric for Artifact Detection in 3D Scene Reconstructions | Nicolai Hermann et.al. | 2411.17489 | null |
2024-11-26 | Structure-Guided MR-to-CT Synthesis with Spatial and Semantic Alignments for Attenuation Correction of Whole-Body PET/MR Imaging | Jiaxu Zheng et.al. | 2411.17488 | null |
2024-11-26 | Dual-Representation Interaction Driven Image Quality Assessment with Restoration Assistance | Jingtong Yue et.al. | 2411.17390 | link |
2024-11-26 | InsightEdit: Towards Better Instruction Following for Image Editing | Yingjing Xu et.al. | 2411.17323 | null |
2024-11-26 | Reward Incremental Learning in Text-to-Image Generation | Maorong Wang et.al. | 2411.17310 | null |
2024-11-26 | Grounding-IQA: Multimodal Language Grounding Model for Image Quality Assessment | Zheng Chen et.al. | 2411.17237 | link |
2024-11-26 | AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM | Jiarui Wang et.al. | 2411.17221 | link |
2024-11-26 | ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting | Chengyou Jia et.al. | 2411.17176 | null |
2024-11-26 | OSDFace: One-Step Diffusion Model for Face Restoration | Jingkai Wang et.al. | 2411.17163 | link |
2024-11-26 | Motion Free B-frame Coding for Neural Video Compression | Van Thang Nguyen et.al. | 2411.17160 | null |
2024-11-26 | 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction | Woong Oh Cho et.al. | 2411.17044 | null |
2024-11-26 | TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On | Zhenchen Wan et.al. | 2411.17017 | link |
2024-11-25 | G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs | Kunyi Li et.al. | 2411.16898 | null |
2024-11-25 | Fully Automatic Deep Learning Pipeline for Whole Slide Image Quality Assessment | Falah Jabar et.al. | 2411.16885 | null |
2024-11-25 | LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction | Yiran Sun et.al. | 2411.16629 | link |
2024-11-25 | Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric | Zhichao Zhang et.al. | 2411.16619 | null |
2024-11-25 | Coherence Based Sound Speed Aberration Correction -- with clinical validation in obstetric ultrasound | Anders Emil Vrålstad et.al. | 2411.16551 | null |
2024-11-25 | Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGAN | Elona Shatri et.al. | 2411.16405 | null |
2024-11-25 | Human-Calibrated Automated Testing and Validation of Generative Language Models | Agus Sudjianto et.al. | 2411.16391 | null |
2024-11-25 | Bounds for the maximum modulus of polynomial roots with nearly optimal worst-case overestimation | Prashant Batra et.al. | 2411.16385 | null |
2024-11-25 | Privacy-Preserving Federated Foundation Model for Generalist Ultrasound Artificial Intelligence | Yuncheng Jiang et.al. | 2411.16380 | null |
2024-11-25 | Sonic: Shifting Focus to Global Audio Perception in Portrait Animation | Xiaozhong Ji et.al. | 2411.16331 | null |
2024-11-25 | EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training | Yiying Wei et.al. | 2411.16312 | null |
2024-11-25 | Weakly supervised image segmentation for defect-based grading of fresh produce | Manuel Knott et.al. | 2411.16219 | link |
2024-11-25 | VIRES: Video Instance Repainting with Sketch and Text Guidance | Shuchen Weng et.al. | 2411.16199 | null |
2024-11-25 | Image Generation Diversity Issues and How to Tame Them | Mischa Dombrowski et.al. | 2411.16171 | link |
2024-11-25 | ENCLIP: Ensembling and Clustering-Based Contrastive Language-Image Pretraining for Fashion Multimodal Search with Limited Data and Low-Quality Images | Prithviraj Purushottam Naik et.al. | 2411.16096 | null |
2024-11-25 | AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity | Jili Xia et.al. | 2411.16087 | null |
2024-11-24 | Distribution models of antennas in radio astronomy: Efficiency comparison of the golden spiral interferometry | Elio Quiroga Rodriguez et.al. | 2411.15904 | null |
2024-11-24 | A review on Machine Learning based User-Centric Multimedia Streaming Techniques | Monalisa Ghosh et.al. | 2411.15801 | null |
2024-11-24 | LTCF-Net: A Transformer-Enhanced Dual-Channel Fourier Framework for Low-Light Image Restoration | Gaojing Zhang et.al. | 2411.15740 | null |
2024-11-23 | SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation | Jiayuan Zhu et.al. | 2411.15513 | null |
2024-11-23 | Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark | Rong-Cheng Tu et.al. | 2411.15488 | link |
2024-11-22 | HeadRouter: A Training-free Image Editing Framework for MM-DiTs by Adaptively Routing Attention Heads | Yu Xu et.al. | 2411.15034 | null |
2024-11-22 | FloAt: Flow Warping of Self-Attention for Clothing Animation Generation | Swasti Shreya Mishra et.al. | 2411.15028 | null |
2024-11-22 | Information Extraction from Heterogenous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation | Aniket Bhattacharyya et.al. | 2411.14957 | null |
2024-11-22 | Evaluating Vision Transformer Models for Visual Quality Control in Industrial Manufacturing | Miriam Alber et.al. | 2411.14953 | link |
2024-11-22 | Fast High-Quality Enhanced Imaging Algorithm for Layered Dielectric Targets Based on MMW MIMO-SAR System | Xu Chen et.al. | 2411.14837 | null |
2024-11-22 | BrightVAE: Luminosity Enhancement in Underexposed Endoscopic Images | Farzaneh Koohestani et.al. | 2411.14663 | null |
2024-11-22 | VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space | Armani Rodriguez et.al. | 2411.14642 | null |
2024-11-21 | Unveiling the Hidden: A Comprehensive Evaluation of Underwater Image Enhancement and Its Impact on Object Detection | Ali Awad et.al. | 2411.14626 | null |
2024-11-21 | Optimal Transcoding Preset Selection for Live Video Streaming | Zahra Nabizadeh et.al. | 2411.14613 | null |
2024-11-21 | Roadmap on Advances in Visual and Physiological Optics | Jesús E. Gómez-Correa et.al. | 2411.14606 | null |
2024-11-21 | Night-to-Day Translation via Illumination Degradation Disentanglement | Guanzhou Lan et.al. | 2411.14504 | null |
2024-11-21 | Regional Attention for Shadow Removal | Hengxing Liu et.al. | 2411.14201 | link |
2024-11-21 | Image Compression Using Novel View Synthesis Priors | Luyuan Peng et.al. | 2411.13862 | null |
2024-11-21 | Detecting Human Artifacts from Text-to-Image Models | Kaihong Wang et.al. | 2411.13842 | link |
2024-11-21 | Robust Steganography with Boundary-Preserving Overflow Alleviation and Adaptive Error Correction | Yu Cheng et.al. | 2411.13819 | null |
2024-11-21 | Edge-Cloud Routing for Text-to-Image Model with Token-Level Multi-Metric Prediction | Zewei Xin et.al. | 2411.13787 | null |
2024-11-20 | What You See Is What Matters: A Novel Visual and Physics-Based Metric for Evaluating Video Generation Quality | Zihan Wang et.al. | 2411.13609 | null |
2024-11-20 | HF-Diff: High-Frequency Perceptual Loss and Distribution Matching for One-Step Diffusion-Based Image Super-Resolution | Shoaib Meraj Sami et.al. | 2411.13548 | null |
2024-11-20 | RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content | Yuxuan Jiang et.al. | 2411.13362 | null |
2024-11-20 | OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging | Rajini Makam et.al. | 2411.13230 | link |
2024-11-20 | ESARM: 3D Emotional Speech-to-Animation via Reward Model from Automatically-Ranked Demonstrations | Xulong Zhang et.al. | 2411.13089 | null |
2024-11-20 | LMM-driven Semantic Image-Text Coding for Ultra Low-bitrate Learned Image Compression | Shimon Murai et.al. | 2411.13033 | link |
2024-11-19 | HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation | Abdul Basit Anees et.al. | 2411.12832 | link |
2024-11-19 | Mitigating Perception Bias: A Training-Free Approach to Enhance LMM for Image Quality Assessment | Siyi Pan et.al. | 2411.12791 | null |
2024-11-19 | Stochastic BIQA: Median Randomized Smoothing for Certified Blind Image Quality Assessment | Ekaterina Shumitskaya et.al. | 2411.12575 | null |
2024-11-19 | PR-ENDO: Physically Based Relightable Gaussian Splatting for Endoscopy | Joanna Kaleta et.al. | 2411.12510 | link |
2024-11-19 | A |
Abdul Halim et.al. | 2411.12457 | null |
2024-11-19 | Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models | Jun Xiao et.al. | 2411.12450 | null |
2024-11-19 | Acquire Precise and Comparable Fundus Image Quality Score: FTHNet and FQS Dataset | Zheng Gong et.al. | 2411.12273 | null |
2024-11-19 | Performance of Large Language Models in Technical MRI Question Answering: A Comparative Study | Alan B McMillan et.al. | 2411.12238 | null |
2024-11-19 | Tangential Randomization in Linear Bandits (TRAiL): Guaranteed Inference and Regret Bounds | Arda Güçlü et.al. | 2411.12154 | null |
2024-11-18 | FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting | Fangyu Wu et.al. | 2411.12089 | null |
2024-11-18 | Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion | Meng Zhou et.al. | 2411.11799 | link |
2024-11-18 | Additional Tests for TV 3.0 | Eduardo Peixoto et.al. | 2411.11755 | null |
2024-11-18 | Towards Degradation-Robust Reconstruction in Generalizable NeRF | Chan Ho Park et.al. | 2411.11691 | null |
2024-11-18 | CLUE-MARK: Watermarking Diffusion Models using CLWE | Kareem Shehata et.al. | 2411.11434 | null |
2024-11-17 | BVI-CR: A Multi-View Human Dataset for Volumetric Video Compression | Ge Gao et.al. | 2411.11199 | link |
2024-11-17 | Enhanced Anime Image Generation Using USE-CMHSA-GAN | J. Lu et.al. | 2411.11179 | null |
2024-11-17 | Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion | Yu-Fei Shi et.al. | 2411.11123 | null |
2024-11-17 | MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild | Xi Fang et.al. | 2411.11098 | null |
2024-11-17 | Spectral Subspace Clustering for Attributed Graphs | Xiaoyang Lin et.al. | 2411.11074 | link |
2024-11-17 | Skeleton-Guided Spatial-Temporal Feature Learning for Video-Based Visible-Infrared Person Re-Identification | Wenjia Jiang et.al. | 2411.11069 | null |
2024-11-17 | Hyperspectral Imaging-Based Grain Quality Assessment With Limited Labelled Data | Priyabrata Karmakar et.al. | 2411.10924 | null |
2024-11-16 | HJ-Ky-0.1: an Evaluation Dataset for Kyrgyz Word Embeddings | Anton Alekseev et.al. | 2411.10724 | link |
2024-11-15 | M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation | Sucheng Ren et.al. | 2411.10433 | link |
2024-11-15 | On the Foundation Model for Cardiac MRI Reconstruction | Chi Zhang et.al. | 2411.10403 | null |
2024-11-15 | Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting | Ziqi Xie et.al. | 2411.10309 | link |
2024-11-15 | The Unreasonable Effectiveness of Guidance for Diffusion Models | Tim Kaiser et.al. | 2411.10257 | null |
2024-11-15 | Block based Adaptive Compressive Sensing with Sampling Rate Control | Kosuke Iwama et.al. | 2411.10200 | null |
2024-11-15 | Visual question answering based evaluation metrics for text-to-image generation | Mizuki Miyamoto et.al. | 2411.10183 | null |
2024-11-15 | SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning | Zewen Chen et.al. | 2411.10161 | link |
2024-11-15 | Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning | Yushen Zuo et.al. | 2411.10130 | null |
2024-11-15 | EveGuard: Defeating Vibration-based Side-Channel Eavesdropping with Audio Adversarial Perturbations | Jung-Woo Chang et.al. | 2411.10034 | null |
2024-11-14 | Video Denoising in Fluorescence Guided Surgery | Trevor Seets et.al. | 2411.09798 | null |
2024-11-14 | Research evaluation with ChatGPT: Is it age, country, length, or field biased? | Mike Thelwall et.al. | 2411.09768 | null |
2024-11-14 | Evaluating the Predictive Capacity of ChatGPT for Academic Peer Review Outcomes Across Multiple Platforms | Mike Thelwall et.al. | 2411.09763 | null |
2024-11-14 | MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation | Jonas Serych et.al. | 2411.09551 | link |
2024-11-14 | GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising | Yunuo Wang et.al. | 2411.09512 | null |
2024-11-14 | Iterative tomographic reconstruction with TV prior for low-dose CBCT dental imaging | Louise Friot-Giroux et.al. | 2411.09306 | null |
2024-11-14 | LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution | Chenyang Wang et.al. | 2411.09293 | null |
2024-11-14 | LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space | Guanwen Feng et.al. | 2411.09268 | null |
2024-11-14 | JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation | Xuyang Cao et.al. | 2411.09209 | link |
2024-11-14 | Orthogonal Linear Array based Product Beamforming for Real Time Underwater 3D Acoustical Imaging | Mimisha M Menakath et.al. | 2411.09197 | null |
2024-11-14 | Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance | Md Fahim Anjum et.al. | 2411.09174 | null |
2024-11-13 | Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment | Zihao Huang et.al. | 2411.09007 | null |
2024-11-13 | Causal Explanations for Image Classifiers | Hana Chockler et.al. | 2411.08875 | link |
2024-11-13 | A novel imaging setup for hybrid radiotherapy tailored PET/MR in patients with head and neck cancer | R. M. Winter et.al. | 2411.08783 | null |
2024-11-13 | Robust Divergence Learning for Missing-Modality Segmentation | Runze Cheng et.al. | 2411.08305 | null |
2024-11-13 | Numerical Analysis of Lensless Imaging with Active Metasurfaces and Single-Pixel Detectors | Julie Belleville et.al. | 2411.08282 | null |
2024-11-12 | DuoLift-GAN:Reconstructing CT from Single-view and Biplanar X-Rays with Generative Adversarial Networks | Zhaoxi Zhang et.al. | 2411.07941 | null |
2024-11-12 | Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information Minimization | Ziyu Shan et.al. | 2411.07936 | null |
2024-11-12 | CT-Mamba: A Hybrid Convolutional State Space Model for Low-Dose CT Denoising | Linxuan Li et.al. | 2411.07930 | link |
2024-11-12 | Joint multi-dimensional dynamic attention and transformer for general image restoration | Huan Zhang et.al. | 2411.07893 | link |
2024-11-12 | No-Reference Point Cloud Quality Assessment via Graph Convolutional Network | Wu Chen et.al. | 2411.07728 | null |
2024-11-12 | SegQC: a segmentation network-based framework for multi-metric segmentation quality control and segmentation error detection in volumetric medical images | Bella Specktor-Fadida et.al. | 2411.07601 | null |
2024-11-12 | IR image databases generation under target intrinsic thermal variability constraints | Jerome Gilles et.al. | 2411.07577 | null |
2024-11-12 | Multi-task Feature Enhancement Network for No-Reference Image Quality Assessment | Li Yu et.al. | 2411.07556 | null |
2024-11-12 | A Novel Automatic Real-time Motion Tracking Method for Magnetic Resonance Imaging-guided Radiotherapy: Leveraging the Enhanced Tracking-Learning-Detection Framework with Automatic Segmentation | Shengqi Chen et.al. | 2411.07503 | null |
2024-11-12 | An Exploration of Parallel Imaging System for Very-low Field (50mT) MRI Scanner | Lei Yang et.al. | 2411.07489 | null |
2024-11-11 | Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy | Sepideh K. Gharamaleki et.al. | 2411.07426 | null |
2024-11-11 | Exploring Variational Autoencoders for Medical Image Generation: A Comprehensive Study | Khadija Rais et.al. | 2411.07348 | null |
2024-11-11 | Artificial Intelligence-Informed Handheld Breast Ultrasound for Screening: A Systematic Review of Diagnostic Test Accuracy | Arianna Bunnell et.al. | 2411.07322 | null |
2024-11-11 | GPU-Accelerated Inverse Lithography Towards High Quality Curvy Mask Generation | Haoyu Yang et.al. | 2411.07311 | null |
2024-11-11 | A Hierarchical Compression Technique for 3D Gaussian Splatting Compression | He Huang et.al. | 2411.06976 | null |
2024-11-11 | Multi-scale Frequency Enhancement Network for Blind Image Deblurring | Yawen Xiang et.al. | 2411.06893 | null |
2024-11-11 | Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation | Reo Yoneyama et.al. | 2411.06807 | null |
2024-11-11 | Machine vision-aware quality metrics for compressed image and video assessment | Mikhail Dremin et.al. | 2411.06776 | null |
2024-11-11 | Loss-tolerant neural video codec aware congestion control for real time video communication | Zhengxu Xia et.al. | 2411.06742 | null |
2024-11-11 | 360-Degree Video Super Resolution and Quality Enhancement Challenge: Methods and Results | Ahmed Telili et.al. | 2411.06738 | null |
2024-11-11 | Accelerating Low-field MRI: Compressed Sensing and AI for fast noise-robust imaging | Efrat Shimron et.al. | 2411.06704 | link |
2024-11-10 | CASC: Condition-Aware Semantic Communication with Latent Diffusion Models | Weixuan Chen et.al. | 2411.06552 | null |
2024-11-08 | A Modular Conditional Diffusion Framework for Image Reconstruction | Magauiya Zhussip et.al. | 2411.05993 | null |
2024-11-08 | Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings | Miguel Moura Ramos et.al. | 2411.05986 | null |
2024-11-08 | Dictionary Learning with Convolutional Structure for Seismic Data Denoising and Interpolation | Murad Almadani et.al. | 2411.05956 | null |
2024-11-08 | Alternative Learning Paradigms for Image Quality Transfer | Ahmed Karam Eldaly et.al. | 2411.05885 | null |
2024-11-08 | Benchmarking 3D multi-coil NC-PDNet MRI reconstruction | Asma Tanabene et.al. | 2411.05883 | null |
2024-11-08 | Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation | Long Truong To et.al. | 2411.05641 | null |
2024-11-08 | DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions | Rafael Berral-Soler et.al. | 2411.05552 | link |
2024-11-08 | Improving image synthesis with diffusion-negative sampling | Alakh Desai et.al. | 2411.05473 | null |
2024-11-08 | RED: Residual Estimation Diffusion for Low-Dose PET Sinogram Reconstruction | Xingyu Ai et.al. | 2411.05354 | null |
2024-11-08 | Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning | Quang Truong Nguyen et.al. | 2411.05344 | null |
2024-11-08 | A Quality-Centric Framework for Generic Deepfake Detection | Wentang Song et.al. | 2411.05335 | null |
2024-11-08 | Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet | Boxiao Yu et.al. | 2411.05302 | null |
2024-11-07 | Quantum Imaging and Metrology with Undetected squeezed Photons: Noise Canceling and Noise Based Imaging | S. Samimi et.al. | 2411.05175 | null |
2024-11-08 | SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-07 | Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models | Weixin Liang et.al. | 2411.04996 | null |
2024-11-07 | SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation | Koichi Namekata et.al. | 2411.04989 | null |
2024-11-07 | Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification | Mischa Dombrowski et.al. | 2411.04956 | null |
2024-11-07 | MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views | Yuedong Chen et.al. | 2411.04924 | link |
2024-11-07 | Differentiable Gaussian Representation for Incomplete CT Reconstruction | Shaokai Wu et.al. | 2411.04844 | null |
2024-11-07 | Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation | Benito Buchheim et.al. | 2411.04724 | null |
2024-11-06 | Multi-Reward as Condition for Instruction-based Image Editing | Xin Gu et.al. | 2411.04713 | null |
2024-11-06 | SEE-DPO: Self Entropy Enhanced Direct Preference Optimization | Shivanshu Shekhar et.al. | 2411.04712 | null |
2024-11-07 | Generative Semantic Communications with Foundation Models: Perception-Error Analysis and Semantic-Aware Power Allocation | Chunmei Xu et.al. | 2411.04575 | null |
2024-11-07 | Bayesian Calibration of Win Rate Estimation with LLM Evaluators | Yicheng Gao et.al. | 2411.04424 | link |
2024-11-07 | A Pre-training Framework that Encodes Noise Information for Speech Quality Assessment | Subrina Sultana et.al. | 2411.04379 | null |
2024-11-06 | X-ray Single-Pixel Imaging with MPGD-based detectors | M. Simões et.al. | 2411.03907 | null |
2024-11-06 | VQA |
Ziheng Jia et.al. | 2411.03795 | link |
2024-11-06 | MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models | Wen-Chin Huang et.al. | 2411.03715 | link |
2024-11-06 | Evaluating Eye Tracking Signal Quality with Real-time Gaze Interaction Simulation | Mehedi Hasan Raju et.al. | 2411.03708 | null |
2024-11-06 | Investigation of Inward-Outward Ring Permanent Magnet Array for Portable Magnetic Resonance Imaging (MRI) | Ting-Ou Liang et.al. | 2411.03249 | null |
2024-11-05 | The Impact of Medicaid Expansion on Medicare Quality Measures | Hala Algrain et.al. | 2411.03140 | null |
2024-11-05 | Investigating the Applicability of a Snapshot Computed Tomography Imaging Spectrometer for the Prediction of Brix and pH of Grapes | Mads Svanborg Peters et.al. | 2411.03114 | null |
2024-11-05 | Advances in Photoacoustic Imaging Reconstruction and Quantitative Analysis for Biomedical Applications | Lei Wang et.al. | 2411.02843 | null |
2024-11-04 | Interaction Design with Generative AI: An Empirical Study of Emerging Strategies Across the Four Phases of Design | Marie Muehlhaus et.al. | 2411.02662 | null |
2024-11-04 | Euclid: High-precision imaging astrometry and photometry from Early Release Observations. I. Internal kinematics of NGC 6397 by combining Euclid and Gaia data | M. Libralato et.al. | 2411.02487 | null |
2024-11-02 | Cross-D Conv: Cross-Dimensional Transferable Knowledge Base via Fourier Shifting Operation | Mehmet Can Yavuz et.al. | 2411.02441 | link |
2024-11-04 | Physically Based Neural Bidirectional Reflectance Distribution Function | Chenliang Zhou et.al. | 2411.02347 | null |
2024-11-04 | Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition | Xinkai Liu et.al. | 2411.02334 | null |
2024-11-03 | Degradation-Aware Residual-Conditioned Optimal Transport for Unified Image Restoration | Xiaole Tang et.al. | 2411.01656 | link |
2024-11-03 | Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation | Zhenbin Wang et.al. | 2411.01647 | null |
2024-11-03 | TPOT: Topology Preserving Optimal Transport in Retinal Fundus Image Enhancement | Xuanzhao Dong et.al. | 2411.01403 | null |
2024-11-02 | Interacting Large Language Model Agents. Interpretable Models and Social Learning | Adit Jain et.al. | 2411.01271 | null |
2024-11-02 | The impact of MRI image quality on statistical and predictive analysis on voxel based morphology | Felix Hoffstaedter et.al. | 2411.01268 | link |
2024-11-02 | Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures | Ameya Uppina et.al. | 2411.01251 | null |
2024-11-02 | Real-Time Spatio-Temporal Reconstruction of Dynamic Endoscopic Scenes with 4D Gaussian Splatting | Fengze Li et.al. | 2411.01218 | null |
2024-11-01 | Evaluation Metric for Quality Control and Generative Models in Histopathology Images | Pranav Jeevan et.al. | 2411.01034 | null |
2024-11-01 | Re-thinking Richardson-Lucy without Iteration Cutoffs: Physically Motivated Bayesian Deconvolution | Zachary H. Hendrix et.al. | 2411.00991 | null |
2024-11-01 | Inter-Feature-Map Differential Coding of Surveillance Video | Kei Iino et.al. | 2411.00984 | null |
2024-11-01 | Scalable AI Framework for Defect Detection in Metal Additive Manufacturing | Duy Nhat Phan et.al. | 2411.00960 | null |
2024-11-01 | Intensity Field Decomposition for Tissue-Guided Neural Tomography | Meng-Xun Li et.al. | 2411.00900 | null |
2024-11-01 | CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes | Yang Liu et.al. | 2411.00771 | null |
2024-11-01 | Face Anonymization Made Simple | Han-Wei Kung et.al. | 2411.00762 | link |
2024-11-01 | Demystifying the use of Compression in Virtual Production | Anil Kokaram et.al. | 2411.00547 | null |
2024-11-01 | MV-Adapter: Enhancing Underwater Instance Segmentation via Adaptive Channel Attention | Lianjun Liu et.al. | 2411.00472 | null |
2024-10-31 | IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision | Maxwell Meyer et.al. | 2411.00252 | null |
2024-10-31 | Denoising study of Fluoroscopic Images in real time tumor tracking System based on Statistical model of noise | Yongxuan Yan et.al. | 2411.00199 | null |
2024-10-31 | Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning | Penghui Ruan et.al. | 2410.24219 | link |
2024-10-31 | AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization | Amir Kazemi et.al. | 2410.24116 | null |
2024-10-31 | Parameter choices in HaarPSI for IQA with medical images | Clemens Karner et.al. | 2410.24098 | link |
2024-10-31 | Advanced Predictive Quality Assessment for Ultrasonic Additive Manufacturing with Deep Learning Model | Lokendra Poudel et.al. | 2410.24055 | null |
2024-10-31 | Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation | Yihang Zhou et.al. | 2410.23962 | null |
2024-10-29 | Temporal and Spatial Super Resolution with Latent Diffusion Model in Medical MRI images | Vishal Dubey et.al. | 2410.23898 | null |
2024-10-31 | Cycle-Constrained Adversarial Denoising Convolutional Network for PET Image Denoising: Multi-Dimensional Validation on Large Datasets with Reader Study and Real Low-Dose Data | Yucun Hou et.al. | 2410.23628 | null |
2024-10-31 | LBurst: Learning-Based Robotic Burst Feature Extraction for 3D Reconstruction in Low Light | Ahalya Ravendran et.al. | 2410.23522 | null |
2024-10-30 | Plug-and-play superiorization | Jon Henshaw et.al. | 2410.23401 | null |
2024-10-30 | Redundant Cross-Correlation for Drift Correction in SEM Nanoparticle Imaging | Iago Bischoff Montenegro et.al. | 2410.23390 | link |
2024-10-30 | Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants | Azadeh Sharafi et.al. | 2410.23329 | null |
2024-10-30 | AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection | Yujin Wang et.al. | 2410.22939 | null |
2024-10-30 | Prune and Repaint: Content-Aware Image Retargeting for any Ratio | Feihong Shen et.al. | 2410.22865 | link |
2024-10-30 | Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images | Hanlin Wu et.al. | 2410.22830 | null |
2024-10-30 | Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models | Arash Marioriyad et.al. | 2410.22775 | null |
2024-10-30 | st-DTPM: Spatial-Temporal Guided Diffusion Transformer Probabilistic Model for Delayed Scan PET Image Prediction | Ran Hong et.al. | 2410.22732 | null |
2024-10-30 | FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution | Shuai Wang et.al. | 2410.22655 | null |
2024-10-31 | Consistency Diffusion Bridge Models | Guande He et.al. | 2410.22637 | null |
2024-10-29 | Deep Priors for Video Quality Prediction | Siddharath Narayan Shakya et.al. | 2410.22566 | null |
2024-10-29 | Enhancing Code Annotation Reliability: Generative AI's Role in Comment Quality Assessment Models | Seetharam Killivalavan et.al. | 2410.22323 | null |
2024-10-29 | Multimodal Semantic Communication for Generative Audio-Driven Video Conferencing | Haonan Tong et.al. | 2410.22112 | null |
2024-10-29 | Data Generation for Hardware-Friendly Post-Training Quantization | Lior Dikstein et.al. | 2410.22110 | link |
2024-10-29 | Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis | Deepak Sridhar et.al. | 2410.21638 | link |
2024-10-28 | Exploring the Design Space of Diffusion Bridge Models via Stochasticity Control | Shaorong Zhang et.al. | 2410.21553 | null |
2024-10-28 | SpeechQE: Estimating the Quality of Direct Speech Translation | HyoJung Han et.al. | 2410.21485 | link |
2024-10-28 | Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework | Vladimir Arkhipkin et.al. | 2410.21061 | link |
2024-10-28 | A Simple Yet Effective Corpus Construction Framework for Indonesian Grammatical Error Correction | Nankai Lin et.al. | 2410.20838 | link |
2024-10-28 | FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space | Yiyang Guo et.al. | 2410.20824 | null |
2024-10-28 | Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting | Jiawei Xu et.al. | 2410.20815 | null |
2024-10-28 | LoDAvatar: Hierarchical Embedding and Adaptive Levels of Detail with Gaussian Splatting for Enhanced Human Avatars | Xiaonuo Dongye et.al. | 2410.20789 | null |
2024-10-28 | CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians | Chongjian Ge et.al. | 2410.20723 | null |
2024-10-28 | ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Suyoung Lee et.al. | 2410.20686 | link |
2024-10-27 | Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering | Meng Wei et.al. | 2410.20593 | null |
2024-10-27 | Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network | Chongxiao Liu et.al. | 2410.20546 | link |
2024-10-27 | Enhancing Community Vision Screening -- AI Driven Retinal Photography for Early Disease Detection and Patient Trust | Xiaofeng Lei et.al. | 2410.20309 | null |
2024-10-27 | GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields | Yusuke Sekikawa et.al. | 2410.20306 | null |
2024-10-26 | OAR-Weighted Dice Score: A spatially aware, radiosensitivity aware metric for target structure contour quality assessment | Lucas McCullum et.al. | 2410.20243 | null |
2024-10-26 | Cross-Platform Neural Video Coding: A Case Study | Ruhan Conceição et.al. | 2410.20145 | null |
2024-10-26 | Super-resolved virtual staining of label-free tissue using diffusion models | Yijie Zhang et.al. | 2410.20073 | null |
2024-10-25 | The Galaxy Zoo Catalogs for the Galaxy And Mass Assembly (GAMA) Survey | Benne W. Holwerda et.al. | 2410.19985 | null |
2024-10-25 | FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality | Zhengyao Lv et.al. | 2410.19355 | null |
2024-10-25 | Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion | Emiel Hoogeboom et.al. | 2410.19324 | null |
2024-10-24 | Optimising image capture for low-light widefield quantitative fluorescence microscopy | Zane Peterkovic et.al. | 2410.19210 | null |
2024-10-24 | Sort-free Gaussian Splatting via Weighted Sum Rendering | Qiqi Hou et.al. | 2410.18931 | null |
2024-10-24 | SafeBench: A Safety Evaluation Framework for Multimodal Large Language Models | Zonghao Ying et.al. | 2410.18927 | null |
2024-10-24 | Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances | Shilin Lu et.al. | 2410.18775 | link |
2024-10-24 | Advancements in Image Resolution: Super-Resolution Algorithm for Enhanced EOS-06 OCM-3 Data | Ankur Garg et.al. | 2410.18690 | null |
2024-10-24 | ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks | Renshuai Tao et.al. | 2410.18687 | null |
2024-10-24 | Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data | Anup Shirgaonkar et.al. | 2410.18588 | null |
2024-10-24 | ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis | Zezhong Wang et.al. | 2410.18447 | null |
2024-10-24 | FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling | Zhengqiang Zhang et.al. | 2410.18410 | link |
2024-10-23 | Neural Cover Selection for Image Steganography | Karl Chahine et.al. | 2410.18216 | link |
2024-10-23 | In-Pixel Foreground and Contrast Enhancement Circuits with Customizable Mapping | Md Rahatul Islam Udoy et.al. | 2410.18052 | null |
2024-10-23 | Scalable Ranked Preference Optimization for Text-to-Image Generation | Shyamgopal Karthik et.al. | 2410.18013 | null |
2024-10-23 | Together We Can: Multilingual Automatic Post-Editing for Low-Resource Languages | Sourabh Deoghare et.al. | 2410.17973 | null |
2024-10-23 | Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech | Danilo de Oliveira et.al. | 2410.17834 | null |
2024-10-23 | TopoQA: a topological deep learning-based approach for protein complex structure interface quality assessment | Bingqing Han et.al. | 2410.17815 | null |
2024-10-23 | An Intelligent Agentic System for Complex Image Restoration Problems | Kaiwen Zhu et.al. | 2410.17809 | link |
2024-10-24 | Testing Deep Learning Recommender Systems Models on Synthetic GAN-Generated Datasets | Jesús Bobadilla et.al. | 2410.17651 | null |
2024-10-25 | Comprehensive Evaluation of Matrix Factorization Models for Collaborative Filtering Recommender Systems | Jesús Bobadilla et.al. | 2410.17644 | null |
2024-10-23 | Bilateral Hippocampi Segmentation in Low Field MRIs Using Mutual Feature Learning via Dual-Views | Himashi Peiris et.al. | 2410.17502 | link |
2024-10-21 | MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors | Honghua Chen et.al. | 2410.16272 | null |
2024-10-21 | Multispectral Texture Synthesis using RGB Convolutional Neural Networks | Sélim Ollivier et.al. | 2410.16019 | null |
2024-10-22 | Wireless Link Quality Estimation Using LSTM Model | Yuki Kanto et.al. | 2410.15357 | null |
2024-10-19 | A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends | Junjun Jiang et.al. | 2410.15067 | link |
2024-10-18 | DRACO: Differentiable Reconstruction for Arbitrary CBCT Orbits | Chengze Ye et.al. | 2410.14900 | link |
2024-10-18 | Dynamic Negative Guidance of Diffusion Models | Felix Koulischer et.al. | 2410.14398 | link |
2024-10-18 | Gaia Data Release 3: spectroscopic binary-star orbital solutions and the SB1 processing chain | E. Gosset et.al. | 2410.14372 | null |
2024-10-18 | 2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization | Junan Chen et.al. | 2410.14343 | null |
2024-10-18 | Advanced Underwater Image Quality Enhancement via Hybrid Super-Resolution Convolutional Neural Networks and Multi-Scale Retinex-Based Defogging Techniques | Yugandhar Reddy Gogireddy et.al. | 2410.14285 | null |
2024-10-18 | Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical and Landmark Loss Optimization | Bin Lin et.al. | 2410.14283 | null |
2024-10-18 | Combining Hough Transform and Deep Learning Approaches to Reconstruct ECG Signals From Printouts | Felix Krones et.al. | 2410.14185 | null |
2024-10-18 | Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping | Renguang Chen et.al. | 2410.14161 | null |
2024-10-17 | Generating Signed Language Instructions in Large-Scale Dialogue Systems | Mert İnan et.al. | 2410.14026 | null |
2024-10-17 | Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens | Lijie Fan et.al. | 2410.13863 | null |
2024-10-15 | Comparison of Image Preprocessing Techniques for Vehicle License Plate Recognition Using OCR: Performance and Accuracy Evaluation | Renato Augusto Tavares et.al. | 2410.13622 | null |
2024-10-17 | L3DG: Latent 3D Gaussian Diffusion | Barbara Roessle et.al. | 2410.13530 | null |
2024-10-17 | Enhancing Crowdsourced Audio for Text-to-Speech Models | José Giraldo et.al. | 2410.13357 | null |
2024-10-17 | Active inference and deep generative modeling for cognitive ultrasound | Ruud JG van Sloun et.al. | 2410.13310 | null |
2024-10-17 | Latent Image and Video Resolution Prediction using Convolutional Neural Networks | Rittwika Kansabanik et.al. | 2410.13227 | null |
2024-10-17 | Anchored Alignment for Self-Explanations Enhancement | Luis Felipe Villa-Arenas et.al. | 2410.13216 | null |
2024-10-17 | Using RLHF to align speech enhancement approaches to mean-opinion quality scores | Anurag Kumar et.al. | 2410.13182 | null |
2024-10-16 | Super-resolving Real-world Image Illumination Enhancement: A New Dataset and A Conditional Diffusion Model | Yang Liu et.al. | 2410.12961 | null |
2024-10-16 | Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization | Xingqi Wang et.al. | 2410.12700 | link |
2024-10-16 | SWIM: An Attention-Only Model for Speech Quality Assessment Under Subjective Variance | Imran E Kibria et.al. | 2410.12675 | null |
2024-10-16 | MambaPainter: Neural Stroke-Based Rendering in a Single Step | Tomoya Sawada et.al. | 2410.12524 | link |
2024-10-16 | Conditional Outcome Equivalence: A Quantile Alternative to CATE | Josh Givens et.al. | 2410.12454 | link |
2024-10-16 | Triplet: Triangle Patchlet for Mesh-Based Inverse Rendering and Scene Parameters Approximation | Jiajie Yang et.al. | 2410.12414 | link |
2024-10-14 | Learnable Optimization-Based Algorithms for Low-Dose CT Reconstruction | Daisy Chen et.al. | 2410.11903 | null |
2024-10-15 | Generative Image Steganography Based on Point Cloud | Zhong Yangjie et.al. | 2410.11673 | null |
2024-10-15 | Fast Local Neural Regression for Low-Cost, Path Traced Lambertian Global Illumination | Arturo Salmi et.al. | 2410.11625 | null |
2024-10-15 | Rician Denoising Diffusion Probabilistic Models For Sodium Breast MRI Enhancement | Shuaiyu Yuan et.al. | 2410.11511 | null |
2024-10-15 | Visual-Geometric Collaborative Guidance for Affordance Learning | Hongchen Luo et.al. | 2410.11363 | link |
2024-10-15 | Evolutionary Retrofitting | Mathurin Videau et.al. | 2410.11330 | null |
2024-10-14 | Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation | Emmanouil Zaranis et.al. | 2410.10995 | null |
2024-10-14 | LVD-2M: A Long-take Video Dataset with Temporally Dense Captions | Tianwei Xiong et.al. | 2410.10816 | link |
2024-10-14 | Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention | Dejia Xu et.al. | 2410.10774 | null |
2024-10-14 | LISAC: Learned Coded Waveform Design for ISAC with OFDM | Chenghong Bian et.al. | 2410.10711 | null |
2024-10-14 | A Novel No-Reference Image Quality Metric For Assessing Sharpness In Satellite Imagery | Lucas Gonzalo Antonel et.al. | 2410.10488 | null |
2024-10-14 | Two-Stage Approach for Brain MR Image Synthesis: 2D Image Synthesis and 3D Refinement | Jihoon Cho et.al. | 2410.10269 | null |
2024-10-14 | Saliency Guided Optimization of Diffusion Latents | Xiwen Wang et.al. | 2410.10257 | null |
2024-10-14 | QE-EBM: Using Quality Estimators as Energy Loss for Machine Translation | Gahyun Yoo et.al. | 2410.10228 | null |
2024-10-14 | Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models | Yongjin Yang et.al. | 2410.10166 | null |
2024-10-14 | StegaINR4MIH: steganography by implicit neural representation for multi-image hiding | Weina Dong et.al. | 2410.10117 | link |
2024-10-13 | Crowd IQ -- Aggregating Opinions to Boost Performance | Michal Kosinski et.al. | 2410.10004 | null |
2024-10-13 | Combining Generative and Geometry Priors for Wide-Angle Portrait Correction | Lan Yao et.al. | 2410.09911 | link |
2024-10-13 | Two-Stage Human Verification using HandCAPTCHA and Anti-Spoofed Finger Biometrics with Feature Selection | Asish Bera et.al. | 2410.09866 | null |
2024-10-12 | Preserving Old Memories in Vivid Detail: Human-Interactive Photo Restoration Framework | Seung-Yeon Back et.al. | 2410.09529 | null |
2024-10-12 | Fine-grained subjective visual quality assessment for high-fidelity compressed images | Michela Testolina et.al. | 2410.09501 | link |
2024-10-12 | Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors | Hritam Basak et.al. | 2410.09467 | null |
2024-10-11 | TD-Paint: Faster Diffusion Inpainting Through Time Aware Pixel Conditioning | Tsiry Mayet et.al. | 2410.09306 | null |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars | Xuan Huang et.al. | 2410.08840 | link |
2024-10-11 | Towards virtual painting recolouring using Vision Transformer on X-Ray Fluorescence datacubes | Alessandro Bombini et.al. | 2410.08826 | null |
2024-10-11 | A Theoretical Framework for AI-driven data quality monitoring in high-volume data environments | Nikhil Bangad et.al. | 2410.08576 | null |
2024-10-11 | Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models | Pascl Zwick et.al. | 2410.08551 | link |
2024-10-11 | Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities | Abhijay Ghildyal et.al. | 2410.08534 | null |
2024-10-10 | Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content | Qiuheng Wang et.al. | 2410.08260 | null |
2024-10-10 | Exploring ASR-Based Wav2Vec2 for Automated Speech Disorder Assessment: Insights and Analysis | Tuan Nguyen et.al. | 2410.08250 | null |
2024-10-10 | ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion | Zitian Zhang et.al. | 2410.08168 | link |
2024-10-10 | Efficient Perspective-Correct 3D Gaussian Splatting Using Hybrid Transparency | Florian Hahlbohm et.al. | 2410.08129 | null |
2024-10-10 | Medical Image Quality Assessment based on Probability of Necessity and Sufficiency | Boyu Chen et.al. | 2410.08118 | null |
2024-10-10 | High-redshift LBG selection from broadband and wide photometric surveys using a Random Forest algorithm | C. Payerne et.al. | 2410.08062 | null |
2024-10-10 | Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation | Sweta Agrawal et.al. | 2410.07779 | null |
2024-10-10 | Synthesizing Multi-Class Surgical Datasets with Anatomy-Aware Diffusion Models | Danush Kumar Venkatesh et.al. | 2410.07753 | link |
2024-10-10 | Multi-Facet Counterfactual Learning for Content Quality Evaluation | Jiasheng Zheng et.al. | 2410.07693 | null |
2024-10-10 | DPL: Cross-quality DeepFake Detection via Dual Progressive Learning | Dongliang Zhang et.al. | 2410.07633 | null |
2024-10-10 | Rank Aggregation in Crowdsourcing for Listwise Annotations | Wenshui Luo et.al. | 2410.07538 | null |
2024-10-10 | A 3D-Printed Table for Hybrid X-ray CT and Optical Imaging of a Live Mouse | Wenxuan Xue et.al. | 2410.07517 | null |
2024-10-09 | An undetectable watermark for generative image models | Sam Gunn et.al. | 2410.07369 | link |
2024-10-09 | Secure Video Quality Assessment Resisting Adversarial Attacks | Ao-Xiang Zhang et.al. | 2410.06866 | null |
2024-10-09 | Diff-FMT: Diffusion Models for Fluorescence Molecular Tomography | Qianqian Xue et.al. | 2410.06757 | null |
2024-10-09 | MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes | Zhenhui Ye et.al. | 2410.06734 | null |
2024-10-09 | Perceptual Quality Assessment of Octree-RAHT Encoded 3D Point Clouds | Dongshuai Duan et.al. | 2410.06729 | link |
2024-10-09 | Perceptual Quality Assessment of Trisoup-Lifting Encoded 3D Point Clouds | Juncheng Long et.al. | 2410.06689 | link |
2024-10-09 | SCOREQ: Speech Quality Assessment with Contrastive Regression | Alessandro Ragano et.al. | 2410.06675 | link |
2024-10-09 | InstantIR: Blind Image Restoration with Instant Generative Reference | Jen-Yuan Huang et.al. | 2410.06551 | null |
2024-10-08 | Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content? | Shenbin Qian et.al. | 2410.06338 | link |
2024-10-08 | Automated quality assessment using appearance-based simulations and hippocampus segmentation on low-field paediatric brain MR images | Vaanathi Sundaresan et.al. | 2410.06161 | link |
2024-10-08 | Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach | Sha Guo et.al. | 2410.06149 | null |
2024-10-08 | AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation | Boyuan Cao et.al. | 2410.06055 | link |
2024-10-08 | Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization | Wei Liu et.al. | 2410.06003 | link |
2024-10-08 | Integrating Online Learning and Connectivity Maintenance for Communication-Aware Multi-Robot Coordination | Yupeng Yang et.al. | 2410.05798 | link |
2024-10-08 | T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design | Jiachen Li et.al. | 2410.05677 | null |
2024-10-08 | Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning | Saemi Moon et.al. | 2410.05664 | null |
2024-10-08 | Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree? | Xueru Wen et.al. | 2410.05584 | null |
2024-10-07 | Image Watermarks are Removable Using Controllable Regeneration from Clean Noise | Yepeng Liu et.al. | 2410.05470 | null |
2024-10-07 | SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones | Denis Davletshin et.al. | 2410.05405 | null |
2024-10-07 | Towards a Modern and Lightweight Rendering Engine for Dynamic Robotic Simulations | Christopher John Allison et.al. | 2410.05095 | null |
2024-10-07 | Real-time cardiac cine MRI -- A comparison of a diffusion probabilistic model with alternative state-of-the-art image reconstruction techniques for undersampled spiral acquisitions | Oliver Schad et.al. | 2410.04843 | null |
2024-10-07 | Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration | Zhiyu Zhu et.al. | 2410.04811 | link |
2024-10-07 | Transforming Color: A Novel Image Colorization Method | Hamza Shafiq et.al. | 2410.04799 | null |
2024-10-07 | CAR: Controllable Autoregressive Modeling for Visual Generation | Ziyu Yao et.al. | 2410.04671 | link |
2024-10-07 | Federated Learning Nodes Can Reconstruct Peers' Image Data | Ethan Wilson et.al. | 2410.04661 | null |
2024-10-06 | Towards Unsupervised Blind Face Restoration using Diffusion Prior | Tianshu Kuai et.al. | 2410.04618 | null |
2024-10-06 | How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? | Zhuoyan Li et.al. | 2410.04545 | null |
2024-10-06 | VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide | Dohun Lee et.al. | 2410.04364 | null |
2024-10-05 | Persona Knowledge-Aligned Prompt Tuning Method for Online Debate | Chunkit Chan et.al. | 2410.04239 | link |
2024-10-05 | AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results | Ivan Molodetskikh et.al. | 2410.04225 | null |
2024-10-05 | Deep Transfer Learning Based Peer Review Aggregation and Meta-review Generation for Scientific Articles | Md. Tarek Hasan et.al. | 2410.04202 | null |
2024-10-05 | Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model | Keda Tao et.al. | 2410.04161 | null |
2024-10-05 | Can the Variation of Model Weights be used as a Criterion for Self-Paced Multilingual NMT? | Àlex R. Atrio et.al. | 2410.04147 | null |
2024-10-05 | Beyond Imperfections: A Conditional Inpainting Approach for End-to-End Artifact Removal in VTON and Pose Transfer | Aref Tabatabaei et.al. | 2410.04052 | null |
2024-10-04 | LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding | Doohyuk Jang et.al. | 2410.03355 | null |
2024-10-04 | CLOVE: Travelling Salesman's approach to hyperbolic embeddings of complex networks with communities | Sámuel G. Balogh et.al. | 2410.03270 | null |
2024-10-04 | Parallel Corpus Augmentation using Masked Language Models | Vibhuti Kumari et.al. | 2410.03194 | null |
2024-10-04 | ECHOPulse: ECG controlled echocardio-grams video generation | Yiwei Li et.al. | 2410.03143 | link |
2024-10-03 | Diffusion-based Extreme Image Compression with Compressed Feature Initialization | Zhiyuan Li et.al. | 2410.02640 | link |
2024-10-03 | An Improved Variational Method for Image Denoising | Jing-En Huang et.al. | 2410.02587 | null |
2024-10-03 | Combining Pre- and Post-Demosaicking Noise Removal for RAW Video | Marco Sánchez-Beeckman et.al. | 2410.02572 | null |
2024-10-03 | Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment | Kai Liu et.al. | 2410.02505 | link |
2024-10-03 | Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models | Seyedmorteza Sadat et.al. | 2410.02416 | null |
2024-10-03 | Morphological evaluation of subwords vocabulary used by BETO language model | Óscar García-Sierra et.al. | 2410.02283 | null |
2024-10-03 | SC-CDM: Enhancing Quality of Image Semantic Communication with a Compact Diffusion Model | Kexin Zhang et.al. | 2410.02121 | null |
2024-10-02 | DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation | Jing He et.al. | 2410.02067 | null |
2024-10-02 | Impact of White-Box Adversarial Attacks on Convolutional Neural Networks | Rakesh Podder et.al. | 2410.02043 | null |
2024-10-02 | Social Media Authentication and Combating Deepfakes using Semi-fragile Invisible Image Watermarking | Aakash Varma Nadimpalli et.al. | 2410.01906 | null |
2024-10-02 | Enhancing LLM Fine-tuning for Text-to-SQLs by SQL Quality Measurement | Shouvon Sarker et.al. | 2410.01869 | null |
2024-10-02 | ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation | Rinon Gal et.al. | 2410.01731 | null |
2024-10-04 | HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration | Yushi Huang et.al. | 2410.01723 | null |
2024-10-02 | Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding | Yao Teng et.al. | 2410.01699 | link |
2024-10-02 | SAFE: Semantic Adaptive Feature Extraction with Rate Control for 6G Wireless Communications | Yuna Yan et.al. | 2410.01597 | null |
2024-10-02 | Frequency-Dependent F-Numbers Suppress Grating Lobes and Improve the Lateral Resolution in Line-by-Line Scanning | Martin F. Schiffner et.al. | 2410.01593 | null |
2024-10-02 | Imaging foundation model for universal enhancement of non-ideal measurement CT | Yuxin Liu et.al. | 2410.01591 | link |
2024-10-02 | HARMONI at ELT: tolerance analysis and expected as-build imaging performance of the infrared spectrograph | Eduard Muslimov et.al. | 2410.01581 | null |
2024-10-02 | Adaptive Radiofrequency Shimming in MRI using Reconfigurable Dielectric Materials | Paulina Šiurytė et.al. | 2410.01501 | null |
2024-10-02 | Quo Vadis RankList-based System in Face Recognition? | Xinyi Zhang et.al. | 2410.01498 | null |
2024-10-02 | Design of a custom wideband camera for MISTRAL imager-spectrograph | Eduard Muslimov et.al. | 2410.01414 | null |
2024-10-02 | CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment | Safouane El Ghazouali et.al. | 2410.01411 | link |
2024-10-01 | Generating Seamless Virtual Immunohistochemical Whole Slide Images with Content and Color Consistency | Sitong Liu et.al. | 2410.01072 | null |
2024-10-01 | LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details | Jian Yang et.al. | 2410.00990 | null |
2024-10-01 | Energy-Quality-aware Variable Framerate Pareto-Front for Adaptive Video Streaming | Prajit T Rajendran et.al. | 2410.00849 | null |
2024-10-01 | Maximum entropy and quantized metric models for absolute category ratings | Dietmar Saupe et.al. | 2410.00817 | null |
2024-10-01 | Basis function compression for field probe monitoring | Paul Dubovan et.al. | 2410.00754 | null |
2024-10-01 | Development of the normalization method for the first large field-of-view plastic-based PET Modular scanner | A. Coussat et.al. | 2410.00669 | null |
2024-10-01 | Contribution of soundscape appropriateness to soundscape quality assessment in space: a mediating variable affecting acoustic comfort | Xinhao Yang et.al. | 2410.00667 | null |
2024-10-01 | AutoTM 2.0: Automatic Topic Modeling Framework for Documents Analysis | Maria Khodorchenko et.al. | 2410.00655 | null |
2024-10-01 | Dynamic and Scalable Data Preparation for Object-Centric Process Mining | Lien Bosmans et.al. | 2410.00596 | null |
2024-09-30 | UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation | Cheng Zhang et.al. | 2409.20197 | link |
2024-09-30 | Segmenting Wood Rot using Computer Vision Models | Roland Kammerbauer et.al. | 2409.20137 | null |
2024-09-30 | Machine Learning in Industrial Quality Control of Glass Bottle Prints | Maximilian Bundscherer et.al. | 2409.20132 | null |
2024-09-30 | Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs | Zicheng Zhang et.al. | 2409.20063 | null |
2024-09-30 | Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis | Hippolyte Gisserot-Boukhlef et.al. | 2409.20059 | null |
2024-10-01 | UniSumEval: Towards Unified, Fine-Grained, Multi-Dimensional Summarization Evaluation for LLMs | Yuho Lee et.al. | 2409.19898 | link |
2024-09-29 | OrganiQ: Mitigating Classical Resource Bottlenecks of Quantum Generative Adversarial Networks on NISQ-Era Machines | Daniel Silver et.al. | 2409.19823 | null |
2024-09-29 | SemiDDM-Weather: A Semi-supervised Learning Framework for All-in-one Adverse Weather Removal | Fang Long et.al. | 2409.19679 | link |
2024-09-29 | Effective Diffusion Transformer Architecture for Image Super-Resolution | Kun Cheng et.al. | 2409.19589 | link |
2024-09-29 | High Quality Human Image Animation using Regional Supervision and Motion Blur Condition | Zhongcong Xu et.al. | 2409.19580 | null |
2024-09-27 | A comprehensive review and new taxonomy on superpixel segmentation | I. B. Barcelos et.al. | 2409.19179 | link |
2024-09-27 | Multimodal Pragmatic Jailbreak on Text-to-image Models | Tong Liu et.al. | 2409.19149 | null |
2024-09-27 | ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions | Wenfeng Huang et.al. | 2409.18932 | null |
2024-09-27 | Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors | Yunlong Lin et.al. | 2409.18899 | null |
2024-09-27 | Effectiveness of learning-based image codecs on fingerprint storage | Daniele Mari et.al. | 2409.18730 | link |
2024-09-27 | Decoding Complexity-Rate-Quality Pareto-Front for Adaptive VVC Streaming | Angeliki Katsenou et.al. | 2409.18713 | null |
2024-09-27 | Align |
Hongzhe Huang et.al. | 2409.18541 | link |
2024-09-27 | Underwater Image Enhancement with Physical-based Denoising Diffusion Implicit Models | Nguyen Gia Bach et.al. | 2409.18476 | link |
2024-09-27 | GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation | Jiawei Lu et.al. | 2409.18401 | null |
2024-09-27 | SinoSynth: A Physics-based Domain Randomization Approach for Generalizable CBCT Image Enhancement | Yunkui Pang et.al. | 2409.18355 | link |
2024-09-26 | FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner | Wenliang Zhao et.al. | 2409.18128 | link |
2024-09-26 | Low Photon Number Non-Invasive Imaging Through Time-Varying Diffusers | Adrian Makowski et.al. | 2409.18072 | null |
2024-09-26 | LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Huan Wang et.al. | 2409.18057 | link |
2024-09-26 | MARS: Multi-radio Architecture with Radio Selection using Decision Trees for emerging mesoscale CPS/IoT applications | Jothi Prasanna Shanmuga Sundaram et.al. | 2409.18043 | null |
2024-09-26 | PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging | Xin Cai et.al. | 2409.17996 | null |
2024-09-26 | Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation | Qihan Huang et.al. | 2409.17920 | link |
2024-09-26 | Cross-lingual Human-Preference Alignment for Neural Machine Translation with Direct Quality Optimization | Kaden Uhlig et.al. | 2409.17673 | null |
2024-09-26 | FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates | Nicola Pia et.al. | 2409.17635 | null |
2024-09-26 | Pixel-Space Post-Training of Latent Diffusion Models | Christina Zhang et.al. | 2409.17565 | null |
2024-09-26 | Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset | Yongrok Kim et.al. | 2409.17451 | null |
2024-09-25 | DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion | Yukun Huang et.al. | 2409.17145 | link |
2024-09-25 | Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts | Mohammad Sadil Khan et.al. | 2409.17106 | link |
2024-09-25 | Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model | Xinfeng Wei et.al. | 2409.17104 | null |
2024-09-25 | The effect of image quality on galaxy merger identification with deep learning | Robert W. Bickley et.al. | 2409.17081 | null |
2024-09-25 | Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Aiping Zhang et.al. | 2409.17058 | link |
2024-09-25 | MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features | Katharina Anderer et.al. | 2409.16765 | link |
2024-09-25 | Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation | Youngwan Jin et.al. | 2409.16706 | null |
2024-09-25 | In which fields can ChatGPT detect journal article quality? An evaluation of REF2021 results | Mike Thelwall et.al. | 2409.16695 | null |
2024-09-25 | Morphological-consistent Diffusion Network for Ultrasound Coronal Image Enhancement | Yihao Zhou et.al. | 2409.16661 | null |
2024-09-25 | Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts | Taehun Cha et.al. | 2409.16658 | link |
2024-09-25 | Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation | Siyin Wang et.al. | 2409.16644 | null |
2024-09-25 | DeformStream: Deformation-based Adaptive Volumetric Video Streaming | Boyan Li et.al. | 2409.16615 | null |
2024-09-25 | Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models | Deepak Sridhar et.al. | 2409.16535 | link |
2024-09-24 | Low Latency Point Cloud Rendering with Learned Splatting | Yueyu Hu et.al. | 2409.16504 | link |
2024-09-24 | A Unified Hallucination Mitigation Framework for Large Vision-Language Models | Yue Chang et.al. | 2409.16494 | link |
2024-09-24 | AIM 2024 Challenge on UHD Blind Photo Quality Assessment | Vlad Hosu et.al. | 2409.16271 | null |
2024-09-26 | Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients | Wanchen Zhao et.al. | 2409.16042 | null |
2024-09-24 | Deep chroma compression of tone-mapped images | Xenios Milidonis et.al. | 2409.16032 | link |
2024-09-24 | VascX Models: Model Ensembles for Retinal Vascular Analysis from Color Fundus Images | Jose Vargas Quiros et.al. | 2409.16016 | link |
2024-09-24 | Semantics-Controlled Gaussian Splatting for Outdoor Scene Reconstruction and Rendering in Virtual Reality | Hannah Schieber et.al. | 2409.15959 | null |
2024-09-24 | Unsupervised dMRI Artifact Detection via Angular Resolution Enhancement and Cycle Consistency Learning | Sheng Chen et.al. | 2409.15883 | null |
2024-09-25 | Ring Artifacts Removal Based on Implicit Neural Representation of Sinogram Data | Ligen Shi et.al. | 2409.15731 | null |
2024-09-23 | Blind Localization of Early Room Reflections with Arbitrary Microphone Array | Yogev Hadadi et.al. | 2409.15484 | null |
2024-09-23 | Simplifying Triangle Meshes in the Wild | Hsueh-Ti Derek Liu et.al. | 2409.15458 | null |
2024-09-23 | MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning | Yue Han et.al. | 2409.15179 | null |
2024-09-23 | Advancing Video Quality Assessment for AIGC | Xinli Yue et.al. | 2409.14888 | null |
2024-09-23 | Revisiting Video Quality Assessment from the Perspective of Generalization | Xinli Yue et.al. | 2409.14847 | link |
2024-09-23 | AIM 2024 Challenge on Video Saliency Prediction: Methods and Results | Andrey Moskalenko et.al. | 2409.14827 | link |
2024-09-23 | HiFi-Glot: Neural Formant Synthesis with Differentiable Resonant Filters | Lauri Juvela et.al. | 2409.14823 | null |
2024-09-22 | Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing | Wenze Ren et.al. | 2409.14554 | null |
2024-09-22 | Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting | Daniel A. Mitchell et.al. | 2409.14346 | null |
2024-09-22 | MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators | Qingyu Lu et.al. | 2409.14335 | link |
2024-09-22 | Quantitative and Qualitative Evaluation of NLM and Wavelet Methods in Image Enhancement | Cameron Khanpour et.al. | 2409.14334 | null |
2024-09-21 | JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation | Hadrien Reynaud et.al. | 2409.14149 | null |
2024-09-21 | N-Version Assessment and Enhancement of Generative AI | Marcus Kessel et.al. | 2409.14071 | null |
2024-09-18 | An Efficient Projection-Based Next-best-view Planning Framework for Reconstruction of Unknown Objects | Zhizhou Jia et.al. | 2409.12096 | null |
2024-09-18 | Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement | Zizhen Lin et.al. | 2409.11725 | null |
2024-09-18 | DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion | Jian Xu et.al. | 2409.11642 | link |
2024-09-17 | Noise-aware Dynamic Image Denoising and Positron Range Correction for Rubidium-82 Cardiac PET Imaging via Self-supervision | Huidong Xie et.al. | 2409.11543 | null |
2024-09-17 | Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements | Jipeng Yan et.al. | 2409.11391 | null |
2024-09-17 | Ultrasound Image Enhancement with the Variance of Diffusion Models | Yuxin Zhang et.al. | 2409.11380 | link |
2024-09-17 | Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks | Edgar Heinert et.al. | 2409.11373 | link |
2024-09-17 | Edge-based Denoising Image Compression | Ryugo Morita et.al. | 2409.10978 | null |
2024-09-17 | CUNSB-RFIE: Context-aware Unpaired Neural Schrödinger Bridge in Retinal Fundus Image Enhancement | Xuanzhao Dong et.al. | 2409.10966 | link |
2024-09-17 | Towards Effective User Attribution for Latent Diffusion Models via Watermark-Informed Blending | Yongyang Pan et.al. | 2409.10958 | null |
2024-09-17 | Neural Fields for Adaptive Photoacoustic Computed Tomography | Tianao Li et.al. | 2409.10876 | link |
2024-09-16 | Investigating Training Objectives for Generative Speech Enhancement | Julius Richter et.al. | 2409.10753 | link |
2024-09-16 | Taming Diffusion Models for Image Restoration: A Review | Ziwei Luo et.al. | 2409.10353 | null |
2024-09-16 | FGR-Net:Interpretable fundus imagegradeability classification based on deepreconstruction learning | Saif Khalid et.al. | 2409.10246 | null |
2024-09-16 | RF-GML: Reference-Free Generative Machine Listener | Arijit Biswas et.al. | 2409.10210 | null |
2024-09-16 | Towards Explainable Automated Data Quality Enhancement without Domain Knowledge | Djibril Sarr et.al. | 2409.10139 | null |
2024-09-16 | 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction | Atsuya Nakata et.al. | 2409.09969 | link |
2024-09-15 | A Global Perspective on the Past, Present, and Future of Video Streaming over Starlink | Liz Izhikevich et.al. | 2409.09846 | null |
2024-09-15 | Underwater Image Enhancement via Dehazing and Color Restoration | Chengqin Wu et.al. | 2409.09779 | null |
2024-09-15 | High Definition Map Mapping and Update: A General Overview and Future Directions | Benny Wijaya et.al. | 2409.09726 | null |
2024-09-15 | Superconducting and low temperature RF Coils for Ultra-Low-Field MRI: A Study on SNR Performance | Aditya A Bhosale et.al. | 2409.09608 | null |
2024-09-14 | Estimating Neural Orientation Distribution Fields on High Resolution Diffusion MRI Scans | Mohammed Munzer Dwedari et.al. | 2409.09387 | link |
2024-09-13 | Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions | Zahra Ashktorab et.al. | 2409.08937 | null |
2024-09-13 | Confocal Raman Microscopy with Adaptive Optics | J. D. Munoz-Bolanos et.al. | 2409.08725 | null |
2024-09-13 | Joint image reconstruction and segmentation of real-time cardiac MRI in free-breathing using a model based on disentangled representation learning | Tobias Wech et.al. | 2409.08619 | null |
2024-09-13 | DiffFAS: Face Anti-Spoofing via Generative Diffusion Models | Xinxu Ge et.al. | 2409.08572 | link |
2024-09-13 | CasDyF-Net: Image Dehazing via Cascaded Dynamic Filters | Wang Yinglong et.al. | 2409.08510 | link |
2024-09-12 | OpenACE: An Open Benchmark for Evaluating Audio Coding Performance | Jozef Coldenhoff et.al. | 2409.08374 | link |
2024-09-12 | Expansive Supervision for Neural Radiance Field | Weixiang Zhang et.al. | 2409.08056 | null |
2024-09-12 | OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation | Shun Zou et.al. | 2409.08000 | link |
2024-09-14 | Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment | Shaode Yu et.al. | 2409.07762 | null |
2024-09-11 | Foundation Models Boost Low-Level Perceptual Similarity Metrics | Abhijay Ghildyal et.al. | 2409.07650 | link |
2024-09-11 | Machine Learning and Constraint Programming for Efficient Healthcare Scheduling | Aymen Ben Said et.al. | 2409.07547 | null |
2024-09-11 | FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process | Yang Luo et.al. | 2409.07451 | null |
2024-09-11 | EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion | Jian Zhang et.al. | 2409.07255 | link |
2024-09-12 | 3DGCQA: A Quality Assessment Database for 3D AI-Generated Contents | Yingjie Zhou et.al. | 2409.07236 | link |
2024-09-11 | Phantom-based gradient waveform measurements with compensated variable-prephasing: Description and application to EPI at 7T | Hannah Scholten et.al. | 2409.07203 | null |
2024-09-11 | Attention Down-Sampling Transformer, Relative Ranking and Self-Consistency for Blind Image Quality Assessment | Mohammed Alsaafin et.al. | 2409.07115 | link |
2024-09-11 | CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion | Joshua Kazdan et.al. | 2409.07025 | null |
2024-09-11 | AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models | Boming Miao et.al. | 2409.07002 | null |
2024-09-10 | ExIQA: Explainable Image Quality Assessment Using Distortion Attributes | Sepehr Kazemi Ranjbar et.al. | 2409.06853 | null |
2024-09-10 | Universal End-to-End Neural Network for Lossy Image Compression | Bouzid Arezki et.al. | 2409.06586 | null |
2024-09-10 | Three-dimensional generative adversarial networks for turbulent flow estimation from wall measurements | Antonio Cuéllar et.al. | 2409.06548 | null |
2024-09-11 | AMNS: Attention-Weighted Selective Mask and Noise Label Suppression for Text-to-Image Person Retrieval | Runqing Zhang et.al. | 2409.06385 | null |
2024-09-10 | Multi-Weather Image Restoration via Histogram-Based Transformer Feature Enhancement | Yang Wen et.al. | 2409.06334 | null |
2024-09-10 | DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing | Kuang Yuan et.al. | 2409.06137 | null |
2024-09-09 | Enhancing Cross-Modality Synthesis: Subvolume Merging for MRI-to-CT Conversion | Fuxin Fan et.al. | 2409.05982 | null |
2024-09-09 | SynMorph: Generating Synthetic Face Morphing Dataset with Mated Samples | Haoyu Zhang et.al. | 2409.05595 | null |
2024-09-09 | Efficient Quality Estimation of True Random Bit-streams | Cesare Caratozzolo et.al. | 2409.05543 | null |
2024-09-09 | Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild | Xiongkuo Min et.al. | 2409.05540 | null |
2024-09-09 | A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression | Nora Hofer et.al. | 2409.05490 | null |
2024-09-09 | Boosting CLIP Adaptation for Image Quality Assessment via Meta-Prompt Learning and Gradient Regularization | Xudong Li et.al. | 2409.05381 | null |
2024-09-09 | PersonaTalk: Bring Attention to Your Persona in Visual Dubbing | Longhao Zhang et.al. | 2409.05379 | null |
2024-09-09 | BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec | Detai Xin et.al. | 2409.05377 | link |
2024-09-09 | Adaptive Offloading and Enhancement for Low-Light Video Analytics on Mobile Devices | Yuanyi He et.al. | 2409.05297 | null |
2024-09-08 | Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation | Haichao Zhu et.al. | 2409.05151 | null |
2024-09-07 | Plug-and-Hide: Provable and Adjustable Diffusion Generative Steganography | Jiahao Zhu et.al. | 2409.04878 | null |
2024-09-07 | Metadata augmented deep neural networks for wild animal classification | Aslak Tøn et.al. | 2409.04825 | link |
2024-09-11 | Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras | Zimu Liao et.al. | 2409.04751 | link |
2024-09-06 | Whole Heart Perfusion with High-Multiband Simultaneous Multislice Imaging via Linear Phase Modulated Extended Field of View (SMILE) | Shen Zhao et.al. | 2409.04353 | link |
2024-09-06 | Design and Characterization of MRI-compatible Plastic Ultrasonic Motor | Zhanyue Zhao et.al. | 2409.04006 | null |
2024-09-06 | Bi-modality Images Transfer with a Discrete Process Matching Method | Zhe Xiong et.al. | 2409.03977 | null |
2024-09-03 | Applications and Advances of Artificial Intelligence in Music Generation:A Review | Yanxu Chen et.al. | 2409.03715 | null |
2024-09-05 | Enabling Practical and Privacy-Preserving Image Processing | Chao Wang et.al. | 2409.03568 | null |
2024-09-05 | Use of triplet loss for facial restoration in low-resolution images | Sebastian Pulgar et.al. | 2409.03530 | null |
2024-09-05 | Improving Uncertainty-Error Correspondence in Deep Bayesian Medical Image Segmentation | Prerak Mody et.al. | 2409.03470 | link |
2024-09-05 | Multiple weather images restoration using the task transformer and adaptive mixup strategy | Yang Wen et.al. | 2409.03249 | null |
2024-09-05 | Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem | Qiwen Zhu et.al. | 2409.03179 | link |
2024-09-05 | Large Étendue 3D Holographic Display with Content-adpative Dynamic Fourier Modulation | Brian Chao et.al. | 2409.03143 | null |
2024-09-04 | Incorporating dense metric depth into neural 3D representations for view synthesis and relighting | Arkadeep Narayan Chaudhury et.al. | 2409.03061 | null |
2024-09-04 | Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models | Pujing Yang et.al. | 2409.02597 | null |
2024-09-04 | Coral Model Generation from Single Images for Virtual Reality Applications | Jie Fu et.al. | 2409.02376 | null |
2024-09-04 | Image Registration with Averaging Network and Edge-Based Loss for Low-SNR Cardiac MRI | Xuan Lei et.al. | 2409.02348 | null |
2024-09-03 | Coaching a Robotic Sonographer: Learning Robotic Ultrasound with Sparse Expert's Feedback | Deepak Raina et.al. | 2409.02337 | null |
2024-09-03 | Unveiling Deep Shadows: A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning | Xiaowei Hu et.al. | 2409.02108 | link |
2024-09-03 | AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions | Chenghao Qian et.al. | 2409.02045 | link |
2024-09-03 | Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Yixuan Ye et.al. | 2409.01935 | link |
2024-09-03 | UWStereo: A Large Synthetic Dataset for Underwater Stereo Matching | Qingxuan Lv et.al. | 2409.01782 | null |
2024-09-03 | Boron Isotope Effects on Raman Scattering in Bulk BN, BP, and BAs: A Density-Functional Theory Study | Nima Ghafari Cherati et.al. | 2409.01671 | null |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-09-03 | Learning Task-Specific Sampling Strategy for Sparse-View CT Reconstruction | Liutao Yang et.al. | 2409.01544 | null |
2024-09-03 | Long-Range Biometric Identification in Real World Scenarios: A Comprehensive Evaluation Framework Based on Missions | Deniz Aykac et.al. | 2409.01540 | null |
2024-09-02 | Real-Time Multi-Scene Visibility Enhancement for Promoting Navigational Safety of Vessels Under Complex Weather Conditions | Ryan Wen Liu et.al. | 2409.01500 | link |
2024-09-02 | Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement | Tathagata Bandyopadhyay et.al. | 2409.01352 | link |
2024-09-02 | A Roadmap to Holographic Focused Ultrasound Approaches to Generate Thermal Patterns | Ceren Cengiz et.al. | 2409.01323 | null |
2024-09-02 | Investigation of the spatial resolution of PET imaging system measuring polarization-correlated Compton events | Ana Marija Kožuljević et.al. | 2409.01238 | null |
2024-09-02 | MobileIQA: Exploiting Mobile-level Diverse Opinion Network For No-Reference Image Quality Assessment Using Knowledge Distillation | Zewen Chen et.al. | 2409.01212 | link |
2024-09-02 | Generating Synthetic Satellite Imagery for Rare Objects: An Empirical Comparison of Models and Metrics | Tuong Vy Nguyen et.al. | 2409.01138 | null |
2024-09-02 | Rapid GPU-Based Pangenome Graph Layout | Jiajie Li et.al. | 2409.00876 | null |
2024-09-01 | An Optimized Binning and Probabilistic Slice Sharing Algorithm for Motion Correction in Abdominal DW-MRI | Michelle Su et.al. | 2409.00798 | null |
2024-08-30 | Subspace Diffusion Posterior Sampling for Travel-Time Tomography | Xiang Cao et.al. | 2408.17333 | null |
2024-08-30 | Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution | Yixin Wu et.al. | 2408.17285 | null |
2024-08-30 | LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model | Nasim Jamshidi Avanaki et.al. | 2408.17057 | link |
2024-08-30 | Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Shuyang Zhang et.al. | 2408.17005 | link |
2024-08-29 | Legacy Learning Using Few-Shot Font Generation Models for Automatic Text Design in Metaverse Content: Cases Studies in Korean and Chinese | Younghwi Kim et.al. | 2408.16900 | null |
2024-08-29 | The Continuous Electron Beam Accelerator Facility at 12 GeV | P. A. Adderley et.al. | 2408.16880 | null |
2024-08-29 | MSLIQA: Enhancing Learning Representations for Image Quality Assessment through Multi-Scale Learning | Nasim Jamshidi Avanaki et.al. | 2408.16879 | null |
2024-09-04 | Auto-resolving atomic structure at van der Waal interfaces using a generative model | Wenqiang Huang et.al. | 2408.16802 | link |
2024-09-02 | RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model | Zhuan Shi et.al. | 2408.16634 | null |
2024-09-02 | A Deep-Learning-Based Label-free No-Reference Image Quality Assessment Metric: Application in Sodium MRI Denoising | Shuaiyu Yuan et.al. | 2408.16481 | null |
2024-08-29 | LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement | Ye Yu et.al. | 2408.16235 | link |
2024-08-28 | TEDRA: Text-based Editing of Dynamic and Photoreal Actors | Basavaraj Sunagad et.al. | 2408.15995 | null |
2024-08-28 | Segmentation-guided Layer-wise Image Vectorization with Gradient Fills | Hengyu Zhou et.al. | 2408.15741 | link |
2024-08-28 | Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas | Fabio Quattrini et.al. | 2408.15660 | link |
2024-08-28 | Avoiding Generative Model Writer's Block With Embedding Nudging | Ali Zand et.al. | 2408.15450 | null |
2024-09-02 | Pitfalls and Outlooks in Using COMET | Vilém Zouhar et.al. | 2408.15366 | link |
2024-08-27 | Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment | Xuan Xu et.al. | 2408.15218 | null |
2024-08-27 | CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP | Zhenchen Tang et.al. | 2408.15098 | null |
2024-08-27 | Towards Real-world Event-guided Low-light Video Enhancement and Deblurring | Taewoo Kim et.al. | 2408.14916 | link |
2024-08-27 | Alfie: Democratising RGBA Image Generation With No $$$ | Fabio Quattrini et.al. | 2408.14826 | link |
2024-08-27 | Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation | Qiaoxin Li et.al. | 2408.14754 | null |
2024-08-26 | Gallery-Aware Uncertainty Estimation For Open-Set Face Recognition | Leonid Erlygin et.al. | 2408.14229 | null |
2024-08-27 | SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher | Trung Dao et.al. | 2408.14176 | link |
2024-08-27 | Improving Water Quality Time-Series Prediction in Hong Kong using Sentinel-2 MSI Data and Google Earth Engine Cloud Computing | Rohin Sood et.al. | 2408.14010 | null |
2024-08-26 | LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models | Qihang Ge et.al. | 2408.14008 | null |
2024-08-25 | Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching | Minghao Liu et.al. | 2408.13858 | null |
2024-08-25 | Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In! | Stefano Perrella et.al. | 2408.13831 | link |
2024-08-24 | G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles | Adil Meric et.al. | 2408.13508 | null |
2024-08-23 | ReCon: Reconfiguring Analog Rydberg Atom Quantum Computers for Quantum Generative Adversarial Networks | Nicholas S. DiBrita et.al. | 2408.13389 | link |
2024-08-23 | Re-evaluation of Face Anti-spoofing Algorithm in Post COVID-19 Era Using Mask Based Occlusion Attack | Vaibhav Sundharam et.al. | 2408.13251 | null |
2024-08-23 | ResSR: A Residual Approach to Super-Resolving Multispectral Images | Haley Duba-Sullivan et.al. | 2408.13225 | link |
2024-08-23 | A density ratio framework for evaluating the utility of synthetic data | Thom Benjamin Volker et.al. | 2408.13167 | null |
2024-08-23 | When Diffusion MRI Meets Diffusion Model: A Novel Deep Generative Model for Diffusion MRI Generation | Xi Zhu et.al. | 2408.12897 | null |
2024-08-22 | Variable Stars in M31 Stellar Clusters from the Panchromatic Hubble Andromeda Treasury | Richard Smith et.al. | 2408.12765 | null |
2024-08-22 | Visual Verity in AI-Generated Imagery: Computational Metrics and Human-Centric Analysis | Memoona Aziz et.al. | 2408.12762 | null |
2024-08-22 | Unlocking Intrinsic Fairness in Stable Diffusion | Eunji Kim et.al. | 2408.12692 | null |
2024-08-22 | Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features | Shaoxiang Dang et.al. | 2408.12279 | null |
2024-08-21 | MBSS-T1: Model-Based Self-Supervised Motion Correction for Robust Cardiac T1 Mapping | Eyal Hanania et.al. | 2408.11992 | null |
2024-08-21 | AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results | Maksim Smirnov et.al. | 2408.11982 | link |
2024-08-21 | Estimating Contribution Quality in Online Deliberations Using a Large Language Model | Lodewijk Gelauff et.al. | 2408.11936 | null |
2024-08-21 | FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting | Liyao Jiang et.al. | 2408.11706 | null |
2024-08-21 | Interpretable Long-term Action Quality Assessment | Xu Dong et.al. | 2408.11687 | link |
2024-08-21 | E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment | Shangkun Sun et.al. | 2408.11481 | link |
2024-08-21 | Fairness measures for biometric quality assessment | André Dörsch et.al. | 2408.11392 | null |
2024-08-21 | Gender Bias Evaluation in Text-to-image Generation: A Survey | Yankun Wu et.al. | 2408.11358 | null |
2024-08-21 | Image Score: Learning and Evaluating Human Preferences for Mercari Search | Chingis Oinar et.al. | 2408.11349 | null |
2024-08-21 | High-quality imaging of large areas through path-difference ptychography | Jizhe Cui et.al. | 2408.11332 | null |
2024-08-21 | Optimizing Transmit Field Inhomogeneity of Parallel RF Transmit Design in 7T MRI using Deep Learning | Zhengyi Lu et.al. | 2408.11323 | null |
2024-08-21 | Transfer Learning and the Early Estimation of Single-Photon Source Quality using Machine Learning Methods | David Jacob Kedziora et.al. | 2408.11322 | link |
2024-08-20 | Compress Guidance in Conditional Diffusion Sampling | Anh-Dung Dinh et.al. | 2408.11194 | null |
2024-08-20 | Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement | Satoshi Kosugi et.al. | 2408.11055 | link |
2024-08-20 | Denoising Plane Wave Ultrasound Images Using Diffusion Probabilistic Models | Hojat Asgariandehkordi et.al. | 2408.10987 | null |
2024-08-20 | Influence of Medical Foreign Bodies on Dark-Field Chest Radiographs: First experiences | Lennard Kaster et.al. | 2408.10855 | null |
2024-08-19 | Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation | Liu He et.al. | 2408.10453 | null |
2024-08-19 | Perceptual Depth Quality Assessment of Stereoscopic Omnidirectional Images | Wei Zhou et.al. | 2408.10134 | null |
2024-08-19 | Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement | Kang Xiao et.al. | 2408.09920 | link |
2024-08-19 | Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation | Yunxin Li et.al. | 2408.09787 | link |
2024-08-21 | Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning | Zhi Qiao et.al. | 2408.09731 | null |
2024-08-18 | FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model | Ziyu Yao et.al. | 2408.09384 | null |
2024-08-17 | Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming | Seungyeop Han et.al. | 2408.09244 | null |
2024-08-16 | Explore Cross-Codec Quality-Rate Convex Hulls Relation for Adaptive Streaming | Masoumeh Farhadi Nia et.al. | 2408.09044 | null |
2024-08-16 | Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions | Bhuvanashree Murugadoss et.al. | 2408.08781 | null |
2024-08-16 | Speckle Noise Analysis for Synthetic Aperture Radar (SAR) Space Data | Sanjjushri Varshini R et.al. | 2408.08774 | null |
2024-08-16 | Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Jinming Liu et.al. | 2408.08575 | null |
2024-08-16 | Visual-Friendly Concept Protection via Selective Adversarial Perturbations | Xiaoyue Mi et.al. | 2408.08518 | link |
2024-08-16 | Achieving Complex Image Edits via Function Aggregation with Diffusion Models | Mohammadreza Samadi et.al. | 2408.08495 | null |
2024-08-15 | Level Up Your Tutorials: VLMs for Game Tutorials Quality Assessment | Daniele Rege Cambrin et.al. | 2408.08396 | link |
2024-08-15 | METR: Image Watermarking with Large Number of Unique Messages | Alexander Varlamov et.al. | 2408.08340 | link |
2024-08-15 | Accelerated Image-Aware Generative Diffusion Modeling | Tanmay Asthana et.al. | 2408.08306 | null |
2024-08-15 | Rethinking Medical Anomaly Detection in Brain MRI: An Image Quality Assessment Perspective | Zixuan Pan et.al. | 2408.08228 | link |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-15 | KGV: Integrating Large Language Models with Knowledge Graphs for Cyber Threat Intelligence Credibility Assessment | Zongzong Wu et.al. | 2408.08088 | null |
2024-08-15 | Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation | Seon-Hoon Kim et.al. | 2408.07947 | link |
2024-08-15 | MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion | Lucas Nedel Kirsten et.al. | 2408.07932 | link |
2024-08-14 | New Curriculum, New Chance -- Retrieval Augmented Generation for Lesson Planning in Ugandan Secondary Schools. Prototype Quality Evaluation | Simon Kloker et.al. | 2408.07542 | null |
2024-08-14 | Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models | Jean-Marie Lemercier et.al. | 2408.07472 | null |
2024-08-14 | DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement | Tao Sun et.al. | 2408.07388 | null |
2024-08-13 | Direction of Arrival Correction through Speech Quality Feedback | Caleb Rascon et.al. | 2408.07234 | link |
2024-08-13 | SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis | Yuchen Mao et.al. | 2408.07196 | null |
2024-08-13 | BVI-UGC: A Video Quality Database for User-Generated Content Transcoding | Zihao Qi et.al. | 2408.07171 | null |
2024-08-13 | Efficient Deep Model-Based Optoacoustic Image Reconstruction | Christoph Dehner et.al. | 2408.07109 | null |
2024-08-13 | Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality | Yu-Chih Chen et.al. | 2408.07041 | null |
2024-08-13 | Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines | Samuel Fernández Menduiña et.al. | 2408.07028 | null |
2024-08-13 | Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models | Cheng Chen et.al. | 2408.06995 | null |
2024-08-13 | Evaluating Research Quality with Large Language Models: An Analysis of ChatGPT's Effectiveness with Different Settings and Inputs | Mike Thelwall et.al. | 2408.06752 | null |
2024-08-13 | Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models | Chenqian Yan et.al. | 2408.06646 | null |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses | Zhongweiyang Xu et.al. | 2408.06468 | null |
2024-08-12 | Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming | Xinqi Jin et.al. | 2408.06152 | link |
2024-08-12 | A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting | Felix Assion et.al. | 2408.06071 | null |
2024-08-12 | DiagESC: Dialogue Synthesis for Integrating Depression Diagnosis into Emotional Support Conversation | Seungyeon Seo et.al. | 2408.06044 | link |
2024-08-12 | A Sharpness Based Loss Function for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2408.06014 | link |
2024-08-12 | A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models | Taehong Moon et.al. | 2408.05927 | link |
2024-08-12 | Creating Arabic LLM Prompts at Scale | Abdelrahman El-Sheikh et.al. | 2408.05882 | null |
2024-08-11 | LaWa: Using Latent Space for In-Generation Image Watermarking | Ahmad Rezaei et.al. | 2408.05868 | null |
2024-08-14 | Iterative Improvement of an Additively Regularized Topic Model | Alex Gorbulev et.al. | 2408.05840 | null |
2024-08-11 | SSL: A Self-similarity Loss for Improving Generative Image Super-resolution | Du Chen et.al. | 2408.05713 | link |
2024-08-11 | Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators | Yifan Pu et.al. | 2408.05710 | link |
2024-08-11 | Evaluating BM3D and NBNet: A Comprehensive Study of Image Denoising Across Multiple Datasets | Ghazal Kaviani et.al. | 2408.05697 | null |
2024-08-09 | CBCT scatter correction with dual-layer flat-panel detector | Xin Zhang et.al. | 2408.04943 | null |
2024-08-09 | Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction | Lingbei Meng et.al. | 2408.04831 | null |
2024-08-08 | DaedalusData: Exploration, Knowledge Externalization and Labeling of Particles in Medical Manufacturing -- A Design Study | Alexander Wyss et.al. | 2408.04749 | null |
2024-08-08 | Sampling for View Synthesis: From Local Light Field Fusion to Neural Radiance Fields and Beyond | Ravi Ramamoorthi et.al. | 2408.04586 | null |
2024-08-11 | Synchronous Multi-modal Semantic Communication System with Packet-level Coding | Yun Tian et.al. | 2408.04535 | null |
2024-08-08 | Robustness investigation of quality measures for the assessment of machine learning models | Thomas Most et.al. | 2408.04391 | null |
2024-08-08 | SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression | Linhan Cao et.al. | 2408.04273 | null |
2024-08-08 | LLDif: Diffusion Models for Low-light Emotion Recognition | Zhifeng Wang et.al. | 2408.04235 | null |
2024-08-07 | Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Yiqing Shen et.al. | 2408.04098 | null |
2024-08-07 | Machine Learning-Based Reward-Driven Tuning of Scanning Probe Microscopy: Towards Fully Automated Microscopy | Yu Liu et.al. | 2408.04055 | null |
2024-08-07 | Global-Local Progressive Integration Network for Blind Image Quality Assessment | Xiaoqi Wang et.al. | 2408.03885 | null |
2024-08-07 | Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Joo Chan Lee et.al. | 2408.03822 | null |
2024-08-07 | Soft-Hard Attention U-Net Model and Benchmark Dataset for Multiscale Image Shadow Removal | Eirini Cholopoulou et.al. | 2408.03734 | null |
2024-08-07 | Monitoring of Hermit Crabs Using drone-captured imagery and Deep Learning based Super-Resolution Reconstruction and Improved YOLOv8 | Fan Zhao et.al. | 2408.03559 | null |
2024-08-07 | D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods | Onkar Susladkar et.al. | 2408.03558 | link |
2024-08-07 | PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting | Yijia Guo et.al. | 2408.03538 | null |
2024-08-06 | Image Quality Transfer of Diffusion MRI Guided By High-Resolution Structural MRI | Alp G. Cicimen et.al. | 2408.03216 | null |
2024-08-06 | Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models | Sho Ozaki et.al. | 2408.03156 | null |
2024-08-05 | VidGen-1M: A Large-Scale Dataset for Text-to-video Generation | Zhiyu Tan et.al. | 2408.02629 | null |
2024-08-05 | Cascading Refinement Video Denoising with Uncertainty Adaptivity | Xinyuan Yu et.al. | 2408.02284 | null |
2024-08-04 | PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Aoming Liu et.al. | 2408.02157 | null |
2024-08-06 | RICA2: Rubric-Informed, Calibrated Assessment of Actions | Abrar Majeedi et.al. | 2408.02138 | link |
2024-08-04 | View-consistent Object Removal in Radiance Fields | Yiren Lu et.al. | 2408.02100 | null |
2024-08-04 | Constructing Per-Shot Bitrate Ladders using Visual Information Fidelity | Krishna Srikar Durbha et.al. | 2408.01932 | null |
2024-08-03 | Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation | Jintao Tan et.al. | 2408.01732 | null |
2024-08-03 | JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Model | Farzaneh Jafari et.al. | 2408.01627 | null |
2024-08-02 | Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics | Alexander Gushchin et.al. | 2408.01541 | link |
2024-08-02 | Underwater Object Detection Enhancement via Channel Stabilization | Muhammad Ali et.al. | 2408.01293 | link |
2024-08-02 | Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement | Wenbin Zou et.al. | 2408.01276 | link |
2024-08-02 | Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion | Ke Li et.al. | 2408.01225 | link |
2024-08-02 | Validation of an Analysability Model in Hybrid Quantum Software | Díaz-Muñoz Ana et.al. | 2408.01105 | null |
2024-08-06 | FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation | Xiang Gao et.al. | 2408.00998 | link |
2024-08-01 | SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement | Mark Boss et.al. | 2408.00653 | null |
2024-08-01 | Regional quality estimation for echocardiography using deep learning | Gilles Van De Vyver et.al. | 2408.00591 | link |
2024-08-01 | Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception | Jiancong Feng et.al. | 2408.00470 | null |
2024-08-01 | RDP: Ranked Differential Privacy for Facial Feature Protection in Multiscale Sparsified Subspace | Lu Ou et.al. | 2408.00294 | null |
2024-07-31 | Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification | Xingchen Shi et.al. | 2407.21683 | null |
2024-07-31 | Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model | Zhichao Zhang et.al. | 2407.21408 | null |
2024-07-31 | An all-sky catalogue of stellar reddening values | E. Paunzen et.al. | 2407.21373 | null |
2024-07-31 | ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images | Xilei Zhu et.al. | 2407.21363 | null |
2024-08-01 | Outlier Detection in Large Radiological Datasets using UMAP | Mohammad Tariqul Islam et.al. | 2407.21263 | link |
2024-07-30 | MP-You: A Web-based MPI Simulation Tool | The-Vinh Tran-Luu et.al. | 2407.21155 | null |
2024-07-30 | Simultaneous Multi-Slice Diffusion Imaging using Navigator-free Multishot Spiral Acquisition | Yuancheng Jiang et.al. | 2407.20904 | null |
2024-07-30 | Highly Efficient No-reference 4K Video Quality Assessment with Full-Pixel Covering Sampling and Training Strategy | Xiaoheng Tan et.al. | 2407.20766 | null |
2024-07-30 | Questionnaires for Everyone: Streamlining Cross-Cultural Questionnaire Adaptation with GPT-Based Translation Quality Evaluation | Otso Haavisto et.al. | 2407.20608 | link |
2024-07-29 | Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods | Hyeon Yu et.al. | 2407.20427 | null |
2024-07-29 | Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception | Konstantinos Tzevelekakis et.al. | 2407.20336 | null |
2024-07-29 | DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models | Jing Yang et.al. | 2407.20141 | null |
2024-07-29 | HeadsetOff: Enabling Photorealistic Video Conferencing on Economical VR Headsets | Yili Jin et.al. | 2407.19988 | null |
2024-07-29 | Noise-Resilient Unsupervised Graph Representation Learning via Multi-Hop Feature Quality Estimation | Shiyuan Li et.al. | 2407.19944 | null |
2024-07-29 | FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention | Yu Lu et.al. | 2407.19918 | null |
2024-07-29 | ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement | Ezequiel Perez-Zarate et.al. | 2407.19708 | link |
2024-07-29 | UNQA: Unified No-Reference Quality Assessment for Audio, Image, Video, and Audio-Visual Content | Yuqin Cao et.al. | 2407.19704 | null |
2024-07-29 | Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment | Wulian Yun et.al. | 2407.19675 | null |
2024-07-28 | X-Fake: Juggling Utility Evaluation and Explanation of Simulated SAR Images | Zhongling Huang et.al. | 2407.19436 | null |
2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | null |
2024-07-27 | Towards Clean-Label Backdoor Attacks in the Physical World | Thinh Dao et.al. | 2407.19203 | null |
2024-07-26 | Regularized Multi-Decoder Ensemble for an Error-Aware Scene Representation Network | Tianyu Xiong et.al. | 2407.19082 | null |
2024-07-26 | Correcting for objective sample refractive index mismatch in extended field of view selective plane illumination microscopy | Steven J. Sheppard et.al. | 2407.18862 | null |
2024-07-25 | Joint RGB-Spectral Decomposition Model Guided Image Enhancement in Mobile Photography | Kailai Zhou et.al. | 2407.17996 | link |
2024-07-29 | Invariance of deep image quality metrics to affine transformations | Nuria Alabau-Bosque et.al. | 2407.17927 | link |
2024-07-25 | Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion | Xiaodan Xing et.al. | 2407.17882 | null |
2024-07-24 | Final Alignment and Image Quality Test for the Acquisition and Guiding System of SOXS | J. A. Araiza-Duran et.al. | 2407.17382 | null |
2024-07-24 | SOXS NIR: Optomechanical integration and alignment, optical performance verification before full instrument assembly | M. Genoni et.al. | 2407.17244 | null |
2024-07-24 | Q-Ground: Image Quality Grounding with Large Multi-modality Models | Chaofeng Chen et.al. | 2407.17035 | link |
2024-07-24 | 3DAttGAN: A 3D Attention-based Generative Adversarial Network for Joint Space-Time Video Super-Resolution | Congrui Fu et.al. | 2407.16965 | link |
2024-07-24 | SAR to Optical Image Translation with Color Supervised Diffusion Model | Xinyu Bai et.al. | 2407.16921 | null |
2024-07-23 | QPT V2: Masked Image Modeling Advances Visual Scoring | Qizhi Xie et.al. | 2407.16541 | link |
2024-07-23 | ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation | Zhenhua Wu et.al. | 2407.16508 | null |
2024-07-23 | On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion Models | Deniz Daum et.al. | 2407.16405 | link |
2024-07-23 | Improving multidimensional projection quality with user-specific metrics and optimal scaling | Maniru Ibrahim et.al. | 2407.16328 | null |
2024-07-23 | A new visual quality metric for Evaluating the performance of multidimensional projections | Maniru Ibrahim et.al. | 2407.16309 | null |
2024-07-23 | Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance | Jiyeop Kim et.al. | 2407.16173 | null |
2024-07-23 | Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos | Jiahe Liu et.al. | 2407.16124 | link |
2024-07-22 | Enhancing Cell Instance Segmentation in Scanning Electron Microscopy Images via a Deep Contour Closing Operator | Florian Robert et.al. | 2407.15817 | null |
2024-07-22 | SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection | Daniel Jakab et.al. | 2407.15646 | null |
2024-07-22 | Experimenting with Adaptive Bitrate Algorithms for Virtual Reality Streaming over Wi-Fi | Ferran Maura et.al. | 2407.15614 | link |
2024-07-22 | SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time | Stanislav Frolov et.al. | 2407.15507 | link |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-21 | Assessing Sample Quality via the Latent Space of Generative Models | Jingyi Xu et.al. | 2407.15171 | link |
2024-07-20 | Non-Reference Quality Assessment for Medical Imaging: Application to Synthetic Brain MRIs | Karl Van Eeden Risager et.al. | 2407.14994 | null |
2024-07-20 | Deep Learning CT Image Restoration using System Blur and Noise Models | Yijie Yuan et.al. | 2407.14983 | null |
2024-07-20 | GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation | Jingzhi Gong et.al. | 2407.14982 | link |
2024-07-20 | Dual High-Order Total Variation Model for Underwater Image Restoration | Yuemei Li et.al. | 2407.14868 | link |
2024-07-20 | CBCTLiTS: A Synthetic, Paired CBCT/CT Dataset For Segmentation And Style Transfer | Maximilian E. Tschuchnig et.al. | 2407.14853 | null |
2024-07-20 | Realistic Surgical Image Dataset Generation Based On 3D Gaussian Splatting | Tianle Zeng et.al. | 2407.14846 | null |
2024-07-20 | Difflare: Removing Image Lens Flare with Latent Diffusion Model | Tianwen Zhou et.al. | 2407.14746 | link |
2024-07-20 | Polarimetric compressed sensing with hollow, self-assembled diffractive films | Ji Feng et.al. | 2407.14722 | null |
2024-07-19 | A Minibatch Alternating Projections Algorithm for Robust and Efficient Magnitude Least-Squares RF Pulse Design in MRI | Jonathan B. Martin et.al. | 2407.14696 | link |
2024-07-19 | A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Qi Yang et.al. | 2407.14197 | link |
2024-07-19 | Shape and Style GAN-based Multispectral Data Augmentation for Crop/Weed Segmentation in Precision Farming | Mulham Fawakherji et.al. | 2407.14119 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-19 | Personalized Privacy Protection Mask Against Unauthorized Facial Recognition | Ka-Ho Chow et.al. | 2407.13975 | link |
2024-07-18 | Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion | Boyang Deng et.al. | 2407.13759 | null |
2024-07-18 | A Novel Freeform Slicer IFU for the Magellan InfraRed Multi-Object Spectrograph (MIRMOS) | Maren Cosens et.al. | 2407.13747 | null |
2024-07-18 | HazeCLIP: Towards Language Guided Real-World Image Dehazing | Ruiyi Wang et.al. | 2407.13719 | link |
2024-07-18 | Removing cloud shadows from ground-based solar imagery | Amal Chaoui et.al. | 2407.13379 | null |
2024-07-18 | Any Image Restoration with Efficient Automatic Degradation Adaptation | Bin Ren et.al. | 2407.13372 | link |
2024-07-18 | Heterogeneous Clinical Trial Outcomes via Multi-Output Gaussian Processes | Owen Thomas et.al. | 2407.13283 | null |
2024-07-18 | Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network | Hao Yan et.al. | 2407.13211 | null |
2024-07-18 | Learned HDR Image Compression for Perceptually Optimal Storage and Display | Peibei Cao et.al. | 2407.13179 | null |
2024-07-18 | Image Inpainting Models are Effective Tools for Instruction-guided Image Editing | Xuan Ju et.al. | 2407.13139 | null |
2024-07-18 | Enhanced Denoising of OCT Images Using Residual U-Net: A Cross-Modality Approach on PSOCT and ASOCT for Clinical Diagnostics | Akkidas Noel Prakasha et.al. | 2407.13090 | null |
2024-07-17 | Hallucination Index: An Image Quality Metric for Generative Reconstruction Models | Matthew Tivnan et.al. | 2407.12780 | null |
2024-07-17 | CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems | Jiankun Zhao et.al. | 2407.12676 | link |
2024-07-17 | High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion | Juan Song et.al. | 2407.12538 | link |
2024-07-17 | Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations | Tomáš Chobola et.al. | 2407.12511 | link |
2024-07-17 | Enhancing Film Grain Coding in VVC: Improving Encoding Quality and Efficiency | Vignesh V Menon et.al. | 2407.12465 | null |
2024-07-17 | Voltage-Controlled Magnetoelectric Devices for Neuromorphic Diffusion Process | Yang Cheng et.al. | 2407.12261 | null |
2024-07-16 | Semantic Communication for the Internet of Sounds: Architecture, Design Principles, and Challenges | Chengsi Liang et.al. | 2407.12203 | null |
2024-07-16 | Neural Passage Quality Estimation for Static Pruning | Xuejun Chang et.al. | 2407.12170 | link |
2024-07-16 | MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification | Zhuoxiao Li et.al. | 2407.11840 | null |
2024-07-16 | LoFTI: Localization and Factuality Transfer to Indian Locales | Sona Elza Simon et.al. | 2407.11833 | link |
2024-07-16 | XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach | Truong Thanh Hung Nguyen et.al. | 2407.11771 | link |
2024-07-16 | ITI-IQA: a Toolbox for Heterogeneous Univariate and Multivariate Missing Data Imputation Quality Assessment | Pedro Pons-Suñer et.al. | 2407.11767 | null |
2024-07-16 | Magnetogram-to-Magnetogram: Generative Forecasting of Solar Evolution | Francesco Pio Ramunno et.al. | 2407.11659 | link |
2024-07-16 | ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment | Xinyi Wang et.al. | 2407.11496 | link |
2024-07-16 | Cover-separable Fixed Neural Network Steganography via Deep Generative Models | Guobiao Li et.al. | 2407.11405 | link |
2024-07-16 | Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering | Jingqian Wu et.al. | 2407.11343 | null |
2024-07-15 | UFQA: Utility guided Fingerphoto Quality Assessment | Amol S. Joshi et.al. | 2407.11141 | null |
2024-07-15 | Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation | Tu Vu et.al. | 2407.10817 | null |
2024-07-15 | Melon Fruit Detection and Quality Assessment Using Generative AI-Based Image Data Augmentation | Seungri Yoon et.al. | 2407.10413 | null |
2024-07-15 | Exploring the Impact of Moire Pattern on Deepfake Detectors | Razaib Tariq et.al. | 2407.10399 | null |
2024-07-14 | Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models | Qinyu Yang et.al. | 2407.10285 | link |
2024-07-14 | Low Sensitivity Hopsets | Vikrant Ashvinkumar et.al. | 2407.10249 | null |
2024-07-14 | A Novel Approach to Ultrasound Beamforming using Synthetic Transmit Aperture with Low Complexity and High SNR for Medical Imaging | Thenmozhi Elango et.al. | 2407.10242 | null |
2024-07-13 | Asynchronous Feedback Network for Perceptual Point Cloud Quality Assessment | Yujie Zhang et.al. | 2407.09806 | link |
2024-07-12 | Quantum-dot-based Kitaev chains: Majorana quality measures and scaling with increasing chain length | Viktor Svensson et.al. | 2407.09211 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-12 | Task-driven single-image super-resolution reconstruction of document scans | Maciej Zyrek et.al. | 2407.08993 | null |
2024-07-12 | LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models | Hai Jiang et.al. | 2407.08939 | link |
2024-07-12 | 15M Multimodal Facial Image-Text Dataset | Dawei Dai et.al. | 2407.08515 | null |
2024-07-11 | Imitation Learning for Robotic Assisted Ultrasound Examination of Deep Venous Thrombosis using Kernelized Movement Primitives | Diego Dall'Alba et.al. | 2407.08506 | null |
2024-07-11 | E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors | Jinxiu Liang et.al. | 2407.08231 | null |
2024-07-11 | Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression | Yuke Xing et.al. | 2407.08165 | null |
2024-07-10 | Coherent and Multi-modality Image Inpainting via Latent Space Optimization | Lingzhi Pan et.al. | 2407.08019 | link |
2024-07-10 | Intensity-sensitive quality assessment of extended sources in astronomical images | X. Li et.al. | 2407.07863 | link |
2024-07-12 | Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization | Feixiang Zhou et.al. | 2407.07673 | null |
2024-07-10 | Video In-context Learning | Wentao Zhang et.al. | 2407.07356 | null |
2024-07-10 | Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution | Yuehan Zhang et.al. | 2407.07302 | link |
2024-07-09 | HAMIL-QA: Hierarchical Approach to Multiple Instance Learning for Atrial LGE MRI Quality Assessment | K M Arefeen Sultan et.al. | 2407.07254 | null |
2024-07-09 | Scaling Up Personalized Aesthetic Assessment via Task Vector Customization | Jooyeol Yun et.al. | 2407.07176 | link |
2024-07-09 | Microsoft Cloud-based Digitization Workflow with Rich Metadata Acquisition for Cultural Heritage Objects | Krzysztof Kutt et.al. | 2407.06972 | null |
2024-07-09 | CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection | Shuang Hao et.al. | 2407.06780 | link |
2024-07-09 | Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition | Mingfang Zhang et.al. | 2407.06628 | null |
2024-07-09 | Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View | Dogyoon Lee et.al. | 2407.06613 | null |
2024-07-09 | Low-dose, high-resolution CT of infant-sized lungs via propagation-based phase contrast | James A. Pollock et.al. | 2407.06527 | null |
2024-07-08 | MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions | Xuan Ju et.al. | 2407.06358 | null |
2024-07-08 | Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision | Orr Zohar et.al. | 2407.06189 | link |
2024-07-08 | PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes | Mohammad Reza Karimi Dastjerdi et.al. | 2407.06150 | null |
2024-07-08 | Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation | Xinyu Bai et.al. | 2407.06095 | null |
2024-07-08 | Pan-denoising: Guided Hyperspectral Image Denoising via Weighted Represent Coefficient Total Variation | Shuang Xu et.al. | 2407.06064 | link |
2024-07-08 | MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices | Jianwen Jiang et.al. | 2407.05712 | null |
2024-07-09 | PCAC-GAN:ASparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression | Xiaolong Mao et.al. | 2407.05677 | null |
2024-07-08 | GSBIQA: Green Saliency-guided Blind Image Quality Assessment Method | Zhanxuan Mei et.al. | 2407.05590 | null |
2024-07-08 | Audio-driven High-resolution Seamless Talking Head Video Editing via StyleGAN | Jiacheng Su et.al. | 2407.05577 | null |
2024-07-06 | Panopticon: a telescope for our times | Will Saunders et.al. | 2407.05103 | null |
2024-07-06 | CLIPVQA:Video Quality Assessment via CLIP | Fengchuang Xing et.al. | 2407.04928 | link |
2024-07-06 | OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding | Tiancheng Zhao et.al. | 2407.04923 | null |
2024-07-05 | MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? | Zhaorun Chen et.al. | 2407.04842 | link |
2024-07-05 | Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps | Mattias Nilsson et.al. | 2407.04578 | link |
2024-07-05 | Rethinking Image Compression on the Web with Generative AI | Shayan Ali Hassan et.al. | 2407.04542 | null |
2024-07-05 | Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain | Christophe Karam et.al. | 2407.04484 | null |
2024-07-05 | Unsupervised Video Summarization via Reinforcement Learning and a Trained Evaluator | Mehryar Abbasi et.al. | 2407.04258 | null |
2024-07-05 | HCS-TNAS: Hybrid Constraint-driven Semi-supervised Transformer-NAS for Ultrasound Image Segmentation | Renqi Chen et.al. | 2407.04203 | null |
2024-07-04 | Performance of Medical Image Fusion in High-level Analysis Tasks: A Mutual Enhancement Framework for Unaligned PAT and MRI Image Fusion | Yutian Zhong et.al. | 2407.03992 | link |
2024-07-04 | DSMix: Distortion-Induced Sensitivity Map Based Pre-training for No-Reference Image Quality Assessment | Jinsong Shi et.al. | 2407.03886 | link |
2024-07-04 | Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy | Yujie Zhang et.al. | 2407.03885 | link |
2024-07-04 | DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts | Zheng-Peng Duan et.al. | 2407.03757 | null |
2024-07-04 | Adaptive sampling strategy for tolerance analysis of freeform optical surfaces based on critical ray aiming | Rundong Fan et.al. | 2407.03688 | null |
2024-07-04 | Pathological Semantics-Preserving Learning for H&E-to-IHC Virtual Staining | Fuqiang Chen et.al. | 2407.03655 | link |
2024-07-04 | Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration | Yuhong Zhang et.al. | 2407.03636 | null |
2024-07-04 | Orthogonal Constrained Minimization with Tensor |
Xiaoxia Liu et.al. | 2407.03605 | null |
2024-07-03 | Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models | Chunmei Xu et.al. | 2407.03050 | null |
2024-07-03 | Single Image Rolling Shutter Removal with Diffusion Models | Zhanglei Yang et.al. | 2407.02906 | null |
2024-07-03 | FedPot: A Quality-Aware Collaborative and Incentivized Honeypot-Based Detector for Smart Grid Networks | Abdullatif Albaseer et.al. | 2407.02845 | null |
2024-07-03 | Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design | Gen Li et.al. | 2407.02813 | link |
2024-07-03 | SF-GNN: Self Filter for Message Lossless Propagation in Deep Graph Neural Network | Yushan Zhu et.al. | 2407.02762 | null |
2024-07-03 | MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control | Yeonji Lee et.al. | 2407.02736 | null |
2024-07-02 | Meta 3D Gen | Raphael Bensadoun et.al. | 2407.02599 | null |
2024-07-02 | Off-Grid Ultrasound Imaging by Stochastic Optimization | Vincent van de Schaft et.al. | 2407.02285 | link |
2024-07-02 | SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules | Suyi Li et.al. | 2407.02031 | null |
2024-07-01 | Free-text Rationale Generation under Readability Level Control | Yi-Sheng Hsu et.al. | 2407.01384 | null |
2024-07-01 | GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting | Chenxin Li et.al. | 2407.01301 | null |
2024-07-01 | Optical turbulence vertical distribution at the Peak Terskol Observatory and Mt. Kurapdag | A. Y. Shikhovtsev et.al. | 2407.00960 | null |
2024-07-01 | Diffusion Transformer Model With Compact Prior for Low-dose PET Reconstruction | Bin Huang et.al. | 2407.00944 | link |
2024-06-30 | A Comparative Study of Quality Evaluation Methods for Text Summarization | Huyen Nguyen et.al. | 2407.00747 | null |
2024-06-30 | DCI: An Accurate Quality Assessment Criteria for Protein Complex Structure Models | Wenda Wang et.al. | 2407.00560 | null |
2024-06-29 | Dynamic Optimization of Video Streaming Quality Using Network Digital Twin Technology | Zurh Farus et.al. | 2407.00513 | null |
2024-07-02 | RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering | Weikai Lin et.al. | 2407.00435 | link |
2024-06-29 | Benchmark Evaluation of Image Fusion algorithms for Smartphone Camera Capture | Lucas N. Kirsten et.al. | 2407.00301 | null |
2024-06-28 | PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration | Yuxuan Sun et.al. | 2407.00203 | null |
2024-06-28 | Quantitative Methods in Research Evaluation Citation Indicators, Altmetrics, and Artificial Intelligence | Mike Thelwall et.al. | 2407.00135 | null |
2024-06-28 | MR-zero meets FLASH -- Controlling the transient signal decay in gradient- and rf-spoiled gradient echo sequences | Simon Weinmüller et.al. | 2406.19877 | null |
2024-06-28 | Deep Fusion Model for Brain Tumor Classification Using Fine-Grained Gradient Preservation | Niful Islam et.al. | 2406.19690 | null |
2024-06-28 | UltraGelBot: Autonomous Gel Dispenser for Robotic Ultrasound | Deepak Raina et.al. | 2406.19678 | null |
2024-06-28 | PopAlign: Population-Level Alignment for Fair Text-to-Image Generation | Shufan Li et.al. | 2406.19668 | link |
2024-06-27 | Robustness Testing of Black-Box Models Against CT Degradation Through Test-Time Augmentation | Jack Highton et.al. | 2406.19557 | null |
2024-06-27 | Lightweight Predictive 3D Gaussian Splats | Junli Cao et.al. | 2406.19434 | link |
2024-06-27 | Looking 3D: Anomaly Detection with 2D-3D Alignment | Ankan Bhunia et.al. | 2406.19393 | link |
2024-06-27 | AI Data Readiness Inspector (AIDRIN) for Quantitative Assessment of Data Readiness for AI | Kaveen Hiniduma et.al. | 2406.19256 | null |
2024-06-27 | Staggered Quantizers for Perfect Perceptual Quality: A Connection between Quantizers with Common Randomness and Without | Ruida Zhou et.al. | 2406.19248 | null |
2024-06-27 | Local Manifold Learning for No-Reference Image Quality Assessment | Timin Gao et.al. | 2406.19247 | null |
2024-06-27 | Complex-valued scatter compensation in nonlinear microscopy | Maximilian Sohmen et.al. | 2406.19031 | null |
2024-06-27 | Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model | Jiangtong Tan et.al. | 2406.19030 | link |
2024-06-26 | IDA-UIE: An Iterative Framework for Deep Network-based Degradation Aware Underwater Image Enhancement | Pranjali Singh et.al. | 2406.18628 | null |
2024-06-26 | On Scaling Up 3D Gaussian Splatting Training | Hexu Zhao et.al. | 2406.18533 | link |
2024-06-26 | MultiDiff: Consistent Novel View Synthesis from a Single Image | Norman Müller et.al. | 2406.18524 | null |
2024-06-26 | ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation | Shenghai Yuan et.al. | 2406.18522 | link |
2024-06-26 | MFDNet: Multi-Frequency Deflare Network for Efficient Nighttime Flare Removal | Yiguo Jiang et.al. | 2406.18079 | link |
2024-06-26 | Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation | Qilai Zhang et.al. | 2406.18054 | link |
2024-06-25 | Burst Image Super-Resolution with Base Frame Selection | Sanghyun Kim et.al. | 2406.17869 | null |
2024-06-25 | Sparse-view Signal-domain Photoacoustic Tomography Reconstruction Method Based on Neural Representation | Bowei Yao et.al. | 2406.17578 | null |
2024-06-25 | UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment | Vlad Hosu et.al. | 2406.17472 | null |
2024-06-25 | Leveraging LLMs for Dialogue Quality Measurement | Jinghan Jia et.al. | 2406.17304 | null |
2024-06-25 | HD snapshot diffractive spectral imaging and inferencing | Apratim Majumder et.al. | 2406.17302 | null |
2024-06-25 | Image-Guided Outdoor LiDAR Perception Quality Assessment for Autonomous Driving | Ce Zhang et.al. | 2406.17265 | null |
2024-06-25 | Disentangled Motion Modeling for Video Frame Interpolation | Jaihyun Lew et.al. | 2406.17256 | link |
2024-06-24 | Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models | Bei Yan et.al. | 2406.17115 | link |
2024-06-24 | Fine-tuning Diffusion Models for Enhancing Face Quality in Text-to-image Generation | Zhenyi Liao et.al. | 2406.17100 | link |
2024-06-24 | Reducing the Memory Footprint of 3D Gaussian Splatting | Panagiotis Papantonakis et.al. | 2406.17074 | null |
2024-06-24 | 3D distortion-free, reduced field of view diffusion-prepared GRE at 3T | Sarah McElroy et.al. | 2406.16809 | null |
2024-06-24 | Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation | Katherine M. Collins et.al. | 2406.16807 | null |
2024-06-24 | Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment | Jun Fu et.al. | 2406.16641 | link |
2024-06-24 | DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution | Aiwen Jiang et.al. | 2406.16477 | link |
2024-06-24 | Approximate DCT and Quantization Techniques for Energy-Constrained Image Sensors | Ming-Che Li et.al. | 2406.16358 | null |
2024-06-24 | Priorformer: A UGC-VQA Method with content and distortion priors | Yajing Pei et.al. | 2406.16297 | null |
2024-06-23 | Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation | Rafael Redondo et.al. | 2406.16155 | null |
2024-06-23 | LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical Scene Reconstruction | Hengyu Liu et.al. | 2406.16073 | link |
2024-06-22 | Quality-guided Skin Tone Enhancement for Portrait Photography | Shiqi Gao et.al. | 2406.15848 | null |
2024-06-21 | Adaptive Self-Supervised Consistency-Guided Diffusion Model for Accelerated MRI Reconstruction | Mojtaba Safari et.al. | 2406.15656 | null |
2024-06-21 | Contrastive Entity Coreference and Disambiguation for Historical Texts | Abhishek Arora et.al. | 2406.15576 | null |
2024-06-21 | Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild | Nadav Orzech et.al. | 2406.15331 | null |
2024-06-21 | Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection | Lynn Vonderhaar et.al. | 2406.15268 | null |
2024-06-24 | VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation | Xuan He et.al. | 2406.15252 | null |
2024-06-21 | Unsupervised Bayesian Generation of Synthetic CT from CBCT Using Patient-Specific Score-Based Prior | Junbo Peng et.al. | 2406.15219 | null |
2024-06-21 | Benchmarking Retinal Blood Vessel Segmentation Models for Cross-Dataset and Cross-Disease Generalization | Jeremiah Fadugba et.al. | 2406.14994 | link |
2024-06-21 | Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning | Xu Han et.al. | 2406.14847 | null |
2024-06-21 | Is this a bad table? A Closer Look at the Evaluation of Table Generation from Text | Pritika Ramu et.al. | 2406.14829 | null |
2024-06-20 | Holistic Evaluation for Interleaved Text-and-Image Generation | Minqian Liu et.al. | 2406.14643 | null |
2024-06-20 | A Fuzzy Logic-Based Quality Model For Identifying Microservices With Low Maintainability | Rahime Yilmaz et.al. | 2406.14489 | null |
2024-06-20 | Enhancing multivariate post-processed visibility predictions utilizing CAMS forecasts | Mária Lakatos et.al. | 2406.14159 | null |
2024-06-20 | EnTruth: Enhancing the Traceability of Unauthorized Dataset Usage in Text-to-image Diffusion Models with Minimal and Robust Alterations | Jie Ren et.al. | 2406.13933 | null |
2024-06-19 | IG-CFAT: An Improved GAN-Based Framework for Effectively Exploiting Transformers in Real-World Image Super-Resolution | Alireza Aghelan et.al. | 2406.13815 | link |
2024-06-19 | Convex-hull Estimation using XPSNR for Versatile Video Coding | Vignesh V Menon et.al. | 2406.13712 | null |
2024-06-19 | Assessing the 3D resolution of refocused correlation plenoptic images using a general-purpose image quality estimator | Gianlorenzo Massaro et.al. | 2406.13501 | null |
2024-06-19 | ALiiCE: Evaluating Positional Fine-grained Citation Generation | Yilong Xu et.al. | 2406.13375 | link |
2024-06-19 | AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric Conditioned Diffusion Models | Ken Chen et.al. | 2406.13272 | null |
2024-06-19 | New methods for ALMA angular-scale based observation scheduling, quality assessment, and beam shaping II: refinements | Dirk Petry et.al. | 2406.13199 | null |
2024-06-18 | NTIRE 2024 Challenge on Night Photography Rendering | Egor Ershov et.al. | 2406.13007 | null |
2024-06-18 | Pattern or Artifact? Interactively Exploring Embedding Quality with TRACE | Edith Heiter et.al. | 2406.12953 | link |
2024-06-18 | Automatic generation of insights from workers' actions in industrial workflows with explainable Machine Learning | Francisco de Arriba-Pérez et.al. | 2406.12732 | null |
2024-06-18 | Learned Image Compression for HE-stained Histopathological Images via Stain Deconvolution | Maximilian Fischer et.al. | 2406.12623 | null |
2024-06-18 | Training Diffusion Models with Federated Learning | Matthijs de Goede et.al. | 2406.12575 | null |
2024-06-18 | Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation | Sophie Loizillon et.al. | 2406.12448 | link |
2024-06-18 | AI-Assisted Human Evaluation of Machine Translation | Vilém Zouhar et.al. | 2406.12419 | link |
2024-06-18 | SDNIA-YOLO: A Robust Object Detection Model for Extreme Weather Conditions | Yuexiong Ding et.al. | 2406.12395 | null |
2024-06-17 | A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets | Bernhard Kerbl et.al. | 2406.12080 | null |
2024-06-17 | FinTruthQA: A Benchmark Dataset for Evaluating the Quality of Financial Information Disclosure | Ziyue Xu et.al. | 2406.12009 | link |
2024-06-17 | RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians | Bingling Li et.al. | 2406.11836 | null |
2024-06-17 | Latent Denoising Diffusion GAN: Faster sampling, Higher image quality | Luan Thanh Trinh et.al. | 2406.11713 | link |
2024-06-17 | Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT | Maximilian E. Tschuchnig et.al. | 2406.11650 | null |
2024-06-17 | Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation | Boxuan Lyu et.al. | 2406.11632 | null |
2024-06-17 | Compressed Skinning for Facial Blendshapes | Ladislav Kavan et.al. | 2406.11597 | null |
2024-06-17 | Energy Reduction Opportunities in HDR Video Encoding | Christian Herglotz et.al. | 2406.11492 | null |
2024-06-17 | A Dictionary Based Approach for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2406.11330 | link |
2024-06-17 | NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation | Niu Guanchen et.al. | 2406.11259 | null |
2024-06-17 | Incentivizing Quality Text Generation via Statistical Contracts | Eden Saig et.al. | 2406.11118 | link |
2024-06-16 | Parameter Blending for Multi-Camera Harmonization for Automotive Surround View Systems | Yuzhuo Ren et.al. | 2406.11066 | null |
2024-06-16 | SingMOS: An extensive Open-Source Singing Voice Dataset for MOS Prediction | Yuxun Tang et.al. | 2406.10911 | null |
2024-06-15 | MDeRainNet: An Efficient Neural Network for Rain Streak Removal from Macro-pixel Images | Tao Yan et.al. | 2406.10652 | null |
2024-06-15 | Exploring the Impact of AI-generated Image Tools on Professional and Non-professional Users in the Art and Design Fields | Yuying Tang et.al. | 2406.10640 | null |
2024-06-15 | Full reference point cloud quality assessment using support vector regression | Ryosuke Watanabe et.al. | 2406.10520 | link |
2024-06-15 | CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation | Wei Chen et.al. | 2406.10462 | null |
2024-06-14 | Consistency-diversity-realism Pareto fronts of conditional image generative models | Pietro Astolfi et.al. | 2406.10429 | null |
2024-06-14 | PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Alex Hanson et.al. | 2406.10219 | link |
2024-06-14 | AlignNet: Learning dataset score alignment functions to enable better training of speech quality estimators | Jaden Pieper et.al. | 2406.10205 | null |
2024-06-14 | D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video | Moritz Kappel et.al. | 2406.10078 | null |
2024-06-14 | Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment | Fei Zhou et.al. | 2406.09858 | null |
2024-06-14 | Full-reference Point Cloud Quality Assessment Using Spectral Graph Wavelets | Ryosuke Watanabe et.al. | 2406.09762 | null |
2024-06-14 | Compressed Video Quality Enhancement with Temporal Group Alignment and Fusion | Qiang Zhu et.al. | 2406.09693 | null |
2024-06-13 | DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer | Wei-Ting Chen et.al. | 2406.09622 | null |
2024-06-13 | Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment | Fengbin Guan et.al. | 2406.09546 | null |
2024-06-13 | Modeling Ambient Scene Dynamics for Free-view Synthesis | Meng-Li Shih et.al. | 2406.09395 | null |
2024-06-14 | WonderWorld: Interactive 3D Scene Generation from a Single Image | Hong-Xing Yu et.al. | 2406.09394 | null |
2024-06-13 | LRM-Zero: Training Large Reconstruction Models with Synthesized Data | Desai Xie et.al. | 2406.09371 | link |
2024-06-13 | CMC-Bench: Towards a New Paradigm of Visual Signal Compression | Chunyi Li et.al. | 2406.09356 | link |
2024-06-13 | StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning | Giuseppe Vecchio et.al. | 2406.09293 | null |
2024-06-13 | SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution | Soufiane Belharbi et.al. | 2406.09168 | link |
2024-06-13 | Adaptive Cooperative Streaming of Holographic Video Over Wireless Networks: A Proximal Policy Optimization Solution | Wanli Wen et.al. | 2406.08806 | null |
2024-06-13 | Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation | Mingwang Xu et.al. | 2406.08801 | null |
2024-06-13 | FouRA: Fourier Low Rank Adaptation | Shubhankar Borse et.al. | 2406.08798 | null |
2024-06-12 | Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods | Eugene Vyborov et.al. | 2406.08582 | null |
2024-06-12 | IMFL-AIGC: Incentive Mechanism Design for Federated Learning Empowered by Artificial Intelligence Generated Content | Guangjing Huang et.al. | 2406.08526 | null |
2024-06-12 | DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor | Juncheng Wu et.al. | 2406.08377 | link |
2024-06-12 | WMAdapter: Adding WaterMark Control to Latent Diffusion Models | Hai Ci et.al. | 2406.08337 | null |
2024-06-12 | Guiding In-Context Learning of LLMs through Quality Estimation for Machine Translation | Javad Pourmostafa Roshan Sharami et.al. | 2406.07970 | link |
2024-06-12 | DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera | Senyan Xu et.al. | 2406.07951 | link |
2024-06-12 | Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation | Jiadong Liang et.al. | 2406.07895 | null |
2024-06-11 | A PRISMA Driven Systematic Review of Publicly Available Datasets for Benchmark and Model Developments for Industrial Defect Detection | Can Akbas et.al. | 2406.07694 | null |
2024-06-11 | Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? | Xingyu Fu et.al. | 2406.07546 | null |
2024-06-11 | Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Kuan Heng Lin et.al. | 2406.07540 | null |
2024-06-11 | Trim 3D Gaussian Splatting for Accurate Geometry Representation | Lue Fan et.al. | 2406.07499 | null |
2024-06-11 | Textual Similarity as a Key Metric in Machine Translation Quality Estimation | Kun Sun et.al. | 2406.07440 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399 | null |
2024-06-11 | DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling | Sixian Wang et.al. | 2406.07390 | null |
2024-06-11 | Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment | Takuto Igarashi et.al. | 2406.07280 | null |
2024-06-11 | Accurate estimate of the ESPRESSO fiber-injection losses inferred from integrated field-stabilization images | Tobias M. Schmidt et.al. | 2406.07193 | null |
2024-06-11 | Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation | Yuanhao Zhai et.al. | 2406.06890 | link |
2024-06-11 | A Subjective Quality Evaluation of 3D Mesh with Dynamic Level of Detail in Virtual Reality | Duc Nguyen et.al. | 2406.06888 | null |
2024-06-09 | Latent Diffusion Model-Enabled Real-Time Semantic Communication Considering Semantic Ambiguities and Channel Noises | Jianhua Pei et.al. | 2406.06644 | link |
2024-06-10 | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Peize Sun et.al. | 2406.06525 | link |
2024-06-10 | MVGamba: Unify 3D Content Generation as State Space Sequence Modeling | Xuanyu Yi et.al. | 2406.06367 | link |
2024-06-10 | Federated learning in food research | Zuzanna Fendor et.al. | 2406.06202 | null |
2024-06-10 | Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios | Raül Pérez-Gonzalo et.al. | 2406.06165 | null |
2024-06-10 | JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis | Hyunjae Cho et.al. | 2406.06111 | null |
2024-06-10 | GAIA: Rethinking Action Quality Assessment for AI-Generated Videos | Zijian Chen et.al. | 2406.06087 | link |
2024-06-10 | FRAG: Frequency Adapting Group for Diffusion Video Editing | Sunjae Yoon et.al. | 2406.06044 | link |
2024-06-12 | MLCM: Multistep Consistency Distillation of Latent Diffusion Model | Qingsong Xie et.al. | 2406.05768 | link |
2024-06-08 | Energy-Efficient Approximate Full Adders Applying Memristive Serial IMPLY Logic For Image Processing | Seyed Erfan Fatemieh et.al. | 2406.05525 | null |
2024-06-08 | Blurry-Consistency Segmentation Framework with Selective Stacking on Differential Interference Contrast 3D Breast Cancer Spheroid | Thanh-Huy Nguyen et.al. | 2406.05349 | null |
2024-06-08 | Deep convolutional demosaicking network for multispectral polarization filter array | Tomoharu Ishiuchi et.al. | 2406.05312 | null |
2024-06-08 | YouTube SFV+HDR Quality Dataset | Yilin Wang et.al. | 2406.05305 | null |
2024-06-07 | Spectral Codecs: Spectrogram-Based Audio Codecs for High Quality Speech Synthesis | Ryan Langman et.al. | 2406.05298 | null |
2024-06-07 | GANetic Loss for Generative Adversarial Networks with a Focus on Medical Applications | Shakhnaz Akhmedova et.al. | 2406.05023 | link |
2024-06-07 | Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior | Tanvir Mahmud et.al. | 2406.04873 | link |
2024-06-07 | SMC++: Masked Learning of Unsupervised Video Semantic Compression | Yuan Tian et.al. | 2406.04765 | link |
2024-06-07 | The Active Optics System on the Vera C. Rubin Observatory: Optimal Control of Degeneracy Among the Large Number of Degrees of Freedom | Guillem Megias Homar et.al. | 2406.04656 | null |
2024-06-07 | GenzIQA: Generalized Image Quality Assessment using Prompt-Guided Latent Diffusion Models | Diptanu De et.al. | 2406.04654 | null |
2024-06-07 | StreamOptix: A Cross-layer Adaptive Video Delivery Scheme | Mufan Liu et.al. | 2406.04632 | link |
2024-06-07 | Attention Fusion Reverse Distillation for Multi-Lighting Image Anomaly Detection | Yiheng Zhang et.al. | 2406.04573 | null |
2024-06-06 | Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance | Reyhane Askari Hemmat et.al. | 2406.04551 | null |
2024-06-06 | A Versatile Collage Visualization Technique | Zhenyu Wang et.al. | 2406.04008 | null |
2024-06-06 | JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits | Minzhou Pan et.al. | 2406.03720 | link |
2024-06-06 | Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Diwen Wan et.al. | 2406.03697 | link |
2024-06-05 | Anatomy-based quality metric of diffusion-weighted MRI data for accurate derivation of muscle fiber orientation | Nadya Shusharina et.al. | 2406.03560 | null |
2024-06-05 | Globally and Locally Optimized Pannini Projection for High FoV Rendering of 360-degree Images | Falah Jabar et.al. | 2406.03282 | null |
2024-06-05 | FAPNet: An Effective Frequency Adaptive Point-based Eye Tracker | Xiaopeng Lin et.al. | 2406.03177 | null |
2024-06-05 | Dynamic 3D Gaussian Fields for Urban Areas | Tobias Fischer et.al. | 2406.03175 | null |
2024-06-05 | The new Herschel/PACS Point Source Catalogue | Gábor Marton et.al. | 2406.03116 | null |
2024-06-05 | A-Bench: Are LMMs Masters at Evaluating AI-generated Images? | Zicheng Zhang et.al. | 2406.03070 | link |
2024-06-05 | DifAttack++: Query-Efficient Black-Box Adversarial Attack via Hierarchical Disentangled Feature Space in Cross Domain | Jun Liu et.al. | 2406.03017 | link |
2024-06-05 | Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms | Firas Trabelsi et.al. | 2406.02832 | null |
2024-06-04 | ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation | Tianchen Zhao et.al. | 2406.02540 | link |
2024-06-04 | Guiding a Diffusion Model with a Bad Version of Itself | Tero Karras et.al. | 2406.02507 | link |
2024-06-04 | Towards AI-Assisted Sustainable Adaptive Video Streaming Systems: Tutorial and Survey | Reza Farahani et.al. | 2406.02302 | null |
2024-06-04 | I4VGen: Image as Stepping Stone for Text-to-Video Generation | Xiefan Guo et.al. | 2406.02230 | null |
2024-06-04 | OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection | Chenyang Huang et.al. | 2406.01919 | link |
2024-06-04 | Rank-based No-reference Quality Assessment for Face Swapping | Xinghui Zhou et.al. | 2406.01884 | null |
2024-06-03 | Video Coding with Cross-Component Sample Offset | Han Gao et.al. | 2406.01795 | null |
2024-06-03 | DEFT: Efficient Finetuning of Conditional Diffusion Models by Learning the Generalised |
Alexander Denker et.al. | 2406.01781 | link |
2024-06-03 | Enhancing Dynamic CT Image Reconstruction with Neural Fields Through Explicit Motion Regularizers | Pablo Arratia et.al. | 2406.01299 | null |
2024-06-03 | Capsule Enhanced Variational AutoEncoder for Underwater Image Reconstruction | Rita Pucci et.al. | 2406.01294 | link |
2024-06-03 | Dimba: Transformer-Mamba Diffusion Models | Zhengcong Fei et.al. | 2406.01159 | null |
2024-06-03 | Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline | Jan Lippemeier et.al. | 2406.01071 | null |
2024-06-03 | UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment | Hantao Zhou et.al. | 2406.01069 | link |
2024-06-03 | CLIP-Guided Attribute Aware Pretraining for Generalizable Image Quality Assessment | Daekyu Kwon et.al. | 2406.01020 | null |
2024-06-02 | EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing | Hadrien Reynaud et.al. | 2406.00808 | link |
2024-06-04 | Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion Models | Cristiano Patrício et.al. | 2406.00772 | link |
2024-06-02 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-06-01 | Bilateral Guided Radiance Field Processing | Yuehao Wang et.al. | 2406.00448 | null |
2024-06-01 | Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention Reasoner | Xing Cui et.al. | 2406.00432 | link |
2024-06-01 | Hybrid attention structure preserving network for reconstruction of under-sampled OCT images | Zezhao Guo et.al. | 2406.00279 | null |
2024-05-31 | Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis | Chaoyou Fu et.al. | 2405.21075 | null |
2024-05-31 | Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling | Jiatao Gu et.al. | 2405.21048 | null |
2024-05-31 | Tsang's resolution enhancement method for imaging with focused illumination | Alexander Duplinskiy et.al. | 2405.20979 | null |
2024-05-31 | Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation | Shuzhou Yang et.al. | 2405.20669 | link |
2024-05-30 | An Automatic Question Usability Evaluation Toolkit | Steven Moore et.al. | 2405.20529 | link |
2024-05-30 | Can No-Reference Quality-Assessment Methods Serve as Perceptual Losses for Super-Resolution? | Egor Kashkarov et.al. | 2405.20392 | null |
2024-05-30 | CoSy: Evaluating Textual Explanations of Neurons | Laura Kopf et.al. | 2405.20331 | link |
2024-05-31 | NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation | Pedro Martin et.al. | 2405.20078 | null |
2024-05-30 | Promptus: Can Prompts Streaming Replace Video Streaming with Stable Diffusion | Jiangkai Wu et.al. | 2405.20032 | link |
2024-06-03 | DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild | Honghao Fu et.al. | 2405.19996 | link |
2024-05-29 | CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning | Yiping Wang et.al. | 2405.19547 | link |
2024-05-29 | A Full-duplex Speech Dialogue Scheme Based On Large Language Models | Peng Wang et.al. | 2405.19487 | null |
2024-05-29 | VisTA-SR: Improving the Accuracy and Resolution of Low-Cost Thermal Imaging Cameras for Agriculture | Heesup Yun et.al. | 2405.19413 | null |
2024-05-29 | Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare | Hanwei Zhu et.al. | 2405.19298 | link |
2024-05-29 | A study on the adequacy of common IQA measures for medical images | Anna Breger et.al. | 2405.19224 | link |
2024-05-29 | A study of why we need to reassess full reference image quality assessment with medical images | Anna Breger et.al. | 2405.19097 | null |
2024-05-31 | Benchmarking and Improving Detail Image Caption | Hongyuan Dong et.al. | 2405.19092 | link |
2024-05-29 | Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization | Zhiwei Tang et.al. | 2405.18881 | link |
2024-05-29 | Descriptive Image Quality Assessment in the Wild | Zhiyuan You et.al. | 2405.18842 | null |
2024-05-29 | Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics | Zhangkai Ni et.al. | 2405.18790 | link |
2024-05-28 | Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers? | Zebin You et.al. | 2405.18029 | null |
2024-05-30 | Proof of Quality: A Costless Paradigm for Trustless Generative AI Model Inference on Blockchains | Zhenjie Zhang et.al. | 2405.17934 | null |
2024-05-30 | MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization | Tianchen Zhao et.al. | 2405.17873 | null |
2024-05-28 | PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild | Kun Yuan et.al. | 2405.17765 | null |
2024-05-28 | AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval | Sihe Zhang et.al. | 2405.17718 | null |
2024-05-27 | Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba | Jiahao Huang et.al. | 2405.17659 | null |
2024-05-27 | Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction | Wenhao Zhang et.al. | 2405.17167 | null |
2024-05-28 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-29 | The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control | Arle Lommel et.al. | 2405.16969 | null |
2024-05-27 | EM Distillation for One-step Diffusion Models | Sirui Xie et.al. | 2405.16852 | null |
2024-05-27 | Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model | Shoma Iwai et.al. | 2405.16817 | link |
2024-05-26 | Coil Reweighting to Suppress Motion Artifacts in Real-Time Exercise Cine Imaging | Chong Chen et.al. | 2405.16715 | null |
2024-05-26 | Deep learning improved autofocus for motion artifact reduction and its application in quantitative susceptibility mapping | Chao Li et.al. | 2405.16664 | null |
2024-05-26 | Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models | Regev Cohen et.al. | 2405.16475 | null |
2024-05-25 | Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination | Shelly Golan et.al. | 2405.16260 | link |
2024-05-25 | Maintaining and Managing Road Quality:Using MLP and DNN | Makgotso Jacqueline Maotwana et.al. | 2405.16196 | null |
2024-05-25 | Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection | Yun Zhu et.al. | 2405.16178 | null |
2024-05-24 | Diff-DTI: Fast Diffusion Tensor Imaging Using A Feature-Enhanced Joint Diffusion Model | Lang Zhang et.al. | 2405.15830 | null |
2024-05-24 | Erase to Enhance: Data-Efficient Machine Unlearning in MRI Reconstruction | Yuyang Xue et.al. | 2405.15517 | link |
2024-05-24 | Benchmarking Pre-trained Large Language Models' Potential Across Urdu NLP tasks | Munief Hassan Tahir et.al. | 2405.15453 | null |
2024-05-24 | Fieldscale: Locality-Aware Field-based Adaptive Rescaling for Thermal Infrared Image | Hyeonjae Gil et.al. | 2405.15395 | link |
2024-05-24 | CPT-Interp: Continuous sPatial and Temporal Motion Modeling for 4D Medical Image Interpolation | Xia Li et.al. | 2405.15385 | null |
2024-05-24 | Seeing the World through an Antenna's Eye: Reception Quality Visualization Using Incomplete Technical Signal Information | Leif Bergerhoff et.al. | 2405.15253 | null |
2024-05-24 | Improved Distribution Matching Distillation for Fast Image Synthesis | Tianwei Yin et.al. | 2405.14867 | link |
2024-05-23 | Physics-informed Score-based Diffusion Model for Limited-angle Reconstruction of Cardiac Computed Tomography | Shuo Han et.al. | 2405.14770 | null |
2024-05-23 | Convolutional Neural Network Model Observers Discount Signal-like Anatomical Structures During Search in Virtual Digital Breast Tomosynthesis Phantoms | Aditya Jonnalagadda et.al. | 2405.14720 | null |
2024-05-23 | OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance | Shuheng Ge et.al. | 2405.14709 | null |
2024-05-24 | Autoregressive Image Diffusion: Generation of Image Sequence and Application in MRI | Guanxiong Luo et.al. | 2405.14327 | link |
2024-05-23 | Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization | Zhibo Chen et.al. | 2405.14221 | null |
2024-05-22 | Uncertainty-aware Evaluation of Auxiliary Anomalies with the Expected Anomaly Posterior | Lorenzo Perini et.al. | 2405.13699 | null |
2024-05-22 | Euclid: Early Release Observations -- Programme overview and pipeline for compact- and diffuse-emission photometry | J. -C. Cuillandre et.al. | 2405.13496 | null |
2024-05-25 | Class-Conditional self-reward mechanism for improved Text-to-Image models | Safouane El Ghazouali et.al. | 2405.13473 | link |
2024-05-22 | Comparative Analysis of Hyperspectral Image Reconstruction Using Deep Learning for Agricultural and Biological Applications | Md. Toukir Ahmed et.al. | 2405.13331 | null |
2024-05-21 | Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos | Jayroop Ramesh et.al. | 2405.13235 | link |
2024-05-24 | Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and Next-Token Prediction | Maciej Kilian et.al. | 2405.13218 | null |
2024-05-21 | NieR: Normal-Based Lighting Scene Rendering | Hongsheng Wang et.al. | 2405.13097 | null |
2024-05-21 | MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video | Hongsheng Wang et.al. | 2405.12806 | null |
2024-05-21 | Is Dataset Quality Still a Concern in Diagnosis Using Large Foundation Model? | Ziqin Lin et.al. | 2405.12584 | null |
2024-05-20 | Paired Conditional Generative Adversarial Network for Highly Accelerated Liver 4D MRI | Di Xu et.al. | 2405.12357 | null |
2024-05-20 | Deep learning-based hyperspectral image reconstruction for quality assessment of agro-product | Md. Toukir Ahmed et.al. | 2405.12313 | null |
2024-05-20 | GGAvatar: Geometric Adjustment of Gaussian Head Avatar | Xinyang Li et.al. | 2405.11993 | null |
2024-05-20 | On Efficient and Statistical Quality Estimation for Data Annotation | Jan-Christoph Klie et.al. | 2405.11919 | null |
2024-05-20 | ViViD: Video Virtual Try-on using Diffusion Models | Zixun Fang et.al. | 2405.11794 | null |
2024-05-19 | Solar image quality assessment: a proof of concept using Variance of Laplacian method and its application to optical atmospheric condition monitoring | Chu Wing So et.al. | 2405.11490 | null |
2024-05-18 | Sampling Strategies for Mitigating Bias in Face Synthesis Methods | Emmanouil Maragkoudakis et.al. | 2405.11320 | null |
2024-05-18 | Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching | Xingyu Miao et.al. | 2405.11252 | link |
2024-05-18 | Testing the Performance of Face Recognition for People with Down Syndrome | Christian Rathgeb et.al. | 2405.11240 | null |
2024-05-21 | SPOR: A Comprehensive and Practical Evaluation Method for Compositional Generalization in Data-to-Text Generation | Ziyao Xu et.al. | 2405.10650 | link |
2024-05-17 | Simultaneous Deep Learning of Myocardium Segmentation and T2 Quantification for Acute Myocardial Infarction MRI | Yirong Zhou et.al. | 2405.10570 | null |
2024-05-17 | Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network | Junhui Li et.al. | 2405.10518 | null |
2024-05-16 | Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder | Mohamed Ilyes Lakhal et.al. | 2405.10423 | null |
2024-05-16 | GS-Planner: A Gaussian-Splatting-based Planning Framework for Active High-Fidelity Reconstruction | Rui Jin et.al. | 2405.10142 | null |
2024-05-16 | Semantic Communication via Rate Distortion Perception Bottleneck | Zihe Zhao et.al. | 2405.09995 | null |
2024-05-16 | VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing | Binghui Chen et.al. | 2405.09985 | null |
2024-05-16 | NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge | Jie Liang et.al. | 2405.09923 | null |
2024-05-16 | DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection | Yuhao Sun et.al. | 2405.09882 | link |
2024-05-15 | Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment | Xinying Lin et.al. | 2405.09472 | null |
2024-05-16 | Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images | Memoona Aziz et.al. | 2405.09426 | null |
2024-05-15 | Application of Gated Recurrent Units for CT Trajectory Optimization | Yuedong Yuan et.al. | 2405.09333 | null |
2024-05-21 | Deep Blur Multi-Model (DeepBlurMM) - a strategy to mitigate the impact of image blur on deep learning model performance in histopathology image analysis | Yujie Xiang et.al. | 2405.09298 | null |
2024-05-15 | Sensitivity Decouple Learning for Image Compression Artifacts Reduction | Li Ma et.al. | 2405.09291 | null |
2024-05-15 | Shacl4Bib: custom validation of library data | Péter Király et.al. | 2405.09177 | null |
2024-05-18 | Scalable Image Coding for Humans and Machines Using Feature Fusion Network | Takahiro Shindo et.al. | 2405.09152 | link |
2024-05-15 | RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing | Jiamei Xiong et.al. | 2405.09083 | link |
2024-05-14 | Chemically peculiar stars on the pre-main sequence | L. Kueß et.al. | 2405.08946 | null |
2024-05-14 | Enhancing Blind Video Quality Assessment with Rich Quality-aware Features | Wei Sun et.al. | 2405.08745 | link |
2024-05-13 | The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective | Andrew Shin et.al. | 2405.08720 | null |
2024-05-14 | Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs | P. Mas-Buitrago et.al. | 2405.08703 | link |
2024-05-15 | RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content | Tianhao Peng et.al. | 2405.08621 | null |
2024-05-14 | Dual-Branch Network for Portrait Image Quality Assessment | Wei Sun et.al. | 2405.08555 | link |
2024-05-14 | WaterMamba: Visual State Space Model for Underwater Image Enhancement | Meisheng Guan et.al. | 2405.08419 | null |
2024-05-14 | Perivascular space Identification Nnunet for Generalised Usage (PINGU) | Benjamin Sinclair et.al. | 2405.08337 | link |
2024-05-14 | Progressive enhancement and restoration for mural images under low-light and defected conditions based on multi-receptive field strategy | Xiameng Wei et.al. | 2405.08245 | link |
2024-05-13 | Quality of Experience Optimization for Real-time XR Video Transmission with Energy Constraints | Guangjin Pan et.al. | 2405.07689 | null |
2024-05-15 | PRANK: a singular value based noise filtering approach | Francesco Trainotti et.al. | 2405.07578 | null |
2024-05-13 | Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches | Gao Yu Lee et.al. | 2405.07520 | null |
2024-05-12 | Understanding and Evaluating Human Preferences for AI Generated Images with Instruction Tuning | Jiarui Wang et.al. | 2405.07346 | link |
2024-05-12 | PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification | Mohammad Shafiul Alam et.al. | 2405.07332 | link |
2024-05-12 | Stable Signature is Unstable: Removing Image Watermark from Diffusion Models | Yuepeng Hu et.al. | 2405.07145 | null |
2024-05-11 | Large Language Model-aided Edge Learning in Distribution System State Estimation | Renyou Xie et.al. | 2405.06999 | null |
2024-05-15 | Generation of Granular-Balls for Clustering Based on the Principle of Justifiable Granularity | Zihang Jia et.al. | 2405.06904 | null |
2024-05-11 | FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment | Jinglin Xu et.al. | 2405.06887 | link |
2024-05-10 | Multi-Object Tracking in the Dark | Xinzhe Wang et.al. | 2405.06600 | link |
2024-05-10 | Compression-Realized Deep Structural Network for Video Quality Enhancement | Hanchi Sun et.al. | 2405.06342 | null |
2024-05-09 | Perceptual Crack Detection for Rendered 3D Textured Meshes | Armin Shafiee Sarvestani et.al. | 2405.06143 | link |
2024-05-09 | Distilling Diffusion Models into Conditional GANs | Minguk Kang et.al. | 2405.05967 | null |
2024-05-09 | How Quality Affects Deep Neural Networks in Fine-Grained Image Classification | Joseph Smith et.al. | 2405.05742 | null |
2024-05-09 | LatentColorization: Latent Diffusion-Based Speaker Video Colorization | Rory Ward et.al. | 2405.05707 | null |
2024-05-09 | SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space | Zeren Zhang et.al. | 2405.05636 | null |
2024-05-09 | Array SAR 3D Sparse Imaging Based on Regularization by Denoising Under Few Observed Data | Yangyang Wang et.al. | 2405.05565 | null |
2024-05-08 | Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation | Jonas Kohler et.al. | 2405.05224 | null |
2024-05-08 | Bridging the Gap Between Saliency Prediction and Image Quality Assessment | Kirillov Alexey et.al. | 2405.04997 | link |
2024-05-07 | Remote Diffusion | Kunal Sunil Kasodekar et.al. | 2405.04717 | null |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-07 | Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation | Dogucan Yaman et.al. | 2405.04327 | null |
2024-05-07 | Cross-IQA: Unsupervised Learning for Image Quality Assessment | Zhen Zhang et.al. | 2405.04311 | null |
2024-05-07 | Sora Detector: A Unified Hallucination Detection for Large Text-to-Video Models | Zhixuan Chu et.al. | 2405.04180 | link |
2024-05-07 | Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment | Aobo Li et.al. | 2405.04167 | link |
2024-05-07 | Lossy Compression with Data, Perception, and Classification Constraints | Yuhan Wang et.al. | 2405.04144 | null |
2024-05-07 | Joint Estimation of Identity Verification and Relative Pose for Partial Fingerprints | Xiongjun Guan et.al. | 2405.03959 | link |
2024-05-06 | AI-Driven Frameworks for Enhancing Data Quality in Big Data Ecosystems: Error_Detection, Correction, and Metadata Integration | Widad Elouataoui et.al. | 2405.03870 | null |
2024-05-06 | Accelerated MR Cholangiopancreatography with Deep Learning-based Reconstruction | Jinho Kim et.al. | 2405.03732 | link |
2024-05-06 | All-in-One Deep Learning Framework for MR Image Reconstruction | Geunu Jeong et.al. | 2405.03684 | null |
2024-05-06 | An Image Quality Evaluation and Masking Algorithm Based On Pre-trained Deep Neural Networks | Peng Jia et.al. | 2405.03408 | null |
2024-05-06 | Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement | Jiesong Bai et.al. | 2405.03349 | link |
2024-05-06 | Light-VQA+: A Video Quality Assessment Model for Exposure Correction with Vision-Language Guidance | Xunchu Zhou et.al. | 2405.03333 | link |
2024-05-06 | Multi-Modality Spatio-Temporal Forecasting via Self-Supervised Learning | Jiewen Deng et.al. | 2405.03255 | link |
2024-05-05 | Matten: Video Generation with Mamba-Attention | Yu Gao et.al. | 2405.03025 | null |
2024-05-05 | Design, analysis, and manufacturing of a glass-plastic hybrid minimalist aspheric panoramic annular lens | Shaohua Gao et.al. | 2405.02942 | null |
2024-05-05 | Residual-Conditioned Optimal Transport: Towards Structure-preserving Unpaired and Paired Image Restoration | Xiaole Tang et.al. | 2405.02843 | link |
2024-05-04 | Deep Image Restoration For Image Anti-Forensics | Eren Tahir et.al. | 2405.02751 | link |
2024-05-04 | DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model | Liangqi Lei et.al. | 2405.02696 | null |
2024-05-03 | On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning? | Maxime Zanella et.al. | 2405.02266 | link |
2024-05-01 | Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts | Han Cui et.al. | 2405.02208 | null |
2024-05-03 | HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 | Miriam Jäger et.al. | 2405.02005 | null |
2024-05-03 | Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics | Rucha Deshpande et.al. | 2405.01822 | null |
2024-05-07 | Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration | Praveen Kumar Chandaliya et.al. | 2405.01273 | null |
2024-05-02 | Singular Value and Frame Decomposition-based Reconstruction for Atmospheric Tomography | Lukas Weissinger et.al. | 2405.01079 | null |
2024-05-01 | Brighteye: Glaucoma Screening with Color Fundus Photographs based on Vision Transformer | Hui Lin et.al. | 2405.00857 | link |
2024-05-01 | Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models | Xiaoshi Wu et.al. | 2405.00760 | null |
2024-05-01 | Adapting Pretrained Networks for Image Quality Assessment on High Dynamic Range Displays | Andrei Chubarau et.al. | 2405.00670 | link |
2024-05-01 | Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning | Yuxi Xie et.al. | 2405.00451 | link |
2024-04-30 | Fast MRI Reconstruction Using Deep Learning-based Compressed Sensing: A Systematic Review | Mojtaba Safari et.al. | 2405.00241 | link |
2024-04-30 | Charting the Path Forward: CT Image Quality Assessment -- An In-Depth Review | Siyi Xun et.al. | 2405.00075 | null |
2024-04-30 | Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on Perceptual Similarity | Lei Wang et.al. | 2404.19666 | null |
2024-04-30 | Perceptual Constancy Constrained Single Opinion Score Calibration for Image Quality Assessment | Lei Wang et.al. | 2404.19595 | null |
2024-04-30 | Causal Perception Inspired Representation Learning for Trustworthy Image Quality Assessment | Lei Wang et.al. | 2404.19567 | null |
2024-05-04 | Towards Real-world Video Face Restoration: A New Benchmark | Ziyan Chen et.al. | 2404.19500 | null |
2024-04-30 | NeRF-Insert: 3D Local Editing with Multimodal Control Signals | Benet Oriol Sabat et.al. | 2404.19204 | null |
2024-04-30 | Global Search Optics: Automatically Exploring Optimal Solutions to Compact Computational Imaging Systems | Yao Gao et.al. | 2404.19201 | null |
2024-04-30 | Advancing low-field MRI with a universal denoising imaging transformer: Towards fast and high-quality imaging | Zheren Zhu et.al. | 2404.19167 | link |
2024-04-29 | A Comprehensive Rubric for Annotating Pathological Speech | Mario Corrales-Astorgano et.al. | 2404.18851 | null |
2024-04-29 | Autonomous Quality and Hallucination Assessment for Virtual Tissue Staining and Digital Pathology | Luzhe Huang et.al. | 2404.18458 | null |
2024-04-29 | PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images | Jiquan Yuan et.al. | 2404.18409 | link |
2024-04-29 | G-Refine: A General Quality Refiner for Text-to-Image Generation | Chunyi Li et.al. | 2404.18343 | link |
2024-04-28 | An automated pipeline for computation and analysis of functional ventilation and perfusion lung MRI with matrix pencil decomposition: TrueLung | Orso Pusterla et.al. | 2404.18275 | null |
2024-04-28 | LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM | Zic |