Ep. 246 - Part 2 - June 12, 2024

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

Ep. 246 - Part 2 - June 12, 2024

無料で聴く

ポッドキャストの詳細を見る

このコンテンツについて

ArXiv Computer Vision research for Wednesday, June 12, 2024.

00:21: From Sim-to-Real: Toward General Event-based Low-light Frame Interpolation with Per-scene Optimization

01:44: Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement

03:20: Adversarial Patch for 3D Local Feature Extractor

04:00: Valeo4Cast: A Modular Approach to End-to-End Forecasting

05:38: The impact of deep learning aid on the workload and interpretation accuracy of radiologists on chest computed tomography: a cross-over reader study

08:50: Universal Scale Laws for Colors and Patterns in Imagery

10:11: CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer

11:44: ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs

13:25: Continuous fake media detection: adapting deepfake detectors to new generative techniques

15:18: Category-level Neural Field for Reconstruction of Partially Observed Objects in Indoor Environment

16:23: One-Step Effective Diffusion Network for Real-World Image Super-Resolution

18:12: 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation

19:22: Diffusion-Promoted HDR Video Reconstruction

21:09: Runtime Freezing: Dynamic Class Loss for Multi-Organ 3D Segmentation

21:52: A Sociotechnical Lens for Evaluating Computer Vision Models: A Case Study on Detecting and Reasoning about Gender and Emotion

23:54: DistilDoc: Knowledge Distillation for Visually-Rich Document Applications

25:28: Using Deep Convolutional Neural Networks to Detect Rendered Glitches in Video Games

26:39: OpenCOLE: Towards Reproducible Automatic Graphic Design Generation

27:23: Dataset Enhancement with Instance-Level Augmentations

28:33: Interpretable Representation Learning of Cardiac MRI via Attribute Regularization

29:33: A New Class Biorthogonal Spline Wavelet for Image Edge Detection

30:48: Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata

32:10: Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance

33:32: AdaNCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer

35:09: From Chaos to Clarity: 3DGS in the Dark

36:32: LaMOT: Language-Guided Multi-Object Tracking

38:07: UDON: Universal Dynamic Online distillatioN for generic image representations

39:49: WMAdapter: Adding WaterMark Control to Latent Diffusion Models

40:48: Blind Image Deblurring using FFT-ReLU with Deep Learning Pipeline Integration

42:06: DocSynthv2: A Practical Autoregressive Modeling for Document Generation