Spaces:
Sleeping
Sleeping
commit files to HF hub
Browse files- papers.csv +8 -8
papers.csv
CHANGED
@@ -12,7 +12,7 @@ Test-time Personalizable Forecasting of 3D Human Poses,"Cui, Qiongjie*; Sun, Hua
|
|
12 |
HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative perception with vision transformer,"Xiang, Hao; Xu, Runsheng; Ma, Jiaqi*",poster,,,,,,,,,
|
13 |
Efficient neural supersampling on a novel gaming dataset,"Mercier, Antoine*; Erasmus, Ruan S; Savani, Yashesh ; Dhingra, Manik; Porikli, Fatih; Berger, Guillaume J. F.",poster,2308.01483,https://arxiv.org/abs/2308.01483,,https://huggingface.co/papers/2308.01483,,,,6,0
|
14 |
Locally Stylized Neural Radiance Fields,"Pang, Hong Wing*; Hua, Binh-Son; Yeung, Sai-Kit",poster,,,,,,,,,
|
15 |
-
NEMTO: Neural Environment Matting for Novel View and Relighting Synthesis of Transparent Objects,"Wang, Dongqing *; Zhang, Tong ; Süsstrunk, Sabine",poster,2303.11963,https://arxiv.org/abs/2303.11963,,https://huggingface.co/papers/2303.11963,,,,3,
|
16 |
DDColor: Towards Photo-Realistic and Semantic-Aware Image Colorization via Dual Decoders,"Kang, Xiaoyang*; Yang, Tao; Ouyang, Wenqi; REN, PEIRAN; Li, Lingzhi; Xie, Xuansong",poster,,,,,,,,,
|
17 |
IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis,"Ye, Weicai*; CHEN, SHUO; Bao, Chong; Bao, Hujun; Pollefeys, Marc; Cui, Zhaopeng; Zhang, Guofeng",poster,2210.00647,https://arxiv.org/abs/2210.00647,,https://huggingface.co/papers/2210.00647,,,,7,0
|
18 |
PARIS: Part-level Reconstruction and Motion Analysis for Articulated Objects,"Liu, Jiayi*; Mahdavi-Amiri, Ali; Savva, Manolis",poster,2308.07391,https://arxiv.org/abs/2308.07391,,https://huggingface.co/papers/2308.07391,,,,3,0
|
@@ -48,7 +48,7 @@ Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentati
|
|
48 |
Texture Learning Domain Randomization for Domain Generalized Segmentation,"Kim, Sunghwan*; Kim, Dae-hwan; Kim, Hoseong",poster,2303.11546,https://arxiv.org/abs/2303.11546,https://github.com/ssssshwan/TLDR,https://huggingface.co/papers/2303.11546,,,,3,0
|
49 |
Unsupervised Video Object Segmentation with Online Adversarial Self-Tuning,"Su, Tiankang*; Song, Huihui; Liu, Dong; Liu, Bo; Liu, Qingshan",poster,,,,,,,,,
|
50 |
Exploring Open-Vocabulary Semantic Segmentation without Human Labels,"Chen, Jun*; Zhu, Deyao; Qian, Guocheng; Ghanem, Bernard; Yan, Zhicheng; Zhu, Chenchen; Xiao, Fanyi; Elhoseiny, Mohamed; Culatana, Sean",poster,2306.00450,https://arxiv.org/abs/2306.00450,,https://huggingface.co/papers/2306.00450,,,,9,0
|
51 |
-
RbA: Segmenting Unknown Regions Rejected by All,"Nayal, Nazir*; YAVUZ, MISRA; Henriques, Joao F; Guney, Fatma",poster,2211.14293,https://arxiv.org/abs/2211.14293,,https://huggingface.co/papers/2211.14293,,,,4,
|
52 |
SEMPART: Self-supervised Multi-resolution Partitioning of Image Semantics,"Ravindran, Sriram; Basu, Debraj D*",poster,,,,,,,,,
|
53 |
Multi-Object Discovery by Low-Dimensional Object Motion,"Safadoust, Sadra*; Guney, Fatma",poster,2307.08027,https://arxiv.org/abs/2307.08027,,https://huggingface.co/papers/2307.08027,,,,2,0
|
54 |
MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory,"Li, Enxu*; Casas, Sergio; Urtasun, Raquel",poster,,,,,,,,,
|
@@ -478,7 +478,7 @@ Spatio-Temporal Crop Aggregation for Video Representation Learning,"Sameni, Sepe
|
|
478 |
Semantic Information in Contrastive Learning,"Quan, Shengjiang*; Hirano, Masahiro; Yamakawa, Yuji",poster,,,,,,,,,
|
479 |
Cross-Domain Product Representation Learning for Rich-Content E-Commerce,"bai, xuehan; Li, Yan; Cheng, Yanhua; Yang, Wenjie; Chen, Quan*; Li, Han",poster,,,,,,,,,
|
480 |
Contrastive Continuity on Augmentation Stability Rehearsal for Continual Self-Supervised Learning,"Cheng, Haoyang*; Wen, Haitao; Zhang, Xiaoliang; Qiu, Heqian; Wang, Lanxiao; Li, Hongliang",poster,,,,,,,,,
|
481 |
-
HybridAugment++: Unified Frequency Spectra Perturbations for Model Robustness,"Yücel, Mehmet Kerim*; Cinbis, Ramazan Gokberk; Duygulu, Pinar",poster,2307.11823,https://arxiv.org/abs/2307.11823,,https://huggingface.co/papers/2307.11823,,,,3,
|
482 |
Unleashing Text-to-Image Diffusion Models for Visual Perception,"Zhao, Wenliang; Rao, Yongming; Liu, Zuyan; Liu, Benlin; Zhou, Jie; Lu, Jiwen*",poster,2303.02153,https://arxiv.org/abs/2303.02153,https://github.com/wl-zhao/VPD,https://huggingface.co/papers/2303.02153,,,,6,0
|
483 |
Efficient Controllable Multi-Task Architectures,"Aich, Abhishek*; Schulter, Samuel; Roy-Chowdhury, Amit K. ; Chandraker, Manmohan; Suh, Yumin",poster,2308.11744,https://arxiv.org/abs/2308.11744,,https://huggingface.co/papers/2308.11744,,,,5,0
|
484 |
ParCNetV2: Oversized Kernel with Enhanced Attention,"Xu, Ruihan; Zhang, Haokui; Hu, Wenze; Zhang, Shiliang*; Wang, Xiaoyu",poster,2211.07157,https://arxiv.org/abs/2211.07157,,https://huggingface.co/papers/2211.07157,,,,5,0
|
@@ -617,7 +617,7 @@ UMFuse: Unified Multi View Fusion for Human Editing applications,"Jain, Rishabh*
|
|
617 |
Evaluating Data Attribution for Text-to-Image Models,"Wang, Sheng-Yu*; Efros, Alexei A; Zhu, Jun-Yan; Zhang, Richard ",poster,2306.09345,https://arxiv.org/abs/2306.09345,,https://huggingface.co/papers/2306.09345,,,,4,0
|
618 |
Neural Characteristic Function Learning for Conditional Image Generation,"Li, Shengxi; Zhang, Jialu; Li, Yifei; Xu, Mai*; Deng, Xin; Li, Li",poster,,,,,,,,,
|
619 |
WaveIPT: Joint Attention and Flow Alignment in the Wavelet domain for Pose Transfer,"Ma, Liyuan*; Gao, Tingwei; Jiang, Haitian; Shen, Haibin; Huang, Kejie",poster,,,,,,,,,
|
620 |
-
LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models,"Zhang, Junyi*; Guo, Jiaqi; Sun, Shizhao; Lou, Jian-Guang; Zhang, Dongmei",poster,2303.11589,https://arxiv.org/abs/2303.11589,,https://huggingface.co/papers/2303.11589,,,,5,
|
621 |
Human-inspired Facial Sketch Synthesis with Dynamic Adaptation,"Gao, Fei*; Zhu, Yifan; Jiang, Chang; Wang, Nannan",poster,,,,,,,,,
|
622 |
Conceptual and Hierarchical Latent Space Decomposition for Face Editing,"Ozkan, Savas*; Ozay, Mete; Robinson, Thomas W",poster,,,,,,,,,
|
623 |
Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations,"Jeon, Seogkyu*; Liu, Bei; Lee, Pilhyeon; Hong , Kibeom; Fu, Jianlong; Byun, Hyeran",poster,2308.10554,https://arxiv.org/abs/2308.10554,,https://huggingface.co/papers/2308.10554,,,,6,0
|
@@ -704,7 +704,7 @@ Beyond One-to-One: Rethinking the Referring Image Segmentation,"Hu, Yutao*; Wang
|
|
704 |
Multiple Instance Learning Framework with Masked Hard Instance Mining for Whole Slide Image Classification,"Tang, Wenhao; Huang, Sheng*; Zhang, Xiaoxian; Zhou, Fengtao; Zhang, Yi; Liu, Bo",oral,2307.15254,https://arxiv.org/abs/2307.15254,https://github.com/DearCaat/MHIM-MIL,https://huggingface.co/papers/2307.15254,,,,6,0
|
705 |
Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning,"Reed, Colorado J; Gupta, Ritwik*; Li, Shufan; Brockman, Sarah; Funk, Christopher; Clipp, Brian S; Keutzer, Kurt; Candido, Salvatore; Uyttendaele, Matt; Darrell, Trevor",oral,,,,,,,,,
|
706 |
Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval,"Li, Pandeng*; Xie, Chen-Wei; Zhao, Liming; Xie, Hongtao; Ge, Jiannan; Zheng, Yun; Zhao, Deli; Zhang, Yongdong",oral,,,,,,,,,
|
707 |
-
Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning,"He, Junwen*; Wang, Yifan; Wang, Lijun; Lu, Huchuan; Luo, Bin; He, Jun-Yan; Lan, Jin-Peng; Geng, Yifeng; Xie, Xuansong",oral,2307.14786,https://arxiv.org/abs/2307.14786,,https://huggingface.co/papers/2307.14786,,,,9,
|
708 |
LogicSeg: Parsing Visual Semantics with Neural Logic Learning and Reasoning,"Li, Liulei; Wang, Wenguan*; Yang, Yi",oral,,,,,,,,,
|
709 |
ASIC: Aligning Sparse in-the-wild Image Collections,"Gupta, Kamal*; Jampani, Varun; Shrivastava, Abhinav; Makadia, Ameesh; Snavely, Noah; Esteves, Carlos; Kar, Abhishek",oral,2303.16201,https://arxiv.org/abs/2303.16201,,https://huggingface.co/papers/2303.16201,,,,7,0
|
710 |
CLIPascene: Scene Sketching with Different Types and Levels of Abstraction,"Vinker, Yael*; Alaluf, Yuval; Cohen-Or, Danny; Shamir, Ariel",oral,2211.17256,https://arxiv.org/abs/2211.17256,,https://huggingface.co/papers/2211.17256,,,,4,0
|
@@ -1547,7 +1547,7 @@ Essential Matrix Estimation using Convex Relaxations in Orthogonal Space,"Karimi
|
|
1547 |
TripLe: Revisiting Pretrained Model Reuse and Progressive Learning for Efficient Vision Transformer Scaling and Searching,"Fu, Cheng*; Huang, Hanxian; Jiang, Zixuan; Ni, Yun; Nai, Lifeng; Wu, Gang; Cheng, Liqun; Zhou, Yanqi; Li, Sheng; Li, Andrew; Zhao, Jishen",poster,,,,,,,,,
|
1548 |
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers,"Chen, Mengzhao*; Shao, Wenqi; Xu, Peng; Lin, Mingbao; Zhang, Kaipeng; Chao, Fei; Ji, Rongrong; Qiao, Yu; Luo, Ping",poster,2305.17997,https://arxiv.org/abs/2305.17997,https://github.com/OpenGVLab/DiffRate,https://huggingface.co/papers/2305.17997,,,,9,0
|
1549 |
Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection,"Yang, Longrong; Zhou, Xianpan; Li, Xuewei; Qiao, Liang; Li, Zheyang; Yang, Ziwei; Wang, Gaoang; Li, Xi*",poster,2308.14286,https://arxiv.org/abs/2308.14286,https://github.com/TinyTigerPan/BCKD,https://huggingface.co/papers/2308.14286,,,,8,0
|
1550 |
-
From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels,"Yang, Zhendong*; Zeng, Ailing; Li, Zhe; Zhang, Tianke; Yuan, Chun; Li, Yu",poster,2303.13005,https://arxiv.org/abs/2303.13005,https://github.com/yzd-v/cls_KD,https://huggingface.co/papers/2303.13005,,,,6,
|
1551 |
Efficient 3D Semantic Segmentation with Superpoint Transformer,"ROBERT, Damien*; Raguet, Hugo; Landrieu, Loic",poster,2306.08045,https://arxiv.org/abs/2306.08045,,https://huggingface.co/papers/2306.08045,,,,3,1
|
1552 |
Dataset Quantization,"Zhou, Daquan; Wang, Kai*; Gu, Jianyang; Peng, Xiangyu; Lian, Dongze; Zhang, Yifan; You, Yang; Feng, Jiashi",poster,2308.10524,https://arxiv.org/abs/2308.10524,,https://huggingface.co/papers/2308.10524,,,,8,0
|
1553 |
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy,"Jie, Shibo*; Wang, Haoqing; Deng, Zhi-Hong",poster,2307.16867,https://arxiv.org/abs/2307.16867,https://github.com/JieShibo/PETL-ViT,https://huggingface.co/papers/2307.16867,,,,3,0
|
@@ -1817,7 +1817,7 @@ Contrastive Automatic Model Evaluation,"Peng, Ru; Duan, Qiuyang; Wang, Haobo; Ma
|
|
1817 |
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine Perception,"Pan, Xiaqing*; Charron, Nicholas; Yang, Yongqian; Peters, Scott C; Whelan, Thomas; Kong, Chen; Parkhi, Omkar M; Newcombe, Richard; Ren, Yuheng",poster,2306.06362,https://arxiv.org/abs/2306.06362,,https://huggingface.co/papers/2306.06362,,,,9,0
|
1818 |
Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives,"Wu, Haoning*; Zhang, Erli; Liao, Liang; Chen, Chaofeng; Hou, Jingwen; Wang, Annan; Sun, Wenxiu; Yan, Qiong; Lin, Weisi",poster,2211.04894,https://arxiv.org/abs/2211.04894,https://github.com/VQAssessment/DOVER,https://huggingface.co/papers/2211.04894,,,,9,0
|
1819 |
Going Beyond Nouns With Vision & Language Models Using Synthetic Data,"Cascante-Bonilla, Paola*; Shehada, Khaled; Smith, James S; Doveh, Sivan; Kim, Donghyun; Panda, Rameswar; Varol, Gul; Oliva, Aude; Ordonez, Vicente; Feris, Rogerio; Karlinsky, Leonid",poster,2303.17590,https://arxiv.org/abs/2303.17590,,https://huggingface.co/papers/2303.17590,,,,11,0
|
1820 |
-
H3WB: Human3.6M 3D WholeBody Dataset and Benchmark,"Zhu, Yue*; Samet, Nermin; Picard, David",poster,2211.15692,https://arxiv.org/abs/2211.15692,https://github.com/wholebody3d/wholebody3d,https://huggingface.co/papers/2211.15692,,,,3,
|
1821 |
ZOD: A large-scale and diverse multimodal dataset for autonomous driving,"Alibeigi, Mina*; Ljungbergh, William; Tonderski, Adam; Hess, Georg; Lilja, Adam; Lindström, Carl; Motorniuk, Daria; Fu, Junsheng; Widahl, Jenny; Petersson, Christoffer",poster,,,,,,,,,
|
1822 |
CAD-Estate: Large-scale CAD Model Annotation in RGB Videos,"Maninis, Kevis-Kokitsi*; Popov, Stefan; Niessner, Matthias; Ferrari, Vittorio",poster,,,,,,,,,
|
1823 |
Neglected Free Lunch - Learning Image Classifiers Using Annotation Byproducts,"Han, Dongyoon; Choe, Junsuk; Chun, Seonghyeok; Chung, John JY; Chang, Minsuk; Yun, Sangdoo; Song, Jean Y; Oh, Seong Joon*",poster,,,,,,,,,
|
@@ -1950,7 +1950,7 @@ Disentangle then Parse: Night-time Semantic Segmentation with Illumination Disen
|
|
1950 |
Visual Traffic Knowledge Graph Generation from Scene Images,"Guo, Yunfei*; yin, Fei; Li, Xiao-Hui; YAN, XUDONG; XUE, TAO; mei, shuqi; Liu, Cheng-Lin",poster,,,,,,,,,
|
1951 |
Agglomerative Transformer for Human-Object Interaction Detection,"Tu, Danyang*; Sun, Wei; Zhai, Guangtao; Shen, Wei",poster,2308.08370,https://arxiv.org/abs/2308.08370,,https://huggingface.co/papers/2308.08370,,,,4,0
|
1952 |
3D Neural Embedding Likelihood for Robust Probabilistic Inverse Graphics,"Zhou, Guangyao*; Gothoskar, Nishad; Wang, Lirui; Tenenbaum, Joshua; Gutfreund, Dan; Lázaro-Gredilla, Miguel; George, Dileep; Mansinghka, Vikash",poster,2302.03744,https://arxiv.org/abs/2302.03744,,https://huggingface.co/papers/2302.03744,,,,8,0
|
1953 |
-
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation,"Zhou, Zijian*; Shi, Miaojing; Caesar, Holger",poster,2303.15994,https://arxiv.org/abs/2303.15994,https://github.com/franciszzj/HiLo,https://huggingface.co/papers/2303.15994,,,,3,
|
1954 |
SRLIP: Fast Scaling of Relational Language-Image Pre-training,"Yuan, Hangjie*; Zhang, Shiwei; Wang, Xiang; Albanie, Samuel; Pan, Yining; Feng, Tao; Jiang, Jianwen; Ni, Dong; Zhang, Yingya; Zhao, Deli",poster,,,,,,,,,
|
1955 |
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase,"Liu, Youquan*; Chen, Runnan; Li, Xin; Kong, Lingdong; Yang, Yuchen; Xia, Zhaoyang; Bai, Yeqi; Zhu, Xinge; Ma, Yuexin; Li, Yikang; HOU, Yuenan; Qiao, Yu",poster,,,,,,,,,
|
1956 |
See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data,"Lu, Yuhang*; Jiang, Qi; Chen, Runnan; HOU, Yuenan; Zhu, Xinge; Ma, Yuexin",poster,2307.10782,https://arxiv.org/abs/2307.10782,,https://huggingface.co/papers/2307.10782,,,,6,0
|
|
|
12 |
HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative perception with vision transformer,"Xiang, Hao; Xu, Runsheng; Ma, Jiaqi*",poster,,,,,,,,,
|
13 |
Efficient neural supersampling on a novel gaming dataset,"Mercier, Antoine*; Erasmus, Ruan S; Savani, Yashesh ; Dhingra, Manik; Porikli, Fatih; Berger, Guillaume J. F.",poster,2308.01483,https://arxiv.org/abs/2308.01483,,https://huggingface.co/papers/2308.01483,,,,6,0
|
14 |
Locally Stylized Neural Radiance Fields,"Pang, Hong Wing*; Hua, Binh-Son; Yeung, Sai-Kit",poster,,,,,,,,,
|
15 |
+
NEMTO: Neural Environment Matting for Novel View and Relighting Synthesis of Transparent Objects,"Wang, Dongqing *; Zhang, Tong ; Süsstrunk, Sabine",poster,2303.11963,https://arxiv.org/abs/2303.11963,,https://huggingface.co/papers/2303.11963,,,,3,1
|
16 |
DDColor: Towards Photo-Realistic and Semantic-Aware Image Colorization via Dual Decoders,"Kang, Xiaoyang*; Yang, Tao; Ouyang, Wenqi; REN, PEIRAN; Li, Lingzhi; Xie, Xuansong",poster,,,,,,,,,
|
17 |
IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis,"Ye, Weicai*; CHEN, SHUO; Bao, Chong; Bao, Hujun; Pollefeys, Marc; Cui, Zhaopeng; Zhang, Guofeng",poster,2210.00647,https://arxiv.org/abs/2210.00647,,https://huggingface.co/papers/2210.00647,,,,7,0
|
18 |
PARIS: Part-level Reconstruction and Motion Analysis for Articulated Objects,"Liu, Jiayi*; Mahdavi-Amiri, Ali; Savva, Manolis",poster,2308.07391,https://arxiv.org/abs/2308.07391,,https://huggingface.co/papers/2308.07391,,,,3,0
|
|
|
48 |
Texture Learning Domain Randomization for Domain Generalized Segmentation,"Kim, Sunghwan*; Kim, Dae-hwan; Kim, Hoseong",poster,2303.11546,https://arxiv.org/abs/2303.11546,https://github.com/ssssshwan/TLDR,https://huggingface.co/papers/2303.11546,,,,3,0
|
49 |
Unsupervised Video Object Segmentation with Online Adversarial Self-Tuning,"Su, Tiankang*; Song, Huihui; Liu, Dong; Liu, Bo; Liu, Qingshan",poster,,,,,,,,,
|
50 |
Exploring Open-Vocabulary Semantic Segmentation without Human Labels,"Chen, Jun*; Zhu, Deyao; Qian, Guocheng; Ghanem, Bernard; Yan, Zhicheng; Zhu, Chenchen; Xiao, Fanyi; Elhoseiny, Mohamed; Culatana, Sean",poster,2306.00450,https://arxiv.org/abs/2306.00450,,https://huggingface.co/papers/2306.00450,,,,9,0
|
51 |
+
RbA: Segmenting Unknown Regions Rejected by All,"Nayal, Nazir*; YAVUZ, MISRA; Henriques, Joao F; Guney, Fatma",poster,2211.14293,https://arxiv.org/abs/2211.14293,,https://huggingface.co/papers/2211.14293,,,,4,1
|
52 |
SEMPART: Self-supervised Multi-resolution Partitioning of Image Semantics,"Ravindran, Sriram; Basu, Debraj D*",poster,,,,,,,,,
|
53 |
Multi-Object Discovery by Low-Dimensional Object Motion,"Safadoust, Sadra*; Guney, Fatma",poster,2307.08027,https://arxiv.org/abs/2307.08027,,https://huggingface.co/papers/2307.08027,,,,2,0
|
54 |
MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory,"Li, Enxu*; Casas, Sergio; Urtasun, Raquel",poster,,,,,,,,,
|
|
|
478 |
Semantic Information in Contrastive Learning,"Quan, Shengjiang*; Hirano, Masahiro; Yamakawa, Yuji",poster,,,,,,,,,
|
479 |
Cross-Domain Product Representation Learning for Rich-Content E-Commerce,"bai, xuehan; Li, Yan; Cheng, Yanhua; Yang, Wenjie; Chen, Quan*; Li, Han",poster,,,,,,,,,
|
480 |
Contrastive Continuity on Augmentation Stability Rehearsal for Continual Self-Supervised Learning,"Cheng, Haoyang*; Wen, Haitao; Zhang, Xiaoliang; Qiu, Heqian; Wang, Lanxiao; Li, Hongliang",poster,,,,,,,,,
|
481 |
+
HybridAugment++: Unified Frequency Spectra Perturbations for Model Robustness,"Yücel, Mehmet Kerim*; Cinbis, Ramazan Gokberk; Duygulu, Pinar",poster,2307.11823,https://arxiv.org/abs/2307.11823,,https://huggingface.co/papers/2307.11823,,,,3,1
|
482 |
Unleashing Text-to-Image Diffusion Models for Visual Perception,"Zhao, Wenliang; Rao, Yongming; Liu, Zuyan; Liu, Benlin; Zhou, Jie; Lu, Jiwen*",poster,2303.02153,https://arxiv.org/abs/2303.02153,https://github.com/wl-zhao/VPD,https://huggingface.co/papers/2303.02153,,,,6,0
|
483 |
Efficient Controllable Multi-Task Architectures,"Aich, Abhishek*; Schulter, Samuel; Roy-Chowdhury, Amit K. ; Chandraker, Manmohan; Suh, Yumin",poster,2308.11744,https://arxiv.org/abs/2308.11744,,https://huggingface.co/papers/2308.11744,,,,5,0
|
484 |
ParCNetV2: Oversized Kernel with Enhanced Attention,"Xu, Ruihan; Zhang, Haokui; Hu, Wenze; Zhang, Shiliang*; Wang, Xiaoyu",poster,2211.07157,https://arxiv.org/abs/2211.07157,,https://huggingface.co/papers/2211.07157,,,,5,0
|
|
|
617 |
Evaluating Data Attribution for Text-to-Image Models,"Wang, Sheng-Yu*; Efros, Alexei A; Zhu, Jun-Yan; Zhang, Richard ",poster,2306.09345,https://arxiv.org/abs/2306.09345,,https://huggingface.co/papers/2306.09345,,,,4,0
|
618 |
Neural Characteristic Function Learning for Conditional Image Generation,"Li, Shengxi; Zhang, Jialu; Li, Yifei; Xu, Mai*; Deng, Xin; Li, Li",poster,,,,,,,,,
|
619 |
WaveIPT: Joint Attention and Flow Alignment in the Wavelet domain for Pose Transfer,"Ma, Liyuan*; Gao, Tingwei; Jiang, Haitian; Shen, Haibin; Huang, Kejie",poster,,,,,,,,,
|
620 |
+
LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models,"Zhang, Junyi*; Guo, Jiaqi; Sun, Shizhao; Lou, Jian-Guang; Zhang, Dongmei",poster,2303.11589,https://arxiv.org/abs/2303.11589,,https://huggingface.co/papers/2303.11589,,,,5,1
|
621 |
Human-inspired Facial Sketch Synthesis with Dynamic Adaptation,"Gao, Fei*; Zhu, Yifan; Jiang, Chang; Wang, Nannan",poster,,,,,,,,,
|
622 |
Conceptual and Hierarchical Latent Space Decomposition for Face Editing,"Ozkan, Savas*; Ozay, Mete; Robinson, Thomas W",poster,,,,,,,,,
|
623 |
Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations,"Jeon, Seogkyu*; Liu, Bei; Lee, Pilhyeon; Hong , Kibeom; Fu, Jianlong; Byun, Hyeran",poster,2308.10554,https://arxiv.org/abs/2308.10554,,https://huggingface.co/papers/2308.10554,,,,6,0
|
|
|
704 |
Multiple Instance Learning Framework with Masked Hard Instance Mining for Whole Slide Image Classification,"Tang, Wenhao; Huang, Sheng*; Zhang, Xiaoxian; Zhou, Fengtao; Zhang, Yi; Liu, Bo",oral,2307.15254,https://arxiv.org/abs/2307.15254,https://github.com/DearCaat/MHIM-MIL,https://huggingface.co/papers/2307.15254,,,,6,0
|
705 |
Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning,"Reed, Colorado J; Gupta, Ritwik*; Li, Shufan; Brockman, Sarah; Funk, Christopher; Clipp, Brian S; Keutzer, Kurt; Candido, Salvatore; Uyttendaele, Matt; Darrell, Trevor",oral,,,,,,,,,
|
706 |
Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval,"Li, Pandeng*; Xie, Chen-Wei; Zhao, Liming; Xie, Hongtao; Ge, Jiannan; Zheng, Yun; Zhao, Deli; Zhang, Yongdong",oral,,,,,,,,,
|
707 |
+
Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning,"He, Junwen*; Wang, Yifan; Wang, Lijun; Lu, Huchuan; Luo, Bin; He, Jun-Yan; Lan, Jin-Peng; Geng, Yifeng; Xie, Xuansong",oral,2307.14786,https://arxiv.org/abs/2307.14786,,https://huggingface.co/papers/2307.14786,,,,9,1
|
708 |
LogicSeg: Parsing Visual Semantics with Neural Logic Learning and Reasoning,"Li, Liulei; Wang, Wenguan*; Yang, Yi",oral,,,,,,,,,
|
709 |
ASIC: Aligning Sparse in-the-wild Image Collections,"Gupta, Kamal*; Jampani, Varun; Shrivastava, Abhinav; Makadia, Ameesh; Snavely, Noah; Esteves, Carlos; Kar, Abhishek",oral,2303.16201,https://arxiv.org/abs/2303.16201,,https://huggingface.co/papers/2303.16201,,,,7,0
|
710 |
CLIPascene: Scene Sketching with Different Types and Levels of Abstraction,"Vinker, Yael*; Alaluf, Yuval; Cohen-Or, Danny; Shamir, Ariel",oral,2211.17256,https://arxiv.org/abs/2211.17256,,https://huggingface.co/papers/2211.17256,,,,4,0
|
|
|
1547 |
TripLe: Revisiting Pretrained Model Reuse and Progressive Learning for Efficient Vision Transformer Scaling and Searching,"Fu, Cheng*; Huang, Hanxian; Jiang, Zixuan; Ni, Yun; Nai, Lifeng; Wu, Gang; Cheng, Liqun; Zhou, Yanqi; Li, Sheng; Li, Andrew; Zhao, Jishen",poster,,,,,,,,,
|
1548 |
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers,"Chen, Mengzhao*; Shao, Wenqi; Xu, Peng; Lin, Mingbao; Zhang, Kaipeng; Chao, Fei; Ji, Rongrong; Qiao, Yu; Luo, Ping",poster,2305.17997,https://arxiv.org/abs/2305.17997,https://github.com/OpenGVLab/DiffRate,https://huggingface.co/papers/2305.17997,,,,9,0
|
1549 |
Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection,"Yang, Longrong; Zhou, Xianpan; Li, Xuewei; Qiao, Liang; Li, Zheyang; Yang, Ziwei; Wang, Gaoang; Li, Xi*",poster,2308.14286,https://arxiv.org/abs/2308.14286,https://github.com/TinyTigerPan/BCKD,https://huggingface.co/papers/2308.14286,,,,8,0
|
1550 |
+
From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels,"Yang, Zhendong*; Zeng, Ailing; Li, Zhe; Zhang, Tianke; Yuan, Chun; Li, Yu",poster,2303.13005,https://arxiv.org/abs/2303.13005,https://github.com/yzd-v/cls_KD,https://huggingface.co/papers/2303.13005,,,,6,1
|
1551 |
Efficient 3D Semantic Segmentation with Superpoint Transformer,"ROBERT, Damien*; Raguet, Hugo; Landrieu, Loic",poster,2306.08045,https://arxiv.org/abs/2306.08045,,https://huggingface.co/papers/2306.08045,,,,3,1
|
1552 |
Dataset Quantization,"Zhou, Daquan; Wang, Kai*; Gu, Jianyang; Peng, Xiangyu; Lian, Dongze; Zhang, Yifan; You, Yang; Feng, Jiashi",poster,2308.10524,https://arxiv.org/abs/2308.10524,,https://huggingface.co/papers/2308.10524,,,,8,0
|
1553 |
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy,"Jie, Shibo*; Wang, Haoqing; Deng, Zhi-Hong",poster,2307.16867,https://arxiv.org/abs/2307.16867,https://github.com/JieShibo/PETL-ViT,https://huggingface.co/papers/2307.16867,,,,3,0
|
|
|
1817 |
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine Perception,"Pan, Xiaqing*; Charron, Nicholas; Yang, Yongqian; Peters, Scott C; Whelan, Thomas; Kong, Chen; Parkhi, Omkar M; Newcombe, Richard; Ren, Yuheng",poster,2306.06362,https://arxiv.org/abs/2306.06362,,https://huggingface.co/papers/2306.06362,,,,9,0
|
1818 |
Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives,"Wu, Haoning*; Zhang, Erli; Liao, Liang; Chen, Chaofeng; Hou, Jingwen; Wang, Annan; Sun, Wenxiu; Yan, Qiong; Lin, Weisi",poster,2211.04894,https://arxiv.org/abs/2211.04894,https://github.com/VQAssessment/DOVER,https://huggingface.co/papers/2211.04894,,,,9,0
|
1819 |
Going Beyond Nouns With Vision & Language Models Using Synthetic Data,"Cascante-Bonilla, Paola*; Shehada, Khaled; Smith, James S; Doveh, Sivan; Kim, Donghyun; Panda, Rameswar; Varol, Gul; Oliva, Aude; Ordonez, Vicente; Feris, Rogerio; Karlinsky, Leonid",poster,2303.17590,https://arxiv.org/abs/2303.17590,,https://huggingface.co/papers/2303.17590,,,,11,0
|
1820 |
+
H3WB: Human3.6M 3D WholeBody Dataset and Benchmark,"Zhu, Yue*; Samet, Nermin; Picard, David",poster,2211.15692,https://arxiv.org/abs/2211.15692,https://github.com/wholebody3d/wholebody3d,https://huggingface.co/papers/2211.15692,,,,3,1
|
1821 |
ZOD: A large-scale and diverse multimodal dataset for autonomous driving,"Alibeigi, Mina*; Ljungbergh, William; Tonderski, Adam; Hess, Georg; Lilja, Adam; Lindström, Carl; Motorniuk, Daria; Fu, Junsheng; Widahl, Jenny; Petersson, Christoffer",poster,,,,,,,,,
|
1822 |
CAD-Estate: Large-scale CAD Model Annotation in RGB Videos,"Maninis, Kevis-Kokitsi*; Popov, Stefan; Niessner, Matthias; Ferrari, Vittorio",poster,,,,,,,,,
|
1823 |
Neglected Free Lunch - Learning Image Classifiers Using Annotation Byproducts,"Han, Dongyoon; Choe, Junsuk; Chun, Seonghyeok; Chung, John JY; Chang, Minsuk; Yun, Sangdoo; Song, Jean Y; Oh, Seong Joon*",poster,,,,,,,,,
|
|
|
1950 |
Visual Traffic Knowledge Graph Generation from Scene Images,"Guo, Yunfei*; yin, Fei; Li, Xiao-Hui; YAN, XUDONG; XUE, TAO; mei, shuqi; Liu, Cheng-Lin",poster,,,,,,,,,
|
1951 |
Agglomerative Transformer for Human-Object Interaction Detection,"Tu, Danyang*; Sun, Wei; Zhai, Guangtao; Shen, Wei",poster,2308.08370,https://arxiv.org/abs/2308.08370,,https://huggingface.co/papers/2308.08370,,,,4,0
|
1952 |
3D Neural Embedding Likelihood for Robust Probabilistic Inverse Graphics,"Zhou, Guangyao*; Gothoskar, Nishad; Wang, Lirui; Tenenbaum, Joshua; Gutfreund, Dan; Lázaro-Gredilla, Miguel; George, Dileep; Mansinghka, Vikash",poster,2302.03744,https://arxiv.org/abs/2302.03744,,https://huggingface.co/papers/2302.03744,,,,8,0
|
1953 |
+
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation,"Zhou, Zijian*; Shi, Miaojing; Caesar, Holger",poster,2303.15994,https://arxiv.org/abs/2303.15994,https://github.com/franciszzj/HiLo,https://huggingface.co/papers/2303.15994,,,,3,1
|
1954 |
SRLIP: Fast Scaling of Relational Language-Image Pre-training,"Yuan, Hangjie*; Zhang, Shiwei; Wang, Xiang; Albanie, Samuel; Pan, Yining; Feng, Tao; Jiang, Jianwen; Ni, Dong; Zhang, Yingya; Zhao, Deli",poster,,,,,,,,,
|
1955 |
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase,"Liu, Youquan*; Chen, Runnan; Li, Xin; Kong, Lingdong; Yang, Yuchen; Xia, Zhaoyang; Bai, Yeqi; Zhu, Xinge; Ma, Yuexin; Li, Yikang; HOU, Yuenan; Qiao, Yu",poster,,,,,,,,,
|
1956 |
See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data,"Lu, Yuhang*; Jiang, Qi; Chen, Runnan; HOU, Yuenan; Zhu, Xinge; Ma, Yuexin",poster,2307.10782,https://arxiv.org/abs/2307.10782,,https://huggingface.co/papers/2307.10782,,,,6,0
|