multiview transformers for video recognition github

ICVGIP 2021. S/R: Synthetic (S) or Real (R) or Both (B), [1] Big Hand 2.2M Benchmark: Hand Pose Data Set and State of the Art Analysis, CVPR 2017, [2] Depth-based hand pose estimation: methods, data, and challenges, ICCV 2015, [3] Capturing Hand-Object Interaction and Reconstruction of Manipulated Objects, IJCV 2016. [PDF], ShaRF: Shape-conditioned Radiance Fields from a Single View. Black, Supreeth Narasimhaswamy*, Zhengwei Wei*, Yang Wang, Justin Zhang, Minh Hoai, Shangchen Han, Beibei Liu, Robert Wang, Yuting Ye, Christopher D. Twigg, Kenrick Kin, Rza Alp Gler, Natalia Neverova, Iasonas Kokkinos, Ayan Sinha, Asim Unmesh, Qixing Huang, Karthik Ramani, Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Josh Susskind, Wenda Wang, Russ Webb, Julien Valentin, Angela Dai, Matthias Niessner, Pushmeet Kohli, Philip H.S. [PDF], Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation. [PDF], [arXiv:2011.04307] EfficientPose: An efficient, accurate and scalable end-to-end 6D multi object pose estimation approach. https://github.com/taksau/GPS-Net, 26. Michael Oechsle, Songyou Peng, Andreas Geiger. WACV 2022. (arXiv 2022.07) Hybrid CNN-Transformer Model For Facial Affect Recognition In the ABAW4 Challenge. [PDF] [Github], TAVA: Template-free Animatable Volumetric Actors. [PDF], [2017 Neurocomputing] Multi-task, Multi-domain Learning: application to semantic segmentation and pose regression. (arXiv 2022.06) FIT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification. [PDF], GHUM & GHUML: Generative 3D Human Shape and Articulated Pose Models. CVPR 2021 (Best Paper). [PDF], [2022 CVPR] Understanding 3D Object Articulation in Internet Videos. ECCV 2022 issueECCV 2020 - GitHub - amusi/ECCV2022-Papers-with-Code: ECCV 2022 issueECCV 2020 img. (arXiv 2022.02) Arbitrary Shape Text Detection using Transformers. Christopher Xie, Keunhong Park, Ricardo Martin-Brualla, Matthew Brown. (arXiv 2022.08) Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors. ECCV 2022. [PDF], Learning Dynamic View Synthesis With Few RGBD Cameras. (arXiv 2022.07) Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion. CVPR 2021. [PDF] [Project] [Github], X-Fields: Implicit Neural View-, Light- and Time-Image Interpolation. [PDF], Learning To Stylize Novel Views. Data, Augmentation, and Regularization in Vision Transformers. [PDF], AUTO3D: Novel View Synthesis Through Unsupervisely Learned Variational Viewpoint and Global 3D Representation. Nilesh Kulkarni, Abhinav Gupta, David F. Fouhey, Shubham Tulsiani. [PDF], HeadNeRF: A Real-time NeRF-based Parametric Head Model. [PDF], [2013 CVPR] Tracking Human Pose by Tracking Symmetric Parts. (arXiv 2022.04) Self-Driving Car Steering Angle Prediction: Let Transformer Be a Car Again. Seeing All the Angles: Learning Multiview Manipulation Policies for Contact-Rich Tasks from Demonstrations; Adaptive T-Momentum-Based Optimization for Unknown Ratio of Outliers in Amateur Data in Imitation Learning; Robust Behavior Cloning with Adversarial Demonstration Detection; State-Only Imitation Learning for Dexterous Manipulation David Palmer, Dmitriy Smirnov, Stephanie Wang, Albert Chern, Justin Solomon. (arXiv 2022.06) Visual Transformer for Task-aware Active Learning. [PDF], Semantic-NeRF: In-Place Scene Labelling and Understanding with Implicit Scene Representation. (arXiv 2021.07) Surgical Instruction Generation with Transformers. (arXiv 2021.12) iSegFormer: Interactive Image Segmentation with Transformers. Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia. [PDF][Project][Code], Identity-aware Hand Mesh Estimation and Personalization from RGB Images . A brief background of AI, and specifically machine learning (ML) algorithms, is provided including convolutional neural networks (CNNs), generative adversarial networks (GANs), recurrent neural networks (RNNs) (arXiv 2022.05) Improving Transferability for Domain Adaptive Detection Transformers. (arXiv 2022.01) Scene-Adaptive Attention Network for Crowd Counting. (arXiv 2021.03) TransBTS: Multimodal Brain Tumor Segmentation Using Transformer. [PDF], Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement. Shuaifeng Zhi, Tristan Laidlow, Stefan Leutenegger, Andrew J. Davison. [PDF], Neural Rerendering in the Wild. [PDF], [2022 CVPR] SurfEmb: Dense and Continuous Correspondence Distributions for Object Pose Estimation with Learnt Surface Embeddings. (arXiv 2021.11) NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition. (arXiv 2022.10) End-to-end Transformer for Compressed Video Quality Enhancement, (arXiv 2021.03) Face Transformer for Recognition, [. CVPR 2022 (Oral). https://menyifang.github.io/projects/ADGAN/ADGAN.html, 28. (arXiv 2022.04) Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection. [PDF], [arXiv:2105.04112] ROBI: A Multi-View Dataset for Reflective Objects in Robotic Bin-Picking. ACM MM 2021. [PDF], [arXiv:2109.11747] A Multi-View Video-Based 3D Hand Pose Estimation. ); YOU ZHOU (Tokyo Research Center, Huawei); Elif Bozkurt (Huawei Turkey R&D Center, Istanbul, Turkey); Bo Zheng (Huawei), Active Pointly-Supervised Instance Segmentation, Chufeng Tang (Tsinghua University)*; Lingxi Xie (Huawei Inc.); Gang Zhang (Tsinghua University); xiaopeng zhang (Huawei Cloud EI ); Qi Tian (Huawei Cloud & AI); Xiaolin Hu (Tsinghua University), DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation, Xin Lai (The Chinese University of Hong Kong)*; Zhuotao Tian (The Chinese University of Hong Kong); Xiaogang XU (The Chinese University of Hong Kong); Yingcong Chen (Hong Kong University of Science and Technology); Shu Liu (SmartMore); Hengshuang Zhao (University of Oxford); Liwei Wang (CUHK); Jiaya Jia (Chinese University of Hong Kong), ByteTrack: Multi-Object Tracking by Associating Every Detection Box, Yifu Zhang (Huazhong University of Science and Technology); Peize Sun (The University of Hong Kong); Yi Jiang (Bytedance); Dongdong Yu (ByteDance Inc.); Fucheng Weng (Huazhong University of Science and Technology); Zehuan Yuan (Bytedance.Inc); Ping Luo (The University of Hong Kong); Wenyu Liu (Huazhong University of Science and Technology); Xinggang Wang (Huazhong University of Science and Technology)*, Robust Multi-Object Tracking by Marginal Inference, Yifu Zhang (Huazhong University of Science and Technology); Chunyu Wang (Microsoft Research asia); Xinggang Wang (Huazhong University of Science and Technology)*; Wenjun Zeng (EIT Institute for Advanced Study); Wenyu Liu (Huazhong University of Science and Technology), Doubly-Fused ViT: Fuse Information from Vision Transformer Doubly with Local Representation, Li Gao (Wuhan University)*; Dong Nie (UNC); Bo Li (Alibaba Group); Xiaofeng Ren (alibaba group), CATRE: Iterative Point Clouds Alignment for Category-level Object Pose Refinement, Xingyu Liu (Tsinghua University); Gu Wang (JD.COM); Yi Li (University of Washington); Xiangyang Ji (Tsinghua University)*, Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition, Wangmeng Xiang (The Hong Kong Polytechnic University)*; Chao Li (Alibaba); Biao Wang (Alibaba); Xihan Wei (Alibaba); Xian-Sheng Hua (Damo Academy, Alibaba Group); Lei Zhang (Hong Kong Polytechnic University, Hong Kong, China), Efficient Long-Range Attention Network for Image Super-resolution, Xindong Zhang (The Hong Kong Polytechnic University)*; Hui Zeng (OPPO); Shi Guo (The Hong Kong Polytechnic University); Lei Zhang (Hong Kong Polytechnic University, Hong Kong, China), DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection, Liang Peng (ZJU)*; Xiaopei Wu (ZhejiangUniversity); Zheng Yang (FABU); Haifeng Liu (ZJU); Deng Cai (ZJU), FlowFormer: A Transformer Architecture for Optical Flow, Zhaoyang Huang (Chinese University of HongKong)*; Xiaoyu Shi (CUHK); Chao Zhang (Samsung Telecommunication Research Institute); Qiang Wang (Samsung Research China, Beijing); Ka Chun Cheung (Nvidia); Hongwei Qin (Sensetime); Jifeng Dai (SenseTime); Hongsheng Li (The Chinese University of Hong Kong), Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction, Yuanhao Cai (Tsinghua Univisity, Tsinghua Shenzhen International Graduate School); Jing Lin (Tsinghua Univisity, Tsinghua Shenzhen International Graduate School)*; Xiaowan Hu (Tsinghua Univisity, Tsinghua Shenzhen International Graduate School); Haoqian Wang (Tsinghua Shenzhen International Graduate School, Tsinghua University); Xin Yuan (Westlake University); Yulun Zhang (ETH Zurich); Radu Timofte (University of Wurzburg & ETH Zurich); Luc Van Gool (ETH Zurich), An Embedded Feature Whitening Approach to Deep Neural Network Optimization, Hongwei Yong (The Hong Kong Polytechnic University)*; Lei Zhang (Hong Kong Polytechnic University, Hong Kong, China), Optimization over Disentangled Encoding: Unsupervised Cross-Domain Point Cloud Completion via Occlusion Factor Manipulation, Jingyu Gong (Shanghai Jiao Tong University)*; Fengqi Liu (Shanghai Jiao Tong University); Jiachen Xu (Shanghai Jiao Tong University); Min Wang (Sensetime Group); Xin Tan (Shanghai Jiao Tong University); Zhizhong Zhang (East China Normal University); Ran Yi (Shanghai Jiao Tong University); Haichuan Song (East China Normal University); Yuan Xie (East China Normal University); Lizhuang Ma (Shanghai Jiao Tong University), Source-Free Domain Adaptation with Contrastive Domain Alignment and Self-supervised Exploration for Face Anti-Spoofing, Yuchen Liu (Shanghai Jiao Tong university)*; Yabo Chen (Shanghai Jiao Tong University ); Wenrui Dai (Shanghai Jiao Tong University); Mengran Gou (Qualcomm); Chun-Ting Huang (Qualcomm); Hongkai Xiong (Shanghai Jiao Tong University), MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection, Xuesong Chen (The Chinese University of Hong Kong)*; Shaoshuai Shi (MPI Informatics); Benjin Zhu (MEGVII); Ka Chun Cheung (Nvidia); Hang Xu (Huawei Noahs Ark Lab); Hongsheng Li (The Chinese University of Hong Kong), SdAE: Self-distillated Masked Autoencoder, Yabo Chen (Shanghai Jiao Tong University ); Yuchen Liu (Shanghai Jiao Tong university); Dongsheng Jiang (Huawei Cloud & AI); xiaopeng zhang (Huawei Cloud EI )*; Wenrui Dai (Shanghai Jiao Tong University); Hongkai Xiong (Shanghai Jiao Tong University); Qi Tian (Huawei Cloud & AI), A Transformer-based Decoder for Semantic Segmentation with Multi-level Context Mining, Bowen Shi (Shanghai Jiao Tong University)*; Dongsheng Jiang (Huawei Cloud & AI); xiaopeng zhang (Huawei Cloud EI ); Han Li (Shanghai Jiao Tong University); Wenrui Dai (Shanghai Jiao Tong University); Junni Zou (Shanghai Jiao Tong University); Hongkai Xiong (Shanghai Jiao Tong University); Qi Tian (Huawei Cloud & AI), Graph-constrained Contrastive Regularization for Semi-weakly Volumetric Segmentation, Simon Rei (Karlsruhe Institute of Technology)*; Constantin Marc Seibold (Karlsruhe Institute of Technology); Alexander Freytag (Carl Zeiss AG, Jena, Germany); Rodner Erik (University of Applied Sciences Berlin); Rainer Stiefelhagen (Karlsruhe Institute of Technology), Improving Vision Transformers by Revisiting High-frequency Components, Jiawang Bai (Tsinghua University)*; Li Yuan (Peking University); Shu-Tao Xia (Tsinghua University); Shuicheng Yan (Sea AI Labs); Zhifeng Li (Tencent AI Lab); Wei Liu (Tencent), Adaptive Co-Teaching for Unsupervised Monocular Depth Estimation, Weisong Ren (Dalian University of Technology); Lijun Wang (Dalian University of Technology)*; Yongri Piao (Dalian University of Technology); Miao Zhang (Dalian University of Technology); Huchuan Lu (Dalian University of Technology); Ting Liu (Alibaba), FurryGAN: High quality foreground-aware image synthesis, Jeongmin Bae (Yonsei University); Mingi Kwon (Yonsei University); Youngjung Uh (Yonsei University)*, An Efficient Spatio-Temporal Pyramid Transformer for Action Detection, Yuetian Weng (Monash University); Zizheng Pan (Monash University); Mingfei Han (Monash University; DATA61, CSIRO); Xiaojun Chang (University of Technology Sydney); Bohan Zhuang (Monash University)*, LocVTP: Video-Text Pre-training for Temporal Localization, Meng Cao (Peking University); Tianyu Yang (Tencent AI Lab); Junwu Weng (Tencent AI Lab); Can Zhang (Peking University); Jue Wang (Tencent AI Lab); Yuexian Zou (Peking University)*, Fusing Local Similarities for Retrieval-based 3D Orientation Estimation of Unseen Objects, Chen Zhao (EPFL)*; Yinlin Hu (EPFL); Mathieu Salzmann (EPFL), Online Segmentation of LiDAR Sequences: Dataset and Algorithm, Romain Loiseau (cole des ponts ParisTech)*; Mathieu Aubry (cole des ponts ParisTech); loic landrieu (IGN), MVSTER: Epipolar Transformer for Efficient Multi-View Stereo, Xiaofeng Wang (Institute of Automation, Chinese Academy of Sciences; School of Artificial Intelligence, University of Chinese Academy of Sciences)*; Zheng Zhu (Tsinghua University); Guan Huang (Institute of Automation, Chinese Academy of Sciences); Fangbo Qin (Institute of Automation, Chinese Academy of Sciences); Yun Ye (XForwardAI Technology Co., Ltd, Beijing, China); Yijia He (Beijing Kuaishou Technology Co., Ltd); Xu Chi (Phigent Robotics); Xingang Wang (Institute of Automation, CAS), Unsupervised Learning of 3D Semantic Keypoints with Mutual Reconstruction, Haocheng Yuan (Northwestern Polytechnical University); Chen Zhao (EPFL); Shichao Fan (Northwestern Polytechnical University); Jiaxi Jiang (Northwestern Polytechnical University); Jiaqi Yang (Northwestern Polytechnical University)*, Generalizable Medical Image Segmentation via Random Amplitude Mixup and Domain-Specific Image Restoration, Ziqi Zhou (Nanjing University)*; Lei Qi (Southeast University); Yinghuan Shi (Nanjing University), Demystifying Unsupervised Semantic Correspondence Estimation, Mehmet Aygn (The University of Edinburgh)*; Oisin Mac Aodha (University of Edinburgh), Learning Shadow Correspondence for Video Shadow Detection, Xinpeng Ding (The Hong Kong University of Science and Technology); Jingwen Yang (The Hong Kong University of Science and Technology); Xiaowei Hu (Shanghai AI Laboratory); Xiaomeng Li (The Hong Kong University of Science and Technology)*. Haoyu Chen, Hao Tang, Henglin Shi, Wei Peng, Nicu Sebe, Guoying Zhao. [PDF] [Project], DeepCurrents: Learning Implicit Representations of Shapes With Boundaries. ECCV 2020. [PDF] [Project], Neural Radiance Transfer Fields for Relightable Novel-view Synthesis with Global Illumination. ICML 2021. (arXiv 2022.03)Towards Exemplar-Free Continual Learning in Vision Transformers: an Account of Attention, Functional and Weight Regularization. [PDF][Github] [[Project](https://sherwinbahmani.github.io/3dvidgen/], FDNeRF: Few-shot Dynamic Neural Radiance Fields for Face Reconstruction and Expression Editing. Bangbang Yang, Chong Bao, Junyi Zeng, Hujun Bao, Yinda Zhang, Zhaopeng Cui, Guofeng Zhang. (arXiv 2022.07) Hunting Group Clues with Transformers for Social Group Activity Recognition. [PDF], [arXiv:2206.07117] TriHorn-Net: A Model for Accurate Depth-Based 3D Hand Pose Estimation. [PDF][Code], [2020 WACV] Nonparametric Structure Regularization Machine for 2D Hand Pose Estimation. (arXiv 2022.01) Patches Are All You Need?. [PDF], Object-Occluded Human Shape and Pose Estimation From a Single Color Image. Xingang Pan, Xudong Xu, Chen Change Loy, Christian Theobalt, Bo Dai. (arXiv 2021.12) A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer. (arXiv 2021.12) End-to-End Learning of Multi-category 3D Pose and Shape Estimation. Soubhik Sanyal, Alex Vorobiov, Timo Bolkart, Matthew Loper, Betty Mohler, Larry Davis, Javier Romero, Michael J. (arXiv 2022.06) Cross-domain Detection Transformer based on Spatial-aware and Semantic-aware Token Alignment. (arXiv 2022.03) An End-to-End Transformer Model for Crowd Localization. [PDF][Project], JGR-P2O: Joint Graph Reasoning based Pixel-to-Offset Prediction Network for 3D Hand Pose Estimation from a Single Depth Image. (arXiv 2022.01) Splicing ViT Features for Semantic Appearance Transfer. https://github.com/Megvii-Nanjing/BBN, 15. (arXiv 2021.09) Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers. (arXiv 2021.07) Combiner: Full Attention Transformer with Sparse Computation Cost. arxiv 2021. Wohlhart, Paul and Lepetit, Vincent Category-Specific Object Reconstruction From a Single Image. [Paper], (arXiv 2022.03) Transformers Meet Visual Learning Understanding: A Comprehensive Review. [PDF], [2021 WACV] A Pose Proposal and Refinement Network for Better 6D Object Pose Estimation. Daniel Rebain, Wei Jiang, Soroosh Yazdani, Ke Li, Kwang Moo Yi, Andrea Tagliasacchi. (arXiv 2021.09) Transformer-Unet: Raw Image Processing with Unet. [PDF] [Github], Towards Realistic Visual Dubbing with Heterogeneous Sources. (arXiv 2021.06) KVT: k-NN Attention for Boosting Vision Transformers. NeurIPS 2019. (arXiv 2022.08) Jointformer: Single-Frame Lifting Transformer with Error Prediction and Refinement for 3D Human Pose Estimation. [PDF], [arXiv:1911.12501] An End-to-end Framework for Unconstrained Monocular 3D Hand Pose Estimation. [PDF], View-Invariant Probabilistic Embedding for Human Pose. A large scale comparison of deep instance segmentation, Johannes Theodoridis (Hochschule der Medien Stuttgart)*; Jessica Hofmann (Hochschule der Medien); Johannes Maucher (Media University Stuttgart); Andreas G Schilling (University of Tbingen), MVDG: A Unified Multi-view Framework for Domain Generalization, Jian Zhang (Nanjing University)*; Lei Qi (Southeast University); Yinghuan Shi (Nanjing University); Yang Gao (Nanjing University), MINER: Multiscale Implicit Neural Representation, Vishwanath Saragadam (Rice University)*; Jasper T Tan (Rice University); Guha Balakrishnan (Rice University); Richard Baraniuk (Rice University); Ashok Veeraraghavan (Rice University), PTQ4ViT: Post-Training Quantization for Vision Transformers with Twin Uniform Quantization, Zhihang Yuan (Peking University)*; Chenhao Xue (Peking University); Yiqi Chen (Peking University); Qiang Wu (HOUMO.AI); Guangyu Sun (Peking University), Context-Consistent Semantic Image Editing with Style-Preserved Modulation, Wuyang Luo (School of Computer Science, Fudan University); Su Yang (School of Computer Science, Fudan University)*; Hong Wang (School of Computer Science, Fudan University); Bo Long (School of Computer Science, Fudan University ); Weishan Zhang (Department of Software Engineering, China University of Petroleum), Distilling the Undistillable: Learning from a Nasty Teacher, Surgan Jandial (MDSR Labs, Adobe)*; Yash Khasbage (Indian Institute of Technology, Hyderabad); Arghya Pal (Harvard University); Vineeth N Balasubramanian (Indian Institute of Technology, Hyderabad); Balaji Krishnamurthy (), Grounding Visual Representations with Texts for Domain Generalization, Seonwoo Min (LG AI Research)*; Nokyung Park (Korea University); Siwon Kim (Seoul National University); Seunghyun Park (Clova AI Research, NAVER Corp.); Jinkyu Kim (Korea University), Towards Accurate Open-Set Recognition via Background-Class Regularization, Wonwoo Cho (Korea Advanced Institute of Science and Technology)*; Jaegul Choo (Korea Advanced Institute of Science and Technology), In Defense of Image Pre-Training for Spatiotemporal Recognition, Xianhang Li (University of California, Santa Cruz)*; Huiyu Wang (JHU); Chen Wei (Johns Hopkins University); Jieru Mei (Johns Hopkins University); Alan Yuille (Johns Hopkins University); Yuyin Zhou (UC Santa Cruz); Cihang Xie (University of California, Santa Cruz), SocialVAE: Human Trajectory Prediction using Timewise Latents, Pei Xu (Clemson University)*; Jean-Bernard Hayet (CIMAT); Ioannis Karamouzas (Clemson University), BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking, Dorian F Henning (Imperial College London)*; Tristan Laidlow (Imperial College London); Stefan Leutenegger (TU Munich), Eliminating Gradient Conflict in Reference-based Line-Art Colorization, zekun li (University of Electronic Science and Technology of China)*; Zhengyang Geng (Peking University); Zhao Kang (University of Electronic Science and Technology of China); Wenyu Chen (University of Electronic Science and Technology of China); Yibo Yang (Peking University), Matteo Boschini (University of Modena and Reggio Emilia)*; Lorenzo Bonicelli (Universit of Modena and Reggio Emilia); Angelo Porrello (University of Modena and Reggio Emilia); Giovanni Bellitto (University of Catania); Matteo Pennisi (University of Catania); Simone Palazzo (University of Catania); Concetto Spampinato (University of Catania); SIMONE CALDERARA (University of Modena and Reggio Emilia, Italy), DSR A dual subspace re-projection network for surface anomaly detection, Vitjan Zavrtanik (University of Ljubljana)*; Matej Kristan (University of Ljubljana); Danijel Skocaj (University of Ljubljana), Multi-Exit Semantic Segmentation Networks, Alexandros Kouris (Imperial College London and Samsung AI)*; Stylianos Venieris (Samsung AI); Stefanos Laskaridis (Samsung AI); Nicholas Lane (University of Cambridge and Samsung AI), Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks, Bernd Prach (IST Austria)*; Christoph H Lampert (IST Austria), Bridging the visual semantic gap in VLN via semantically richer instructions, Joaqun Ignacio Ossandn (Universidad Catolica de Chile)*; Benjamn Earle (Universidad Catlica de Chile); Alvaro Soto (Universidad Catolica de Chile), Kernel Relative-prototype Spectral Filtering for Few-shot Learning, Tao Zhang (Chengdu Techman Software Co., Ltd.)*; Wu Huang (Sichuan University), StoryDALL-E: Adapting Pretrained Text-to-image Transformers for Story Continuation, Adyasha Maharana (UNC Chapel Hill)*; Darryl Hannan (University of North Carolina at Chapel Hill); Mohit Bansal (University of North Carolina at Chapel Hill), Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations, Atsuhiro Noguchi (The University of Tokyo)*; Xiao Sun (Microsoft Research Asia); Stephen Lin (Microsoft Research); Tatsuya Harada (The University of Tokyo / RIKEN), PANDORA: Polarization-Aided Neural Decomposition Of Radiance, Akshat Dave (Rice University)*; Yongyi Zhao (Rice University); Ashok Veeraraghavan (Rice University), OCR-free Document Understanding Transformer, Geewook Kim (NAVER Corporation)*; Teakgyu Hong (Upstage AI); Moonbin Yim (Clova AI Research, NAVER Corp.); Jeongyeon Nam (Naver); Jinyoung Park (TmaxAI); Jinyeong Yim (Google); Wonseok Hwang (LBox); Sangdoo Yun (NAVER AI LAB); Dongyoon Han (NAVER AI Lab); Seunghyun Park (Clova AI Research, NAVER Corp.), VQGAN-CLIP: Open Domain Image Generation and Manipulation Using Natural Language, Katherine B Crowson (EleutherAI); Stella R Biderman (Booz Allen Hamilton)*; daniel kornis (Eleuther.ai); Dashiell Stander (Eleuther AI); Eric Hallahan (EleutherAI); Louis J Castricato (Georgia Tech); Edward Raff (Booz Allen Hamilton), Learning to use unlabeled data in data augmentation for 3D detection, Zhaoqi Leng (Waymo)*; Shuyang Cheng (Waymo LLC); Ben Caine (Google); Weiyue Wang (Waymo); Xiao Zhang (Cruise); Jonathon Shlens (Google); Mingxing Tan (Waymo); Dragomir Anguelov (Waymo), Differentiable Zooming for Multiple Instance Learning on Whole-Slide Images, Kevin Thandiackal (ETH Zurich / IBM Research)*; Boqi Chen (ETH Zurich ); Pushpak Pati (IBM Research Zurich); Guillaume Jaume (Harvard); Drew Williamson (Pathology, Brigham and Womens Hospital, Harvard Medical School); Maria Gabrani (IBM Research); Orcun Goksel (ETH Zurich), Towards Learning Neural Representations from Shadows, Kushagra Tiwary (MIT)*; Tzofi M Klinghoffer (Massachusetts Institute of Technology); Ramesh Raskar (Massachusetts Institute of Technology), Augmenting Deep Classifiers with Polynomial Neural Networks, Grigorios Chrysos (EPFL)*; Markos Georgopoulos (Imperial College London); Jiankang Deng (Imperial College London); Jean Kossaifi (NVIDIA); Yannis Panagakis (University of Athens); Animashree Anandkumar (Caltech), AdaBest: Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation, Farshid Varno (Dalhousie/Imagia)*; Marzie Saghayi (Dalhousie University); Laya Rafiee Sevyeri (Concordia); Sharut Gupta (MILA, Imagia, Indian Institute of Technology Delhi (IIT Delhi)); Stan Matwin (Dalhouise University); Mohammad Havaei (Imagia), A Simple Approach and Benchmark for 21,000-Category Object Detection, Yutong Lin (Xian Jiaotong University); Chen Li (Xian Jiaotong University); Yue Cao (Microsoft Research); Zheng Zhang (MSRA); Jianfeng Wang (Microsoft); Lijuan Wang (Microsoft); Zicheng Liu (Microsoft); Han Hu (Microsoft Research Asia)*, Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning Approach, Jiseok Youn (Seoul National University)*; Jaehun Song (Seoul National University); Hyung-Sin Kim (Seoul National University); Saewoong Bahk (Seoul National University), Learning with Noisy Labels by Efficient Transition Matrix Estimation to Combat Label Miscorrection, Seong Min Kye (KAIST); Kwanghee Choi (Sogang University); Joonyoung Yi (Hyperconnect); Buru Chang (Hyperconnect)*, Online Task-free Continual Learning with Dynamic Sparse Distributed Memory, Julien Pourcel (ENSEA)*; Ngoc-Son Vu (ETIS/Universit Paris Seine, Universit Cergy-Pontoise, ENSEA, CNRS/ 95000-Cergy); Robert M FRENCH (CNRS).

Haider Ackermann Website, Philips Annual Report 2022, Cheap Wood Charcuterie Board, Deep Learning Model Compression, Insulation Board With Reflective Silver Backing, Tulane Social Work Faculty, Academica Sporting Lisbon B, Sliced Roast Beef Roll-ups, Iron Nail Rusting Experiment Results,

multiview transformers for video recognition github