000 06488nam a22006255i 4500
001 978-3-031-20059-5
003 DE-He213
005 20240730170351.0
007 cr nn 008mamaa
008 221028s2022 sz | s |||| 0|eng d
020 _a9783031200595
_9978-3-031-20059-5
024 7 _a10.1007/978-3-031-20059-5
_2doi
050 4 _aTA1634
072 7 _aUYQV
_2bicssc
072 7 _aCOM016000
_2bisacsh
072 7 _aUYQV
_2thema
082 0 4 _a006.37
_223
245 1 0 _aComputer Vision - ECCV 2022
_h[electronic resource] :
_b17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXXVI /
_cedited by Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner.
250 _a1st ed. 2022.
264 1 _aCham :
_bSpringer Nature Switzerland :
_bImprint: Springer,
_c2022.
300 _aLVI, 755 p. 212 illus., 206 illus. in color.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
490 1 _aLecture Notes in Computer Science,
_x1611-3349 ;
_v13696
505 0 _aMaking the Most of Text Semantics to Improve Biomedical Vision-Language Processing -- Generative Negative Text Replay for Continual Vision-Language Pretraining -- Video Graph Transformer for Video Question Answering -- Trace Controlled Text to Image Generation -- Video Question Answering with Iterative Video-Text Co-Tokenization -- Rethinking Data Augmentation for Robust Visual Question Answering -- Explicit Image Caption Editing -- Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding -- Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly -- GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features -- Selective Query-Guided Debiasing for Video Corpus Moment Retrieval -- Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding -- Object-Centric Unsupervised Image Captioning -- Contrastive Vision-Language Pre-training with Limited Resources -- Learning Linguistic Association towards Efficient Text-Video Retrieval -- ASSISTER: Assistive Navigation via Conditional Instruction Generation -- X-DETR: A Versatile Architecture for Instance-Wise Vision-Language Tasks -- Learning Disentanglement with Decoupled Labels for Vision-Language Navigation -- Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input -- Word-Level Fine-Grained Story Visualization -- Unifying Event Detection and Captioning as Sequence Generation via Pre-training -- Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation -- Fine-Grained Visual Entailment -- Bottom Up Top down Detection Transformers for Language Grounding in Images and Point Clouds -- New Datasets and Models for Contextual Reasoning in Visual Dialog -- VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage FeatureSelection -- Classification-Regression for Chart Comprehension -- AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant -- FindIt: Generalized Localization with Natural Language Queries -- UniTAB: Unifying Text and Box Outputs for Grounded VisionLanguage Modeling -- Scaling Open-Vocabulary Image Segmentation with Image-Level Labels -- The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning -- Speaker-Adaptive Lip Reading with User-Dependent Padding -- TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation -- SemAug: Semantically Meaningful Image Augmentations for Object Detection through Language Grounding -- Referring Object Manipulation of Natural Images with Conditional Classifier-Free Guidance -- NewsStories: Illustrating Articles with Visual Summaries -- Webly Supervised Concept Expansion for General Purpose Vision Models -- FedVLN: Privacy-Preserving Federated Vision-and-Language Navigation -- CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval -- Language-Driven Artistic Style Transfer -- Single-Stream Multi-level Alignment for Vision-Language Pretraining.
520 _aThe 39-volume set, comprising the LNCS books 13661 until 13699, constitutes the refereed proceedings of the 17th European Conference on Computer Vision, ECCV 2022, held in Tel Aviv, Israel, during October 23-27, 2022. The 1645 papers presented in these proceedings were carefully reviewed and selected from a total of 5804 submissions. The papers deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; object recognition; motion estimation.
650 0 _aComputer vision.
_993025
650 0 _aComputer engineering.
_910164
650 0 _aComputer networks .
_931572
650 0 _aPattern recognition systems.
_93953
650 0 _aNatural language processing (Computer science).
_94741
650 1 4 _aComputer Vision.
_993026
650 2 4 _aComputer Engineering and Networks.
_993027
650 2 4 _aAutomated Pattern Recognition.
_931568
650 2 4 _aNatural Language Processing (NLP).
_931587
700 1 _aAvidan, Shai.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
_993028
700 1 _aBrostow, Gabriel.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
_993029
700 1 _aCissé, Moustapha.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
_993030
700 1 _aFarinella, Giovanni Maria.
_eeditor.
_0(orcid)
_10000-0002-6034-0432
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
_993031
700 1 _aHassner, Tal.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
_993032
710 2 _aSpringerLink (Online service)
_993033
773 0 _tSpringer Nature eBook
776 0 8 _iPrinted edition:
_z9783031200588
776 0 8 _iPrinted edition:
_z9783031200601
830 0 _aLecture Notes in Computer Science,
_x1611-3349 ;
_v13696
_923263
856 4 0 _uhttps://doi.org/10.1007/978-3-031-20059-5
912 _aZDB-2-SCS
912 _aZDB-2-SXCS
912 _aZDB-2-LNC
942 _cELN
999 _c86872
_d86872