Maybury M. — Multimedia Information Extraction: Advances in Video, Audio, and Imagery Analysis for Search, Data Mining, Surveillance, and Authoring :: Электронная библиотека попечительского совета мехмата МГУ

The advent of increasingly large consumer collections of audio (e.g., iTunes), imagery (e.g., Flickr), and video (e.g., YouTube) is driving a need not only for multimedia retrieval but also information extraction from and across media. Furthermore, industrial and government collections fuel requirements for stock media access, media preservation, broadcast news retrieval, identity management, and video surveillance. While significant advances have been made in language processing for information extraction from unstructured multilingual text and extraction of objects from imagery and video, these advances have been explored in largely independent research communities who have addressed extracting information from single media (e.g., text, imagery, audio). And yet users need to search for concepts across individual media, author multimedia artifacts, and perform multimedia analysis in many domains.

This collection is intended to serve several purposes, including reporting the current state of the art, stimulating novel research, and encouraging cross-fertilization of distinct research disciplines. The collection and integration of a common base of intellectual material will provide an invaluable service from which to teach a future generation of cross disciplinary media scientists and engineers.

Content:
Chapter 1 Introduction (pages 1–12): Mark T. Maybury
Chapter 2 Multimedia Information Extraction: History and State of the Art (pages 13–40): Mark T. Maybury
Chapter 3 Visual Feature Localization for Detecting Unique Objects in Images (pages 45–61): Madirakshi Das, Alexander C. Loui and Andrew C. Blose
Chapter 4 Entropy?Based Analysis of Visual and Geolocation Concepts in Images (pages 63–80): Keiji Yanai, Hidetoshi Kawakubo and Kobus Barnard
Chapter 5 The Meaning of 3D Shape and Some Techniques to Extract it (pages 81–97): Sven Havemann, Torsten Ullrich and Dieter W. Fellner
Chapter 6 A Data?Driven Meaningful Representation of Emotional Facial Expressions (pages 99–112): Nicolas Stoiber, Gaspard Breton and Renaud Seguier
Chapter 7 Visual Semantics for Reducing False Positives in Video Search (pages 119–128): Rohini K. Srihari and Adrian Novischi
Chapter 8 Automated Analysis of Ideological Bias in Video (pages 129–143): Wei?Hao Lin and Alexander G. Hauptmann
Chapter 9 Multimedia Information Extraction in a Live Multilingual News Monitoring System (pages 145–157): David D. Palmer, Marc B. Reichman and Noah White
Chapter 10 Semantic Multimedia Extraction Using Audio and Video (pages 159–174): Evelyne Tzoukermann, Geetu Ambwani, Amit Bagga, Leslie Chipman, Anthony R. Davis, Ryan Farrell, David Houghton, Oliver Jojic, Jan Neumann, Robert Rubinoff, Bageshree Shevade and Hongzhong Zhou
Chapter 11 Analysis of Multimodal Natural Language Content in Broadcast Video (pages 175–184): Prem Natarajan, Ehry Macrostie, Rohit Prasad and Jonathan Watson
Chapter 12 Web?Based Multimedia Information Extraction Based on Social Redundancy (pages 185–200): Jose San Pedro, Stefan Siersdorfer, Vaiva Kalnikaite and Steve Whittaker
Chapter 13 Information Fusion and Anomaly Detection with Uncalibrated Cameras in Video Surveillance (pages 201–216): Erhan Baki Ermis, Venkatesh Saligrama and Pierre?marc Jodoin
Chapter 14 Automatic Detection, Indexing, and Retrieval of Multiple Attributes from Cross?Lingual Multimedia Data (pages 221–233): Qian Hu, Fred J. Goodman, Stanley M. Boykin, Randall K. Fish, Warren R. Greiff, Stephen R. Jones and Stephen R. Moore
Chapter 15 Information Graphics in Multimodal Documents (pages 235–252): Sandra Carberry, Stephanie Elzer, Richard Burns, Peng Wu, Daniel Chester and Seniz Demir
Chapter 16 Extracting Information from Human Behavior (pages 253–267): Fabio Pianesi, Bruno Lepri, Nadia Mana, Alessandro Cappelletti and Massimo Zancanaro
Chapter 17 Retrieval of Paralinguistic Information in Broadcasts (pages 273–287): Bjorn Schuller, Martin Wollmer, Florian Eyben and Gerhard Rigoll
Chapter 18 Audience Reactions for Information Extraction About Persuasive Language in Political Communication (pages 289–304): Marco Guerini, Carlo Strapparava and Oliviero Stock
Chapter 19 The Need for Affective Metadata in Content?Based Recommender Systems for Images (pages 305–319): Marko Tkalcic, Jurij Tasic and Andrej Kosir
Chapter 20 Affect?Based Indexing for Multimedia Data (pages 321–345): Gareth J. F. Jones and Ching Hau Chan
Chapter 21 Multimedia Annotation, Querying, and Analysis in Anvil (pages 351–367): Michael Kipp
Chapter 22 Toward Formalization of Display Grammar for Interactive Media Production with Multimedia Information Extraction (pages 369–384): Robin Bargar
Chapter 23 Media Authoring with Ontological Reasoning: Use Case for Multimedia Information Extraction (pages 385–400): Insook Choi
Chapter 24 Annotating Significant Relations on Multimedia Web Documents (pages 401–417): Matusala Addisu, Danilo Avola, Paola Bianchi, Paolo Bottoni, Stefano Levialdi and Emanuele Panizzi