Cambazoglu, B. Barla.

Scalability Challenges in Web Search Engines [electronic resource] / by B. Barla Cambazoglu, Ricardo Baeza-Yates. - 1st ed. 2016. - XV, 122 p. online resource. - Synthesis Lectures on Information Concepts, Retrieval, and Services, 1947-9468 . - Synthesis Lectures on Information Concepts, Retrieval, and Services, .

Preface -- Introduction -- The Web Crawling System -- The Indexing System -- The Query Processing System -- Concluding Remarks -- Bibliography -- Authors' Biographies .

In this book, we aim to provide a fairly comprehensive overview of the scalability and efficiency challenges in large-scale web search engines. More specifically, we cover the issues involved in the design of three separate systems that are commonly available in every web-scale search engine: web crawling, indexing, and query processing systems. We present the performance challenges encountered in these systems and review a wide range of design alternatives employed as solution to these challenges, specifically focusing on algorithmic and architectural optimizations. We discuss the available optimizations at different computational granularities, ranging from a single computer node to a collection of data centers. We provide some hints to both the practitioners and theoreticians involved in the field about the way large-scale web search engines operate and the adopted design choices. Moreover, we survey the efficiency literature, providing pointers to a large number of relatively important research papers. Finally, we discuss some open research problems in the context of search engine efficiency.

9783031022982

10.1007/978-3-031-02298-2 doi


Computer networks .
Computer Communication Networks.

TK5105.5-5105.9

004.6