Apache Solr for indexing data enhance your Solr indexing experience with advanced techniques and the built-in functionalities available in Apache Solr

Enhance your Solr indexing experience with advanced techniques and the built-in functionalities available in Apache Solr About This Book Learn about distributed indexing and real-time optimization to change index data on fly Index data from various sources and web crawlers using built-in analyzers a...

Descripción completa

Detalles Bibliográficos
Otros Autores: Handiekar, Sachin, author (author), Johri, Anshul, author
Formato: Libro electrónico
Idioma:Inglés
Publicado: Birmingham : Packt Publishing 2015.
Edición:1st edition
Colección:Community experience distilled.
Materias:
Ver en Biblioteca Universitat Ramon Llull:https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009629915206719
Tabla de Contenidos:
  • Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started; Overview and installation of Solr; Installing Solr in OS X (Mac); Running Solr; Installing Solr in Windows; Installing Solr on Linux; The Solr architecture and directory structure; Solr directory structure; Cores in Solr (Multicore Solr); Summary; Chapter 2: Understanding Analyzers, Tokenizers, and Filters; Introducing analyzers; Analysis phases; Tokenizers; Standard tokenizer; Keyword tokenizer; Lowercase tokenizer; N-gram tokenizer; Filters
  • Lowercase filterSynonym filter; Porter stem filter; Running your analyzer; Summary; Chapter 3: Indexing Data; Indexing data in Solr; Introducing field types; Defining fields; Defining an unique key; Copy fields and dynamic fields; Building our musicCatalogue example; Using the Solr Admin UI; Facet searching; Summary; Chapter 4: Index Data - Basic Technique and Using Index Handlers; Inserting data into Solr; Configuring UpdateRequestHandler; Indexing documents using XML; Adding and updating documents; Deleting a document; Indexing documents using JSON; Adding a single document
  • Adding multiple JSON documentsSequential JSON update commands; Indexing updates using CSV; Summary; Chapter 5: Index Data Using Structured Data Source Using DIH; Indexing data from MySQL; Configuring datasource; DIH commands; Indexing data using XPath; Summary; Chapter 6: Indexing Data Using Apache Tika; Introducing Apache Tika; Configuring Apache Tika in Solr; Indexing PDF and Word documents; Summary; Chapter 7: Apache Nutch; Introducing Apache Nutch; Installing Apache Nutch; Configuring Solr with Nutch; Summary; Chapter 8: Commits, Real-Time Index Optimizations, and Atomic Updates
  • Understanding soft commit, optimize, and hard commitUsing atomic updates in Solr; Using RealTime Get; Summary; Chapter 9: Advanced Topics - Multilanguage, Deduplication, and Others; Multilanguage indexing; Removing duplicate documents (deduplication); Content streaming; UIMA integration with Solr; Summary; Chapter 10: Distributed Indexing; Setting up SolrCloud; The collections API; Updating configuration files; Distributed indexing and searching; Summary; Chapter 11: Case Study of Using Solr in E-Commerce; Creating an AutoSuggest feature; Facet navigation; Search filtering and sorting
  • Relevancy boostingSummary; Index