Mastering Apache Solr 7.x an expert guide to advancing, optimizing, and scaling your enterprise search
Accelerate your enterprise search engine and bring relevancy in your search analytics About This Book A practical guide in building expertise with Indexing, Faceting, Clustering and Pagination Master the management and administration of Enterprise Search Applications and services seamlessly Handle m...
Otros Autores: | , , |
---|---|
Formato: | Libro electrónico |
Idioma: | Inglés |
Publicado: |
Birmingham, England ; Mumbai, [India] :
Packt Publishing
2018.
|
Edición: | 1st edition |
Materias: | |
Ver en Biblioteca Universitat Ramon Llull: | https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009631108006719 |
Tabla de Contenidos:
- Cover
- Title Page
- Copyright and Credits
- Packt Upsell
- Contributors
- Table of Contents
- Preface
- Chapter 1: Introduction to Solr 7
- Introduction to Solr
- History of Solr
- Lucene - the backbone of Solr
- Why choose Solr?
- Benefits of keyword search
- Benefits of ranked results
- Solr use cases
- Social media
- Science and research
- Search engine
- E-commerce
- Media and entertainment
- Government
- Education
- What's new in Solr 7?
- Replication for SolrCloud
- TLOG replicas
- PULL replicas
- Schemaless improvements
- Autoscaling
- Default numeric types
- Spatial fields
- SolrJ
- JMX and MBeans
- Other changes
- Summary
- Chapter 2: Getting Started
- Solr installation
- Understanding various files and the folder structure
- bin
- Solr script
- Post script
- contrib
- DataImportHandler
- ContentExtractionLibrary
- LanguageIdentifier
- Clustering
- VelocityIntegration
- dist and docs
- example
- core.properties
- zoo.cfg
- solr.xml
- server
- Running Solr
- Running basic Solr commands
- Production Solr setup
- Loading sample data
- Loading data from MySQL
- Understanding the browse interface
- Using the Solr admin interface
- Dashboard
- Logging
- Cloud screens
- Tree view
- Graph view
- Collections or core admin
- Java properties
- Thread dump
- Collection-specific tools
- Overview
- Analysis
- DataImport
- Documents
- Files
- Query
- Stream
- Schema
- Core-specific tools
- Summary
- Chapter 3: Designing Schemas
- How Solr works
- Getting started with Solr's basics
- The schema file of Solr
- Understanding field types
- Definitions and properties of field types
- Field type properties
- Field types available in Solr
- Understanding date fields
- Understanding currencies and exchange rates
- Understanding enum fields
- Field management.
- Field properties
- Copying fields
- Dynamic fields
- Mastering Schema API
- Schema API in detail
- Schema operations
- Listing fields, field types, DynamicFields, and CopyField rules
- Deciphering schemaless mode
- Creating a schemaless example
- Schemaless mode configuration
- Managed schema
- Field guessing
- Summary
- Chapter 4: Mastering Text Analysis Methodologies
- Understanding text analysis
- What is text analysis?
- How text analysis works
- Understanding analyzer
- What is an analyzer?
- How an analyzer works
- Understanding tokenizers
- What is a tokenizer?
- Available tokenizers in Solr
- Standard tokenizer
- White space tokenizer
- Classic tokenizer
- Keyword tokenizer
- Lower case tokenizer
- Letter tokenizer
- N-gram tokenizer
- Edge n-gram tokenizer
- Understanding filters
- What is a filter?
- Available filters in Solr
- Stop filter
- Classic filter
- Synonym filter
- Synonym graph filter
- ASCII folding filter
- Keep word filter
- KStem filter
- KeywordMarkerFilterFactory
- Word delimiter graph filter
- Understanding CharFilter
- Understanding PatternReplaceCharFilterFactor
- Understanding multilingual analysis
- Language identification
- Configuring Solr for multiple language search
- Creating separate fields per language
- Creating separate indexes per language
- Understanding phonetic matching
- Understanding Beider-Morse phonetic matching
- Summary
- Chapter 5: Data Indexing and Operations
- Basics of Solr indexing
- Installing Postman
- Exploring the post tool
- Understanding index handlers
- Working with an index handler with the XML format
- Index handler with JSON
- Apache Tika and indexing
- Solr Cell basics
- Indexing a binary using Tika
- Language detection
- Language detection configuration
- Client APIs
- Summary
- Chapter 6: Advanced Queries - Part I.
- Search relevance
- Velocity search UI
- Query parsing and syntax
- Common query parameters
- Standard query parser
- Advantage
- Disadvantage
- Searching terms for standard query parser
- Term modifiers
- Wildcard searches
- Fuzzy searches
- Proximity searching
- Range searches
- Boolean operators
- Escaping special characters
- Grouping terms
- Dates and times in query strings
- Adding comments to the query string
- The DisMax Query Parser
- Advantages
- DisMax query parser parameters
- eDisMax Query Parser
- Response writer
- JSON
- Standard XML
- CSV
- Velocity
- Faceting
- Common parameters
- Field-value faceting parameters
- Range faceting
- Pivot faceting
- Interval faceting
- Highlighting
- Highlighting parameters
- Highlighter
- Unified highlighter (hl.method=unified)
- Original highlighter (hl.method=original)
- FastVector highlighter (hl.method=fastVector)
- Boundary scanners
- The breakIterator boundary scanner
- The simple boundary scanner
- Summary
- Chapter 7: Advanced Queries - Part II
- Spellchecking
- Spellcheck parameters
- Implementation approaches
- IndexBasedSpellChecker
- DirectSolrSpellChecker
- FileBasedSpellChecker
- WordBreakSolrSpellChecker
- Distributed spellcheck
- Suggester
- Suggester parameters
- Running suggestions
- Pagination
- How to implement pagination
- Cursor pagination
- Result grouping
- Result grouping parameters
- Running result grouping
- Result clustering
- Result clustering parameters
- Result clustering implementation
- Install the clustering contrib
- Declare the cluster search component
- Declare the request handler and include the cluster search component
- Spatial search
- Spatial search implementation
- Field types
- Query parser
- Spatial search query parser parameters
- Function queries
- Summary.
- Chapter 8: Managing and Fine-Tuning Solr
- JVM configuration
- Managing the memory heap
- Managing solrconfig.xml
- User-defined properties
- Implicit Solr core properties
- Managing backups
- Backup in SolrCloud
- Standalone mode backups
- Backup API
- Backup status
- API to restore
- Restore status API
- Snapshot API
- JMX with Solr
- JMX configuration
- Logging configuration
- Log settings using the admin web interface
- Log level at startup
- Setting the environment variable
- Passing parameters in the startup script
- Configuring Log4J for logging
- SolrCloud overview
- SolrCloud in interactive mode
- SolrCloud - core concepts
- Routing documents
- Splitting shards
- Setting up ignore commits from client applications
- Enabling SSL - Solr security
- Prerequisites
- Generating a key and self-signed certificate
- Starting Solr with SSL system properties
- Performance statistics
- Statistics for request handlers
- Summary
- Chapter 9: Client APIs - An Overview
- Client API overview
- JavaScript Client API
- SolrJ Client API
- Ruby Client API
- Python Client API
- Summary
- Index.