TY - BOOK AU - Kuc,Rafal TI - Solr Cookbook: Solve Real-Time Problems Related to Apache Solr 4. x and 5. 0 Effectively with the Help of over 100 Easy-to-follow Recipes SN - 9781783553167 AV - TK5105.884.K87 2015 PY - 2015/// CY - Birmingham PB - Packt Publishing, Limited KW - Search engines -- Programming KW - Web search engines KW - Electronic books N1 - Intro -- Solr Cookbook Third Edition -- Table of Contents -- Solr Cookbook Third Edition -- Credits -- About the Author -- Acknowledgments -- About the Reviewers -- www.PacktPub.com -- Support files, eBooks, discount offers, and more -- Why subscribe? -- Free access for Packt account holders -- Preface -- What this book covers -- What you need for this book -- Who this book is for -- Sections -- Getting ready -- How to do it… -- How it works… -- There's more… -- See also -- Conventions -- Reader feedback -- Customer support -- Downloading the example code -- Errata -- Piracy -- Questions -- 1. Apache Solr Configuration -- Introduction -- Running Solr on a standalone Jetty -- Getting ready -- How to do it... -- How it works... -- There's more... -- I want Jetty to run on a different port -- Buffer size is too small -- Installing ZooKeeper for SolrCloud -- Getting ready -- How to do it... -- How it works... -- Migrating configuration from master-slave to SolrCloud -- Getting ready -- How to do it... -- How it works... -- Choosing the proper directory configuration -- How to do it... -- How it works... -- Configuring the Solr spellchecker -- How to do it... -- How it works... -- There's more... -- More than one spellchecker -- Using Solr in a schemaless mode -- How to do it... -- How it works... -- Limiting I/O usage -- Getting ready -- How to do it... -- How it works... -- Using core discovery -- How to do it... -- How it works... -- There's more... -- Configuring SolrCloud for NRT use cases -- How to do it... -- How it works... -- Configuring SolrCloud for high-indexing use cases -- Getting ready -- How to do it... -- How it works... -- Configuring SolrCloud for high-querying use cases -- Getting ready -- How to do it... -- How it works... -- Configuring the Solr heartbeat mechanism -- How to do it... -- How it works... -- There's more; Enabling and disabling the heartbeat mechanism -- Changing similarity -- Getting ready -- How to do it... -- How it works... -- There's more... -- Changing the global similarity -- 2. Indexing Your Data -- Introduction -- Indexing PDF files -- How to do it... -- How it works... -- Counting the number of fields -- How to do it... -- How it works... -- Using parsing update processors to parse data -- Getting ready -- How to do it... -- How it works... -- See also -- Using scripting update processors to modify documents -- Getting ready -- How to do it... -- How it works... -- See also -- Indexing data from a database using Data Import Handler -- How to do it... -- How it works... -- There's more... -- How to change the default behavior of deleting index contents at the beginning of a full import -- Incremental imports with DIH -- Getting ready -- How to do it... -- How it works... -- See also -- Transforming data when using DIH -- Getting ready -- How to do it... -- How it works... -- There's more... -- Using scripts other than JavaScript -- Indexing multiple geographical points -- How to do it... -- How it works... -- See also -- Updating document fields -- How to do it... -- How it works... -- Detecting the document language during indexation -- How to do it... -- How it works... -- There's more... -- Language identification based on Apache Tika -- Optimizing the primary key indexation -- How to do it... -- How it works... -- See also -- Handling multiple currencies -- How to do it... -- How it works... -- There's more... -- Setting up your own currency provider -- 3. Analyzing Your Text Data -- Introduction -- Using the enumeration type -- How to do it... -- How it works... -- Removing HTML tags during indexing -- How to do it... -- How it works... -- There's more... -- Preserving defined tags -- See also -- Storing data outside of Solr index; How to do it... -- How it works... -- Using synonyms -- How to do it... -- How it works... -- There's more... -- Equivalent synonyms setup -- See also -- Stemming different languages -- How to do it... -- How it works... -- There's more... -- Using nonaggressive stemmers -- How to do it... -- How it works... -- There's more... -- Using the n-gram approach to do performant trailing wildcard searches -- How to do it... -- How it works... -- Using position increment to divide sentences -- How to do it... -- How it works... -- Using patterns to replace tokens -- How to do it... -- How it works... -- There's more... -- Using solr.PatternReplaceCharFilterFactory -- 4. Querying Solr -- Introduction -- Understanding and using the Lucene query language -- How to do it... -- How it works... -- See also -- Using position aware queries -- How to do it... -- How it works... -- There's more... -- Too many generated queries -- Using boosting with autocomplete -- How to do it... -- How it works... -- Phrase queries with shingles -- How to do it... -- How it works... -- See also -- Handling user queries without errors -- Getting ready -- How to do it... -- How it works... -- See also -- Handling hierarchies with nested documents -- How to do it... -- How it works... -- There's more... -- Returning children documents in the query -- Sorting data on the basis of a function value -- How to do it... -- How it works... -- Controlling the number of terms needed to match -- Getting ready -- How to do it... -- How it works... -- See also -- Affecting document score using function queries -- How to do it... -- How it works... -- See also -- Using simple nested queries -- How to do it... -- How it works... -- Using the Solr document query join functionality -- How to do it... -- How it works... -- Handling typos with n-grams -- How to do it... -- How it works; Rescoring query results -- How to do it... -- How it works... -- 5. Faceting -- Introduction -- Getting the number of documents with the same field value -- How to do it... -- How it works... -- There's more... -- How to show facets with counts greater than zero -- Lexicographical sorting of the faceting results -- Getting the number of documents with the same value range -- How to do it... -- How it works... -- Getting the number of documents matching the query and subquery -- How to do it... -- How it works... -- Removing filters from faceting results -- Getting ready -- How to do it... -- How it works... -- Using decision tree faceting -- How to do it... -- How it works... -- Calculating faceting for relevant documents in groups -- Getting ready -- How to do it... -- How it works... -- Improving faceting performance for low cardinality fields -- Getting ready -- How to do it... -- How it works... -- There's more... -- Using per segment field cache for faceting calculation -- Specifying the number of faceting threads -- 6. Improving Solr Performance -- Introduction -- Handling deep paging efficiently -- How to do it... -- How it works... -- See also -- Configuring the document cache -- Getting ready -- How to do it... -- How it works... -- Configuring the query result cache -- Getting ready -- How to do it... -- How it works... -- Configuring the filter cache -- Getting ready -- How to do it... -- How it works... -- Improving Solr query performance after the start and commit operations -- How to do it... -- How it works... -- There's more... -- Improving Solr performance after committing operations -- Lowering the memory consumption of faceting and sorting -- How to do it... -- How it works... -- Speeding up indexing with Solr segment merge tuning -- How to do it... -- How it works... -- There's more; Increasing the RAM buffer size to improve the indexing throughput -- Speeding up querying with merge policy tuning -- See also -- Avoiding caching of rare filters to improve the performance -- How to do it... -- How it works... -- Controlling the filter execution to improve expensive filter performance -- Getting ready -- How to do it... -- How it works... -- Configuring numerical fields for high-performance sorting and range queries -- How to do it... -- How it works... -- See also -- 7. In the Cloud -- Introduction -- Creating a new SolrCloud cluster -- Getting ready -- How to do it... -- How it works... -- There's more... -- Starting an embedded ZooKeeper server -- Specifying the Solr server name -- Setting up multiple collections on a single cluster -- Getting ready -- How to do it... -- How it works... -- Splitting shards -- Getting ready -- How to do it... -- How it works... -- Having more than a single shard from a collection on a node -- Getting ready -- How to do it... -- How it works... -- Creating a collection on defined nodes -- Getting ready -- How to do it... -- How it works... -- Adding replicas after collection creation -- Getting ready -- How to do it... -- How it works... -- Removing replicas -- Getting ready -- How to do it... -- How it works... -- Moving shards between nodes -- Getting ready -- How to do it... -- How it works... -- Using aliasing -- Getting ready -- How to do it... -- How it works... -- Using routing -- Getting ready -- How to do it... -- How it works... -- 8. Using Additional Functionalities -- Introduction -- Finding similar documents -- How to do it... -- How it works... -- Highlighting fragments found in documents -- How to do it... -- How it works... -- There's more... -- Changing the default HTML tags that surround the matched content -- Efficient highlighting -- How to do it... -- How it works; Using versioning UR - https://ebookcentral.proquest.com/lib/orpp/detail.action?docID=1931591 ER -