Apache Solr

The RheinInsights Retrieval Suite supports Apache Solr, either as cloud version or as a standalone instance. Also it supports Solr for both, querying data, as well as ingesting data (from crawls).

Apache Solr - Indexing Configuration

Apache Solr - Query Configuration

Technical Prerequisites

We assume that the Solr instance is set up and running. Furthermore, the Solr instance must be accessible from the connector VM(s) or Docker container(s).

Index creation at Apache Solr is not straight-forward. Even though, the connector can manage index fields (as described below), it is not able to create a search index in most circumstances. Please ask your Solr administrator to prepare the physical document and principal indexes (as a standard index template) so that the connector can tailor it to its needs.

Managing the Search Schema

The Search schema field and also the controls are described at Index Creation and Management .

The connector comes with a standard schema, which you can manage from the configuration dialog. The schema also contains vector field definitions, which you could use. The schema is however not mandatory.

The principal index schema will be created when clicking on Create Index. This schema should not be changed and it is not displayed in the configuration dialog.