Enterprise Search Connectors

In this section, you will learn how to manage and configure the enterprise search and retrieval augmented generation connectors, provided by the RheinInsights Retrieval Suite.

A key feature of our Retrieval Suite is that you can configure any combination of content source and search engine. Also each connector can be configured to use distinct content processing and principal processing pipelines.

  1. Please first add the connectors, you like to use at Managing Connectors .

  2. Afterwards, configure

    1. the content source connection, cf. Sources .

    2. the search engine connection, cf. Search Engines .

    3. content transformation, cf. Content Transformation .

    4. principal transformation, cf. Security Transformation

  3. If you successfully validated all configurations, you can start or schedule crawls for your new connector.

Starting and Managing Crawls

For crawl modes and crawling, please refer to Crawl Modes .

Multi-Node Environments and Load Balancing

As written at Deployment and Base Configuration , the RheinInsights Retrieval Suite is not just multi-threaded but can run in multi-node environments.

All connector will be configured centrally for all nodes. The configuration is valid for all instances of the RheinInsights Retrieval Suite which are connected to the database instance at Database Settings . When starting a crawl, the crawl is picked up by the first node of the RheinInsights Retrieval Suite, which has sufficient capacity. This is where the crawl will be performed.

A node can execute up to four crawls in parallel. Where a crawl can be a principal synchronization, an incremental, a checksum or a full crawl.