Documentation
Enterprise Search Connectors
In this section, you will learn how to manage and configure the enterprise search and retrieval augmented generation connectors, provided by the RheinInsights Retrieval Suite.
The following content sources are available for crawling. The following connectors come with full support for secure search, i.e., early binding security trimming:
Atlassian Confluence Connector
Atlassian Confluence Cloud Connector
Atlassian Jira Cloud Connector
GitHub Enterprise Cloud Connector
GitHub Enterprise Server Connector
JDBC and SQL Server Connector - see the security concept described at the connector page
Microsoft SharePoint Online Connector
SAP Success Factors (documentation follows soon)
The following connector does not come with security trimming as of now
Git Connector - due to the general Git protocol
Managing Connectors
A key feature of our Retrieval Suite is that you can configure any combination of content source and search engine. Also each connector can be configured to use distinct content processing and principal processing pipelines.
Please first add the connectors, you like to use at Managing Connectors .
Afterwards, configure
the content source connection, cf. Sources .
the search engine connection, cf. Search Engines .
content transformation, cf. Content Transformation .
principal transformation, cf. Security Transformation
If you successfully validated all configurations, you can start or schedule crawls for your new connector.
Starting and Managing Crawls
For crawl modes and crawling, please refer to Crawl Modes .
Multi-Node Environments and Load Balancing
As written at Deployment and Base Configuration , the RheinInsights Retrieval Suite is not just multi-threaded but can run in multi-node environments.
All connector will be configured centrally for all nodes. The configuration is valid for all instances of the RheinInsights Retrieval Suite which are connected to the database instance at Database Settings . When starting a crawl, the crawl is picked up by the first node of the RheinInsights Retrieval Suite, which has sufficient capacity. This is where the crawl will be performed.
A node can execute up to four crawls in parallel. Where a crawl can be a principal synchronization, an incremental, a checksum or a full crawl.