Documentation
Standard Schema
We believe that a consistent wording of metadata (same should be named the same and different things different) is key to an easy integration of your content sources into enterprise search or into retrieval augmented generation. Therefore, we follow the following wording for our search schemas.
All connectors will fill a subset of these fields and the search interface will use these for rendering search results and filters.
Fieldname | Type | Semantics |
---|---|---|
id | String | Unique id of this item |
title | String | |
url | String | Clickable URL |
body | byte[] | Document body of this item (e.g. HTML for a web page or bytes for an Excel) |
parentItemTitle | String | Title of the parent item, for instance a space |
parentItemUrl | String | Url of the parent item, for instance a space |
rootItemUrl | String | Title of the content source |
rootItemTitle | String | Url of the content source |
attachmentParentTitle | String | Title of the parent item of an attachment, for instance of a page |
attachmentParentUrl | String | Url of the parent item of an attachment, for instance of a page |
itemType | String | Determines the item type in a human-friendly way. For instance PDF document, web page, wiki page, issue or card) |
fileExtension | String | Is the file extension of this document, if applicable |
iconSvg | String | Contains an SVG which can be used for rendering of a standartized icon for this item |
lastModifiedDate | Date | Date, when the item was last modified |
createdDate | Date | Date, when the item was created |
allow_acl | List<String> | Allow access control list which comprises user Ids and group Ids |
deny_acl | List<String> | Deny access control list which comprises user Ids and group Ids (which are excluded from access to this item) |
authors | List<String> | Author names (display names) of this item |
authorsMail | List<String> | Mail addresses of the authors. |
creators | List<String> | Creator name (display name) of this item |
keywords | List<String> | Is a list of keywords or tags. |
Additional Schema Fields | ||
iconUrl | String | Can be used to use own icon material for items. |
iconBytes | String | Bytes as used for e.g. Jira Project images |
assignee | String | Determines the assignee at indexing time of an issue |
status | List<String> | Status of an issue |
previewImageUrl | String | Contains a link to a static preview image URL, if the content source supports it. For instance our SharePoint and SharePoint Online connectors fill this field. |
interactivePreviewUrl | String | Contains a link to an interactive preview, if the content source supports it. For instance our SharePoint and SharePoint Online connectors fill this field. |
comment | List<String> | A list of comments, for instance for wiki pages, issues |
commitId | List<String> | Is the unique id of a commit, as given by our Git connectors. |
profilePictureUrl | String | Profile information. Link to a profile picture. |
mobilePhone | String | Profile information |
businessPhone | String | Profile information |
jobTitle | String | Profile information |
department | String | Profile information |
officeLocation | String | Profile information |
mailAddress | String | Profile information |
userPrincipalName | String | Profile information |
city | String | Profile information |
zipCode | String | Profile information |
street | String | Profile information |
country | String | Profile information |
position | String | Profile information |
language | String | Is optionally used if a content language is given |
Dates
Dates will in particular be provided in the format yyyy-MM-dd'T'HH:mm:ss.SSSZ.