Configuration
- Site List: Comma delimited list of tenants sites to crawl for content types. Use ‘root’ for main site.
- Library List: Libraries to Crawl. List names do not require their parent sites (for example, ‘Documents’ not ‘sites/Test/Documents’)
- Crawl Subsites: If the site has any subsites, crawl them as well. For example, if you leave the list above as root, but there is a subsite ([tenant].sharepoint.com/mySite), it will not be crawled unless this box is checked.
- Process Folders: Process folders as well as documents.
- Get Versions: Retrieve document versions.
CASE SENSITIVITY Site and Library names are case-sensitive. If the case is wrong, the job will complete successfully, but no documents will be picked up.
Output Specification
- Tenant Name: The name of the tenant. All O365 SharePoint instances use the structure [tenant].sharepoint.com. We use this to construct urls and gather site IDs.
- Output Folder Path: The folder where the files will be stored. Do not include the library root folder in this path. If left blank, documents will be written to the library root (for example, Documents will write directly to the Shared Documents folder)
- Output Site: The tenant (sites/mySite) or subsite (/mySite) where the documents will be written. Defaults to root
- Output List: The name of the library within the site. Defaults to Documents
- Check documents for a value to override destination site: Each document will have its metadata checked for a new subsite or tenant site.
- Field to check for site override value: If checking for site overrides, this field will be checked for the new site path. If present, the document will be uploaded to that site. The default is ‘site’. If this value is set and no list override is present, the files will upload to ‘Documents’ on the new site.
- Check documents for a value to override destination list: Each document will have its metadata checked for a new list name.
- Field to check for list override value: If checking for list overrides, this field will be checked for the new list name. If present, the document will be uploaded to that list. The default is ‘list’. This can be combined with a site override.
- Retry Attempts: The number of times to try completing a document upload before failing.
- Roll Back files: There are instances where a document may be uploaded but its metadata will exceed the allowed number of upload attempts. If this flag is checked, that document will be deleted (and will be noted in the Removed column in the job status screen).
- Set Permissions: Set ACLs for documents when available. Expected format is “(email)=writer” or “(email)=reader”