Unified endpoint to ingest documents from various sources.
This endpoint can handle file uploads and URL scraping in a single request. It uses multipart/form-data.
Note: Supported file types depend on the server’s active capabilities (e.g. audio/video files require transcription to be enabled).
A list of files to upload.
A comma-separated list of URLs to scrape.
A comma-separated list of workspace IDs to add the documents to.
Custom API metadata (JSON string).
Scraping mode ('Simple Scraping' or 'Crawling').
The maximum depth for crawling links.
Whether to only crawl links on the same domain.
The maximum number of pages to crawl.
The delay in seconds between requests.
The format to download YouTube videos in ('audio' or 'video').
Successful Response
The unique identifier for the document.
The display name of the document.
The current processing status of the document.
The timestamp when the document was created.
The timestamp when the document was last updated.
The MIME type of the document file.
The name of the processing pipeline used for this document.
If processing failed, this field will contain the error message.
Custom API metadata resolved for the current user. Useful for retrieving stored external references (e.g. {'internal_app_ref': 'REF-123'}). Note: This value is resolved based on the request's external_user_id. A document shared globally but also scoped to a specific user may return different metadata depending on who is asking.
The original file name of the document.
The full content of the document converted to Markdown format.