Box Data Extract
The Box Data Extract Connector enables seamless data extraction from Box, allowing users to access file content from their Box cloud storage.
This connector helps automate workflows by using Box AI to extract data directly from documents stored in Box.
For example, a company managing lease contracts can automatically capture key details and populate their system of record. Likewise, a financial institution can extract critical data from loan application documents—such as bank statements and W-2s—and integrate it into approval workflows.
By prioritizing content intelligence, this connector unlocks new opportunities for automation and AI-driven insights.
This page contains the setup guide and reference information for the Box Data Extract source connector.
Prerequisites
You will need a Box application configured to use Client Credential Grants (CCG) Follow this guide to complete authentication.
From your box app configuration take note of:
- Client ID: You Box App client ID. Find yours in the Box App configurations.
- Client Secret: You Box App client secret.
Decide on what account is going to login to Box:
- Box Subject Type: Represents the type of user to login as ("user" or "enterprise"). Enterprise will login with the application service account. User will login with the user if app can impersonate users.
- Box Subject ID: If subject type is "enterprise", use your enterprise ID If subject type is "user", use the user id to login as.
Choose the which Box folder conatins the files you want to process:
- Folder ID: Folder to retreive data from.
- Recursive: Read the folders recursively.
If you are using Box AI you'll need:
- Ask AI Prompt: If using the Ask AI, what prompt to send the AI about the document
- Extract AI Prompt: If using the Extract AI, what prompt to send the AI about the document
- Extract Structured AI Fields: If using the Extract Structured AI, what prompt to send the AI about the document. For more information on the expected format of this prompt check our developer documentation.
Setup guide
Set up Box Data Extract
For Airbyte Cloud:
- Log into your Airbyte Cloud account.
- Click Sources and then click + New source.
- On the Set up the source page, select Box Data Extract from the Source type dropdown.
- Enter a name for the Box Data Extract connector.
- Fill in the information:
- Client ID
- Client Secret
- Box Subject Type
- Box Subject ID
- Folder ID
- Recursive
 
- Click Setup source
For Airbyte Open Source:
- Navigate to the Airbyte Open Source dashboard.
- Click Sources and then click + New source.
- On the Set up the source page, select Box Data Extract from the Source type dropdown.
- Enter a name for the Box Data Extract connector.
- Fill in the information:
- Client ID
- Client Secret
- Box Subject Type
- Box Subject ID
- Folder ID
- Recursive
 
- Click Setup source
Supported sync modes
The Box Data Extract source connector supports the following sync modes:
| Feature | Supported? | 
|---|---|
| Full Refresh Sync | Yes | 
| Incremental Sync | No | 
| SSL connection | Yes | 
| Namespaces | No | 
Supported Streams
- File text extraction: Extract a text representation from your Box documents
- AI Ask: Ask AI something about your Box documents
- AI Extract: Extract structured data from your Box documents
- AI Extract Structured: Extract structured data in a strctured format from your Box documents
Reference
Config fields reference
Changelog
Expand to review
| Version | Date | Pull Request | Subject | 
|---|---|---|---|
| 0.1.13 | 2025-05-10 | 59315 | Update dependencies | 
| 0.1.12 | 2025-04-26 | 58737 | Update dependencies | 
| 0.1.11 | 2025-04-19 | 58294 | Update dependencies | 
| 0.1.10 | 2025-04-12 | 57611 | Update dependencies | 
| 0.1.9 | 2025-04-05 | 57154 | Update dependencies | 
| 0.1.8 | 2025-03-29 | 56593 | Update dependencies | 
| 0.1.7 | 2025-02-13 | Adding Box Extract Structured AI stream | |
| 0.1.6 | 2025-02-13 | Adding Box Extract AI stream | |
| 0.1.5 | 2025-02-13 | Adding Box Ask AI stream | |
| 0.1.4 | 2025-02-12 | Initial release by @BoxDevRel |