Version: 0.2.3

Dropbox

Dropbox Connector

The Dropbox connector indexes files from your Dropbox account, including documents, PDFs, and other file types.

Prerequisites

Before using the Dropbox connector, you need to create a Dropbox App:

Go to the Dropbox App Console
Click Create app
Select Scoped access
Choose Full Dropbox or App folder depending on your needs
Give your app a name
Click Create app

Configure Permissions

In your app settings, under the Permissions tab, enable:

files.metadata.read - Read file and folder metadata
files.content.read - Download file content
account_info.read - Read user account info (usually enabled by default)

Configure OAuth

In your app settings, under the Settings tab:

Add the redirect URI: http://localhost:18080/callback
Note your App key (Client ID)
Note your App secret (Client Secret)

Capabilities

Capability	Supported	Notes
Full sync	Yes	Indexes all files from configured path
Incremental sync	Yes	Uses Dropbox cursor for efficient updates
Watch mode	No	Webhook integration not available in CLI
Hierarchy	Yes	Folder structure preserved via parent URIs
Binary content	No	Downloads text/PDF content only
Validation	Yes	Verifies credentials before sync

What Gets Indexed

The connector indexes:

File names and paths
File content (for supported types up to 5MB)
File metadata (size, modification time, revision)
Content hash for change detection

Supported Content Types

Content is downloaded and indexed for:

Text files (.txt, .md, .html, .css, .csv, etc.)
Code files (.js, .ts, .py, .go, .java, etc.)
PDF documents
JSON, XML, YAML files

Configuration

These options control what gets indexed during sync.

Option	Description	Default
`folder_path`	Root folder path to sync	`""` (root)
`recursive`	Include subfolders	`true`
`mime_types`	Filter by MIME types	All types

Folder Path

Sync a specific folder instead of the entire Dropbox:

--config "folder_path=/Documents"

The path should start with / and match the Dropbox folder path.

Recursive Sync

By default, all subfolders are included. To sync only the specified folder:

--config "recursive=false"

MIME Type Filter

Limit syncing to specific file types:

--config "mime_types=text/plain,application/pdf"

Document Structure

URI Pattern

Files are identified by URIs:

dropbox://files/{file_id}

Example: dropbox://files/id:ABC123DEF456

Folder Hierarchy

Files reference their parent folder via ParentURI:

dropbox://folders/Documents/Reports

Metadata

Each file includes:

Field	Description
`file_id`	Dropbox file ID
`title`	File name
`path`	Full path (e.g., `/Documents/report.pdf`)
`size`	File size in bytes
`modified_time`	Server modified timestamp
`rev`	Revision ID
`content_hash`	Dropbox content hash

Sync Behaviour

Full Sync

Full sync retrieves all files from the configured path:

Calls files/list_folder with configured path
Paginates with files/list_folder/continue while has_more=true
Downloads content for supported file types
Stores cursor for incremental sync

Incremental Sync

Incremental sync uses the Dropbox cursor:

Calls files/list_folder/continue with stored cursor
Processes file additions, modifications, and deletions
Updates cursor for next sync

Cursor Expiration

If the cursor expires:

The connector detects the reset error
Returns an error indicating full sync is required
Run a full sync to re-establish the cursor

Rate Limiting

Dropbox has API rate limits. The connector uses:

Setting	Value
Requests per second	5
Burst size	10

When throttled (HTTP 429), the connector waits and retries with backoff.

Error Handling

Error	Handling
Rate limit (429)	Wait and retry with backoff
Cursor expired	Trigger full resync
File not found	Skip and continue
Authentication failure	Report error, stop sync

Limitations

Limitation	Description
File size	Files over 5MB are indexed without content
Watch mode	Not supported in CLI
Shared folders	Accessible if user has permission
Team folders	Requires appropriate permissions

Maximum Content Size (5MB)

Files larger than 5MB have their metadata indexed but their content is not downloaded. This prevents excessive memory usage and API quota consumption during sync.

Impact:

File metadata (name, path, size, modification time) is always indexed
Full-text search will not find content within files over 5MB
The file will still appear in search results by name/path

Workaround - Building from Source:

If you need to change this limit, modify the MaxContentSize constant in internal/connectors/dropbox/file.go:

// MaxContentSize is the maximum file size to download.
// Default: 5MB (5 * 1024 * 1024)
const MaxContentSize = 10 * 1024 * 1024  // Change to 10MB

Then rebuild:

CGO_ENABLED=1 go build -o sercha ./cmd/sercha

caution

Increasing this limit may:

Significantly increase memory usage during sync
Slow down sync operations
Consume more Dropbox API quota

Future Enhancement: Configuration via environment variable is planned for a future release.

Example Usage

Add Dropbox authentication:

sercha auth add --provider dropbox --name "My Dropbox"

Create a Dropbox source with default settings (entire Dropbox):

sercha source add \
  --type dropbox \
  --name "My Files" \
  --auth "My Dropbox"

Create a source for a specific folder:

sercha source add \
  --type dropbox \
  --name "Documents" \
  --auth "My Dropbox" \
  --config "folder_path=/Documents"

Create a source for PDFs only:

sercha source add \
  --type dropbox \
  --name "PDF Files" \
  --auth "My Dropbox" \
  --config "mime_types=application/pdf"

Sync the source:

sercha sync <source-id>

List indexed documents:

sercha document list <source-id>

Supported Connectors - Browse all connectors
Filesystem - Index local files
Google Drive - Index Google Drive files

Dropbox Connector

Prerequisites​

Configure Permissions​

Configure OAuth​

Capabilities​

What Gets Indexed​

Supported Content Types​

Configuration​

Folder Path​

Recursive Sync​

MIME Type Filter​

Document Structure​

URI Pattern​

Folder Hierarchy​

Metadata​

Sync Behaviour​

Full Sync​

Incremental Sync​

Cursor Expiration​

Rate Limiting​

Error Handling​

Limitations​

Maximum Content Size (5MB)​

Example Usage​

Next​