Vendored deer-flow upstream (bytedance/deer-flow) plus prompt-injection hardening: - New deerflow.security package: content_delimiter, html_cleaner, sanitizer (8 layers — invisible chars, control chars, symbols, NFC, PUA, tag chars, horizontal whitespace collapse with newline/tab preservation, length cap) - New deerflow.community.searx package: web_search, web_fetch, image_search backed by a private SearX instance, every external string sanitized and wrapped in <<<EXTERNAL_UNTRUSTED_CONTENT>>> delimiters - All native community web providers (ddg_search, tavily, exa, firecrawl, jina_ai, infoquest, image_search) replaced with hard-fail stubs that raise NativeWebToolDisabledError at import time, so a misconfigured tool.use path fails loud rather than silently falling back to unsanitized output - Native client back-doors (jina_client.py, infoquest_client.py) stubbed too - Native-tool tests quarantined under tests/_disabled_native/ (collect_ignore_glob via local conftest.py) - Sanitizer Layer 7 fix: only collapse horizontal whitespace, preserve newlines and tabs so list/table structure survives - Hardened runtime config.yaml references only the searx-backed tools - Factory overlay (backend/) kept in sync with deer-flow tree as a reference / source See HARDENING.md for the full audit trail and verification steps.
2.7 KiB
2.7 KiB
Setup Guide
Quick setup instructions for DeerFlow.
Configuration Setup
DeerFlow uses a YAML configuration file that should be placed in the project root directory.
Steps
-
Navigate to project root:
cd /path/to/deer-flow -
Copy example configuration:
cp config.example.yaml config.yaml -
Edit configuration:
# Option A: Set environment variables (recommended) export OPENAI_API_KEY="your-key-here" # Option B: Edit config.yaml directly vim config.yaml # or your preferred editor -
Verify configuration:
cd backend python -c "from deerflow.config import get_app_config; print('✓ Config loaded:', get_app_config().models[0].name)"
Important Notes
- Location:
config.yamlshould be indeer-flow/(project root), notdeer-flow/backend/ - Git:
config.yamlis automatically ignored by git (contains secrets) - Priority: If both
backend/config.yamland../config.yamlexist, backend version takes precedence
Configuration File Locations
The backend searches for config.yaml in this order:
DEER_FLOW_CONFIG_PATHenvironment variable (if set)backend/config.yaml(current directory when running from backend/)deer-flow/config.yaml(parent directory - recommended location)
Recommended: Place config.yaml in project root (deer-flow/config.yaml).
Sandbox Setup (Optional but Recommended)
If you plan to use Docker/Container-based sandbox (configured in config.yaml under sandbox.use: deerflow.community.aio_sandbox:AioSandboxProvider), it's highly recommended to pre-pull the container image:
# From project root
make setup-sandbox
Why pre-pull?
- The sandbox image (~500MB+) is pulled on first use, causing a long wait
- Pre-pulling provides clear progress indication
- Avoids confusion when first using the agent
If you skip this step, the image will be automatically pulled on first agent execution, which may take several minutes depending on your network speed.
Troubleshooting
Config file not found
# Check where the backend is looking
cd deer-flow/backend
python -c "from deerflow.config.app_config import AppConfig; print(AppConfig.resolve_config_path())"
If it can't find the config:
- Ensure you've copied
config.example.yamltoconfig.yaml - Verify you're in the correct directory
- Check the file exists:
ls -la ../config.yaml
Permission denied
chmod 600 ../config.yaml # Protect sensitive configuration
See Also
- Configuration Guide - Detailed configuration options
- Architecture Overview - System architecture