Initial commit: hardened DeerFlow factory
Vendored deer-flow upstream (bytedance/deer-flow) plus prompt-injection hardening: - New deerflow.security package: content_delimiter, html_cleaner, sanitizer (8 layers — invisible chars, control chars, symbols, NFC, PUA, tag chars, horizontal whitespace collapse with newline/tab preservation, length cap) - New deerflow.community.searx package: web_search, web_fetch, image_search backed by a private SearX instance, every external string sanitized and wrapped in <<<EXTERNAL_UNTRUSTED_CONTENT>>> delimiters - All native community web providers (ddg_search, tavily, exa, firecrawl, jina_ai, infoquest, image_search) replaced with hard-fail stubs that raise NativeWebToolDisabledError at import time, so a misconfigured tool.use path fails loud rather than silently falling back to unsanitized output - Native client back-doors (jina_client.py, infoquest_client.py) stubbed too - Native-tool tests quarantined under tests/_disabled_native/ (collect_ignore_glob via local conftest.py) - Sanitizer Layer 7 fix: only collapse horizontal whitespace, preserve newlines and tabs so list/table structure survives - Hardened runtime config.yaml references only the searx-backed tools - Factory overlay (backend/) kept in sync with deer-flow tree as a reference / source See HARDENING.md for the full audit trail and verification steps.
@@ -0,0 +1,35 @@
|
||||
=== TITANIC DATASET ANALYSIS SUMMARY ===
|
||||
|
||||
Dataset shape: (891, 20)
|
||||
Total passengers: 891
|
||||
Survivors: 342 (38.38%)
|
||||
|
||||
KEY FINDINGS:
|
||||
1. Gender disparity: Female survival rate was much higher than male.
|
||||
- Female: 74.20%
|
||||
- Male: 18.89%
|
||||
|
||||
2. Class disparity: Higher classes had better survival rates.
|
||||
- Class 1: 62.96% (136/216)
|
||||
- Class 2: 47.28% (87/184)
|
||||
- Class 3: 24.24% (119/491)
|
||||
|
||||
3. Children had better survival rates than adults.
|
||||
- Child (0-12): 57.97% (40/69)
|
||||
- Teen (13-18): 42.86% (30/70)
|
||||
- Young Adult (19-30): 35.56% (96/270)
|
||||
- Adult (31-50): 42.32% (102/241)
|
||||
- Senior (51+): 34.38% (22/64)
|
||||
|
||||
4. Passengers with cabins had much higher survival rates.
|
||||
- With cabin: 66.67%
|
||||
- Without cabin: 29.99%
|
||||
|
||||
5. Family size affected survival.
|
||||
- Alone: 30.35%
|
||||
- With family: 50.56%
|
||||
|
||||
6. Embarkation port correlated with survival.
|
||||
- Port C: 55.36% (93/168)
|
||||
- Port Q: 38.96% (30/77)
|
||||
- Port S: 33.70% (217/644)
|
||||
|
After Width: | Height: | Size: 47 KiB |
|
After Width: | Height: | Size: 62 KiB |
|
After Width: | Height: | Size: 69 KiB |
|
After Width: | Height: | Size: 62 KiB |
|
After Width: | Height: | Size: 89 KiB |
|
After Width: | Height: | Size: 64 KiB |
|
After Width: | Height: | Size: 58 KiB |