kaggle
View:
Before
After
✨ Suggestions active
0
M
MEHMET ISIK

WWTP LLM Defense: Can AI Protect Critical Infrastructure?

36-hour SCADA cyber attack simulation on a real WWTP. 5 attack scenarios, 3 defense modes, 2 awareness levels. Stuxnet-inspired sensor manipulation with real plant data.

Task Detail
Discussion
Settings
This task is Private. To make it visible on a public benchmark, set visibility to Public in Settings. Edit visibility
Description

This task does not have a description yet.

Results (7)
Model ↓ Latest Result Cost NEW Time NEW
G
Gemini 2.5 Flash
89.80 $0.12 5m 12s
G
Gemini 2.0 Flash Lite
85.10 $0.08 3m 44s
G
Gemini 2.0 Flash
78.20 $0.10 4m 30s
A
Claude Opus 4.1
running... $2.40 so far 18m ...
G
Gemini 3 Flash Preview
ERROR $0.31 7m 02s
G
Gemini 2.5 Pro
ERROR $4.87 22m 15s
A
Claude Opus 4.6
92.40 $8.62 46m 20s
Daily AI Quota: $0.00 used / $100.00
Monthly AI Quota: $499.80 used / $500.00
License
Apache 2.0
Version
v2
Updated
5 days ago
Evaluate More Models
Multiple Anthropic models selected. Running them together may cause slower performance or errors. Consider running them one at a time. NEW
Anthropic Claude PROPRIETARY Select all
Claude Sonnet 4.6
Aug 2025
Claude Opus 4.6
Aug 2025
Claude Haiku 4.5
Feb 2025
Claude Opus 4.5
Aug 2025
Claude Sonnet 4.5
Feb 2025
Claude Opus 4.1
Mar 2025
DeepSeek PROPRIETARY Select all
DeepSeek V3.2
Jul 2025
DeepSeek V3.1
Jul 2025
DeepSeek-R1
Jan 2025
Google Gemini PROPRIETARY Select all ✓
Gemini 3 Flash Preview
Jan 2025
Gemini 3 Pro Preview
Jan 2025
Gemini 2.5 Flash
Jan 2025
Gemini 2.5 Pro
Jan 2025
QwenLM Qwen OPEN Select all
Qwen 3 Next 80B Thinking
Jan 2025
Qwen 3 Next 80B Instruct
Jan 2025
Qwen 3 Coder 480B
Jan 2025
Qwen 3 235B A22B
Jan 2025