M
MEHMET ISIK
WWTP LLM Defense: Can AI Protect Critical Infrastructure?
36-hour SCADA cyber attack simulation on a real WWTP. 5 attack scenarios, 3 defense modes, 2 awareness levels. Stuxnet-inspired sensor manipulation with real plant data.
Task Detail
Discussion
Settings
This task is Private. To make it visible on a public benchmark, set visibility to Public in Settings.
Edit visibility
Description
This task does not have a description yet.
Results (7)
| Model | ↓ Latest Result | Cost NEW | Time NEW | |
|---|---|---|---|---|
Gemini 2.5 Flash |
89.80 | $0.12 | 5m 12s | |
Gemini 2.0 Flash Lite |
85.10 | $0.08 | 3m 44s | |
Gemini 2.0 Flash |
78.20 | $0.10 | 4m 30s | |
Claude Opus 4.1 |
running... | $2.40 so far | 18m ... | |
Gemini 3 Flash Preview |
ERROR | $0.31 | 7m 02s | |
Gemini 2.5 Pro |
ERROR | $4.87 | 22m 15s | |
Claude Opus 4.6 |
92.40 | $8.62 | 46m 20s |
1–7 of 7
Daily AI Quota: $0.00 used / $100.00
⚠ Monthly AI Quota: $499.80 used / $500.00