Contents

CTFs and AI

CTFs and Artificial Intelligence. Post Ph0wn 2026 Thoughts.

Myth #1: “CTF players hate AI because it spoils CTFs”

Reality: this is not the opinion of the majority of CTF players

/images/ctf2026-destroyed.png

“AI has destroyed CTFs” - this kind of message is frequent on the net. However, in a feedback form of Ph0wn CTF 2026, 80% hold no grudge against AI and are either happy with it, or just want to adapt:

  • “I just want to win, with or without AI”, 30%
  • “It’s a new way of solving challenges. It requires different skills. I still like CTFs.”, 20%
  • “I want to learn, so I look into only few challenges and take time to solve them and learn, with or without AI” 30%.

/images/ctf2026-povai.png

Learning about AI is a strong motivation for players. On future types of challenges, AI-based challenges were 70% of demands for the next edition (prompt injection, MCP, data injection in LLM, creating agents, vibe coding…).

Only 20% express issues with AI:

  • “When a challenge is solved easily by AI, it ruins the fun for me” ~10%
  • “I don’t use AI” ~10%

/images/ctf2026-flag.png

It seems AI is rather an issue for challenge creators who

  1. Hate to see their hard work be solved by AI in seconds,
  2. Don’t understand teams who solve without understanding the challenge,
  3. Don’t like to spend time hardening their challenge towards AI or (4) are unsure what’s the best option to do so.

Conclusion: 80% of CTF players see no issue with using AI over CTF challenges. They welcome the technology and want to learn about it. Challenge creators are more reluctant, partly discouraged or helpless faced to machines.

Myth #2: “Solving with AI requires no skills”

Reality: AI requires different skills

People often show contempt towards players who solve with AI. A little like Scripts Kiddies for hackers.

In reality, while some challenges are solved with no glory, several challenges require (1) a good prompt, (2) human guidance, (3) a good AI setup with tailored agents, skills, scripts.

Ph0wn feedback demonstrates this point: in a question, the feedback form asked how much AI helped for each challenge. The responses show very different results.

In absolutely all challenges, we had some teams report “AI did not help”, while other teams reported it helped. Sometimes, the answers are opposites. For example, for Sploosh Kaboom 2 challenges, 1 team reported an entirely automatic solve with AI, while 3 teams said AI did not help at all.

/images/ctf2026-aihelp.png

Conclusion: AI does not help all teams the same way. It would be unfair to see AI does not give a strong advantage to teams, but it would be equally unfair to claim teams who flag with AI demonstrate no intelligence ๐Ÿ’ก of their own.

Myth #3: “CTFs are Pay to Win”

Reality: very few CTF players will pay to win

Some teams very probably use AI better than others, or have a better setup. Better setup… is that about AI subscription? Does a large AI subscription give unfair advantage to the team? For sure, it helps.

Ph0wn feedback forms tend to prove that most CTF players do not have costly setups: less than 30% have a paid account.

/images/ctf2026-aisetup.png

But this is probably different in more competitve CTFs like Insomni’hack CTF, where it seemed that nearly all teams had access to a paid subscription, sometimes “unlimited plans”. Discussing with a few teams though, I haven’t heard yet of any CTF which purchases an AI plan only for the CTFs. Most of the time, they are re-using a subscription they have from their work, from a friend, or from access to in-house AI servers.

There is a discrepancy between what CTF players say publicly and what they do

Publicly, CTF players “hate AI”, “never use AI”, “use it responsibly”, “only for side questions” etc. From organizers view or an external visitor, AI is on all screens, many (if not most) teams use it intensively.

/images/ctf2026-lie.png

I think it’s not a blatant lie, but more a cognitive biais: we convince ourselves our brains did the hard part, it’s above us to humanly admit we were lazy and that it’s the opposite, AI did most of the job (AI can’t talk and say we lied). Lack of proof.

Solutions to cope with AI in CTFs

SolutionTypeComment
Ask not to use AIOrganizationSome (many?) teams will cheat
No prizes to winOrganizationLess incentive
No rankingOrganizationSome teams want to use the ranking on their CVs
Provide laptops with no InternetOrganizationCost + people like their own setup…
Submit writeup instead of flagOrganizationAI is good at creating writeups too…
Detect AI-written writeupsTechnicalBypassable. Adding “human written parts”, instructing AI to insert typos etc
Lightning talks of solutionsOrganizationAI can write the slides, human rehearses them. At least, doing so, they learn about the challenges
Ban teams with quick solvesTechnicalBypassable: teams will wait to flag + how do we tell the difference between AI and a genuine human genius?
Report use of AI when you flag a challengeOrganization / TechnicalImplementable like the challenge feedback plugin in CTFd. Players don’t have to say the truth + what about challenges where AI “helped a bit”?. Used at NorthSec 2026
Separate scoreboards: AI no AIOrganization / TechnicalTeams can cheat
Send flag found by AI to a MCP serverTechnicalPlayers can still submit their flag manually and not count as “found by AI” + what about challenges where AI “helped a bit” only? Used at NorthSec 2026.
No InternetTechnicalBypassed by 4G/5G
Faraday cageOrganizationCost, size of the cage…
Watch participants screensOrganizationWhen the cat doesn’t watch, mice dance.
Live broadcast of 1 participant. All others watch.OrganizationVery different from CTFs. Hackers don’t like to share all their secrets
Block major AI websites from WiFiTechnicalOnly works for on-site CTFs which provide WiFi/LAN access to Internet. Even in this case, easily bypassable by 4G/5G, VPNs, SSH tunnels
Block AI traffic (e.g CloudFlare)TechnicalOnly works for challenges when participants access a service, server, container or remote host that runs on a host behind the filter/CloudFlare. Bypassed by VPNs or SSH tunnels
Catch flags from AI trafficTechnicalOnly works for on-site CTFs which provide WiFi/LAN access. Requires SSL inspection
Detect AI User AgentsTechnicalOnly works for challenges with participants access a service, server or remote host. Used at FCSC 2026
Redirect AI websites to fake oneTechnicalBypassed by 4G/5G, VPNs, SSH tunnels
Inject prompt with fake flagTechnicalNeeds to be fine tuned. Only works for a couple of months.
Use custom CTF interfaceTechnicalLimits use of on-the-shelf agents that download challenges from CTFd for example. But new ones can be created.
Shaming of teams who supply fake AI flagOrganizationIs it really ethical to shame?
Challenge instructions by videoTechnicalFile size
Inject prompt to convince AI not to solveTechnicalNeeds to be fine tuned. Only works for a couple of months.
Injecting a prompt that suggest something is illegalTechnicalSee PagedOut magazine. Tried but abandonned for Ph0wn because we couldn’t make it work
Inject false lead that slows AITechnicalDoes not affect all teams - Ph0wn 2026 “Ancient Stories”, Hack10
Insert easy fake flag and hope AI stops on itTechnicalDoes not work all the time
Inject prompt in polyglotTechnicalContemplated for Ph0wn 2026. Used at RITSEC 2026
Insert misleading AGENTS.md, CLAUDE.md or SKILL.mdTechnicalSeen at Insomni’hack CTF 2026
Require physical interaction with deviceTechnicalBeware SSH/Serial/USB access + simulators/containers. Used Ph0wn 2026
Use custom virtual machinesTechnicalDoes not necessarily stop AI…
Use air gapped challengesTechnicalWorks well, but hard constraint on challenges. Used at Ph0wn 2026 Minitel challenges
Protocols with no/little docTechnicalOnly few such protocols! Partly used at Ph0wn 2026
Unknown tricks / 0-daysTechnicalTransient state - Ph0wn 2026 “Flagged Pages”
Only 1 connection per team to challengeTechnicalAI can use the connection…
Challenges around optimizations e.g “create a binary with x/y/z below 200 bytes”TechnicalAI isn’t so good at optimization yet (but it might improve). We contemplated this sort of challenge for Ph0wn 2026 but finally didn’t play it.
Challenges with guessy partsTechnicalAI is usually better than humans to solve the guessy part. This does not refrain AI usage, but it opens up to different challenges
Wider scope challenges where there are many possibilities to investigateTechnicalAI is not scared by work, it will investigate possibilities. Humans can guide towards the best option. This does not refrain use of AI, but it opens up to a different category of challenges

Challenges categories which resist best to AI

  • Air gapped challenges
  • OSINT
  • Reverse and Pwn

What will we do for Ph0wn 2027?

Currently, I do not know. We are not taking any decision yet, we want to hear from other attempts in other CTFs, and pick up what worked the best.

After our talk at THCon, with Damien Cauquil, we had many valuable discussions:

  • Some CTFs are going to test the non-competitive mode (no ranking, no scoreboard). I won’t spoil which CTF that is ๐Ÿ˜‰ , but I am very impatient to see how it goes.
  • THCon CTF announced they have a few challenges intended to be “AI-resistant”. For sure, it won’t be perfect, but they may have come up with excellent ideas. CTF starts tomorrow.
  • Collaborative challenges where a given team solves a first part, explains it to another team to continue second part…
  • Blocking AI or not? People have different ideas on the questions, both being perfectly valid opinions. On one side, some participants say they no longer want to participate in CTFs where AI is permitted. On the other side, others say hackers have always had access to all tools, and AI is one of the tools to learn to use.

References ๐Ÿ“š

CTFs and AI talks:

Feedbacks from CTFs:

User feedback:

Tools:

  • CAI wins the NeuroGrid CTF: European-built CAI (CyberSecurity AI) sets a new global benchmark, November 25, 2025 ๐Ÿ”—
  • Yuwen Zou, Jia Liu, Wenjun Fan, CTFAgent: An LLM-powered Agent for CTF Challenge Solving, Journal of Information Security and Applications, DOI ๐Ÿ”—

CTF write-ups, solved with AI (at least parts):

  • Cryptax, Drone and secret message, THCon CTF 2023 ๐Ÿ”—
  • Cryptax, MANUAL (crypto) - flagged with no skill, Hack.Lu CTF 2025 ๐Ÿ”—
  • Cryptax, AI: Ask for It, Double Protection 1 and 2, N0PS CTF 2024 ๐Ÿ”—
  • Cryptax, Xmas Root Me CTF 2025 ๐Ÿ”—

Misc:

  • CSAW'25, Agentic Automated CTF, July-October 2025 ๐Ÿ”—
  • Peter Whiting, Piracy as Proof of Personhood, PagedOut #7, page 9 ๐Ÿ”—
  • Write-ups for challenges, Ph0wn Mag #3 ๐Ÿ”—