Can Claude Code Reason About Physics? (PDE Solver Benchmark)

Name: Can Claude Code Reason About Physics? (PDE Solver Benchmark)
Uploaded: Dec 25, 2025
Duration: 1281 s

Surya Sathujoda37 subscribers

599 views

Dec 25, 2025

21:21

Large language models have already shown strong performance on full-stack software projects, especially in TypeScript and Python. What’s been tested far less is how these systems perform in scientific computing, where correctness depends on numerical stability, physical constraints, and careful validation - not just code that compiles. In this video, I benchmark Claude Code by asking it to build a C++ solver for the two-dimensional heat equation, live. I guide the model the way a senior researcher would guide a junior engineer, but the numerical decisions are its own. We evaluate the result using standard numerical-methods criteria: stability under time-step changes correctness of boundary conditions comparison against analytical solutions and predictable failure modes This is not a demo of code generation — it’s a test of scientific reasoning.

Download

1 formats

Video Formats

360pmp433.8 MB

Download

Right-click 'Download' and select 'Save Link As' if the file opens in a new tab.