Agents learn to identify 'reactor heat' (stress) and use cooling valves to maintain operational status.