The computational layer. The agent that learns. The system that decides.
Every decision happens inside a light cone. Everything else is noise.
Special relativity tells you that nothing outside your past light cone can have caused the present moment. An intelligent system operates under the same constraint. it can only act on information that has arrived. Everything else is unmeasured state.
GPS satellites drift by 38 microseconds per day without relativistic correction. That is not philosophy. That is a navigation error that compounds until the system fails. Intelligence has to know where it is in time.
Reinforcement learning, control theory, and orbital mechanics are all versions of the same problem: an agent, bounded by what it can observe, deciding what to do next inside a universe that does not pause to explain itself.
Classical control tells the system what to do. Reinforcement learning lets the system figure it out. Given a state, take an action, observe a reward, update the policy. Repeat until the behaviour emerges.
The policy was never written. It was discovered through interaction.