Once code is in production, it serves as its own specification, making it crucial to track deviations from expected behavior. Testing should document actual behavior rather than just correct behavior, allowing developers to discern between bugs and expected outcomes. This approach is exemplified by the challenges faced in maintaining legacy systems, such as the rounding errors in Excel, where specified behavior must remain unchanged despite being technically incorrect.