gh-135871: Reload lock internal state while spinning in `PyMutex_LockTimed` (gh-146064) Add atomic loads in the slow path of PyMutex to increase the number of lock acquisitions per second that threads can make on a shared mutex.