Additionally, model updates need to be aggregated at the appropriate timing to avoid outdated information affecting overall learning performance. The reinforcement learning algorithm can simulate ...