See b99c1abc3a for more information about how using fewer threads for just copying data can improve performance. In my simple test file with mesh data re-uploaded every frame, this improved performance from 23.5 FPS to 25.5 FPS (almost 9%).
b99c1abc3a
get_keyframe_values