Vision algorithm Improve recognition rate

Identify with the vision algorithm, and get the box drawn by boundingBox. Too inaccurate, how to improve the accuracy rate?

https://developer.apple.com/documentation/vision/tracking_multiple_objects_or_rectangles_in_video