I'm experiencing a very similar issue on an M2 Max GPU. I'm using wgpu to run compute shaders, but internally it is using the Metal APIs.
Restarting the computer is the only way to reduce the usage after the program has exited.
Did you manage to find a workaround for this to eliminate the leak or to manually clean up the resources?