I already said the reason: "Every Unity camera has a good amount of main thread overhead"
Cameras do a lot of work to determine what to draw and then draw it. One camera vs two, means less setup costs.
The weapon camera drew your hands and items in your hands. It was probably setup that way to stop hands and items clipping through walls.
The UI is also another camera.