Somehow I'm not getting what the problem is here. Most interfaces have the ability to balance the input signal with the playback from the DAW. Very few do not offer this feature. It might be a physical knob or it might be via a Software mixer. This feature is pretty darn important.
There was a similar thread a few days ago and in the end the person was having to spend more money to make up for a shortcoming in the monitoring system. In this case the lack of a output volume separate from the headphone volume.
Bottom line is when shopping for an audio interface, make sure it has the options you need.
Example , my Scarlett 6i6 ( and others in the series above it) have the software "Mix Control" that allows dozens of options for headphone and cue mixes that do not involve changes to your the DAW's mix. I haven't had to use more than one mix so far, but looks like I could have 4 different cue mixes happening. It takes only a couple clicks to set a nice balance between my input and the playback.
My Tascam interface did this with a knob but it was not very loud. The Scarlett is real loud! .