Another things to check/tailor is your I/0 buffers (Advanced Mode: Preferences->Audio->Synch and Caching). Typically for the ASIO latency you have (128), I/O buffers of 256 or 512 are optimal. Too extreme high or low can also affect things.
I just got a new interface, and observed a similar delay when the sample rates on the interface and in the project did not match (never realized you could do this before, but had 48KHz feeding a 44.1KHz project). If these match, disregard this one.