IB/mlx4: Add counter based implementation for QP multicast loopback block