You will necessarily have differences between the primary database and the replica one, due to delay / latency while replicating the data.
You have to be aware of this delay / latency and you can't do without it (hmmm ... I don't know this kind of architecture, so I only think that you can't do without it).
Knowing this delay / latency, this is what I would do :
- not critical data : let the server ask either the primary database or the replica one
- critial data : force the server to ask only the primary database
With 16 req/s, I don't think that you need any replica database for performance purpose. So if you really want to replicate the database, you should rather do it for availability (in case of failover) and not for performance.