@sahil
feed pulls - current aws setup can serve few hundred requests per second for the For You feed (most computationally expensive), with 1 sec latency. global trending feed is 3 second latency, but can be sub-second purely by linearly scaling infra. all of this can be scaled up based on real demand.
we do feed A/B testing based on network engagement data and user/client interviews for now. we can't do anything based on client side data/metrics, yet. in the future, clients can customize feed parameters or model weights using OpenRank SDK based on their client side data.