>>1757the latency overhead is usually negligible compared to the actual network round-trip time of the api calls themselves. if you're worried about performance, focus more on how the library handles
stream processing and token buffering. i've seen abstraction layers add a few milliseconds, but it's rarely the bottleneck when you're waiting on an llm response.