Enhance SDG to Support Multiple OpenAI Endpoints for Improved Performance #216

npalaska · 2024-07-25T20:29:39Z

Currently, SDG only supports a single OpenAI endpoint. However, adding support for multiple OpenAI endpoints could significantly improve overall SDG performance. We have observed nearly a 50% improvement in total SDG timing by running two replicas of the vLLM server instead of one and load balancing them internally.

Consider the following scenarios:
Scenario 1

Teacher model sharded across 2 gpus -> endpoint A
Teacher model sharded across 2 gpus -> endpoint B

Scenario 2

Teacher model sharded across 4 gpus -> endpoint A

Running SDG with Scenario 1 showed nearly 50% improvement over Scenario 2. If SDG can work with multiple replicas of vLLM, we can incorporate Scenario 1 for better performance.

The text was updated successfully, but these errors were encountered:

shivchander · 2024-07-25T20:42:06Z

@njhill would be good to have your thoughts on this

njhill · 2024-07-26T00:31:10Z

I think it's a good option to have in the toolbox for throughput-maximization experimentation. A wrapper client could be used which just wraps two different clients configured with different endpoints.

russellb · 2024-08-05T22:43:19Z

This seems like a pretty normal load balancer use case?

github-actions · 2024-11-20T02:01:50Z

This issue has been automatically marked as stale because it has not had activity within 90 days. It will be automatically closed if no further activity occurs within 30 days.

bbrowning · 2024-11-20T14:08:19Z

My initial reaction is to echo what Russell said and that this is really the concern of a load balancer. However, is there a specific reason we need client-side load balancing and managing a pool of multiple OpenAI endpoints? Perhaps I'm overlooking a reason just using a single endpoint behind a standalone load balancer isn't ideal?

nathan-weinberg added the enhancement New feature or request label Aug 20, 2024

github-actions bot added the stale label Nov 20, 2024

github-actions bot removed the stale label Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance SDG to Support Multiple OpenAI Endpoints for Improved Performance #216

Enhance SDG to Support Multiple OpenAI Endpoints for Improved Performance #216

npalaska commented Jul 25, 2024

shivchander commented Jul 25, 2024

njhill commented Jul 26, 2024 •

edited

Loading

russellb commented Aug 5, 2024

github-actions bot commented Nov 20, 2024

bbrowning commented Nov 20, 2024

Enhance SDG to Support Multiple OpenAI Endpoints for Improved Performance #216

Enhance SDG to Support Multiple OpenAI Endpoints for Improved Performance #216

Comments

npalaska commented Jul 25, 2024

shivchander commented Jul 25, 2024

njhill commented Jul 26, 2024 • edited Loading

russellb commented Aug 5, 2024

github-actions bot commented Nov 20, 2024

bbrowning commented Nov 20, 2024

njhill commented Jul 26, 2024 •

edited

Loading