Resolve confusion about "batching" support #174

markmc · 2024-07-19T09:44:05Z

Capturing some observations from #157

Pipeline has "batching" support - it can shard the dataset and spawn an instance of the pipeline for each shard - batch_num_workers and batch_size
LLMBlock has "batching" support - it can request multiple chat completions from the OpenAI server using the n argument - num_instructions_to_generate

In ilab we disable (1) with llama-cpp by passing batch_size=None - see instructlab/instructlab#346

In LLMBlock we disable (2) with llama-cpp with the server_supports_batched which checks whether the n argument works

Resolve:

Do we want to call both of these "batching"
Do we want to different ways of handling backend-specific capabilities?
Should the library be trying to probe the backend for its capabilities, or should the library user give it information about the backend?
server_supports_batched should be a property on PipelineContext, not something we set on the OpenAI client object

The text was updated successfully, but these errors were encountered:

github-actions · 2024-11-20T02:01:56Z

This issue has been automatically marked as stale because it has not had activity within 90 days. It will be automatically closed if no further activity occurs within 30 days.

bbrowning · 2024-11-20T14:10:46Z

This has confused me more than once - some tech debt to tackle here for sure.

markmc mentioned this issue Jul 19, 2024

LLMBlock concurrency #157

Merged

nathan-weinberg added the question Further information is requested label Aug 20, 2024

github-actions bot added the stale label Nov 20, 2024

github-actions bot removed the stale label Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resolve confusion about "batching" support #174

Resolve confusion about "batching" support #174

markmc commented Jul 19, 2024

github-actions bot commented Nov 20, 2024

bbrowning commented Nov 20, 2024

Resolve confusion about "batching" support #174

Resolve confusion about "batching" support #174

Comments

markmc commented Jul 19, 2024

github-actions bot commented Nov 20, 2024

bbrowning commented Nov 20, 2024