Chat Streaming doesn't work with audio modalities #398

vbandi · 2025-01-10T08:20:02Z

Bug Report

Overview

The gpt-4o-audio-preview-2024-12-17 model allows for audio as an output modality. This doesn't seem to work.

To Reproduce

Steps to reproduce the behavior:

The code below doesn't have any Delta or any data in the chunks.

async Task GPTSpeech()
{
    var client = new OpenAIClient();
    var speaker = new SpeakerOutput();

    var chatRequest = new ChatRequest([new Message(Role.System, "Count from 1 to 10. Whisper please")],
        audioConfig: new AudioConfig(Voice.Nova), model: "gpt-4o-audio-preview-2024-12-17");  // Doesn't seem to work... OpenAI Lib issue??
        
    await foreach (var chunk in client.ChatEndpoint.StreamCompletionEnumerableAsync(chatRequest))
    {
        if (chunk.FirstChoice.Delta is not null)
            Console.Write(chunk.FirstChoice.Delta.Content);

        if (chunk.FirstChoice.Message?.AudioOutput is not null)
            Console.WriteLine(chunk.FirstChoice.Message.AudioOutput.Data.Length);
    }

    Console.WriteLine("Done.");
    Console.ReadKey();

}

However, when not providing audio in the ChatRequest, this still works:

    var chatRequest = new ChatRequest([new Message(Role.System, "Count from 1 to 10. Whisper please")]);

Expected behavior

Chunks should contain text and / or audio content when the model is generating audio

The text was updated successfully, but these errors were encountered:

StephenHodgson · 2025-01-12T20:07:14Z

Ended up being broken for both async and Enumerable streaming paths.

vbandi added the bug Something isn't working label Jan 10, 2025

StephenHodgson added help wanted Extra attention is needed good first issue Good for newcomers enhancement New feature or request and removed bug Something isn't working labels Jan 10, 2025

StephenHodgson self-assigned this Jan 10, 2025

StephenHodgson linked a pull request Jan 11, 2025 that will close this issue

OpenAI-DotNet 8.4.2 #399

Merged

StephenHodgson added a commit that referenced this issue Jan 12, 2025

fix #398

54f05cc

StephenHodgson changed the title ~~StreamCompletionEnumerableAsync doesn't work with audio~~ Chat Streaming doesn't work with audio modalities Jan 12, 2025

StephenHodgson added bug Something isn't working and removed enhancement New feature or request help wanted Extra attention is needed good first issue Good for newcomers labels Jan 12, 2025

StephenHodgson closed this as completed in #399 Jan 15, 2025

StephenHodgson linked a pull request Jan 15, 2025 that will close this issue

com.openai.unity 8.4.5 RageAgainstThePixel/com.openai.unity#323

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chat Streaming doesn't work with audio modalities #398

Chat Streaming doesn't work with audio modalities #398

vbandi commented Jan 10, 2025

StephenHodgson commented Jan 12, 2025

Chat Streaming doesn't work with audio modalities #398

Chat Streaming doesn't work with audio modalities #398

Comments

vbandi commented Jan 10, 2025

Bug Report

Overview

To Reproduce

Expected behavior

StephenHodgson commented Jan 12, 2025