Skip to content
This repository was archived by the owner on May 14, 2026. It is now read-only.
This repository was archived by the owner on May 14, 2026. It is now read-only.

[GSoC] Dynamic Text-to-Speech Playback #926

Description

@danylo-boiko

Feature description

In the current implementation, playback begins only after the entire text has been processed by the Google Text-to-Speech API, which cause inconvenient delays for long texts. Journey Voices provides real-time streaming with low latency for some common languages (search for "Journey" on the voices page). The goal of the project is to implement a flexible approach that supports both streaming and waiting for a complete response, depending on the language.

Expected outcomes

Users can listen to synthesized text in two modes, depending on language support.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions