llaske / sugarizer

Sugarizer is a web implementation of the Sugar platform to run on any device or browser
https://sugarizer.org
Apache License 2.0
199 stars 422 forks source link

Enhancing story narrating experience by introducing audio in story books #1666

Open Monu2114 opened 1 month ago

Monu2114 commented 1 month ago

Add Read-Aloud Functionality for Storybooks

Description: It would be highly beneficial to introduce a "read-aloud" feature that allows children to listen to storybooks being narrated in a natural, human-like voice. This would enhance the user experience, especially for younger audiences who may prefer or require auditory learning. The goal is to provide a more immersive, humanized storytelling experience.

Proposed Implementation:

Text-to-Speech Integration:

We can leverage a Text-to-Speech (TTS) API to convert the story text into speech. These services offer natural-sounding voices that can mimic human narration.

Voice Customization: Users can choose between different voice types (e.g., gender, accent) to cater to different preferences and languages. User Interface:

Add a "Read Aloud" button on the storybook interface, which, when clicked, triggers the TTS engine to start narrating the story. Include basic playback controls (play, pause, stop) for better user control.

Performance Considerations: Caching the audio output for frequently accessed books to minimize API calls and improve performance. Ensuring the feature runs smoothly across all supported platforms. Benefits:

Enhances accessibility for children with visual impairments or reading difficulties. Provides a comforting, human-like storytelling experience. Supports auditory learners and adds a layer of engagement to the story-reading process.

llaske commented 1 month ago

It could be a good idea. It's what Speak activity do so it could be interesting to have this feature on e-books. BTW the implementation should respect two major rules of Sugarizer:

Monu2114 commented 1 month ago

Hey am fairly new to this open source. But I wanna work on this issue. And I wanna take part in gsoc 25 as well. Can I work on this ?

On Wed, Oct 9, 2024, 02:15 Lionel Laské @.***> wrote:

It could be a good idea. It's what Speak activity do so it could be interesting to have this feature on e-books. BTW the implementation should respect two major rules of Sugarizer:

  • It should work offline
  • It should use only Free/Libre Open Source library/service.

— Reply to this email directly, view it on GitHub https://github.com/llaske/sugarizer/issues/1666#issuecomment-2400788186, or unsubscribe https://github.com/notifications/unsubscribe-auth/A33RIC5SXY5KV4P5XGIYPZDZ2Q76BAVCNFSM6AAAAABPPDFIZSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIMBQG44DQMJYGY . You are receiving this because you authored the thread.Message ID: @.***>

llaske commented 1 month ago

@Monu2114 there is no need to ask permission to work on an issue, everyone can work on anything. See here. There is no plan for GSoC 2025 today and there is plenty of time before the next GSoC.