pluralitybook / plurality

Root repository for ⿻數位 Plurality: The Future of Collaborative Technology and Democracy by E. Glen Weyl, Audrey Tang and the Plurality Community
https://www.plurality.net
Creative Commons Zero v1.0 Universal
338 stars 140 forks source link

Add the sound information in text format for ⿻ symbols across the book #986

Open akinorioyama opened 2 months ago

akinorioyama commented 2 months ago

The current parts and sections write the ⿻ character only in the form of symbol, and are missing its sound information. The adjustment is to add the pronounciation information for the ⿻ character. The index of the ⿻ character in the attachement file in subsequent conversation message can provide the information of the location, and the sound information from machine synthesized sound of the audiobook. The text changes in markdown files will be done in the way illustrated below.

The original text segment of the file, english/0-1-About-the-authors.md, below dozens of members of the ⿻plurality community to the aria-label enclosed segment below dozens of members of the <span aria-label="Plurality">⿻</span> community

This is intended to be uttered by screen reader applications like Windows Narrator. The adjustment will not necessarily make the "Plurality" text uttered by browsers such as Edge Read Aloud feature.

akinorioyama commented 2 months ago

Interim index of the occurances of ⿻ is atteched. Indexof⿻_character.txt

The file has the followig columns.

sequence header text description
1 section the section of the book
2 before text segment preceeding the ⿻ character (in pluralitybook site text)
3 after text segment succeeding the ⿻ character (in pluralitybook site text)
4 segment_before_found True if column 2 text is found in audiobook transcript
5 segment_after_found True if column 3 text is found in audiobook transcript
6 before_found_string X if column 4 text is found. - if not found.
7 after_found_string X if column 5 text is found. - if not found.
8 all_found_before location(s) of the column 2 text is found
9 all_latter_words_before words found in column 8 position, which is sound in audiobook transcript
10 found_before text segment found with a fuzzier text search
11 all_found_after text segment found in pluralitybok sitetext with a fuzzier text search
12 all_latter_words_after words found in front of after segment
13 found_afterfound_regex_before N/A
14 found_regex_after True if text segment of afterwith a fuzzier text search