Helen-CC commented 2 years ago

[x] 確認Github desktop 上面不能用issue??

18~21行 df_keyword %>% select(word2, sdg) %>% drop_na() %>% rename(word = word2)

[x] 針對drop_na() code 運作方式(功能我大概能猜到)解釋
[x] rename(word = word2), word & word2 我知道意義和功能，但word & word2 用rename function 哪個要放前後一下反應不過來 改column 新的名字在前面，舊的column name 在後面

24行 mutate(SDG_order = str_extract(sdg, pattern = "\\d+"),

[x] 正規表達式 \d+ 做筆記 \d 是數字，+任意多個，要兩個\是因為R表達就要兩個

29~30行 確認# trim white spaces at both sides mutate(word = str_trim(word, side = "both"))

[ ] str_trim function 就是去掉多餘不小心打的空白對嗎? YES!!!

31~34行 # add regular expressions mutate(word = str_replace_all(word, "\\*", ".?")) %>% mutate(word = str_replace_all(word, " AND ", ".*?")) %>% #舉例 Economic Resource AND Access 在一個句子裡面同時出現，不一定要前後 mutate(word = str_split(word, "; ")) %>% #把excel 裡面同一格有分號; 的分開到不同row 如row 101

[ ] 確認str_replace_all 就是 replace 的意思吧? 確認str_replace_all function，確認regular expression "\*", ".?"
[ ] 我知道str_replace_all 就是把AND 換成.*? 確認為甚麼要這樣做，我自己後面寫的筆記反而把自己搞混

39&44 行

Create a dataframe of keywords without spaces -> nspace

Create a dataframe of keywords with spaces

[ ] 確認這樣的目的是什麼
[ ] without spaces -> nspace 我打開來看怪怪的比如row 84 & 127 neglected tropical disease 中間有space (除非我理解錯?)

50行跑完後environment

[ ] 發現df_bind 跟df_keyword_unnest 都是3021個obversation 為甚麼要有space +沒有space combine?

66~71行

Load the manual edited keyword mapping

https://docs.google.com/spreadsheets/d/1fZdE9WcFYI_d_sD4BgBpI5D1QhOYlRngsuSEtB7w694/edit#gid=470532546

df_manual <- read_excel("./data/raw_data/manual_edit_keywords.xlsx", sheet = "df_manual") h <- hash(keys = df_manual$word, values = df_manual$word_new)

[ ] 我在想這個到底還需不需要?
[ ] 這個不知道是不是之後06那個table 還有長串醜醜關鍵字的原因

71~79行

[ ] function 解釋
[ ] 92~94行解釋
[ ] 96~99行讓我來解釋看看
[ ] 102行，說不定就是我們不小心移除governmance 這個字，所以之後heatmap 顏色比較淡??? 然後我跑了一下，有governance 跟沒governance 發現根本沒差都是3015 個keyword，因為governance 根本拼錯了冏冏哈哈

boyiechen commented 2 years ago

Markdown example

This the markdown 101 lecture.

eamples

To link to other github issues: #2
To use code-highlighted style
[x] todo list

# is the first level header

this is 3rd level header

boyiechen commented 2 years ago

Another example to point out the lines you want to know the details

https://github.com/a0981906660/Fortune500_SDG_Analysis/blob/main/code/01_1_keyword.R#L5-L20

boyiechen commented 2 years ago

Markdown example

italic Bold face bold face option 2

*italic*
**Bold face**
__bold face option 2__

boyiechen commented 2 years ago

Reference

https://docs.python.org/3/library/re.html

Helen-CC / Fortune500_SDG_Analysis

01_keyword code 解釋 #3

Create a dataframe of keywords without spaces -> nspace

Create a dataframe of keywords with spaces

Load the manual edited keyword mapping

https://docs.google.com/spreadsheets/d/1fZdE9WcFYI_d_sD4BgBpI5D1QhOYlRngsuSEtB7w694/edit#gid=470532546

Markdown example

eamples

this is 3rd level header

Another example to point out the lines you want to know the details

Markdown example

Reference