boostcampaitech6 / level2-dkt-recsys-05

level2-dkt-recsys-05 created by GitHub Classroom
5 stars 1 forks source link

DKT

๐Ÿ“Œ ํ”„๋กœ์ ํŠธ ๊ฐœ์š”

project_info1 project_info2

DKT ๋ž€ ์šฐ๋ฆฌ์˜ ์ง€์‹์ƒํƒœ๋ฅผ ์ถ”์ ํ•˜๋Š” ๋”ฅ๋Ÿฌ๋‹ ๋ฐฅ๋ฒ• ์ž…๋‹ˆ๋‹ค.

ํ•ด๋‹น ๊ฒฝ์ง„๋Œ€ํšŒ๋Š” ์ง€์‹์ƒํƒœ๋ณด๋‹ค ์ฃผ์–ด์ง„ ๋ฌธ์ œ๋ฅผ ๋งž์ถœ์ง€ ๋ชป๋งž์ถœ์ง€ ์˜ˆ์ธกํ•˜๋Š” ๋Œ€ํšŒ์ž…๋‹ˆ๋‹ค.

๐Ÿฅˆ ํ”„๋กœ์ ํŠธ ๊ฒฐ๊ณผ

Public

Public leader board

Private

Private leader board

๐Ÿ“‹ ํ”„๋กœ์ ํŠธ ์ˆ˜ํ–‰ ์ ˆ์ฐจ ๋ฐ ๋ฐฉ๋ฒ•

Cal

EDA

Feature Engineering

์œ„ EDA๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๋‹ค์–‘ํ•œ ๋ณ€์ˆ˜ ์ƒ์„ฑ.

Feature Selection

from sklearn.feature_selection import VarianceThreshold

selector = VarianceThreshold(0.8)
train_thres = selector.fit(X_train)
select_feat = train_thres.get_feature_names_out()

๋ชจ๋ธ๋ง

ํ”„๋กœ์ ํŠธ ์ˆ˜ํ–‰ ๊ฒฐ๊ณผ

์ตœ์ข… ๋ชจ๋ธ

๋ชจ๋ธ XGBoost CatBoost Last Query Transformer SAINT+
AUROC (LB) 0.8302 0.8253 0.8092 0.8042
Accuracy (LB) 0.7661 0.7473 0.7366 0.7258
AUROC (Public) 0.8316 2nd
AUROC (Private) 0.8529 2nd

score

๐Ÿค– ํŒ€์›

๋…ธ๊ด€์˜ฅ ๋ฐ•๊ฒฝ์› ์ด์„๊ทœ ์ด์ง„์› ์žฅ์„ฑ์ค€

 

๐Ÿ“š Report & Presentation

Wrap-up Report (PDF)

ํ”„๋กœ์ ํŠธ ์ˆ˜ํ–‰ ์ ˆ์ฐจ, ๋ฐฉ๋ฒ•, ๊ฒฐ๊ณผ, ์ตœ์ข… ํ‰๊ฐ€, ํŒ€์›๋ณ„ ํšŒ๊ณ ๋Š” wrap-up report์—์„œ ๋” ์ž์„ธํžˆ ํ™•์ธํ•˜์‹ค ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

Presentation (PPT)

ํ”„๋กœ์ ํŠธ ๊ฒฐ๊ณผ ๋ฐœํ‘œ ์ž๋ฃŒ์ž…๋‹ˆ๋‹ค.

Environment Setting Guide

์„œ๋ฒ„ ํ• ๋‹น ํ›„ ํŒจํ‚ค์ง€ ๊ด€๋ฆฌ์ž update ๋ฐ locale ์„ค์ •ํ•˜๊ธฐ.

$ apt update
$ apt-get update
$ pip install --upgrade pip
$ apt install locales
$ locale-gen en_US.UTF-8
$ update-locale LANG=en_US.UTF-8

ํ• ๋‹น๋ฐ›์€ ์„œ๋ฒ„์—์„œ pyenv.sh์„ ์‹คํ–‰ํ•˜๋ฉด pyenv๊ฐ€ ์„ค์น˜๋ฉ๋‹ˆ๋‹ค.

$ bash pyenv.sh
$ source ~/.bashrc

poetry๋ฅผ ์„ค์น˜ํ•˜๊ณ  cache ๋””๋ ‰ํ† ๋ฆฌ๋ฅผ ๋ณ€๊ฒฝํ•ด์ฃผ์„ธ์š”.

$ poetry config cache-dir /data/ephemeral/.cache/pypoetry