๐ŸณDev/Machine Learning

[Machine Learning] Learning Rate, Data Preprocessing, Overfitting and DataSet

fortune.00 2022. 1. 12. 15:22

๋ชจ๋‘๋ฅผ ์œ„ํ•œ ๋”ฅ๋Ÿฌ๋‹ (๊น€์„ฑํ›ˆ)
๋จธ์‹ ๋Ÿฌ๋‹(Machine Learning) 9์žฅ - Learning rate, data preprocessing, overfitting

ML07


1. Learning Rate

Gradient Descent ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ๊ธฐ์šธ๊ธฐ์— lr์„ ๊ณฑํ•ด์„œ, ๋‹ค์Œ W๋ฅผ ๊ฒฐ์ •ํ•œ๋‹ค. lr์ด ๋„ˆ๋ฌด ํฌ๋ฉด, ๋ฐ์ดํ„ฐ๊ฐ€ ๋ฌด์งˆ์„œํ•˜๊ฒŒ ์ดํƒˆํ•˜์—ฌ ์ตœ์ €์ ์— ์ˆ˜๋ ดํ•˜์ง€ ๋ชปํ•œ๋‹ค. ์ด๋ฅผ ์˜ค๋ฒ„์ŠˆํŒ…์ด๋ผ๊ณ  ํ•œ๋‹ค. lr์ด ๋„ˆ๋ฌด ์ž‘์œผ๋ฉด, ํ•™์Šต ์‹œ๊ฐ„์ด ์˜ค๋ž˜ ๊ฑธ๋ ค ์ตœ์ €์ ์— ์ˆ˜๋ ดํ•˜์ง€ ๋ชปํ•œ๋‹ค.

2. Data Preprocessing

๋งŒ์•ฝ ์ž…๋ ฅ๊ฐ’์ด ๋‘ ๊ฐœ ์กด์žฌํ•  ๋•Œ, ๊ฐ’์— ๋Œ€ํ•œ ๋ฒ”์œ„์˜ ์ฐจ์ด๊ฐ€ ๋งค์šฐ ํฌ๋ฉด, ์•„๋ž˜์™€ ๊ฐ™์ด ์™œ๊ณก๋œ ํ˜•ํƒœ์˜ ๊ทธ๋ž˜ํ”„๊ฐ€ ๋‚˜ํƒ€๋‚œ๋‹ค. ๋”ฐ๋ผ์„œ ํ•œ ์ถ•์œผ๋กœ๋Š” ๊ฐ„๊ฒฉ์ด ์ข๊ณ , ๋‹ค๋ฅธ ์ถ•์€ ๊ฐ„๊ฒฉ์ด ๋„“์–ด lr์ด ์กฐ๊ธˆ๋งŒ ์ปค๋„ ์ตœ์ €์ ์— ์ˆ˜๋ ดํ•˜์ง€ ๋ชปํ•œ๋‹ค. ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด Preprocessing์„ ์ด์šฉํ•˜์—ฌ ์ •๊ทœํ™”๋ฅผ ์‹œ์ผœ์ค€๋‹ค.

3. Overfitting

๋จธ์‹ ๋Ÿฌ๋‹์ด ํ•™์Šต ๋ฐ์ดํ„ฐ์— ๋„ˆ๋ฌด ๋”ฑ ๋งž์œผ๋ฉด, ์‹ค์ œ ๋ฐ์ดํ„ฐ์—์„œ ํ‹€๋ฆด ์ˆ˜ ์žˆ๋Š” ํ™•๋ฅ ์ด ๋†’๋‹ค. ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ๋จผ์ € ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ ์–‘์„ ๋Š˜๋ฆฌ๊ฑฐ๋‚˜, regulation์œผ๋กœ ์ผ๋ฐ˜ํ™”๋ฅผ ์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•์ด ์žˆ๋‹ค. regulation์€ Weight์˜ ๊ฐ’์„ ๋งค์šฐ ํฌ์ง€ ์•Š๊ฒŒ ๋งŒ๋“œ๋Š” ๋ฐฉ๋ฒ•์ด๋‹ค. Weight๊ฐ€ ํฌ๋ฉด ๊ทธ๋ž˜ํ”„๊ฐ€ ๊ตฌ๋ถ€๋Ÿฌ์ง„ ํ˜•ํƒœ๋ฅผ ๊ฐ€์ง€๋ฉฐ, ์ž‘์œผ๋ฉด ์„ ํ˜•์ ์ธ ๋ชจ์Šต์„ ๋ˆ๋‹ค. ์ด๋ฅผ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•ด cost function ๋’ค์— regulation์„ ์œ„ํ•œ ์‹์„ ์ถ”๊ฐ€์‹œ์ผœ์ค€๋‹ค. ๋ชจ๋“  weight๊ฐ’์˜ ์ œ๊ณฑ์„ ๋”ํ•˜๋ฉด, cost function์„ ํ†ตํ•ด ์ตœ์†Œํ™”๋œ ๊ฐ’์„ ์ฐพ๊ฒŒ ๋˜๊ณ , weight ๊ฐ’์ด ์ž‘์€ ๊ฐ’์„ ๊ฐ€์ง€๋„๋ก ์„ค๊ณ„๋œ๋‹ค.

4. DataSet

์ „์ฒด DataSet์˜ 70ํผ์„ผํŠธ๋Š” Training Set์œผ๋กœ, 30ํผ์„ผํŠธ๋Š” Testing Set์œผ๋กœ ๋‚˜๋ˆˆ๋‹ค. ๊ทธ๋ž˜์„œ Traning Set์œผ๋กœ ๋ชจ๋ธ์„ ํ•™์Šต์‹œํ‚ค๊ณ , Test Set์œผ๋กœ๋Š” ๋จธ์‹ ์˜ ์„ฑ๋Šฅ์„ ์ธก์ •ํ•œ๋‹ค. ๋˜ํ•œ ๋ชจ๋ธ์„ ํ•™์Šต ์‹œํ‚ฌ ๋•Œ, lr์™€ ฮป๊ฐ’์„ ์กฐ์ ˆํ•  ์ˆ˜ ์žˆ๋‹ค. ๋”ฐ๋ผ์„œ Training Set์„ Validation์œผ๋กœ ์ผ๋ถ€ ๋” ๋‚˜๋ˆŒ ์ˆ˜ ์žˆ๋‹ค.