๐Ÿ“– Theory/AI & ML

[ML] Bias์™€ Variance : ๋จธ์‹ ๋Ÿฌ๋‹ ๋ชจ๋ธ ํ‰๊ฐ€ ๋ฐฉ๋ฒ•

๋ญ…์ฆค 2022. 3. 22. 23:08
๋ฐ˜์‘ํ˜•

 

  • Bias : ๋ชจ๋ธ์˜ ์ถœ๋ ฅ์œผ๋กœ ์–ป์€ ์˜ˆ์ธก๊ฐ’๊ณผ ์ •๋‹ต(Ground Truth) ์™€์˜ ์ฐจ์ด์˜ ํ‰๊ท 
  • Variance : ๋‹ค์–‘ํ•œ ๋ฐ์ดํ„ฐ ์…‹์—์„œ ์˜ˆ์ธก๊ฐ’์ด ์–ผ๋งˆ๋‚˜ ๋ณ€ํ™”ํ•  ์ˆ˜ ์žˆ๋Š”์ง€์— ๋Œ€ํ•œ ๊ฐ’

 

๋จธ์‹ ๋Ÿฌ๋‹์—์„œ bias์™€ variance๋Š” ๋ชจ๋ธ์ด ์–ผ๋งˆ๋‚˜ ์ž˜ ํ•™์Šต๋˜์—ˆ๋Š”์ง€ ํ™•์ธํ•˜๋Š” ์ฒ™๋„ ์ค‘ ํ•˜๋‚˜๋กœ, ๊ฐ€์žฅ ์ข‹์€ ๊ฒฝ์šฐ๋Š” bias์™€ variance๊ฐ€ ๋ชจ๋‘ ๋‚ฎ์€ ๊ฒฝ์šฐ์ด๋‹ค. (์•„๋ž˜ ๊ทธ๋ฆผ ์ฐธ๊ณ )

๊ทธ๋Ÿฐ๋ฐ ์œ„์˜ ๋‚ด์šฉ์€ ๋„ˆ๋ฌด ๋‹น์—ฐํ•œ ๋‚ด์šฉ์ด์ž ๊ฒฐ๊ณผ๋ก ์ ์ธ ์ด์•ผ๊ธฐ์ด๊ณ , bias์™€ variance๋Š” ๋ชจ๋ธ ํ•™์Šต๊ณผ ์—ฐ๊ด€์ง€์–ด ์ƒ๊ฐํ•  ํ•„์š”๊ฐ€ ์žˆ๋‹ค.

 

ํ•™์Šต์ด ๋œ๋œ underfitting ๊ตฌ๊ฐ„์—๋Š” ํ•™์Šต๋ฐ์ดํ„ฐ ์…‹์˜ ์˜ˆ์ธก๊ฐ’๋„ ๋งŽ์ด ํ‹€๋ฆฌ๊ธฐ ๋•Œ๋ฌธ์— bias๊ฐ€ ๋†’์€ ์ƒํƒœ์ด๊ณ , ์ ์ ˆํ•œ ํ•™์Šต ์ข…๋ฃŒ ์ง€์ ์„ ์ง€๋‚œ ๊ตฌ๊ฐ„์—์„œ๋Š” ํ•™์Šต ๋ฐ์ดํ„ฐ์…‹๊ณผ ๋ชจ๋ธ์˜ loss๋ฅผ ์ตœ์†Œํ™” ํ•˜๊ธฐ ์œ„ํ•ด overfitting์ด ๋ฐœ์ƒํ•˜์—ฌ variance๊ฐ€ ๋†’์€ ์ƒํƒœ๊ฐ€ ๋˜๋Š” ๊ฒƒ์ด๋‹ค.

 

์•„๋ž˜ ๊ทธ๋ฆผ๋“ค์„ ๋ณด๋ฉด ์กฐ๊ธˆ ๋” ์ดํ•ด๊ฐ€ ์‰ฌ์›Œ์ง„๋‹ค.

 

Underfitting, Overfitting๊ณผ Bias, Variance ์™€์˜ ๊ด€๊ณ„
Underfitting, Overfitting๊ณผ Bias, Variance ์™€์˜ ๊ด€๊ณ„

 

Underfitting๊ณผ Overfitting์€ ํ•™์Šต ์ข…๋ฃŒ์‹œ์ ์— ๋”ฐ๋ผ ๋ฐœ์ƒํ•  ์ˆ˜๋„ ์žˆ์ง€๋งŒ, ๋จธ์‹ ๋Ÿฌ๋‹ ๋ชจ๋ธ๊ณผ task(dataset)์™€์˜ ๊ด€๊ณ„์™€๋„ ๊ด€๋ จ์ด ์žˆ๋‹ค.

 

  • High Bias / underfitting : ์ฃผ๋กœ underfitting์€ ํ’€์–ด์•ผํ•  task์— ๋น„ํ•ด ๋ชจ๋ธ์˜ capability๊ฐ€ ์•ฝํ•œ ๊ฒฝ์šฐ์— ๋ฐœ์ƒ
  • High Variance / overfitting : ์ฃผ๋กœ overfitting์€ ํ’€์–ด์•ผํ•  task์— ๋น„ํ•ด ๋ชจ๋ธ์˜ capability๊ฐ€ ๋„ˆ๋ฌด ์ข‹์€ ๊ฒฝ์šฐ์— ๋ฐœ์ƒ

 

๋•Œ๋ฌธ์— Low bais, Low variance ๊ฒฐ๊ณผ๋ฅผ ๊ฐ€์ง€๋Š” ๋ชจ๋ธ์„ ํ•™์Šต์‹œํ‚ค๊ธฐ ์œ„ํ•ด์„œ๋Š” 1. validation set ์„ ์‚ฌ์šฉ, 2. task ๋Œ€๋น„ ์ ์ ˆํ•œ ์‚ฌ์ด์ฆˆ์˜ ๋ชจ๋ธ ์‚ฌ์šฉ, 3. ์ ์ ˆํ•œ regularization ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์ด ์ค‘์š”ํ•˜๋‹ค.

 

๋ฐ˜์‘ํ˜•