๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
math4ai/Probability & Statistics

That's a Wrap! - Prob&Stat

by ์žผ๋ฏผai 2024. 8. 9.

์ถœ์ฒ˜: https://jrc-park.tistory.com/259

  • ๐Ÿง Central Limit Theorem ์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€? ํ‘œ๋ณธํ‰๊ท ์„ ์ •๊ทœํ™”ํ–ˆ์„ ๋•Œ ํ‘œ๋ณธ์˜ ํฌ๊ธฐ๊ฐ€ ์ฆ๊ฐ€ํ•จ์— ๋”ฐ๋ผ ๊ทธ ๋ถ„ํฌ๊ฐ€ ํ‘œ์ค€์ •๊ทœ๋ถ„ํฌ์— ์ˆ˜๋ ดํ•œ๋‹ค๋Š” ์ •๋ฆฌ.
  • ๐Ÿง Central Limit Theorem์€ ์–ด๋””์— ์“ธ ์ˆ˜ ์žˆ๋Š”๊ฐ€? ๋ชจ์ง‘๋‹จ์˜ ํ‘œ์ค€ํŽธ์ฐจ๋ฅผ ์•Œ๊ณ  ์žˆ๋Š” ์ƒํ™ฉ์—์„œ ํ‘œ๋ณธ์กฐ์‚ฌ๋ฅผ ํ†ตํ•œ ํ†ต๊ณ„์  ์ถ”์ •์„ ํ•  ๋•Œ CLT์— ๋”ฐ๋ฅด๋ฉด ํ‘œ๋ณธํ‰๊ท ์€ ๋ชจํ‰๊ท ์— ์ˆ˜๋ ดํ•˜๊ณ  ํ‘œ๋ณธํ‰๊ท ์„ ์ •๊ทœํ™”ํ•œ ๋ถ„ํฌ๋Š” ํ‘œ์ค€์ •๊ทœ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅด๋ฏ€๋กœ, ์‹ ๋ขฐ๊ตฌ๊ฐ„์„ ์ด์šฉํ•œ ๋ชจํ‰๊ท ์˜ ์ถ”์ • ๋“ฑ์— ํ™œ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค. ๋‹จ ์ด๋•Œ, ๋ถ„์‚ฐ์ด ์œ ํ•œํ•˜๊ณ , ํ‘œ๋ณธ๋“ค์ด ๊ฐ™์€ ํ™•๋ฅ ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅด๋Š” ์ง‘๋‹จ์—์„œ ๋…๋ฆฝ์ถ”์ถœ๋กœ ์–ป์–ด์ง„ ๊ฒƒ๋“ค์ด์–ด์•ผ ํ•œ๋‹ค๋Š” ์กฐ๊ฑด์ด ์žˆ๋‹ค!
  • ๐Ÿง ํฐ์ˆ˜์˜ ๋ฒ•์น™์ด๋ž€? ๋นˆ๋„์ฃผ์˜์  ๊ด€์ ์—์„œ, ์–ด๋–ค ์‚ฌ๊ฑด์ด ์ผ์–ด๋‚  ํ†ต๊ณ„์  ํ™•๋ฅ ์€ ์ƒ˜ํ”Œ ์‚ฌ์ด์ฆˆ๊ฐ€ ์ปค์ง€๋ฉด ์ˆ˜ํ•™์  ํ™•๋ฅ ์— ์ˆ˜๋ ดํ•œ๋‹ค!
  • ๐Ÿง ํ™•๋ฅ ์ด๋ž‘ ํ†ต๊ณ„๋ž‘ ๋‹ค๋ฅธ ์ ์€?
  • ๐Ÿง Marginal Distribution์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€?
  • ๐Ÿง Conditional Distribution์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€?  
  • ๐Ÿง Bias๋ž€ ๋ฌด์—‡์ธ๊ฐ€? ๊ฐ€์„ค๋กœ์จ ์˜ˆ์ธกํ•œ ๊ฐ’์˜ ๊ธฐ๋Œ“๊ฐ’๊ณผ ์‹ค์ œ๊ฐ’์˜ ์ฐจ์ด๋ฅผ ์˜๋ฏธํ•จ
  • ๐Ÿง Biased/Unbiased estimation์˜ ์ฐจ์ด๋Š”? ํŒŒ๋ผ๋ฏธํ„ฐ ์ถ”์ •๊ฐ’์˜ ๊ธฐ๋Œ“๊ฐ’์ด ์‹ค์ œ ๋ชจ์ง‘๋‹จ์˜ ๊ฐ’๊ณผ ์ผ์น˜ํ•˜๋ฉด unbiased, ๋ถˆ์ผ์น˜ํ•˜๋ฉด biased๋ผ๊ณ  ํ•ฉ๋‹ˆ๋‹ค. unbiased์˜ ๋Œ€ํ‘œ์ ์ธ ๊ฒƒ์ด MLE๋กœ ๊ณ„์‚ฐ๋œ ํ‰๊ท , biased์˜ ๋Œ€ํ‘œ์ ์ธ ๊ฒƒ์ด ์ด MLE ํ‰๊ท ์œผ๋กœ๋ถ€ํ„ฐ ์œ ๋„๋œ ๋ถ„์‚ฐ!
  • ๐Ÿง Bias, Variance, MSE๋ž€? ๊ทธ๋ฆฌ๊ณ  ๊ทธ๋“ค์˜ ๊ด€๊ณ„๋Š” ๋ฌด์—‡์ธ๊ฐ€? Bias-Variance Decomposition์˜ ๊ฐœ๋…์— ์ด๋“ค์˜ ๊ด€๊ณ„๊ฐ€ ํ•จ์ถ•๋˜์–ด ์žˆ๋Š”๋ฐ, ์‹ค์ œ๊ฐ’ $y$์™€ ์šฐ๋ฆฌ์˜ ๊ฐ€์„ค๋กœ๋ถ€ํ„ฐ ์˜ˆ์ธกํ•œ ๊ฐ’ $y^\prime$์˜ mean-squared-error๋Š” bias^2, variance, ๊ทธ๋ฆฌ๊ณ  unavoidable error๋กœ ๋ถ„ํ•ด(?)ํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ฒƒ.. ์ด๋•Œ variance๋Š” ๋ชจ๋ธ์˜ ๋ณต์žก๋„(?)๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” ์ง€ํ‘œ..์ธ๋ฐ ๊ฐ€์„ค์˜ ์˜ˆ์ธก์„ h(x)๋ผ๊ณ  ํ•œ๋‹ค๋ฉด, h(x)์˜ ๋ถ„์‚ฐ์„ ์˜๋ฏธํ•œ๋‹ค๊ณ  ์„ค๋ช…..ํ•ด์•ผ ํ•  ๊ฑฐ ๊ฐ™๋‹ค.
  • ๐Ÿง Sample Variance๋ž€ ๋ฌด์—‡์ธ๊ฐ€?
  • ๐Ÿง Variance๋ฅผ ๊ตฌํ•  ๋•Œ, N๋Œ€์‹ ์— N-1๋กœ ๋‚˜๋ˆ ์ฃผ๋Š” ์ด์œ ๋Š” ๋ฌด์—‡์ธ๊ฐ€? Unbiased estimator๋กœ ๋งŒ๋“ค์–ด์ฃผ๊ธฐ ์œ„ํ•ด์„œ์ž„. N์œผ๋กœ ๋‚˜๋ˆ„๋ฉด, ์ด estimator์˜ ๊ธฐ๋Œ“๊ฐ’์ด ๋ชจ์ˆ˜์˜ ํŒŒ๋ผ๋ฏธํ„ฐ์™€ ๋‹ค๋ฆ„..! ๊ทธ๋ž˜์„œ n-1๋กœ ๋‚˜๋ˆ ์ค˜์•ผ ํ•˜๋Š”๋ฐ, ์—ฌ๊ธฐ์—๋Š” ํ†ต๊ณ„ํ•™์—์„œ์˜ ์ž์œ ๋„ ๊ฐœ๋…์ด ๋‚ดํฌ๋˜์–ด ์žˆ๊ณ  sample variance๋ฅผ ๊ตฌํ•˜๋Š” ๋ฐ ์žˆ์–ด ์‚ฌ์šฉ๋˜๋Š” ํ‘œ๋ณธํ‰๊ท ์˜ DoF๊ฐ€ n-1์ด๋ผ ๊ทธ๋ ‡๋‹ค๊ณ  ์•Œ๊ณ  ์žˆ์œผ๋‚˜ ์ด ์ด์ƒ์€...ใ…œใ…œ
  • ๐Ÿง Gaussian Distribution์—์„œ MLE์™€ Sample Variance ์ค‘์— ์–ด๋–ค ๊ฑธ ์‚ฌ์šฉํ•ด์•ผ ํ•˜๋Š”๊ฐ€? ?? ๋ชฌ์†Œ๋ฆฌ์•ผ
  • ๐Ÿง Unbiased Estimation์€ ๋ฌด์กฐ๊ฑด ์ข‹์€๊ฐ€?
  • ๐Ÿง Unbiaed Estimation์˜ ์žฅ์ ์€ ๋ฌด์—‡์ธ๊ฐ€? ๋ชจ์ง‘๋‹จ์„ ์ž˜ ๋Œ€ํ‘œํ•ด ์ค€๋‹จ ๊ฑฐ..?
  • ๐Ÿง Binomial, Bernoulli, Multinomial, Multinoulli ๋ž€ ๋ฌด์—‡์ธ๊ฐ€?
  • ๐Ÿง Beta Distribution๊ณผ Dirichlet Distribution์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€?
  • ๐Ÿง Gamma Distribution์€ ์–ด๋””์— ์“ฐ์ด๋Š”๊ฐ€? ์ง€์ˆ˜ ๋ถ„ํฌ๋ž‘ ํ‘ธ์•„์†ก ๋ถ„ํฌ์˜ Conjugate prior๋กœ์„œ ๋งŽ์ด ํ™œ์šฉ๋จ.
  • ๐Ÿง Possion distribution์€ ์–ด๋””์— ์“ฐ์ด๋Š”๊ฐ€? ํŠน์ • ์‹œ๊ฐ„ ๋™์•ˆ ์‚ฌ๊ฑด์ด ๋ช‡ ๋ฒˆ ์ผ์–ด๋‚˜๋Š”์ง€์— ๋Œ€ํ•œ ์ด์‚ฐํ™•๋ฅ ๋ถ„ํฌ๋กœ ๋งŽ์ด ํ™œ์šฉ๋˜๊ณ , ์ด๋•Œ ์‚ฌ์šฉ๋˜๋Š” ํŒŒ๋ผ๋ฏธํ„ฐ $\lambda$๊ฐ€ ๊ทธ average๋ฅผ ๋‚˜ํƒ€๋ƒ„.
  • ๐Ÿง Bias and Varaince Trade-Off ๋ž€ ๋ฌด์—‡์ธ๊ฐ€? ์œ„์—์„œ ๋งํ•จ~
  • ๐Ÿง Conjugate Prior๋ž€?
  • ๐Ÿง Confidence Interval์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€? 
  • ๐Ÿง covariance/correlation ์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€?
    - ๊ณต๋ถ„์‚ฐ: ๋‘ ํ™•๋ฅ ๋ณ€์ˆ˜ ์‚ฌ์ด์˜ ์„ ํ˜• ๊ด€๊ณ„๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” ์ง€ํ‘œ๋กœ, ํ•˜๋‚˜๊ฐ€ ์ฆ๊ฐ€ํ•  ๋•Œ ๋‹ค๋ฅธ ํ•˜๋‚˜๋„ ๊ฐ์†Œํ•˜๋Š” ๊ฒฝํ–ฅ์ด ์žˆ์œผ๋ฉด ๊ณต๋ถ„์‚ฐ์ด ์–‘์˜ ๊ฐ’์„ ๊ฐ€์ง€๊ณ  ๋…๋ฆฝ์ด๋ฉด 0, ์šฐํ•˜ํ–ฅ(?)์ด๋ฉด ์Œ์˜ ๊ฐ’์„ ๊ฐ€์ง!
    - ์ƒ๊ด€๊ณ„์ˆ˜: ๊ณต๋ถ„์‚ฐ/๋‘๋ณ€์ˆ˜์˜std๊ณฑ; ์–˜๋„ ๋งˆ์ฐฌ๊ฐ€์ง€์˜ ๊ธฐ๋Šฅ์ž„
  • ๐Ÿง Total variation ์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€?
  • ๐Ÿง Explained variation ์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€?
  • ๐Ÿง Uexplained variation ์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€
  • ๐Ÿง Coefficient of determination ์ด๋ž€? (๊ฒฐ์ •๊ณ„์ˆ˜)
  • ๐Ÿง Total variation distance์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€?
  • ๐Ÿง P-value๋ž€ ๋ฌด์—‡์ธ๊ฐ€?
  • ๐Ÿง likelihood-ratio test ์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€?

 

728x90

'math4ai > Probability & Statistics' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

M4AI) Probability & Statistics  (1) 2024.08.09