๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
728x90

๐Ÿ› Research/Detection & Segmentation14

[๋…ผ๋ฌธ ๋ฆฌ๋ทฐ] Pyramid Scene Parsing Network / PSPNet / Pyramid Pooling ๋ณธ ๋…ผ๋ฌธ์€ CVPR2017์— ๊ฒŒ์žฌ๋˜์—ˆ์œผ๋ฉฐ PSPNet(ImageNet scene parsing challenge 2016์—์„œ 1๋“ฑ)์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. ํ˜„์žฌ๋Š” ๋” ์„ฑ๋Šฅ์ด ์ข‹์€ ์—ฐ๊ตฌ๊ฐ€ ๋งŽ์ด ์†Œ๊ฐœ๋˜์—ˆ์ง€๋งŒ semantic segmentation์— global contextual information์„ ํ™œ์šฉํ•˜๊ธฐ ์œ„ํ•œ Pyramid Pooling Module ์„ ์ •๋ฆฌํ•˜๊ธฐ ์œ„ํ•ด ๋ฆฌ๋ทฐ๋ฅผ ์ž‘์„ฑํ•ฉ๋‹ˆ๋‹ค. Motivation ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ๊ธฐ์กด์˜ segmentation ์•Œ๊ณ ๋ฆฌ์ฆ˜์— 3๊ฐ€์ง€ ๋ฌธ์ œ์ ์ด ์žˆ๋‹ค๊ณ  ์ง€์ ํ•ฉ๋‹ˆ๋‹ค. (์œ„ ๊ทธ๋ฆผ์—์„œ๋Š” FCN ๊ณผ ๋น„๊ต) 1) Mismatched Relationship : ์ฃผ๋ณ€ ํ™˜๊ฒฝ(contextual information)๊ณผ ๋งž์ง€ ์•Š๋Š” ํ”ฝ์…€ ๋ถ„๋ฅ˜. ์˜ˆ๋ฅผ ๋“ค์–ด ํ˜ธ์ˆ˜ ๊ทผ์ฒ˜์˜ ์ž๋™์ฐจ, ๋„๋กœ ์œ„์˜ ๋ณดํŠธ ๊ฐ™์€.. 2021. 12. 5.
[๋…ผ๋ฌธ ๋ฆฌ๋ทฐ] Unified Perceptual Parsing for Scene Understanding / UperNet / Multi-task learning ๋ณธ ๋…ผ๋ฌธ์€ ECCV 2018์— ๊ฒŒ์žฌ๋œ ๋…ผ๋ฌธ์œผ๋กœ ๋‹ค์–‘ํ•œ visual concepts ์ธ์‹ํ•˜๋Š”(multi-task learning) Unified Perceptual Parsing ์ด๋ผ๋Š” ์ƒˆ๋กœ์šด task ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. Introduction ์œ„ ๊ทธ๋ฆผ์€ ๊ฑฐ์‹ค(scene)์— ํ…Œ์ด๋ธ”, ๊ทธ๋ฆผ, ๋ฒฝ๊ณผ ๊ฐ™์€ ๋‹ค์–‘ํ•œ ๊ฐ์ฒด(object)๋กœ ์ด๋ฃจ์–ด์ ธ์žˆ๊ณ  ๋™์‹œ์— ํ…Œ์ด๋ธ”์€ ํ…Œ์ด๋ธ” ๋‹ค๋ฆฌ, ์ƒํŒ, apron(part) ๋“ฑ์œผ๋กœ ๊ตฌ์„ฑ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค. ๋˜ํ•œ ํ…Œ์ด๋ธ”์€ ๋‚˜๋ฌด(material)๋กœ ๋งŒ๋“ค์–ด์กŒ๊ณ  ์†ŒํŒŒ ํ‘œ๋ฉด์€ kinitted(texture) ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค. ์ด๋Ÿฌํ•œ ์นดํ…Œ๊ณ ๋ฆฌ๋“ค์€ scene understanding, object/material/part/texture recognition task์—์„œ ๊ฐ๊ฐ ๋…๋ฆฝ์ ์œผ๋กœ ์ˆ˜ํ–‰๋˜์–ด ์™”์Šต๋‹ˆ๋‹ค... 2021. 12. 4.
728x90