๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
728x90

๐Ÿ› Research/Detection & Segmentation14

[๋…ผ๋ฌธ ๋ฆฌ๋ทฐ] End-to-End Object Detection with Transformers | DETR ์„ค๋ช… ์˜ค๋Š˜์€ 2020๋…„์— Meta์—์„œ ๊ณต๊ฐœํ•œ DETR ๋ชจ๋ธ(ECCV 2020)์„ ๋ฆฌ๋ทฐํ•ด ๋ณด๊ณ ์ž ํ•œ๋‹ค. ํ”ผ ์ธ์šฉ์ˆ˜๊ฐ€ 9000ํšŒ์— ์œก๋ฐ•ํ•˜๋ฉฐ, ์ตœ๊ทผ ๊ณต๊ฐœ๋˜๋Š” ๊ฐ์ฒด ๊ฒ€์ถœ ๋…ผ๋ฌธ๋“ค์„ ๋ณด๋ฉด DETR ๊ธฐ๋ฐ˜์˜ ์—ฐ๊ตฌ๋„ ์‹ฌ์‹ฌ์น˜ ์•Š๊ฒŒ ๋ณผ ์ˆ˜ ์žˆ๋‹ค. Deformable DETR, Conditional DETR, Group DETR, Co-DETR, ... DETR (DEtection TRansformer) DETR์€ ํŠธ๋žœ์Šคํฌ๋จธ์™€ ์ด๋ถ„ ๋งค์นญ(Bipartite-matching) ๊ธฐ๋ฐ˜์˜ ์ƒˆ๋กœ์šด ๊ฒ€์ถœ ๋ฐฉ์‹์„ ๋„์ž…ํ•˜์—ฌ RPN, NMS์™€ ๊ฐ™์€ hand-crafted ํ•œ ์—”์ง€๋‹ˆ์–ด๋ง์ด ํ•„์š”์—†๋Š” ๋ชจ๋ธ ๊ตฌ์กฐ๋ผ๊ณ  ํ•œ๋‹ค. ๊ตฌ์กฐ์ ์œผ๋กœ ๊ต‰์žฅํžˆ ๊ฐ„๋‹จํ•˜๋ฉด์„œ ๋‹ค๋ฅธ task์— ๋Œ€ํ•œ ํ™•์žฅ์„ฑ๋„ ์ข‹๊ณ , ์–ดํ…์…˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์ด์šฉํ•˜๊ธฐ ๋•Œ๋ฌธ์— ํฐ ๊ฐ์ฒด๋ฅผ ๊ฒ€์ถœ ๋Šฅ๋ ฅ์ด Faste.. 2023. 11. 25.
[๋…ผ๋ฌธ ๋ฆฌ๋ทฐ] Fast Segment Anything | Fast SAM | SAM์˜ ๊ฒฝ๋Ÿ‰ํ™” SAM (Segment Anything Model) ์„ค๋ช… ๋ฐ ์‚ฌ์šฉ ๋ฐฉ๋ฒ• [Meta AI] SAM (Segment Anything Model) ์‚ฌ์šฉ ๋ฐฉ๋ฒ• | ๋ชจ๋“  ๊ฐ์ฒด๋ฅผ ๋ถ„ํ• ํ•˜๋Š” Vision AI ๋ชจ๋ธ SAM (Segment Anything Model) Meta ์—์„œ SAM (Segment Anything Model) ์ด๋ผ๋Š” ์–ด๋–ค ๊ฒƒ์ด๋“  ๋ถ„ํ• ํ•  ์ˆ˜ ์žˆ๋Š” ๋ชจ๋ธ์„ ๊ณต๊ฐœํ–ˆ๋‹ค. ๋…ผ๋ฌธ ์ œ๋ชฉ ์ž์ฒด๊ฐ€ 'Segment Anything' ์ธ๋ฐ ๊ต‰์žฅํžˆ ์ž์‹ ๊ฐ ๋„˜์น˜๋Š” ์›Œ๋”ฉ์ด๋‹ค. ๊ฐ„๋‹จํ•œ ์„ค๋ช…์„ mvje.tistory.com Meta AI์˜ Segment Anything Model (SAM)์ด ๊ณต๊ฐœ๋œ์ง€ ์–ผ๋งˆ๋‚˜ ๋๋‹ค๊ณ  ๋ฒŒ์จ Fast SAM์ด๋ผ๋Š” ์†๋„๊ฐ€ ํ–ฅ์ƒ๋œ ๋ฒ„์ „์˜ SAM์ด ๊ณต๊ฐœ๋˜์—ˆ๋‹ค. ๋น…ํ…Œํฌ ๊ธฐ์—…์—์„œ ํ˜์‹ ์ ์ธ AI ๋ชจ๋ธ์„ ์ง€์†์ .. 2023. 7. 2.
[๋…ผ๋ฌธ ์†Œ๊ฐœ] TAM (Track Anything Model) | ์–ด๋–ค ๊ฒƒ์ด๋“  ์ถ”์ ํ•˜๋Š” Vision AI ๋ชจ๋ธ | Sagment Anything ๋น„๋””์˜ค ๋ฒ„์ „ Track Anything: Segment Anything Meets Videos ์„ธ์ƒ ์ฐธ ๋น ๋ฅด๋‹ค. Meta AI์˜ SAM (Segment Anything Model)์ด ๋‚˜์˜จ์ง€ ์–ผ๋งˆ๋‚˜ ๋๋‹ค๊ณ  SAM์„ ๋น„๋””์˜ค์— ์ ์šฉํ•ด tracking task๋ฅผ ์ˆ˜ํ–‰ํ•˜๋Š” TAM (Tracking Anything Model) ๋…ผ๋ฌธ์ด ๋‚˜์™”๋‹ค๊ณ  ํ•œ๋‹ค. Track-Anything์€ ๋น„๋””์˜ค ๊ฐ์ฒด ์ถ”์  ๋ฐ ๋ถ„ํ• ์„ ์œ„ํ•œ ์œ ์—ฐํ•œ ๋Œ€ํ™”ํ˜• ๋„๊ตฌ๋กœ Segment Anything์—์„œ ๊ฐœ๋ฐœ๋˜์—ˆ์œผ๋ฉฐ ์‚ฌ์šฉ์ž ํด๋ฆญ์„ ํ†ตํ•ด์„œ๋งŒ ์ถ”์  ๋ฐ ์„ธ๊ทธ๋จผํŠธํ™”ํ•  ํ•ญ๋ชฉ์„ ์ง€์ •ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ถ”์ ํ•˜๋Š” ๋™์•ˆ ์‚ฌ์šฉ์ž๋Š” ์ถ”์ ํ•˜๋ ค๋Š” ๊ฐœ์ฒด๋ฅผ ์œ ์—ฐํ•˜๊ฒŒ ๋ณ€๊ฒฝํ•˜๊ฑฐ๋‚˜ ๋ชจํ˜ธํ•œ ๋ถ€๋ถ„์ด ์žˆ๋Š” ๊ฒฝ์šฐ ๊ด€์‹ฌ ์˜์—ญ์„ ์ˆ˜์ •ํ•  ์ˆ˜๋„ ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ํŠน์„ฑ์„ ํ†ตํ•ด Track-Anything์€ ๋‹ค์Œ๊ณผ ๊ฐ™์€ ์ž‘์—….. 2023. 4. 30.
[๋…ผ๋ฌธ ์†Œ๊ฐœ] DINOv2 - Self-supervised Vision Transformer | Meta AI | ๋ ˆ์ด๋ธ” ๋ฐ์ดํ„ฐ ์—†์ด ๊ฐ•๋ ฅํ•œ ์„ฑ๋Šฅ์„ ๋‚ด๋Š” Vision AI ๋ชจ๋ธ DINOv2 ๋…ผ๋ฌธ ์ œ๋ชฉ : DINOv2: Learning Robust Visual Features without Supervision GitHub Demo 23๋…„ 4์›” Meta AI์—์„œ self-supervised learning์„ ์‚ฌ์šฉํ•˜์—ฌ ๊ณ ์„ฑ๋Šฅ ์ปดํ“จํ„ฐ๋น„์ „ ๋ชจ๋ธ์„ ํ•™์Šตํ•˜๋Š” ์ƒˆ๋กœ์šด ๋ฐฉ๋ฒ•์ธ DINOv2๋ฅผ ๊ณต๊ฐœํ–ˆ๋‹ค. LLM(Large Language Model) ํ•™์Šต์—๋„ ํ™œ์šฉ๋˜๋Š” self-supervised learning ๋ฐฉ๋ฒ•์€ ๋ชจ๋ธ ํ•™์Šต ์‹œ ๋งŽ์€ ์–‘์˜ ๋ ˆ์ด๋ธ”์ด ์ง€์ •๋œ ๋ฐ์ดํ„ฐ๊ฐ€ ํ•„์š”ํ•˜์ง€ ์•Š๊ธฐ ๋•Œ๋ฌธ์— AI ๋ชจ๋ธ์„ ํ•™์Šตํ•˜๋Š” ๊ฐ•๋ ฅํ•˜๊ณ  ์œ ์—ฐํ•œ ๋ฐฉ๋ฒ•์ด๋‹ค. ๋…ผ๋ฌธ์— ๋”ฐ๋ฅด๋ฉด ์ตœ๊ทผ ๋ช‡๋…„ ๋™์•ˆ ์ปดํ“จํ„ฐ๋น„์ „ ์ž‘์—…์˜ ํ‘œ์ค€ ์ ‘๊ทผ ๋ฐฉ์‹์ด์—ˆ๋˜ ์ด๋ฏธ์ง€-ํ…์ŠคํŠธ๋ฅผ ํŽ˜์–ด๋กœ ํ•™์Šตํ•˜๋Š” ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ฐฉ์‹์˜ ํ•™์Šต ๋ฐฉ๋ฒ•์—์„œ๋Š” ์ด๋ฏธ์ง€์˜ ์บก์…˜ ์ •๋ณด์— ์˜์กดํ•œ.. 2023. 4. 29.
[๋…ผ๋ฌธ ๋ฆฌ๋ทฐ] SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers ๋ณธ ๋…ผ๋ฌธ์€ NeurIPS 2021 ์— ๊ณต๊ฐœ๋˜์—ˆ๊ณ , ์‹ฌํ”Œํ•˜๊ณ  ๊ฐ•๋ ฅํ•œ semantic segmentation task ์šฉ Transformer ์ธ SegFormer ๋ฅผ ์ œ์•ˆํ•˜๋Š” ๋…ผ๋ฌธ์ž…๋‹ˆ๋‹ค. Abstract ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ํšจ์œจ์ ์ธ Segmentation task ์ˆ˜ํ–‰์„ ์œ„ํ•œ ๊ฐ„๋‹จํ•˜๊ณ  ํšจ์œจ์ ์ด๋ฉด์„œ ๊ฐ•๋ ฅํ•œ semantic segmentation ํ”„๋ ˆ์ž„์›Œํฌ์ธ SegFormer ๋ฅผ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค. SegFormer ๋Š” 1) multi-scale feature ๋ฅผ ์ถ”์ถœํ•˜๋Š” ์ƒˆ๋กœ์šด hierarchically structured Transformer encoder ๋กœ ๊ตฌ์„ฑ๋˜๊ณ , positional encoding์ด ํ•„์š”ํ•˜์ง€ ์•Š๊ธฐ ๋•Œ๋ฌธ์— ํ…Œ์ŠคํŠธ ์ด๋ฏธ์ง€์˜ ํ•ด์ƒ๋„๊ฐ€ ํ•™์Šต ์ด๋ฏธ์ง€์˜ ํ•ด์ƒ๋„์™€ ๋‹ค๋ฅผ ๋•Œ ์„ฑ๋Šฅ์ด ์ €ํ•˜๋˜๋Š” positiona.. 2022. 8. 9.
[๋…ผ๋ฌธ ๋ฆฌ๋ทฐ] Deep Learning for Large-Scale Traffic-Sign Detection and Recognition / ๊ตํ†ต ํ‘œ์ง€ํŒ ๊ฒ€์ถœ ๋ณธ ํฌ์ŠคํŒ…์—์„œ๋Š” Traffic sign detection (๊ตํ†ต ํ‘œ์ง€ํŒ ๊ฐ์ง€) ์— ๋Œ€ํ•œ ๋…ผ๋ฌธ 2๊ฐœ๋ฅผ ์†Œ๊ฐœํ•ฉ๋‹ˆ๋‹ค. Traffic-Sign Detection and Classification in the Wild / CVPR 2016 Deep Learning for Large-Scale Traffic-Sign Detection and Recognition / IEEE T-ITS 2019 Traffic sign detection ์€ object detection์˜ ํ•˜์œ„ task๋กœ ๋ณผ ์ˆ˜ ์žˆ๊ณ , ์ž์œจ ์ฃผํ–‰ ๋ฐ ๋„๋กœ ์ •๋ณด๋ฅผ ์ƒ์„ฑํ•˜๋Š”๋ฐ ํ•„์ˆ˜์ ์œผ๋กœ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. ๊ต‰์žฅํžˆ ์ž‘์€ ๊ฐ์ฒด๋ฅผ ๊ฐ์ง€ํ•˜๋Š” ๋ฐฉ๋ฒ•๋“ค์ด ๊ถ๊ธˆํ–ˆ์—ˆ๋Š”๋ฐ, traffic sign detection ๋…ผ๋ฌธ๋“ค์ด ๋„์›€์ด ๋˜๋Š” ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค. "Traffic-Sign De.. 2022. 7. 8.
728x90