๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
728x90

๐Ÿ› Research58

[๋…ผ๋ฌธ ๋ฆฌ๋ทฐ] Data Augmentation for Scene Text Recognition ํ…์ŠคํŠธ ์ธ์‹์— ํฌ์ปค์Šค๊ฐ€ ๋งž์ถฐ์ง„ augmentation์ด ์žˆ์„๊นŒ ์‹ถ์–ด ๋…ผ๋ฌธ์„ ์ฐพ๋˜์ค‘ ICCV 2021 ํ•™ํšŒ์—์„œ ๋ฐœํ‘œ๋œ STR์—์„œ์˜ Data augmentation ๋…ผ๋ฌธ์ด ์žˆ์–ด์„œ ์ •๋ฆฌํ•˜๋ ค ํ•œ๋‹ค. Abstract ์ผ๋ถ€ Scene Text Recognition(STR) ๋ชจ๋ธ์€ ์‹ค์ œ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•ด์„œ ํ‰๊ฐ€ํ•˜๊ธฐ ๋•Œ๋ฌธ์— ํ•™์Šต ๋ฐ์ดํ„ฐ์™€ ํ…Œ์ŠคํŠธ ๋ฐ์ดํ„ฐ ๋ถ„ํฌ ๊ฐ„์˜ ๋ถˆ์ผ์น˜๋Š” ์ฃผ๋กœ nosie, artifacts, geometry, structure ๋“ฑ์˜ ์˜ํ–ฅ์„ ๋ฐ›์•„์„œ ์„ฑ๋Šฅ ์ €ํ•˜๋กœ ์ด์–ด์ง„๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์ด๋ฅผ ๊ฐœ์„ ํ•˜๊ธฐ ์œ„ํ•ด 36๊ฐœ์˜ image augmenation function์œผ๋กœ ๊ตฌ์„ฑ๋œ STRAug๋ฅผ ์†Œ๊ฐœํ•œ๋‹ค. ๊ฐ ํ•จ์ˆ˜๋Š” ์ž์—ฐ ์žฅ๋ฉด์—์„œ ์ฐพ์„ ์ˆ˜ ์žˆ๊ฑฐ๋‚˜ ์นด๋ฉ”๋ผ ์„ผ์„œ์— ์˜ํ•ด ๋ฐœ์ƒํ•˜๊ฑฐ๋‚˜ ์‹ ํ˜ธ ์ฒ˜๋ฆฌ ์ž‘์—… ์ค‘ ๋ฐœ์ƒํ•˜๋Š” ์ด๋ฏธ์ง€ ์†์„ฑ.. 2023. 3. 11.
[์›น ๋ฐ๋ชจ] ๋„ค์ด๋ฒ„ ํด๋กœ๋ฐ” OCR ๋ฐ๋ชจ OCR์€ ์ด๋ฏธ์ง€ ์†์—์„œ ํ…์ŠคํŠธ๋ฅผ ์ฐพ๊ณ  ์ฝ์–ด๋‚ด๋Š” ๊ธฐ์ˆ ๋กœ ์ตœ๊ทผ์—๋Š” ์›ํ•˜๋Š” ํ…์ŠคํŠธ ์ •๋ณด๋งŒ์„ ์ถ”์ถœํ•˜๋Š” ์ˆ˜์ค€๊นŒ์ง€ ๋„๋‹ฌํ–ˆ๊ณ , ์ด ๋ถ„์•ผ์—์„œ๋Š” ๋„ค์ด๋ฒ„๊ฐ€ ์—…๊ณ„ ์ตœ๊ณ  ์ˆ˜์ค€์˜ ๊ธฐ์ˆ ๋ ฅ์„ ๊ฐ€์ง€๊ณ  ์žˆ๋‹ค. ๋„ค์ด๋ฒ„๋Š” CVPR 2019์—์„œ ๋ฐœํ‘œํ•œ Text detection ๋ชจ๋ธ์ธ CRAFT, 21๋…„์— ๋ฐœํ‘œํ•œ end-to-end document understanding ๋ชจ๋ธ์ธ Donut ๊ทธ๋ฆฌ๊ณ  ๊ฐ€์žฅ ์ตœ๊ทผ์ธ 22๋…„์— ๋ฐœํ‘œํ•œ DEER ๋ชจ๋ธ๊นŒ์ง€ OCR ๋ถ€๋ถ„์—์„œ ๋งŽ์€ ๋…ผ๋ฌธ์„ ๋‚ด๊ณ  ์žˆ๋‹ค. ๋…ผ๋ฌธ์—์„œ์˜ ์ˆ˜์น˜์ ์€ ์„ฑ๋Šฅ์ด ์šฐ์ˆ˜ํ•œ ๊ฒƒ์€ ์•Œ๊ฒ ๋Š”๋ฐ, ์‹ค์ œ๋กœ ์–ผ๋งˆ๋‚˜ ์ž˜ ๋™์ž‘ํ•˜๋Š” ๋ชจ๋ธ์ผ๊นŒ? ๋„ค์ด๋ฒ„ ํด๋กœ๋ฐ”๋Š” OCR ์›น ๋ฐ๋ชจ๋ฅผ ์ œ๊ณตํ•˜๊ณ  ์žˆ์–ด ๋ˆ„๊ตฌ๋‚˜ ์‚ฌ์šฉํ•ด ๋ณผ ์ˆ˜ ์žˆ๋‹ค. (๋งํฌ) ๋„ค์ด๋ฒ„ ํด๋กœ๋ฐ” OCR ์›น ๋ฐ๋ชจ ํŽ˜์ด์ง€์—์„œ General OCR, ์˜์ˆ˜์ฆ, ์‹ ์šฉ์นด๋“œ ๋“ฑ .. 2023. 3. 1.
[์—ฐ๊ตฌ ์†Œ๊ฐœ] ๋ฌธ์„œ ์ด๋ฏธ์ง€ ๊ทธ๋ฆผ์ž์ œ๊ฑฐ / ๋ฌธ์„œ OCR ๊ฒฐ๊ณผ๋ฅผ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•ด ์š”์ฆ˜์€ ๋ฌธ์„œ๋ฅผ ์‚ฌ์ง„์œผ๋กœ ์ฐ์–ด์„œ ํšŒ์‚ฌ๋‚˜ ๊ณต๊ณต ๊ธฐ๊ด€์— ์ œ์ถœํ•˜๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋งŽ๋‹ค. ์ด ๋•Œ ํšŒ์‚ฌ๋Š” ๋ฐ›์€ ๋ฌธ์„œ์—์„œ OCR ๊ธฐ์ˆ ์„ ์‚ฌ์šฉํ•ด์„œ ํ…์ŠคํŠธ๋ฅผ ๋””์ง€ํ„ธํ™”์‹œ์ผœ์„œ ์ €์žฅํ•˜๊ฒŒ ๋œ๋‹ค. ๊ทธ๋Ÿฐ๋ฐ ํœด๋Œ€ํฐ์œผ๋กœ ๋ฌธ์„œ ์‚ฌ์ง„์„ ์ฐ๋Š” ๊ฒฝ์šฐ ๊ทธ๋ฆผ์ž๊ฐ€ ๋งŽ์ด ์ƒ๊ฒจ์„œ ์ด๋ฏธ์ง€์˜ ํ€„๋ฆฌํ‹ฐ๊ฐ€ ๋–จ์–ด์ง€๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋งŽ๊ณ  ์ด๋Š” ํ…์ŠคํŠธ ์ธ์‹ ์˜ค๋ฅ˜๋ฅผ ๋ฐœ์ƒํ•˜๊ฒŒ ํ•œ๋‹ค. ๊ทธ๋Ÿฐ๋ฐ... ์ด๋ฏธ์ง€์—์„œ ๊ทธ๋ฆผ์ž๋ฅผ ์ œ๊ฑฐํ•˜๋Š” ์—ฐ๊ตฌ๊ฐ€ ์กด์žฌํ•œ๋‹ค๊ณ  ํ•œ๋‹ค. ์—ญ์‹œ ์„ธ์ƒ ์‚ฌ๋žŒ๋“ค์€ ์ฐธ ๋˜‘๋˜‘ํ•˜๊ณ  ์—†๋Š” ๊ฒŒ ์ž˜ ์—†๋‹ค... Paper : BEDSR-Net A Deep Shadow Removal Network from a Single Document Image / CVPR 2020 github : https://github.com/IsHYuhi/BEDSR-Net_A_Deep_Shadow_Removal_.. 2022. 12. 20.
[์˜คํ”ˆ ์†Œ์Šค] EasyOCR ํ…์ŠคํŠธ ๊ฒ€์ถœ/์ธ์‹ AI ๋ชจ๋ธ์„ ๋ฌด๋ฃŒ๋กœ ์‰ฝ๊ฒŒ ์‚ฌ์šฉํ•ด๋ณด์ž https://github.com/JaidedAI/EasyOCR GitHub - JaidedAI/EasyOCR: Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chines Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. - GitHub - JaidedAI/EasyOCR: Ready-to-use OCR with 80+ ... github.com OCR(Optical Character.. 2022. 12. 16.
[์˜คํ”ˆ ์†Œ์Šค] BERT๋ฅผ ์ด์šฉํ•œ ํ•œ๊ตญ์–ด ๊ฐœ์ฒด๋ช… ์ธ์‹ | NER (Named Entity Recognition) NER(Named Entity Recognition) Named Entity Recognition (NER)์€ ์ž์—ฐ์–ด ์ฒ˜๋ฆฌ ๊ธฐ์ˆ  ์ค‘ ํ•˜๋‚˜๋กœ, ๋ฌธ์žฅ ๋‚ด์—์„œ ํŠน์ •ํ•œ ์œ ํ˜•์˜ ๋ช…์นญ(๊ฐœ์ฒด)์„ ์ธ์‹ํ•˜๋Š” ์ž‘์—…์ด๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด, "Steve Jobs๋Š” Apple์˜ ์ฐฝ์—…์ž์ž…๋‹ˆ๋‹ค" ๋ผ๋Š” ๋ฌธ์žฅ์ด ์žˆ๋‹ค๋ฉด, "Steve Jobs"๋Š” ์ธ๋ฌผ(person), "Apple"์€ ์กฐ์ง(organization)์ด๋ผ๋Š” ์œ ํ˜•์˜ ๊ฐœ์ฒด๋กœ ์ธ์‹๋œ๋‹ค. ์ด์™ธ์—๋„ ์žฅ์†Œ, ์‹œ๊ฐ„ ๋“ฑ ๋‹ค์–‘ํ•œ ๊ฐœ์ฒด๋ฅผ ์ธ์‹ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ NER์€ ์ •๋ณด ์ถ”์ถœ, ์งˆ์˜ ์‘๋‹ต, ๋ฆฌ๋ทฐ ๋ถ„์„, ๊ธฐ๊ณ„๋ฒˆ์—ญ ๋“ฑ ๋‹ค์–‘ํ•œ ๊ณณ์—์„œ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค. ์ „ํ˜€ ์ƒ๊ฐํ•˜์ง€ ๋ชปํ–ˆ๋˜ ํ™œ์šฉ์ฒ˜๋Š” ๊ธฐ๊ณ„๋ฒˆ์—ญ ๋ถ„์•ผ์ด๋‹ค. ์˜์–ด๋ฅผ ํ•œ๊ตญ์–ด๋กœ ๋ฒˆ์—ญํ•  ๋•Œ ๊ธฐ์—…์„ ์ง€์นญํ•˜๋Š” "Apple"์€ "์‚ฌ๊ณผ"๊ฐ€ ์•„๋‹Œ "์• ํ”Œ"๋กœ ๋ฒˆ์—ญํ•ด์•ผ .. 2022. 12. 15.
[์˜คํ”ˆ ์†Œ์Šค] ๋ฌธ์„œ ์Šค์บ๋„ˆ / ๋ฌธ์„œ ์ •๋ฉด ๋ทฐ ๋ณ€ํ™˜ / ๋ฌธ์„œ ์ด๋ฏธ์ง€ Perspective Transformation https://github.com/andrewdcampbell/OpenCV-Document-Scanner GitHub - andrewdcampbell/OpenCV-Document-Scanner: An interactive document scanner built in Python using OpenCV featuring automat An interactive document scanner built in Python using OpenCV featuring automatic corner detection, image sharpening, and color thresholding. - GitHub - andrewdcampbell/OpenCV-Document-Scanner: An i... github.co.. 2022. 12. 15.
728x90