[NLP] ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ ์„ค๋ช… | Huggingface sentence-transformers, OpenAI

2025. 5. 13. 23:15ยท๐Ÿ› Research/NLP & LLM
๋ฐ˜์‘ํ˜•

OpenAI

 

์ž์—ฐ์–ด์ฒ˜๋ฆฌ(NLP) ๋ถ„์•ผ์—์„œ ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ(Text Embedding)์ด๋ž€, ๋ฌธ์žฅ์ด๋‚˜ ๋ฌธ์„œ๋ฅผ ์ปดํ“จํ„ฐ๊ฐ€ ์ดํ•ดํ•  ์ˆ˜ ์žˆ๋„๋ก ๊ณ ์ฐจ์›์˜ ๋ฒกํ„ฐ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ์ž‘์—…์„ ๋งํ•œ๋‹ค. ์ด ๋ฒกํ„ฐ๋Š” ๋‹จ์–ด ๊ฐ„, ๋ฌธ์žฅ ๊ฐ„์˜ ์˜๋ฏธ์  ์œ ์‚ฌ์„ฑ์„ ์ˆ˜์น˜์ ์œผ๋กœ ํ‘œํ˜„ํ•  ์ˆ˜ ์žˆ๋„๋ก ๋„์™€์ค€๋‹ค.

ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์€ ์ž‘์—…์—์„œ ํ•ต์‹ฌ ์—ญํ• ์„ ํ•œ๋‹ค.

  • ๋ฌธ์žฅ ์œ ์‚ฌ๋„ ๋ถ„์„
  • ์˜๋ฏธ ๊ธฐ๋ฐ˜ ๊ฒ€์ƒ‰ (semantic search)
  • ํ…์ŠคํŠธ ๋ถ„๋ฅ˜ ๋ฐ ๊ตฐ์ง‘ํ™”
  • ์ถ”์ฒœ ์‹œ์Šคํ…œ
  • ์งˆ๋ฌธ-์‘๋‹ต(QA) ๋งค์นญ
  • ์ •๋ณด ๊ฒ€์ƒ‰ (Retrieval-Augmented Generation ๋“ฑ)

์ž„๋ฒ ๋”ฉ ๋ฒกํ„ฐ๊ฐ€ ์ž˜ ๋งŒ๋“ค์–ด์กŒ๋‹ค๋Š” ๊ฒƒ์€, ์˜ˆ๋ฅผ ๋“ค์–ด "๋‚˜๋Š” ์˜ํ™”๋ฅผ ์ข‹์•„ํ•œ๋‹ค"์™€ "์˜ํ™” ๋ณด๋Š” ๊ฑธ ์ฆ๊ธด๋‹ค" ๊ฐ™์€ ๋ฌธ์žฅ์ด ์„œ๋กœ ๊ฐ€๊นŒ์šด ์œ„์น˜์˜ ๋ฒกํ„ฐ๋กœ ํ‘œํ˜„๋œ๋‹ค๋Š” ๋œป์ด๋‹ค. ์ด์ฒ˜๋Ÿผ ์œ ์‚ฌํ•œ ์˜๋ฏธ์˜ ๋ฌธ์žฅ์ด ๊ฐ€๊นŒ์ด ์œ„์น˜ํ•˜๋ฉด ๊ฒ€์ƒ‰, ๋ถ„๋ฅ˜, ์ถ”์ฒœ์˜ ์„ฑ๋Šฅ๋„ ์˜ฌ๋ผ๊ฐ€๊ฒŒ ๋œ๋‹ค.


Hugging Face์—์„œ ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ๋“ค

์ตœ๊ทผ์—๋Š” Hugging Face Transformers์™€ sentence-transformers ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ํ†ตํ•ด ๊ฐ„ํŽธํ•˜๊ฒŒ ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค. ์•„๋ž˜๋Š” ํ˜„์—…๊ณผ ์—ฐ๊ตฌ์—์„œ ๋งŽ์ด ์‚ฌ์šฉ๋˜๋Š” ๋ชจ๋ธ๋“ค์ด๋‹ค.

1. all-MiniLM-L6-v2

  • sentence-transformers/all-MiniLM-L6-v2
  • ํŠน์ง•: ์ž‘๊ณ  ๋น ๋ฅด์ง€๋งŒ ๊ฝค ๊ฐ•๋ ฅํ•œ ๋ฌธ์žฅ ์ž„๋ฒ ๋”ฉ ์„ฑ๋Šฅ ์ œ๊ณต
  • ๋ฒกํ„ฐ ํฌ๊ธฐ: 384์ฐจ์›
  • ์ ํ•ฉํ•œ ์šฉ๋„: ๋น ๋ฅธ ๋ฌธ์žฅ ์œ ์‚ฌ๋„ ๋ถ„์„, ๊ฒ€์ƒ‰ ์‹œ์Šคํ…œ, ๋ฒกํ„ฐ ๊ธฐ๋ฐ˜ ํด๋Ÿฌ์Šคํ„ฐ๋ง
  • ์žฅ์ : ์†๋„์™€ ์„ฑ๋Šฅ์˜ ๊ท ํ˜•์ด ๋›ฐ์–ด๋‚จ

2. all-mpnet-base-v2

  • sentence-transformers/all-mpnet-base-v2
  • ํŠน์ง•: ํ˜„์žฌ ๊ฐ€์žฅ ๋งŽ์ด ์‚ฌ์šฉ๋˜๋Š” SOTA ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ ์ค‘ ํ•˜๋‚˜
  • ๋ฒกํ„ฐ ํฌ๊ธฐ: 768์ฐจ์›
  • ์ ํ•ฉํ•œ ์šฉ๋„: ๊ณ ์ •๋ฐ€ ์˜๋ฏธ ๊ธฐ๋ฐ˜ ๊ฒ€์ƒ‰, ๋ถ„๋ฅ˜, QA ๋งค์นญ
  • ์žฅ์ : ์„ฑ๋Šฅ์ด ๋งค์šฐ ๋›ฐ์–ด๋‚˜๋ฉฐ ์—ฌ๋Ÿฌ ํƒœ์Šคํฌ์—์„œ ์ข‹์€ ๊ฒฐ๊ณผ

3. e5-base / e5-large

  • intfloat/e5-base, intfloat/e5-large
  • ํŠน์ง•: "query:"์™€ "passage:" ํ”„๋ฆฌํ”ฝ์Šค๋ฅผ ๋ถ™์—ฌ ๊ฒ€์ƒ‰์— ์ตœ์ ํ™”๋œ ์ž„๋ฒ ๋”ฉ์„ ์ƒ์„ฑ
  • ์ ํ•ฉํ•œ ์šฉ๋„: RAG, ๊ฒ€์ƒ‰ ๊ธฐ๋ฐ˜ QA, ๋ฌธ์„œ ๊ฒ€์ƒ‰
  • ์žฅ์ : query-passage ๊ตฌ๋ถ„์„ ํ†ตํ•ด ๊ฒ€์ƒ‰ ํŠนํ™” ์„ฑ๋Šฅ ๊ทน๋Œ€ํ™”
  • ์ถ”๊ฐ€ ํŒ: E5 ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•  ๋• ๋ฐ˜๋“œ์‹œ ํ”„๋ฆฌํ”ฝ์Šค๋ฅผ ๋ถ™์—ฌ์•ผ ์„ฑ๋Šฅ์ด ์ œ๋Œ€๋กœ ๋‚˜์˜จ๋‹ค.
# ์˜ˆ์‹œ
query = "query: ๋‚ ์”จ๊ฐ€ ์ข‹์€ ๋‚  ํ• ๋งŒํ•œ ์•ผ์™ธ ํ™œ๋™"
docs = ["passage: ๋“ฑ์‚ฐ์€ ์ข‹์€ ์šด๋™์ž…๋‹ˆ๋‹ค.", "passage: ์‹ค๋‚ด ๋ณด๋“œ๊ฒŒ์ž„๋„ ์žฌ๋ฏธ์žˆ์Šต๋‹ˆ๋‹ค."]

4. gte-base / gte-large

  • ๋ชจ๋ธ ๊ฒฝ๋กœ: thenlper/gte-base, thenlper/gte-large
  • ํŠน์ง•: E5์™€ ๋น„์Šทํ•œ ๊ตฌ์กฐ์ง€๋งŒ ํ”„๋ฆฌํ”ฝ์Šค ์—†์ด๋„ ์‚ฌ์šฉ ๊ฐ€๋Šฅ
  • ์žฅ์ : ์ผ๋ฐ˜์ ์ธ ๋ฌธ์žฅ ์ž„๋ฒ ๋”ฉ ํƒœ์Šคํฌ์— ์ ํ•ฉํ•˜๋ฉฐ ์‚ฌ์šฉ์ด ๊ฐ„ํŽธ
  • ์ ํ•ฉํ•œ ์šฉ๋„: ๋ฌธ์žฅ ๋ถ„๋ฅ˜, ์œ ์‚ฌ๋„ ๊ณ„์‚ฐ, ์ถ”์ฒœ ์‹œ์Šคํ…œ

5. OpenAI Text Embedding Models

  • ๋Œ€ํ‘œ ๋ชจ๋ธ: text-embedding-ada-002 (API ๊ธฐ๋ฐ˜)
  • ํŠน์ง•: OpenAI์—์„œ ์ œ๊ณตํ•˜๋Š” ๊ณ ์„ฑ๋Šฅ ํด๋ผ์šฐ๋“œ ๊ธฐ๋ฐ˜ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ
  • ๋ฒกํ„ฐ ํฌ๊ธฐ: 1536์ฐจ์›
  • ์ ํ•ฉํ•œ ์šฉ๋„: ๋Œ€๊ทœ๋ชจ ๊ฒ€์ƒ‰, RAG, ์œ ์‚ฌ๋„ ๋ถ„์„, ๋ถ„๋ฅ˜
  • ์žฅ์ 
    • ๋ฒกํ„ฐ ํ’ˆ์งˆ์ด ๋งค์šฐ ๋›ฐ์–ด๋‚˜๋ฉฐ ๋‹ค์–‘ํ•œ ์–ธ์–ด์—์„œ๋„ ๋†’์€ ์ผ๊ด€์„ฑ
    • API ํ˜ธ์ถœ ํ•œ ๋ฒˆ์œผ๋กœ ์†์‰ฝ๊ฒŒ ๋ฒกํ„ฐ๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ์Œ
  • ์ฃผ์˜์‚ฌํ•ญ: ์œ ๋ฃŒ API ๊ธฐ๋ฐ˜์ด๋ฏ€๋กœ ์‚ฌ์šฉ๋Ÿ‰์— ๋”ฐ๋ผ ์š”๊ธˆ์ด ๋ฐœ์ƒ

 

์–ด๋–ค ๋ชจ๋ธ์„ ์„ ํƒํ•ด์•ผ ํ• ๊นŒ?

๋ชฉ์  ์ถ”์ฒœ๋ชจ๋ธ
๋น ๋ฅด๊ณ  ๊ฐ€๋ฒผ์šด ๋ฌธ์žฅ ์ž„๋ฒ ๋”ฉ all-MiniLM-L6-v2
๊ณ ์„ฑ๋Šฅ ๋ฌธ์žฅ ์ž„๋ฒ ๋”ฉ all-mpnet-base-v2
๊ฒ€์ƒ‰ ์ตœ์ ํ™” (Query/Document) intfloat/e5-base or e5-large
๋ฒ”์šฉ์„ฑ + ๊ฐ„ํŽธํ•จ thenlper/gte-base
API ๊ธฐ๋ฐ˜ ๊ณ ํ’ˆ์งˆ ์ž„๋ฒ ๋”ฉ OpenAI text-embedding-ada-002

 


 

ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ์€ ๋‹จ์ˆœํ•œ ๋ฌธ์žฅ ํ‘œํ˜„์„ ๋„˜์–ด, NLP ์‹œ์Šคํ…œ์˜ ๊ฑฐ์˜ ๋ชจ๋“  ์˜์—ญ์—์„œ ํ•ต์‹ฌ ๊ธฐ๋ฐ˜ ๊ธฐ์ˆ ๋กœ ์‚ฌ์šฉ๋˜๊ณ  ์žˆ๋‹ค. ์ข‹์€ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ์„ ์„ ํƒํ•˜๊ณ  ์ž˜ ํ™œ์šฉํ•˜๋ฉด, ๊ฒ€์ƒ‰ ํ’ˆ์งˆ์ด ๋น„์•ฝ์ ์œผ๋กœ ํ–ฅ์ƒ๋˜๊ณ  ์ถ”์ฒœ์ด๋‚˜ ์œ ์‚ฌ๋„ ๊ธฐ๋ฐ˜ ๊ธฐ๋Šฅ ๊ตฌํ˜„์ด ์‰ฌ์›Œ์ง„๋‹ค.

 

Hugging Face์™€ sentence-transformers, OpenAI API๋ฅผ ํ™œ์šฉํ•˜๋ฉด ์†์‰ฝ๊ฒŒ SOTA ๋ชจ๋ธ๋“ค์„ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์œผ๋‹ˆ, ์‹คํ—˜์ ์œผ๋กœ ๋‹ค์–‘ํ•œ ๋ชจ๋ธ์„ ์ ์šฉํ•ด๋ณด๊ณ  ๋‚ด ๋ฌธ์ œ์— ๊ฐ€์žฅ ์ž˜ ๋งž๋Š” ์ž„๋ฒ ๋”ฉ์„ ์ฐพ์•„๋ณด๋Š” ๊ฒƒ์ด ์ข‹๋‹ค.

๋ฐ˜์‘ํ˜•

'๐Ÿ› Research > NLP & LLM' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

[AI/LLM] Transformer์˜ ์ธ์ฝ”๋”์™€ ๋””์ฝ”๋” ์‰ฝ๊ฒŒ ์ดํ•ดํ•˜๊ธฐ  (0) 2024.11.06
[AI/LLM] Transformer Attention ์ดํ•ดํ•˜๊ธฐ: Q, K, V์˜ ์—ญํ• ๊ณผ ๋™์ž‘ ์›๋ฆฌ  (0) 2024.11.06
[ํ”„๋กฌํ”„ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง] (5) ํ”„๋กฌํ”„ํŠธ ๋ณด์•ˆ : LLM ์ทจ์•ฝ์ ๊ณผ ๋ณด์™„ ๋ฐฉ๋ฒ•  (0) 2024.07.27
[ํ”„๋กฌํ”„ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง] (4) ๊ณ ๊ธ‰ ๊ธฐ๋ฒ• : Expert prompting, Generated knowledge prompting, RAG, Tree-of-Thought, Plan-and-solve prompting, Automatic prompt engineer  (0) 2024.07.27
[ํ”„๋กฌํ”„ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง] (3) ๊ณ ๊ธ‰ ๊ธฐ๋ฒ•: Few-shot, Chain-of-thought, Self-consistency, Selection-inference, Least-to-most, ReAct, Self-evaluation  (0) 2024.07.27
'๐Ÿ› Research/NLP & LLM' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€
  • [AI/LLM] Transformer์˜ ์ธ์ฝ”๋”์™€ ๋””์ฝ”๋” ์‰ฝ๊ฒŒ ์ดํ•ดํ•˜๊ธฐ
  • [AI/LLM] Transformer Attention ์ดํ•ดํ•˜๊ธฐ: Q, K, V์˜ ์—ญํ• ๊ณผ ๋™์ž‘ ์›๋ฆฌ
  • [ํ”„๋กฌํ”„ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง] (5) ํ”„๋กฌํ”„ํŠธ ๋ณด์•ˆ : LLM ์ทจ์•ฝ์ ๊ณผ ๋ณด์™„ ๋ฐฉ๋ฒ•
  • [ํ”„๋กฌํ”„ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง] (4) ๊ณ ๊ธ‰ ๊ธฐ๋ฒ• : Expert prompting, Generated knowledge prompting, RAG, Tree-of-Thought, Plan-and-solve prompting, Automatic prompt engineer
๋ญ…์ฆค
๋ญ…์ฆค
AI ๊ธฐ์ˆ  ๋ธ”๋กœ๊ทธ
    ๋ฐ˜์‘ํ˜•
  • ๋ญ…์ฆค
    CV DOODLE
    ๋ญ…์ฆค
  • ์ „์ฒด
    ์˜ค๋Š˜
    ์–ด์ œ
  • ๊ณต์ง€์‚ฌํ•ญ

    • โœจ About Me
    • ๋ถ„๋ฅ˜ ์ „์ฒด๋ณด๊ธฐ (198)
      • ๐Ÿ“– Fundamentals (33)
        • Computer Vision (9)
        • 3D vision & Graphics (6)
        • AI & ML (15)
        • NLP (2)
        • etc. (1)
      • ๐Ÿ› Research (64)
        • Deep Learning (7)
        • Image Classification (2)
        • Detection & Segmentation (17)
        • OCR (7)
        • Multi-modal (4)
        • Generative AI (6)
        • 3D Vision (2)
        • Material & Texture Recognit.. (8)
        • NLP & LLM (11)
        • etc. (0)
      • ๐ŸŒŸ AI & ML Tech (7)
        • AI & ML ์ธ์‚ฌ์ดํŠธ (7)
      • ๐Ÿ’ป Programming (85)
        • Python (18)
        • Computer Vision (12)
        • LLM (4)
        • AI & ML (17)
        • Database (3)
        • Apache Airflow (6)
        • Docker & Kubernetes (14)
        • ์ฝ”๋”ฉ ํ…Œ์ŠคํŠธ (4)
        • C++ (1)
        • etc. (6)
      • ๐Ÿ’ฌ ETC (3)
        • ์ฑ… ๋ฆฌ๋ทฐ (3)
  • ๋งํฌ

  • ์ธ๊ธฐ ๊ธ€

  • ํƒœ๊ทธ

    Python
    segmentation
    ํ”„๋กฌํ”„ํŠธ์—”์ง€๋‹ˆ์–ด๋ง
    VLP
    Image Classification
    pandas
    Text recognition
    material recognition
    ๊ฐ์ฒด ๊ฒ€์ถœ
    OpenAI
    ChatGPT
    OpenCV
    3D Vision
    AI
    pytorch
    OCR
    ์ปดํ“จํ„ฐ๋น„์ „
    ํŒŒ์ด์ฌ
    LLM
    multi-modal
    ๋”ฅ๋Ÿฌ๋‹
    object detection
    nlp
    ๋„์ปค
    GPT
    ๊ฐ์ฒด๊ฒ€์ถœ
    deep learning
    airflow
    CNN
    Computer Vision
  • ์ตœ๊ทผ ๋Œ“๊ธ€

  • ์ตœ๊ทผ ๊ธ€

  • hELLOยท Designed By์ •์ƒ์šฐ.v4.10.3
๋ญ…์ฆค
[NLP] ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ ์„ค๋ช… | Huggingface sentence-transformers, OpenAI
์ƒ๋‹จ์œผ๋กœ

ํ‹ฐ์Šคํ† ๋ฆฌํˆด๋ฐ”