[pandas] ํŠน์ • ์ปฌ๋Ÿผ์˜ ๊ฐ’์ด ๊ณต๋ฐฑ์ธ ํ–‰์„ ์ œ์™ธํ•˜๋Š” ๋ฐฉ๋ฒ• | dropna

2023. 11. 17. 08:24ยท๐Ÿ’ป Programming/Python
๋ฐ˜์‘ํ˜•


ํŒ๋‹ค์Šค์˜ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์—์„œ ํŠน์ • ์ปฌ๋Ÿผ์˜ ๊ฐ’์ด ๊ณต๋ฐฑ(๋˜๋Š” ๋นˆ ๋ฌธ์ž์—ด)์ธ ํ–‰์„ ์ œ์™ธํ•˜๊ณ  ์‹ถ์„ ๋•Œ ์–ด๋–ป๊ฒŒ ํ•˜๋ฉด ๋˜๋Š”์ง€ ์•Œ์•„๋ณด์ž.

 

 

ํŠน์ • ์ปฌ๋Ÿผ์˜ ๊ณต๋ฐฑ์ธ ํ–‰ ์ œ๊ฑฐ

import pandas as pd

# ์ƒ˜ํ”Œ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„ ์ƒ์„ฑ
data = {'A': [1, 2, 3, 4],
        'B': ['apple', 'banana', '', 'orange']}
df = pd.DataFrame(data)

# 'B' ์ปฌ๋Ÿผ์˜ ๊ฐ’์ด ๊ณต๋ฐฑ์ธ ํ–‰ ์ œ์™ธ
df_no_empty_values = df[df['B'].str.strip() != '']

# ๊ฒฐ๊ณผ ์ถœ๋ ฅ
print(df_no_empty_values)
  • ์œ„ ์ฝ”๋“œ์—์„œ df['B'].str.strip() != '' ๋ถ€๋ถ„์€ 'B' ์ปฌ๋Ÿผ์˜ ๊ฐ ๊ฐ’์— ๋Œ€ํ•ด ์ขŒ์šฐ์˜ ๊ณต๋ฐฑ์„ ์ œ๊ฑฐํ•˜๊ณ  ๋นˆ ๋ฌธ์ž์—ด๊ณผ ๋น„๊ตํ•˜์—ฌ ๊ณต๋ฐฑ์ธ์ง€ ์—ฌ๋ถ€๋ฅผ ํ™•์ธ ๊ฐ€๋Šฅ
  • ์ด ์กฐ๊ฑด์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์„ ํ•„ํ„ฐ๋งํ•˜๋ฉด 'B' ์ปฌ๋Ÿผ์˜ ๊ฐ’์ด ๊ณต๋ฐฑ์ธ ํ–‰์ด ์ œ์™ธ๋œ ๊ฒฐ๊ณผ๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ๋‹ค.

 

๋ชจ๋“  ์ปฌ๋Ÿผ์˜ ๊ณต๋ฐฑ์ธ ํ–‰ ์ œ๊ฑฐ

import pandas as pd

# ์ƒ˜ํ”Œ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„ ์ƒ์„ฑ
data = {'A': [1, 2, 3, 4],
        'B': ['apple', 'banana', '', 'orange']}
df = pd.DataFrame(data)

# ๊ณต๋ฐฑ์„ NaN์œผ๋กœ ๋Œ€์ฒด
df = df.replace('', pd.NA)

# NaN์ด ์žˆ๋Š” ํ–‰ ์ œ๊ฑฐ
df_no_empty_values = df.dropna()

# ๊ฒฐ๊ณผ ์ถœ๋ ฅ
print(df_no_empty_values)
  • dropna() ํ•จ์ˆ˜๋Š” ๊ธฐ๋ณธ์ ์œผ๋กœ NaN(Not a Number) ๊ฐ’์ด ์žˆ๋Š” ๋ชจ๋“  ํ–‰์„ ์ œ๊ฑฐํ•œ๋‹ค.
  • ๋งŒ์•ฝ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์—์„œ ๊ณต๋ฐฑ์ด๋‚˜ ๋นˆ ๋ฌธ์ž์—ด์ด ์žˆ๋Š” ํ–‰์„ dropna() ํ•จ์ˆ˜๋กœ ์ œ๊ฑฐํ•˜๊ณ  ์‹ถ๋‹ค๋ฉด, replace() ํ•จ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๊ณต๋ฐฑ์„ NaN์œผ๋กœ ๋ณ€๊ฒฝํ•œ ํ›„์— dropna()๋ฅผ ์ ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.
  • ํŠน์ • ์ปฌ๋Ÿผ์—์„œ ๊ณต๋ฐฑ์ด ์žˆ๋Š” ํ–‰๋งŒ ์ œ๊ฑฐํ•˜๊ณ  ์‹ถ๋‹ค๋ฉด, ํŠน์ • ์ปฌ๋Ÿผ์˜ ๊ณต๋ฐฑ๋งŒ NaN์œผ๋กœ ๋ณ€๊ฒฝํ›„ dropna() ํ•จ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•  ์ˆ˜๋„ ์žˆ๋‹ค.

 

๋ฐ˜์‘ํ˜•

'๐Ÿ’ป Programming > Python' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

[python] ๋ฉ€ํ‹ฐํ”„๋กœ์„ธ์‹ฑ Pool ์‚ฌ์šฉ๋ฒ• ๋ฐ ์ฝ”๋“œ ์˜ˆ์‹œ | multiprocessing.Pool | python ์†๋„ ํ–ฅ์ƒ  (0) 2024.01.07
[pandas] ํŠน์ • ์ปฌ๋Ÿผ์—์„œ ํŠน์ • ๋ฌธ์ž์—ด์ด ํฌํ•จ๋œ ํ–‰ ์ฐพ๊ธฐ | str.contains  (0) 2023.11.17
[pandas] ํŠน์ • ์ปฌ๋Ÿผ์—์„œ ์ค‘๋ณต๋œ ๊ฐ’ ์ œ๊ฑฐ | drop_duplicates  (1) 2023.11.17
[pandas] DataFrame ์„ค๋ช… | ๋ฐ์ดํ„ฐ ์กฐ์ž‘, ํ•„ํ„ฐ๋ง, ์‹œ๊ฐํ™”, ํ†ต๊ณ„ ๋ถ„์„  (0) 2023.11.16
[pandas] 2์ฐจ์› ๋ฆฌ์ŠคํŠธ๋ฅผ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์œผ๋กœ ๋ณ€ํ™˜ํ•˜๊ธฐ | pd.DataFrame  (0) 2023.11.16
'๐Ÿ’ป Programming/Python' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€
  • [python] ๋ฉ€ํ‹ฐํ”„๋กœ์„ธ์‹ฑ Pool ์‚ฌ์šฉ๋ฒ• ๋ฐ ์ฝ”๋“œ ์˜ˆ์‹œ | multiprocessing.Pool | python ์†๋„ ํ–ฅ์ƒ
  • [pandas] ํŠน์ • ์ปฌ๋Ÿผ์—์„œ ํŠน์ • ๋ฌธ์ž์—ด์ด ํฌํ•จ๋œ ํ–‰ ์ฐพ๊ธฐ | str.contains
  • [pandas] ํŠน์ • ์ปฌ๋Ÿผ์—์„œ ์ค‘๋ณต๋œ ๊ฐ’ ์ œ๊ฑฐ | drop_duplicates
  • [pandas] DataFrame ์„ค๋ช… | ๋ฐ์ดํ„ฐ ์กฐ์ž‘, ํ•„ํ„ฐ๋ง, ์‹œ๊ฐํ™”, ํ†ต๊ณ„ ๋ถ„์„
๋ญ…์ฆค
๋ญ…์ฆค
AI ๊ธฐ์ˆ  ๋ธ”๋กœ๊ทธ
    ๋ฐ˜์‘ํ˜•
  • ๋ญ…์ฆค
    moovzi’s Doodle
    ๋ญ…์ฆค
  • ์ „์ฒด
    ์˜ค๋Š˜
    ์–ด์ œ
  • ๊ณต์ง€์‚ฌํ•ญ

    • โœจ About Me
    • ๋ถ„๋ฅ˜ ์ „์ฒด๋ณด๊ธฐ (209)
      • ๐Ÿ“– Fundamentals (34)
        • Computer Vision (9)
        • 3D vision & Graphics (6)
        • AI & ML (16)
        • NLP (2)
        • etc. (1)
      • ๐Ÿ› Research (72)
        • Deep Learning (7)
        • Perception (19)
        • OCR (7)
        • Multi-modal (4)
        • Image•Video Generation (17)
        • 3D Vision (4)
        • Material • Texture Recognit.. (8)
        • NLP • LLM (6)
        • etc. (0)
      • ๐Ÿ› ๏ธ Engineering (8)
        • Distributed Training & Infe.. (5)
        • AI & ML ์ธ์‚ฌ์ดํŠธ (3)
      • ๐Ÿ’ป Programming (92)
        • Python (18)
        • Computer Vision (12)
        • LLM (4)
        • AI & ML (18)
        • Database (3)
        • Distributed Computing (6)
        • Apache Airflow (6)
        • Docker & Kubernetes (14)
        • ์ฝ”๋”ฉ ํ…Œ์ŠคํŠธ (4)
        • C++ (1)
        • etc. (6)
      • ๐Ÿ’ฌ ETC (3)
        • ์ฑ… ๋ฆฌ๋ทฐ (3)
  • ๋งํฌ

  • ์ธ๊ธฐ ๊ธ€

  • ํƒœ๊ทธ

    ๋”ฅ๋Ÿฌ๋‹
    generative ai
    Image generation
    ํŒŒ์ด์ฌ
    pytorch
    ml
    AI
    3D Vision
    ์ปดํ“จํ„ฐ๋น„์ „
    LLM
    ๊ฐ์ฒด ๊ฒ€์ถœ
    airflow
    pandas
    OpenCV
    material recognition
    pyspark
    multi-modal
    Text recognition
    T2i
    Computer Vision
    segmentation
    OpenAI
    object detection
    OCR
    nlp
    ๋„์ปค
    deep learning
    Python
    ๊ฐ์ฒด๊ฒ€์ถœ
    diffusion
  • ์ตœ๊ทผ ๋Œ“๊ธ€

  • ์ตœ๊ทผ ๊ธ€

  • hELLOยท Designed By์ •์ƒ์šฐ.v4.10.3
๋ญ…์ฆค
[pandas] ํŠน์ • ์ปฌ๋Ÿผ์˜ ๊ฐ’์ด ๊ณต๋ฐฑ์ธ ํ–‰์„ ์ œ์™ธํ•˜๋Š” ๋ฐฉ๋ฒ• | dropna
์ƒ๋‹จ์œผ๋กœ

ํ‹ฐ์Šคํ† ๋ฆฌํˆด๋ฐ”